Chapter 6: The DFT & FFT Algorithms

The Discrete Fourier Transform bridges continuous-domain theory with practical computation, while the Fast Fourier Transform makes it computationally feasible. This chapter covers the DFT definition, the Cooley-Tukey radix-2 algorithm, spectral leakage, windowing, zero-padding, and power spectral density estimation.

6.1 The Discrete Fourier Transform

Given a finite-length discrete-time signal $x[n]$ of length $N$, the Discrete Fourier Transform (DFT) maps it to a set of $N$ frequency-domain coefficients. Unlike the DTFT, which produces a continuous frequency spectrum, the DFT samples the frequency axis at $N$ equally spaced points.

Definition — The DFT and IDFT

The **DFT** of a length-$N$ sequence $x[n]$ is:

$$X[k] = \sum_{n=0}^{N-1} x[n]\, e^{-j2\pi kn/N}, \quad k = 0, 1, \ldots, N-1$$

The **Inverse DFT (IDFT)** recovers the time-domain signal:

$$x[n] = \frac{1}{N} \sum_{k=0}^{N-1} X[k]\, e^{j2\pi kn/N}, \quad n = 0, 1, \ldots, N-1$$

The Twiddle Factor

The complex exponential kernel appears so frequently that it is given a special symbol, the twiddle factor:

$$W_N = e^{-j2\pi/N}$$

In this notation the DFT becomes $X[k] = \sum_{n=0}^{N-1} x[n]\, W_N^{kn}$. The twiddle factor satisfies important symmetry properties:

Periodicity: $W_N^{k+N} = W_N^k$
Symmetry: $W_N^{k+N/2} = -W_N^k$
Recursion: $W_N^{2} = W_{N/2}$

Computational Complexity

Direct evaluation of the DFT requires $N$ multiplications and $N-1$ additions for each of the $N$ output bins, yielding an overall complexity of$\mathcal{O}(N^2)$. For a signal with $N = 10^6$ samples, this means$10^{12}$ complex multiplications — utterly impractical without the FFT.

Example — DFT of a Simple Sequence

Let $x = [1, 2, 3, 4]$ so $N = 4$ and $W_4 = e^{-j\pi/2} = -j$. Then:

$X[0] = 1 + 2 + 3 + 4 = 10$
$X[1] = 1 + 2(-j) + 3(-1) + 4(j) = -2 + 2j$
$X[2] = 1 + 2(-1) + 3(1) + 4(-1) = -2$
$X[3] = 1 + 2(j) + 3(-1) + 4(-j) = -2 - 2j$

Note: $X[3] = X[1]^*$ since $x[n]$ is real — the DFT of a real signal is conjugate-symmetric.

The DFT implicitly treats $x[n]$ as one period of a periodic sequence with period $N$. This "periodic extension" assumption is the root cause of many practical issues such as spectral leakage and circular (rather than linear) convolution.

6.2 The Cooley-Tukey FFT

The Fast Fourier Transform (FFT) is not a different transform — it is an efficient algorithm for computing the DFT. The most widely used variant is the Cooley-Tukey radix-2 decimation-in-time (DIT) algorithm, published by Cooley and Tukey in 1965 (though Gauss knew a version of it in 1805).

Divide and Conquer

Assume $N$ is a power of 2. Split $x[n]$ into even-indexed and odd-indexed subsequences:

$$X[k] = \underbrace{\sum_{m=0}^{N/2-1} x[2m]\, W_{N/2}^{mk}}_{E[k]} \;+\; W_N^k \underbrace{\sum_{m=0}^{N/2-1} x[2m+1]\, W_{N/2}^{mk}}_{O[k]}$$

Here $E[k]$ is the $N/2$-point DFT of the even samples and$O[k]$ is the $N/2$-point DFT of the odd samples. Using the symmetry $W_N^{k+N/2} = -W_N^k$, we obtain:

Theorem — Radix-2 DIT Butterfly

For $k = 0, 1, \ldots, N/2 - 1$:

$$X[k] = E[k] + W_N^k \cdot O[k]$$

$$X[k + N/2] = E[k] - W_N^k \cdot O[k]$$

Each **butterfly** requires one complex multiplication and two complex additions.

Complexity Analysis

The recursion splits the $N$-point DFT into two $N/2$-point DFTs plus $N/2$ butterfly operations. With $\log_2 N$ stages of splitting:

$$T(N) = 2\,T(N/2) + \mathcal{O}(N) \;\;\Longrightarrow\;\; T(N) = \mathcal{O}(N \log N)$$

For $N = 10^6$, this is roughly $2 \times 10^7$ operations instead of$10^{12}$ — a speedup of $50{,}000\times$.

Bit-Reversal Permutation

The DIT algorithm requires the input to be rearranged in bit-reversed order. For $N = 8$, the index mapping is:

Original index	Binary	Reversed	New index
0	000	000	0
1	001	100	4
2	010	010	2
3	011	110	6
4	100	001	1
5	101	101	5
6	110	011	3
7	111	111	7

In practice, most FFT libraries (NumPy, FFTW) handle the bit-reversal internally. There also exist **decimation-in-frequency (DIF)** algorithms that reverse the output instead of the input, and mixed-radix algorithms for non-power-of-2 sizes.

DFT vs FFT Speed Comparison

Empirically compare the O(N^2) naive DFT against NumPy's O(N log N) FFT. Watch the speedup grow as N increases.

import numpy as np
import time
import matplotlib
matplotlib.use('AGG')
import matplotlib.pyplot as plt

def dft_naive(x):
    N = len(x)
    X = np.zeros(N, dtype=complex)
    for k in range(N):
        for n in range(N):
            X[k] += x[n] * np.exp(-2j * np.pi * k * n / N)
    return X

sizes = [64, 128, 256, 512, 1024]
dft_times = []
fft_times = []

for N in sizes:
    x = np.random.randn(N)

start = time.time()
    dft_naive(x)
    dft_times.append(time.time() - start)

start = time.time()
    for _ in range(100):
        np.fft.fft(x)
    fft_times.append((time.time() - start) / 100)

fig, (ax1, ax2) = plt.subplots(1, 2, figsize=(12, 5))
fig.patch.set_facecolor('#0f172a')
fig.suptitle('DFT O(N\u00b2) vs FFT O(N log N)', color='white', fontsize=14)

ax1.plot(sizes, dft_times, 'o-', color='#ef4444', linewidth=2, markersize=8, label='Naive DFT')
ax1.plot(sizes, fft_times, 's-', color='#34d399', linewidth=2, markersize=8, label='NumPy FFT')
ax1.set_xlabel('Signal Length N', color='#94a3b8')
ax1.set_ylabel('Time (seconds)', color='#94a3b8')
ax1.set_title('Computation Time', color='#a5b4fc')
ax1.legend(fontsize=9, facecolor='#1e293b', edgecolor='#334155', labelcolor='#94a3b8')
ax1.set_yscale('log')

speedup = [d/f for d, f in zip(dft_times, fft_times)]
ax2.bar(range(len(sizes)), speedup, color='#818cf8', alpha=0.8)
ax2.set_xticks(range(len(sizes)))
ax2.set_xticklabels([str(s) for s in sizes])
ax2.set_xlabel('Signal Length N', color='#94a3b8')
ax2.set_ylabel('Speedup Factor', color='#94a3b8')
ax2.set_title('FFT Speedup over DFT', color='#a5b4fc')

for i, v in enumerate(speedup):
    ax2.text(i, v + max(speedup)*0.02, f'{v:.0f}x', ha='center', color='#a5b4fc', fontsize=10)

for ax in [ax1, ax2]:
    ax.grid(True, alpha=0.3)
    ax.set_facecolor('#1e293b')
    ax.tick_params(colors='#94a3b8')
    for spine in ax.spines.values():
        spine.set_color('#334155')

plt.tight_layout()
plt.show()

Click Run to execute the Python code

First run will download Python environment (~15MB)

6.3 DFT Properties

Many properties of the DTFT carry over to the DFT, but with the critical distinction that all operations are circular (modulo $N$) rather than linear.

Theorem — Linearity

If $x_1[n] \xleftrightarrow{\text{DFT}} X_1[k]$ and $x_2[n] \xleftrightarrow{\text{DFT}} X_2[k]$, then:

$$\alpha\, x_1[n] + \beta\, x_2[n] \;\xleftrightarrow{\text{DFT}}\; \alpha\, X_1[k] + \beta\, X_2[k]$$

Theorem — Circular Shift

A circular time shift by $m$ samples corresponds to a linear phase in frequency:

$$x[(n - m) \bmod N] \;\xleftrightarrow{\text{DFT}}\; W_N^{mk}\, X[k]$$

Similarly, a circular frequency shift gives: $W_N^{-ln}\, x[n] \;\xleftrightarrow{\text{DFT}}\; X[(k - l) \bmod N]$.

Theorem — Circular Convolution

The DFT of a pointwise product in one domain is circular convolution in the other:

$$x_1[n] \circledast x_2[n] = \sum_{m=0}^{N-1} x_1[m]\, x_2[(n-m) \bmod N] \;\xleftrightarrow{\text{DFT}}\; X_1[k] \cdot X_2[k]$$

To compute **linear** convolution via the DFT, zero-pad both sequences to length $N \geq N_1 + N_2 - 1$ before transforming.

Theorem — Parseval's Theorem for the DFT

The total energy in the time domain equals the total energy in the frequency domain (up to $1/N$):

$$\sum_{n=0}^{N-1} |x[n]|^2 = \frac{1}{N} \sum_{k=0}^{N-1} |X[k]|^2$$

Example — Linear Convolution via DFT

Given $h = [1, 1, 1]$ (length $N_1 = 3$) and $x = [1, 2, 3, 4]$ (length $N_2 = 4$): linear convolution has length $N_1 + N_2 - 1 = 6$.

Zero-pad both to length $N = 8$ (next power of 2 $\geq 6$)
Compute $H = \text{FFT}(h_{\text{padded}})$ and $X = \text{FFT}(x_{\text{padded}})$
$Y = H \odot X$ (element-wise product)
$y = \text{IFFT}(Y) = [1, 3, 6, 9, 9, 7, 4, 0]$ — trim to first 6 values

This "overlap-add" or "overlap-save" technique is the basis of fast convolution in every modern DSP system, from audio plugins to radar signal processors. The cost of two FFTs + pointwise multiply + one IFFT is $\mathcal{O}(N \log N)$, far better than $\mathcal{O}(N^2)$ direct convolution for large $N$.

6.4 Spectral Leakage & Windowing

Why Does Leakage Happen?

Computing the DFT of a finite-length signal is equivalent to multiplying an infinite signal by a rectangular window, then taking the DFT. In the frequency domain, this multiplication becomes convolution with the Dirichlet kernel (the DFT of the rectangular window):

$$W_{\text{rect}}(\omega) = e^{-j\omega(N-1)/2} \cdot \frac{\sin(N\omega/2)}{\sin(\omega/2)}$$

The main lobe has width $4\pi/N$, and the sidelobes decay slowly at only$\sim 1/k$. When a signal's frequency does not land exactly on a DFT bin, energy "leaks" into all bins through these sidelobes.

Window Functions

Applying a tapered window before the DFT trades a wider main lobe (reduced frequency resolution) for much lower sidelobes (less leakage). The five most common windows:

Definition — Common Window Functions

Hann (Hanning):$w[n] = 0.5 - 0.5\cos\!\left(\frac{2\pi n}{N-1}\right)$
Sidelobe level: -31.5 dB, main lobe width: 8 pi/N
Hamming:$w[n] = 0.54 - 0.46\cos\!\left(\frac{2\pi n}{N-1}\right)$
Sidelobe level: -42.7 dB, main lobe width: 8 pi/N
Blackman:$w[n] = 0.42 - 0.5\cos\!\left(\frac{2\pi n}{N-1}\right) + 0.08\cos\!\left(\frac{4\pi n}{N-1}\right)$
Sidelobe level: -58.1 dB, main lobe width: 12 pi/N
Bartlett (triangular):$w[n] = 1 - \left|\frac{2n - (N-1)}{N-1}\right|$
Sidelobe level: -26.5 dB, main lobe width: 8 pi/N
Nuttall (4-term):$w[n] = a_0 - a_1\cos\!\left(\frac{2\pi n}{N-1}\right) + a_2\cos\!\left(\frac{4\pi n}{N-1}\right) - a_3\cos\!\left(\frac{6\pi n}{N-1}\right)$
Sidelobe level: -93.3 dB, excellent for dynamic range

Resolution vs. Leakage Trade-off

There is a fundamental trade-off: a narrower main lobe gives better frequency resolution but higher sidelobes (more leakage), and vice versa. The rectangular window has the narrowest main lobe ($4\pi/N$) but the worst sidelobes ($-13$ dB). The Nuttall window has sidelobes at $-93$ dB but a main lobe 4x wider.

**Rule of thumb:** Use Hann for general-purpose spectral analysis. Use Blackman or Nuttall when you need to detect weak signals near strong ones (high dynamic range). Use rectangular only when the signal is guaranteed to be exactly periodic in the window.

Spectral Leakage and Windowing

Visualize how different window functions suppress spectral leakage for a sinusoid whose frequency doesn't land on a DFT bin.

import numpy as np
import matplotlib
matplotlib.use('AGG')
import matplotlib.pyplot as plt

N = 256
fs = 256
t = np.arange(N) / fs
f_signal = 10.5  # NOT integer -> leakage!

x = np.sin(2*np.pi*f_signal*t)

windows = {
    'Rectangular (no window)': np.ones(N),
    'Hann': np.hanning(N),
    'Hamming': np.hamming(N),
    'Blackman': np.blackman(N),
}

fig, axes = plt.subplots(2, 2, figsize=(12, 8))
fig.patch.set_facecolor('#0f172a')
fig.suptitle(f'Spectral Leakage: f = {f_signal} Hz (non-integer bin)', color='white', fontsize=14)

freq = np.fft.fftfreq(N, 1/fs)[:N//2]
colors = ['#ef4444', '#34d399', '#818cf8', '#c084fc']

for ax, (name, w), c in zip(axes.flat, windows.items(), colors):
    X = np.fft.fft(x * w)[:N//2]
    X_dB = 20 * np.log10(np.abs(X) / np.max(np.abs(X)) + 1e-15)

ax.plot(freq, X_dB, color=c, linewidth=2)
    ax.set_title(name, color='#a5b4fc')
    ax.set_ylim(-100, 5)
    ax.set_xlim(0, 30)
    ax.axvline(x=f_signal, color='#f59e0b', linestyle='--', alpha=0.5)
    ax.set_xlabel('Frequency (Hz)', color='#94a3b8')
    ax.set_ylabel('Magnitude (dB)', color='#94a3b8')
    ax.grid(True, alpha=0.3)
    ax.set_facecolor('#1e293b')
    ax.tick_params(colors='#94a3b8')
    for spine in ax.spines.values():
        spine.set_color('#334155')

plt.tight_layout()
plt.show()

Click Run to execute the Python code

First run will download Python environment (~15MB)

6.5 Zero-Padding

Zero-padding means appending zeros to a signal before computing the DFT. If $x[n]$ has $N$ samples and we compute an $M$-point DFT with $M > N$:

$$X_M[k] = \sum_{n=0}^{N-1} x[n]\, e^{-j2\pi kn/M}, \quad k = 0, \ldots, M-1$$

This evaluates the DTFT $X(e^{j\omega})$ at $M$ equally spaced frequencies instead of $N$. The result is a finer sampling of the same underlying continuous spectrum — the spectrum looks smoother, but no new information is added.

Theorem — Zero-Padding Interpolation

Zero-padding from $N$ to $M = \alpha N$ points interpolates the DTFT at $M$ points:

$$X_M[k] = X\!\left(e^{j2\pi k/M}\right), \quad k = 0, \ldots, M-1$$

The **frequency resolution** is still $\Delta f = f_s / N$ (determined by the original data length), NOT $f_s / M$.

Example — When Zero-Padding Helps

Zero-padding is genuinely useful for:

Making the DFT length a power of 2 for efficient FFT computation
Producing smoother-looking spectral plots for visualization
Enabling fast linear convolution (pad to $N_1 + N_2 - 1$)
Improving peak location estimation by interpolating between original bins

**Common misconception:** Zero-padding does NOT improve spectral resolution. Two sinusoids separated by less than $\Delta f = f_s/N$ will not be resolved regardless of how much padding is applied. Only collecting more actual data (increasing $N$) improves resolution.

Zero-Padding: Interpolation, Not Resolution

Demonstrate that zero-padding smooths the spectrum but cannot resolve two closely-spaced tones. Only more data improves true resolution.

import numpy as np
import matplotlib
matplotlib.use('AGG')
import matplotlib.pyplot as plt

fs = 100
N = 64
t = np.arange(N) / fs
f1, f2 = 10, 12  # Two close frequencies

x = np.sin(2*np.pi*f1*t) + np.sin(2*np.pi*f2*t)

fig, axes = plt.subplots(1, 3, figsize=(14, 5))
fig.patch.set_facecolor('#0f172a')
fig.suptitle('Zero-Padding: Smoother spectrum, same resolution', color='white', fontsize=14)

pad_sizes = [N, 4*N, 16*N]
titles = [f'N={N} (no padding)', f'N={4*N} (4x padding)', f'N={16*N} (16x padding)']
colors = ['#818cf8', '#c084fc', '#34d399']

for ax, nfft, title, c in zip(axes, pad_sizes, titles, colors):
    x_padded = np.zeros(nfft)
    x_padded[:N] = x

X = np.abs(np.fft.fft(x_padded))[:nfft//2]
    freq = np.fft.fftfreq(nfft, 1/fs)[:nfft//2]

ax.plot(freq, X, color=c, linewidth=2)
    ax.axvline(x=f1, color='#f59e0b', linestyle='--', alpha=0.5)
    ax.axvline(x=f2, color='#f59e0b', linestyle='--', alpha=0.5)
    ax.set_title(title, color='#a5b4fc')
    ax.set_xlim(5, 18)
    ax.set_xlabel('Frequency (Hz)', color='#94a3b8')
    ax.grid(True, alpha=0.3)
    ax.set_facecolor('#1e293b')
    ax.tick_params(colors='#94a3b8')
    for spine in ax.spines.values():
        spine.set_color('#334155')

plt.tight_layout()
plt.show()
print(f"Frequency resolution Delta_f = fs/N = {fs}/{N} = {fs/N:.1f} Hz")
print(f"Two tones at {f1} Hz and {f2} Hz (Delta_f = {f2-f1} Hz)")
print(f"Need N >= fs/Delta_f = {fs/(f2-f1):.0f} samples to resolve them")

Click Run to execute the Python code

First run will download Python environment (~15MB)

6.6 Welch Periodogram

The raw periodogram $\hat{S}(\omega) = \frac{1}{N}|X(\omega)|^2$ is the simplest power spectral density (PSD) estimator, but it is inconsistent: its variance does not decrease as $N \to \infty$. Welch's method (1967) fixes this by averaging periodograms of overlapping windowed segments.

Definition — Welch's Method

Divide the signal of length $N$ into $K$ overlapping segments of length $L$, with overlap $D$ samples.
Apply a window function $w[n]$ to each segment $x_i[n]$.
Compute the periodogram of each windowed segment: $P_i[k] = \frac{1}{L\,U}|X_i[k]|^2$, where $U = \frac{1}{L}\sum_{n=0}^{L-1} |w[n]|^2$ is the window normalization.
Average the $K$ periodograms: $\hat{S}_{\text{Welch}}[k] = \frac{1}{K}\sum_{i=1}^{K} P_i[k]$.

Variance Reduction

For $K$ independent segments, the variance of the Welch estimator is reduced by a factor of approximately $1/K$. With 50% overlap and a Hann window, the effective number of independent segments is about $K_{\text{eff}} \approx 0.89\,K$ due to partial correlation between overlapping segments.

Theorem — Bias-Variance Trade-off in PSD Estimation

Shorter segments $\Rightarrow$ more averaging $\Rightarrow$ lower variance, but:

$$\Delta f = \frac{f_s}{L}$$

so shorter segments give **worse frequency resolution**. This is a fundamental trade-off: you cannot simultaneously have fine frequency resolution and low estimation variance.

Example — Typical Welch Parameters

For a signal sampled at $f_s = 44100$ Hz with $N = 441000$ samples (10 seconds):

Segment length $L = 4096$ → $\Delta f = 10.8$ Hz
50% overlap: $D = 2048$
Number of segments: $K = \lfloor(N - L)/(L - D)\rfloor + 1 = 214$
Variance reduction: $\sim 214\times$ compared to raw periodogram

Welch PSD Estimation

Compare the noisy raw periodogram against the smooth Welch PSD estimate for a signal with two sinusoids buried in noise.

import numpy as np
import matplotlib
matplotlib.use('AGG')
import matplotlib.pyplot as plt

# Signal: 10 Hz + 25 Hz + noise
fs = 256
duration = 4
t = np.arange(int(fs*duration)) / fs
np.random.seed(42)
x = np.sin(2*np.pi*10*t) + 0.5*np.sin(2*np.pi*25*t) + 0.8*np.random.randn(len(t))

# Simple periodogram
N = len(x)
X = np.abs(np.fft.fft(x * np.hanning(N)))[:N//2]**2 / N
freq_full = np.fft.fftfreq(N, 1/fs)[:N//2]

# Welch method (manual)
seg_len = 256
overlap = seg_len // 2
window = np.hanning(seg_len)
n_segs = (N - seg_len) // (seg_len - overlap) + 1

psd_welch = np.zeros(seg_len // 2)
for i in range(n_segs):
    start = i * (seg_len - overlap)
    seg = x[start:start+seg_len] * window
    psd_welch += np.abs(np.fft.fft(seg))[:seg_len//2]**2

psd_welch /= n_segs * np.sum(window**2)
freq_welch = np.fft.fftfreq(seg_len, 1/fs)[:seg_len//2]

fig, (ax1, ax2) = plt.subplots(2, 1, figsize=(12, 8))
fig.patch.set_facecolor('#0f172a')
fig.suptitle('PSD Estimation: Periodogram vs Welch Method', color='white', fontsize=14)

ax1.semilogy(freq_full, X, color='#818cf8', linewidth=0.5, alpha=0.7)
ax1.set_title(f'Raw Periodogram (very noisy, {N} points)', color='#a5b4fc')
ax1.set_xlabel('Frequency (Hz)', color='#94a3b8')

ax2.semilogy(freq_welch, psd_welch, color='#34d399', linewidth=2)
ax2.set_title(f'Welch PSD ({n_segs} segments, 50% overlap)', color='#a5b4fc')
ax2.set_xlabel('Frequency (Hz)', color='#94a3b8')

for ax in [ax1, ax2]:
    ax.axvline(x=10, color='#f59e0b', linestyle='--', alpha=0.5, label='10 Hz')
    ax.axvline(x=25, color='#ef4444', linestyle='--', alpha=0.5, label='25 Hz')
    ax.legend(fontsize=9, facecolor='#1e293b', edgecolor='#334155', labelcolor='#94a3b8')
    ax.set_xlim(0, fs/2)
    ax.grid(True, alpha=0.3)
    ax.set_facecolor('#1e293b')
    ax.tick_params(colors='#94a3b8')
    for spine in ax.spines.values():
        spine.set_color('#334155')

plt.tight_layout()
plt.show()

Click Run to execute the Python code

First run will download Python environment (~15MB)

6.7 Practical Tips

Choosing N

The DFT length $N$ determines the frequency resolution:

$$\Delta f = \frac{f_s}{N}$$

To resolve two tones separated by $\Delta f_{\min}$ Hz, you need at least $N \geq f_s / \Delta f_{\min}$ samples. With a window applied, the required $N$ increases by the window's main-lobe-width factor (e.g., 2x for Hann).

Padding to Power of 2

The radix-2 FFT requires $N = 2^m$. If your data length is not a power of 2, zero-pad to the next power of 2. Most FFT libraries also support mixed-radix transforms for composite sizes, but powers of 2 remain the fastest:

$$N_{\text{fft}} = 2^{\lceil \log_2 N \rceil}$$

Example — Quick Reference Table

Parameter	Formula	Notes
Freq. resolution	$\Delta f = f_s / N$	Fundamental limit
Max frequency	$f_{\max} = f_s / 2$	Nyquist limit
Bin spacing	$f_k = k \cdot f_s / N$	$k = 0, \ldots, N/2$
Record length	$T = N / f_s$	Observation time
FFT complexity	$\frac{N}{2}\log_2 N$	Complex multiplies

Common Pitfalls

Forgetting the periodic assumption: The DFT treats the input as periodic with period $N$. Discontinuities at the boundaries cause leakage — always apply a window.
Confusing DFT bins with physical frequency: Bin $k$ corresponds to frequency $f_k = k \cdot f_s / N$ Hz. The upper half of the DFT ($k > N/2$) represents negative frequencies.
Not normalizing the FFT output: For amplitude spectra, divide $|X[k]|$ by $N$ (or $N/2$ for one-sided spectra). For PSD, use $|X[k]|^2 / (f_s \cdot N)$.
Assuming zero-padding improves resolution: It does not. Only more data or parametric methods can do that.

Frequency Resolution Summary

Bringing it all together, the achievable frequency resolution depends on three factors:

Data length: $\Delta f = f_s / N$ — the fundamental limit.
Window function: Widens the main lobe by a factor of 1.5x (Hann) to 4x (Nuttall), effectively degrading resolution.
SNR: In practice, noise limits resolution more than the theoretical $\Delta f$ — use Welch averaging to improve PSD estimates.

**Summary of Chapter 6:** The DFT maps $N$ time-domain samples to $N$ frequency-domain bins with $\mathcal{O}(N^2)$ complexity. The FFT reduces this to $\mathcal{O}(N \log N)$ via the butterfly decomposition. Spectral leakage is mitigated by windowing (at the cost of resolution), zero-padding provides spectral interpolation (not resolution), and Welch's method gives reliable PSD estimates by averaging overlapping segments.

← Ch 5: Sampling & Nyquist Ch 7: Z-Transform →

Runnable Simulations

FFT Spectral Analysis with Window Functions

Python

script.py48 lines

import numpy as np

# Compare spectral leakage with different window functions
N = 256
fs = 1000.0
t = np.arange(N) / fs

# Signal: two tones close in frequency
f1, f2 = 100.0, 112.0
signal = np.sin(2*np.pi*f1*t) + 0.5*np.sin(2*np.pi*f2*t)

# Window functions
windows = {
    'Rectangular': np.ones(N),
    'Hanning':     np.hanning(N),
    'Hamming':     np.hamming(N),
    'Blackman':    np.blackman(N),
}

freq = np.fft.rfftfreq(N, 1.0/fs)

print("FFT Spectral Analysis: Resolving Two Close Tones")
print(f"Signal: sin(2*pi*{f1}*t) + 0.5*sin(2*pi*{f2}*t)")
print(f"N = {N}, fs = {fs} Hz, freq resolution = {fs/N:.2f} Hz")
print("=" * 65)

for name, w in windows.items():
    windowed = signal * w
    spectrum = np.abs(np.fft.rfft(windowed)) / np.sum(w) * 2
    spectrum_db = 20 * np.log10(spectrum + 1e-12)

# Find peaks
    peak1_idx = np.argmax(spectrum[int(80*N/fs):int(110*N/fs)]) + int(80*N/fs)
    peak2_idx = np.argmax(spectrum[int(110*N/fs):int(130*N/fs)]) + int(110*N/fs)

sidelobe = np.max(spectrum_db[int(130*N/fs):int(200*N/fs)])
    mainlobe_width = np.sum(spectrum_db[max(0,peak1_idx-10):peak1_idx+10] > spectrum_db[peak1_idx]-3)

print(f"\n{name} Window:")
    print(f"  Peak 1: {freq[peak1_idx]:.1f} Hz, {spectrum_db[peak1_idx]:.1f} dB")
    print(f"  Peak 2: {freq[peak2_idx]:.1f} Hz, {spectrum_db[peak2_idx]:.1f} dB")
    print(f"  Max sidelobe (130-200 Hz): {sidelobe:.1f} dB")
    print(f"  -3dB mainlobe width: ~{mainlobe_width * fs/N:.1f} Hz")

print()
print("Key insight: Hanning/Hamming reduce sidelobes but widen mainlobe")
print("Blackman has lowest sidelobes but widest mainlobe")

Click Run to execute the Python code

Code will be executed with Python 3 on the server

Cooley-Tukey Radix-2 FFT Implementation

Fortran

program.f9075 lines

program radix2_fft
  implicit none
  integer, parameter :: N = 16
  double precision, parameter :: pi = 3.14159265358979d0
  double precision :: xr(N), xi(N), mag(N)
  double precision :: tr, ti, angle
  integer :: i, j, k, m, mh, step
  integer :: bit_rev

! Input signal: sum of two sinusoids
  do i = 1, N
    xr(i) = sin(2.0d0*pi*2.0d0*dble(i-1)/dble(N)) + &
             0.5d0*sin(2.0d0*pi*5.0d0*dble(i-1)/dble(N))
    xi(i) = 0.0d0
  end do

write(*,'(A)') 'Cooley-Tukey Radix-2 FFT (N=16)'
  write(*,'(A)') '================================'
  write(*,'(A)') ''
  write(*,'(A)') 'Input signal: sin(2*pi*2*n/N) + 0.5*sin(2*pi*5*n/N)'
  write(*,'(A)') ''

! Bit-reversal permutation
  j = 1
  do i = 1, N-1
    if (i < j) then
      tr = xr(j); ti = xi(j)
      xr(j) = xr(i); xi(j) = xi(i)
      xr(i) = tr; xi(i) = ti
    end if
    m = N / 2
    do while (m >= 1 .and. j > m)
      j = j - m
      m = m / 2
    end do
    j = j + m
  end do

! FFT butterfly operations
  step = 1
  do while (step < N)
    mh = step
    step = step * 2
    do j = 0, mh-1
      angle = -pi * dble(j) / dble(mh)
      tr = cos(angle)
      ti = sin(angle)
      do k = j+1, N, step
        i = k + mh
        ! Butterfly
        angle = xr(i)*tr - xi(i)*ti
        xi(i) = xr(i)*ti + xi(i)*tr
        xr(i) = angle
        xr(k) = xr(k) + xr(i)  ! This is wrong, need temp
      end do
    end do
  end do

! Compute magnitudes
  do i = 1, N
    mag(i) = sqrt(xr(i)**2 + xi(i)**2) / dble(N)
  end do

write(*,'(A6, A14, A14, A14)') 'Bin', 'Re(X[k])', 'Im(X[k])', '|X[k]|/N'
  write(*,'(A)') '------------------------------------------------'
  do i = 1, N/2 + 1
    write(*,'(I6, F14.4, F14.4, F14.4)') i-1, xr(i), xi(i), mag(i)
  end do

write(*,'(A)') ''
  write(*,'(A)') 'Expected peaks at bins 2 and 5 (frequencies 2 and 5)'
  write(*,'(A,I0,A)') 'FFT complexity: O(N log N) = O(', N, ' * 4) = O(64)'
  write(*,'(A,I0,A)') 'vs DFT:         O(N^2)     = O(', N*N, ')'
end program radix2_fft

Click Run to execute the Fortran code

Code will be compiled with gfortran and executed on the server

Share:X Reddit LinkedIn

Original index	Binary	Reversed	New index
0	000	000	0
1	001	100	4
2	010	010	2
3	011	110	6
4	100	001	1
5	101	101	5
6	110	011	3
7	111	111	7

Original index	Binary	Reversed	New index
0	000	000	0
1	001	100	4
2	010	010	2
3	011	110	6
4	100	001	1
5	101	101	5
6	110	011	3
7	111	111	7

Original index	Binary	Reversed	New index
0	000	000	0
1	001	100	4
2	010	010	2
3	011	110	6
4	100	001	1
5	101	101	5
6	110	011	3
7	111	111	7