The Fourier Transform — From Signals to Spectra
In 1822 Joseph Fourier showed that any periodic function can be expressed as an infinite sum of sines and cosines. Extended to non-periodic functions, this insight became the Fourier transform — perhaps the single most versatile mathematical tool in applied science. It decomposes any signal into its frequency components, enabling engineers and scientists to filter noise, compress images, scan human brains, detect gravitational waves, and build the modern internet. This article develops the theory from first principles and traces the transform through its most important applications.
1. Fourier Series: Decomposing Periodic Functions
For a function f(t) that is periodic with period T (so f(t+T) = f(t) for all t), the Fourier series representation is:
In complex exponential form (more compact and general):
The frequencies present are discrete multiples of the fundamental frequency f₀ = 1/T: {0, f₀, 2f₀, 3f₀, ...}. A square wave of period T and amplitude A has only odd harmonics:
This decomposition is why a square wave from a synthesiser sounds harsher than a sine wave — it contains energy at many harmonics that stimulate multiple resonances in the human ear.
2. The Continuous Fourier Transform
The Fourier series works for periodic functions. For a non-periodic signal f(t) that exists for all time, we take the limit T → ∞. The discrete frequencies merge into a continuum, and the Fourier series sum becomes an integral — the Fourier transform:
F(ω) is generally a complex-valued function of frequency. Its magnitude |F(ω)| is the amplitude spectrum — how much of each frequency is present. The argument arg(F(ω)) is the phase spectrum — the relative timing of each frequency component.
Common Transform Pairs
3. Key Properties: Linearity, Shift, Scale
The Fourier transform satisfies several fundamental properties that make it powerful:
Uncertainty Principle
A signal cannot be arbitrarily localised in both time and frequency simultaneously. The Heisenberg-Gabor uncertainty principle for signals states:
4. Convolution Theorem
The convolution of two functions f and g is defined as:
Convolution appears everywhere: the output of a linear time-invariant (LTI) system to input f(t) is the convolution of f with the system's impulse response h(t). A room's acoustic echo is the convolution of the dry signal with the room impulse response. Image blurring is 2D convolution with a blur kernel.
The convolution theorem is one of the most useful results in all of signal processing:
This is transformative: instead of computing an O(N²) convolution directly, you can FFT both signals (O(N log N)), multiply them pointwise (O(N)), and inverse FFT (O(N log N)). Total: O(N log N) — a dramatic speedup for large signals.
Filtering in the Frequency Domain
A low-pass filter H(ω) = 1 for |ω| ≤ ω_c, 0 otherwise (ideal brick-wall filter) removes all frequency content above the cut-off frequency ω_c. Its time-domain impulse response is a sinc function:
Real filters use windowed sincs (Hann, Hamming, Kaiser windows) to avoid the Gibbs phenomenon — the ringing artifacts caused by the sharp cut-off in the ideal filter.
5. Parseval's Theorem and Energy Spectra
Parseval's theorem states that the total energy of a signal is the same whether computed in the time domain or the frequency domain:
The function |F(ω)|² is the power spectral density (PSD) — it shows how signal power is distributed across frequencies. Parseval's theorem is physically the statement that the Fourier transform is a unitary operator — it preserves inner products (and hence norms, hence energy).
Power Spectral Density in Practice
For a random stationary process x(t), the PSD S_xx(f) is defined as the Fourier transform of the autocorrelation function R_xx(τ) — the Wiener-Khinchin theorem:
6. Discrete Fourier Transform and the FFT Algorithm
Digital computers work with discrete, finite sequences. The Discrete Fourier Transform (DFT) operates on N samples x[0], x[1], ..., x[N−1]:
The Fast Fourier Transform (FFT)
Naive DFT computation requires O(N²) complex multiplications. For N = 10⁶ samples, this is 10¹² operations — intractable. The Cooley-Tukey FFT algorithm (1965) reduces this to O(N log₂ N) by recursively splitting the DFT:
The FFT is routinely called one of the most important algorithms of the 20th century. It enabled real-time digital signal processing, digital audio workstations, software radio, and modern telecommunications.
7. Real-World Applications
Audio Equalisation and Music Production
A parametric equaliser decomposes the audio signal into frequency bands (using an FFT or filterbank), adjusts the gain at each frequency, and reconstructs the signal. The convolution theorem enables modern convolution reverb: record the impulse response of a concert hall (firing a starter pistol), then convolve any dry recording with it — placing that recording acoustically inside the concert hall. This requires FFT-based convolution to run in real time on a laptop.
MRI — Magnetic Resonance Imaging
In MRI, the radiofrequency signals detected by the receiver coil are actually the Fourier transform of the spatial spin density of the tissue. The scanner systematically samples this Fourier space (called k-space) and then performs a 2D inverse FFT to reconstruct the spatial image:
Incomplete k-space sampling (compressed sensing MRI) uses the sparsity of images in the wavelet domain to reconstruct full-resolution images from far fewer measurements, dramatically reducing scan time.
JPEG Image Compression
JPEG divides an image into 8×8 pixel blocks and applies the Discrete Cosine Transform (DCT) — a variant of the Fourier transform using only cosines:
The key insight: the human visual system is much more sensitive to low-frequency information (overall brightness and colour gradients) than high-frequency detail. JPEG exploits this by aggressively quantising the high-frequency DCT coefficients, achieving compression ratios of 10:1 or more with minimal perceptible quality loss.
Radio: Modulation and Software-Defined Radio
Every radio transmission involves the Fourier transform. Amplitude Modulation (AM) multiplies the baseband audio signal m(t) by a carrier cos(ω_c t) — by the frequency- shift property, this translates the audio spectrum to centre around ω_c. A software-defined radio receiver digitises a wide swath of RF spectrum, then uses FFT-based filtering to select any channel digitally — replacing analogue hardware with mathematics.
OFDM (Orthogonal Frequency Division Multiplexing), the modulation scheme behind WiFi, 4G/5G, and digital TV, transmits data on hundreds of orthogonal subcarriers simultaneously. The modulator is a single IFFT; the demodulator is a single FFT. The entire modern wireless internet runs on the Fourier transform.
Gravitational Wave Detection (LIGO)
When LIGO detected the first gravitational wave signal GW150914 in 2015 — a 0.2-second chirp from two merging black holes — the signal had an amplitude of 10⁻²¹ strain (the detector changed length by 1/1000th of a proton diameter over 4 km). Extracting this from detector noise required:
- Matched filtering: convolve the data with theoretical waveform templates (one template per possible pair of masses) — an FFT-based convolution over weeks of data against thousands of templates
- Whitening: divide the FFT of the data by the square root of the noise PSD to equalise all frequency contributions
- Bandpass filtering: retain only 35–350 Hz (the sensitive band) by multiplication in frequency domain