How the Cochlea Hears: The Physics of Basilar Membrane Resonance
The human ear can distinguish more than 3,000 distinct pitches across a frequency range of roughly 20 Hz to 20,000 Hz, and it can do so even when sounds overlap in time and arrive from different directions. This extraordinary ability rests on a tiny fluid-filled spiral inside the inner ear called the cochlea. At its heart is the basilar membrane — a ribbon of tissue less than 35 mm long — that performs a real-time mechanical frequency analysis of sound using the physics of travelling waves and resonance. Understanding how the basilar membrane works illuminates not only normal hearing but also the mechanisms behind noise-induced deafness, cochlear implant design, and the origins of musical pitch perception.
Anatomy of the Cochlea
The cochlea is a bony, snail-shaped canal wound roughly two and a half turns, embedded in the temporal bone of the skull. It is divided along most of its length into three fluid-filled compartments by two membranes: the basilar membrane (below) and Reissner's membrane (above). The uppermost compartment is the scala vestibuli, the lower one the scala tympani, and the middle compartment — the scala media — contains endolymph, a fluid with an unusually high potassium concentration (~150 mM K⁺) maintained by specialised cells in the stria vascularis.
The endocochlear potential is a resting DC voltage of approximately +80 mV inside the scala media relative to the perilymph in the adjacent scalae. This potential is generated by the stria vascularis acting as a biological battery, and it provides the driving force for mechanotransduction. The hair cells sit on the basilar membrane within the organ of Corti, with their stereocilia in contact with (or very close to) the overlying tectorial membrane. There are approximately 3,500 inner hair cells (IHCs) arranged in a single row, and about 12,000 outer hair cells (OHCs) in three rows.
The Travelling Wave and Tonotopic Mapping
Sound enters the cochlea as a pressure wave via the stapes footplate pressing on the oval window. This pressure difference between scala vestibuli and scala tympani forces the basilar membrane to deflect. The resulting disturbance does not simply resonate the entire membrane at once; instead it propagates as a travelling wave from base to apex, exactly as Georg von Békésy observed in cadaver cochleae (Nobel Prize, 1961).
The basilar membrane is not uniform. At the base it is narrow (~0.1 mm) and very stiff; at the apex it is wide (~0.5 mm) and flexible. Stiffness decreases roughly exponentially from base to apex, by a factor of about 10,000 over the 35 mm length. This gradient means that the local resonant frequency — the frequency at which a given position responds maximally — decreases systematically from base to apex:
f_base ≈ 20,000 Hz (base, x = 0)
f_apex ≈ 20 Hz (apex, x = 35 mm)
d ≈ 7 mm (space constant; each 7 mm halves the CF)
Equivalently: Δx / octave ≈ 4–5 mm (the cochlear map is nearly log-linear)
A 1,000 Hz tone, for example, produces a travelling wave that peaks approximately 20 mm from the base. This orderly spatial arrangement of frequency preference is called the tonotopic map, and it is preserved through the entire auditory pathway from cochlear nucleus to primary auditory cortex (Heschl's gyrus). The brain therefore "reads" frequency by detecting which population of neurons is most active — a place code.
The shape of the travelling wave is asymmetric: it builds up slowly as it approaches the characteristic place and then drops off steeply on the apical side. The envelope of basilar membrane displacement can be modelled with the WKB (Wentzel-Kramers-Brillouin) approximation, treating the membrane as a slow-varying resonating medium:
where k(x) is the local wave number, Re{k} gives wavelength, Im{k} gives spatial damping.
Near the best place, Im{k} becomes large and negative ⇒ rapid amplitude decay.
Hair Cell Mechanotransduction
When the basilar membrane deflects, a shearing motion occurs between it and the overlying tectorial membrane, bending the stereocilia bundles on top of the hair cells. Each hair cell bears a staircase of 50–300 stereocilia connected by fine protein filaments called tip links. Deflection toward the tallest stereocilium stretches the tip links, mechanically opening transducer channels at the tips. These are members of the TMC (transmembrane channel-like) family, though the exact molecular identity remained contested until the mid-2010s.
The channel opening allows K⁺ and Ca²⁺ to flood into the hair cell, driven by the large electrochemical gradient between endolymph (+80 mV, ~150 mM K⁺) and the hair cell interior (~−60 mV, ~5 mM K⁺). The resulting inward current depolarises the inner hair cell, triggering Ca²⁺-dependent exocytosis of glutamate at the ribbon synapse at the base of the cell, which fires the afferent auditory nerve fibre within microseconds. The speed is enabled by a specialised ribbon structure that tethers synaptic vesicles close to the Ca²⁺ channels, allowing release within ~1 ms of stimulus onset.
P_open(x) = 1 / (1 + A × exp(−x / x_0))
where x = displacement of stereocilia bundle (nm)
I_max ≈ 1 nA (per hair cell at saturation)
Half-maximum displacement x_0 ≈ 10–30 nm (extremely sensitive)
Active Amplification by Outer Hair Cells
The passive mechanics described by von Békésy produced tuning curves far too broad to explain the sharp frequency selectivity observed in living animals. The missing ingredient is the cochlear amplifier — an active feedback mechanism provided by the outer hair cells. OHCs are unique among mechanosensory cells in that they can rapidly change their length in response to changes in membrane voltage, a property called electromotility. This is driven by the motor protein prestin (SLC26A5), which undergoes a conformational change that shortens or elongates the OHC by up to 4% at audio frequencies.
When the basilar membrane moves toward the scala vestibuli (upward), stereocilia on the OHCs are deflected, the cell depolarises, and prestin causes the cell to shorten — which in turn amplifies the basilar membrane movement. This positive feedback, tuned to the local resonant frequency, can boost basilar membrane velocity by up to 40 dB (a factor of 100) at low sound levels. The gain decreases at high sound levels (compression), which is why the auditory system operates linearly over a limited range but compresses intensity over its full 120 dB dynamic range:
At high levels: BM velocity ∝ p^0.2–0.3 (compressive)
Overall: input SPL 0–120 dB ⇒ BM response spans ~40–50 dB
Equivalent to: 1 trillion-fold intensity range ⇒ ~300-fold displacement range
This nonlinear compression also underlies two-tone suppression (a second tone can reduce the response to the first), combination tones, and distortion-product otoacoustic emissions — all measurable phenomena that provide non-invasive windows into cochlear function.
Real-World Applications
The physics of the cochlea has practical implications across medicine and engineering:
- Cochlear implants. When hair cells are destroyed by genetic disease, ageing, or noise exposure, a cochlear implant electrode array is inserted along the scala tympani. It drives up to 22 electrode contacts at different positions, exploiting the tonotopic map to restore pitch perception. Signal processing mimics the basilar membrane's frequency analysis by splitting the audio stream into bands using a bank of bandpass filters.
- Hearing aid design. Modern hearing aids apply frequency-specific gain to compensate for the loss of outer hair cell amplification, which is the most common form of sensorineural hearing loss. Wide dynamic range compression in digital hearing aids emulates the cochlear compressive nonlinearity.
- Audiological diagnostics. Otoacoustic emissions (OAEs) — sounds emitted by the cochlea's active amplifier — are used for universal newborn hearing screening. Distortion product OAEs and transient-evoked OAEs are measured quickly and non-invasively, providing a functional test of outer hair cell integrity even in infants.
- Acoustic engineering. The cochlear frequency analysis inspired early analogue filter-bank designs and continues to inform auditory-inspired speech processing, psychoacoustic audio codecs (MP3, AAC), and machine-hearing algorithms.
Frequently Asked Questions
What is the basilar membrane?
The basilar membrane is a stiff, ribbon-like structure inside the cochlea that varies in width and stiffness along its length, enabling it to respond maximally to different sound frequencies at different positions — acting as a biological frequency analyser.
How does the cochlea separate different frequencies?
A sound wave entering the cochlea launches a travelling wave along the basilar membrane. Because stiffness decreases and mass increases from base to apex, each frequency reaches maximum amplitude at a specific location — the tonotopic map — so the brain can identify pitch by which nerve fibres are most active.
What are hair cells and how do they convert sound to nerve signals?
Hair cells are mechanosensory cells sitting on the basilar membrane. When the membrane deflects, stereocilia on the hair cell tips bend, opening ion channels that allow potassium and calcium to flood in, depolarising the cell and triggering neurotransmitter release onto auditory nerve fibres within about one millisecond.
What is the tonotopic map?
The tonotopic map is the orderly spatial arrangement of frequency sensitivity along the cochlea and throughout the auditory pathway. High frequencies map to the base (roughly 20,000 Hz), low frequencies to the apex (roughly 20 Hz), and this organisation is preserved all the way to the auditory cortex.
Why can humans hear such a wide range of sound intensities?
The auditory system spans roughly 120 dB (a factor of one trillion in intensity) thanks to logarithmic compression by outer hair cell amplification, mechanical nonlinearities, and neural adaptation. The decibel scale itself reflects this logarithmic perception of loudness.
What is otoacoustic emission?
Otoacoustic emissions are faint sounds produced by the cochlea itself, caused by the active mechanical feedback of outer hair cells. They can be measured non-invasively with a sensitive microphone placed in the ear canal and are routinely used to screen hearing in newborns.
How does noise-induced hearing loss damage the cochlea?
Intense sound causes excessive deflection of stereocilia, which can break their tip links or overwhelm the hair cell's ionic balance. Outer hair cells at the base — tuned to high frequencies — are most vulnerable, explaining why noise damage typically affects high-frequency hearing first and produces the characteristic 4 kHz notch on an audiogram.
What is the role of the tectorial membrane?
The tectorial membrane is a gelatinous structure that overlies the hair cells. When the basilar membrane vibrates, the shearing motion between it and the tectorial membrane deflects the stereocilia of the outer hair cells, initiating the transduction process and contributing to frequency tuning.
How does a cochlear implant restore hearing?
A cochlear implant bypasses damaged hair cells by electronically stimulating the auditory nerve directly. An external microphone and processor split sound into frequency bands and drive an array of electrodes positioned at different locations along the cochlea, mimicking the tonotopic map to convey pitch information.
What is the difference between inner and outer hair cells?
Inner hair cells are the primary sensory cells, transmitting about 95% of auditory information to the brain. Outer hair cells act as a biological amplifier: they change length in response to voltage changes via the protein prestin (electromotility), boosting basilar membrane motion by up to 40 dB at low sound levels and sharpening frequency tuning.
Try It Yourself
The best way to build intuition for basilar membrane physics is to watch it in motion. Explore the related simulations:
- Cochlea Hearing — visualise the travelling wave and tonotopic map interactively.
- Acoustic Lens — explore how curved surfaces focus and redirect sound waves.
- Beat Frequency — hear and see how two close frequencies interfere to produce beats, a key psychoacoustic phenomenon.