Module 3: Echolocation & Sonar

Only two major mammalian lineages evolved biological sonar: bats and toothed whales (odontocetes). In odontocetes the system is considerably more powerful, emitting clicks at up to 230 dB re 1 μPa (sperm whale) — the loudest biological sound ever recorded — and reaching effective ranges of hundreds of meters for fish-sized prey and kilometers for large schools. The peak frequencies of 100–200 kHz give wavelengths of 1 cm or less in seawater, enabling discrimination of targets differing by 1 mm at 5 m range. This module derives the acoustic physics of click production, the melon acoustic lens, the sonar equation, and the exquisite signal-processing capabilities revealed by behavioral experiments on bottlenose dolphins.

1. Click Production: Phonic Lips and the Museau de Singe

For most of the 20th century the laryngeal origin of dolphin clicks was assumed by analogy with human phonation. Careful anatomical and behavioral work from the 1990s onward (Cranford, Amundin, Norris) established that odontocete sound production occurs in the phonic lips (also called dorsal bursae or “monkey lips” — museau de singe, for the ape-face appearance of the structure in sperm whale dissections). These are paired elastic structures embedded in the upper nasal passages.

Driven air flow from the lungs causes the phonic lips to open and close at a rate set by their elastic properties. Each closure generates a pressure pulse: the “click” itself. For toothed whales other than sperm whales, clicks are broadband with peak frequency ~100–200 kHz and duration ~30–70 µs. Sperm whales produce long (~10 ms) multi-pulse clicks where the initial phonic-lip pulse reflects repeatedly off distal and frontal air sacs to form the famous “inter-pulse interval” signature used to estimate body length via the bent-horn model.

1.1 Source Level and Radiated Power

The source level (SL) is the on-axis sound pressure level at a nominal reference distance of 1 m. For a bottlenose dolphin, SL ≈ 220 dB re 1 μPa. For a sperm whale, SL reaches 230 dB re 1 μPa (broadside) or even 236 dB on-axis (Møhl et al. 2003). The radiated intensity is:

\[ I = \frac{p^2}{\rho c},\quad p = p_{ref}\cdot 10^{SL/20} \]

For SL = 230 dB, \(p = 10^{-6}\cdot 10^{11.5} = 3.16\times10^{5}\,\text{Pa}\)— over 3 atmospheres of peak acoustic pressure. Intensity:\(I = (3.16\times10^5)^2/(1025\cdot 1500) \approx 6.5\times10^4\,\text{W/m}^2\). Integrated over the ~5° beamwidth, the sperm whale radiates roughly 25 kW of peak acoustic power per click — more than a large pop-concert loudspeaker.

2. The Melon: Biological Acoustic Lens

The melon — encountered already in Module 0 — occupies the forehead of toothed whales and functions as an acoustic lens. Its graded lipid composition produces a smoothly varying sound speed \(c(\mathbf{x})\) from ~1310 m/s at the center to ~1500 m/s at the edges, setting up a refractive-index profile that focuses sound from the phonic lips into a narrow forward beam.

2.1 Eikonal Ray Equation

In the high-frequency (ray optics) limit, sound propagation is governed by the eikonal equation:

\[ |\nabla S|^2 = n^2(\mathbf{x}) = (c_0/c(\mathbf{x}))^2 \]

where \(S(\mathbf{x})\) is the phase (travel-time) surface and \(n(\mathbf{x})\) is the acoustic index of refraction.

The ray trajectory \(\mathbf{r}(s)\) satisfies:

\[ \frac{d}{ds}\left(n\,\frac{d\mathbf{r}}{ds}\right) = \nabla n \]

Rays bend toward regions of higher n (lower c), i.e. toward the melon center.

With the center of the melon having the lowest sound speed, rays emitted from the phonic lips are bent toward the optical axis. The melon acts as a positive (converging) lens. If the composition is tuned carefully, the lens is nearly aplanatic: all rays from the phonic lips emerge parallel (a collimated beam) rather than diverging.

2.2 Sperm Whale Bent-Horn Model

The sperm whale's spermaceti organ complicates the picture. Sound is generated at the museau de singe at the front of the head and directed backward through the spermaceti organ (the low-density oil-filled component), bouncing off a frontal air sac and then forward through the junk (denser, segmented organ below spermaceti) out the front of the head. The multiple internal reflections both shape the temporal structure of the click (the characteristic multi-pulse form) and amplify the on-axis intensity. The inter-pulse interval (IPI) between the first and second pulses of a click is proportional to twice the spermaceti organ length:

\[ \text{IPI} = \frac{2 L_{spermaceti}}{c_{oil}} \]

With \(c_{oil} \approx 1370\,\text{m/s}\), an IPI of 5 ms corresponds to \(L \approx 3.4\,\text{m}\).

Whaling records correlate spermaceti organ length with whole body length (~23%), so passively recording clicks allows acoustic “measurement” of sperm whales in the wild without capture.

3. The Sonar Equation

Echolocation performance is captured by the passive sonar equation, borrowed directly from ASW (anti-submarine warfare) acoustics:

\[ \text{EL} = \text{SL} - 2\,\text{TL} + \text{TS} \]

with detection requiring EL > NL + DT (noise level + detection threshold)

Terms: EL echo level at receiver (dB), SL source level (dB re 1 μPa @ 1 m), TL one-way transmission loss (dB), TStarget strength (dB). The factor of 2 on TL accounts for the round trip.

3.1 Transmission Loss

Transmission loss has two components: geometric spreading and absorption:

\[ \text{TL}(r) = 20 \log_{10}(r/r_0) + \alpha(f)\,r \]

Spherical spreading: 20 dB per decade. Absorption \(\alpha(f)\) rises sharply with frequency.

Seawater absorption is dominated by two relaxation processes (boric acid at ~1 kHz and magnesium sulfate at ~100 kHz) plus viscous bulk absorption at very high frequency. Typical values:

  • \(f = 20\,\text{Hz}\) (blue whale): \(\alpha \approx 1\,\text{dB/1000 km}\)
  • \(f = 2\,\text{kHz}\): \(\alpha \approx 0.1\,\text{dB/km}\)
  • \(f = 20\,\text{kHz}\) (sperm whale): \(\alpha \approx 3\,\text{dB/km}\)
  • \(f = 120\,\text{kHz}\) (bottlenose): \(\alpha \approx 40\,\text{dB/km}\)

Thus high-frequency dolphin clicks are limited to ~100 m effective range (the absorption alone accounts for 4 dB at that distance), while low-frequency whale calls can propagate for thousands of kilometers. This constraint shapes the acoustic behavior of each species: dolphin sonar is short-range & high resolution; large-whale vocalizations are long-range & low resolution.

3.2 Target Strength

The target strength quantifies how much of the incident sound a target reflects back toward the source:

\[ \text{TS} = 10 \log_{10}\!\left(\frac{\sigma_{bs}}{4\pi\,r_0^2}\right) \]

\(\sigma_{bs}\): backscattering cross section (m2).

For fish with gas-filled swimbladders, TS at the swimbladder resonance frequency can exceed −25 dB; at other frequencies TS is dominated by flesh impedance contrast (low). Cephalopods (squid, octopus) lack gas inclusions and have tissue impedance nearly matched to seawater — they are acoustically “cryptic,” with TS often below −55 dB. Despite this, sperm whales subsist almost entirely on deep-sea squid, demonstrating extraordinary acoustic discrimination.

4. Click Trains, ICI, and the Buzz Phase

A foraging odontocete does not emit a continuous acoustic stream but rather discrete clicks separated by an inter-click interval (ICI). The ICI is constrained by the round-trip time to the most distant echo:

\[ \text{ICI} \gtrsim \frac{2 d_{target}}{c} \]

ensuring each echo returns before the next click is emitted.

At \(d = 100\,\text{m}\), the round-trip time is \(2\cdot 100/1500 = 133\,\text{ms}\): the animal can emit ~7 clicks/s. As the whale approaches its prey, ICI shrinks proportionally. In the final stage of a capture, the toothed whale transitions from ordinary clicks to a very rapid sequence called the buzz phase, with ICIs of 2 ms or less (corresponding to distances of ~1.5 m). The buzz is an acoustic analog of the rapid fire-control burst a sonar would use on a closing target.

4.1 Discrimination Limits

Bottlenose dolphin psychophysical experiments (Au 1993, Branstetter et al. 2007) show remarkable discrimination:

  • Distinguish solid vs hollow targets differing by 1-mm wall thickness at 5 m range
  • Discriminate cylinders of the same size but differing materials (Al vs PVC vs steel) — apparently using material resonance signatures
  • Classify targets through sediment, clay, or water-bottom clutter — target strength discrimination of ~1 dB
  • Identify individual objects in dense fish schools

4.2 Signature Whistles

Beyond echolocation, dolphins produce tonal whistles in the 1–25 kHz band. The signature whistle, described by Caldwell & Caldwell (1965), is a frequency-modulated pattern unique to each individual — functionally an acoustic “name.” Mother dolphins appear to teach their calves signature whistles, and dolphins use each other's signature whistles to address specific individuals (King & Janik 2013). This is one of the clearest demonstrations of referential vocal learning outside humans.

5. Hearing: The Auditory Pathway

The cetacean auditory system bears little resemblance to that of a terrestrial mammal. There is no pinna (external ear); the external auditory meatus is reduced to a tiny canal filled with wax. Sound reaches the inner ear via a novel pathway: entering through the lower jaw, propagating through a fat-filled channel along the mandible, and coupling into the middle ear bones. This “acoustic fat” has impedance closely matched to seawater, serving as an efficient sound conductor from the exterior into the acoustically isolated ear bones.

5.1 Tympano-Periotic Complex

The middle and inner ears of cetaceans sit in a bony capsule (the tympano-periotic complex) that is isolated from the rest of the skull by air-filled sinuses and a ligamentous suspension. This isolation is crucial for directional hearing: without it, sound arriving through the water-filled skull would reach both ears almost simultaneously, destroying the interaural time difference cues that mammals normally use for sound localization. By suspending each ear capsule in its own acoustic enclosure, cetaceans preserve binaural directional hearing at frequencies up to 150 kHz.

5.2 Ultrasonic Hearing Range

Audiograms for bottlenose dolphins show a hearing range from ~75 Hz to ~150 kHz, with peak sensitivity at 40–80 kHz. Humans hear up to ~20 kHz; dogs and cats reach ~50–60 kHz; only bats (to ~200 kHz) and toothed whales approach these upper limits. Low-frequency hearing is less developed; this limits cetacean sensitivity to many anthropogenic noise sources in the 100–1000 Hz shipping band.

5.3 Interaural Time and Intensity Differences

With the two ears separated by ~20 cm on a bottlenose dolphin, interaural time differences (ITD) for sounds arriving from the side are:

\[ \Delta t_{ITD} = \frac{d_{ears}}{c} \approx \frac{0.2}{1500} \approx 130\,\mu\text{s} \]

At 120 kHz (echo frequency), the period is 8 µs; ITD resolution finer than this has been demonstrated behaviorally, indicating that dolphins exploit phase differences at very fine temporal resolution, much as humans do at lower frequencies. Interaural intensity differences (IID) from head-shadow effects are also used; combined, these provide angular resolution of ~2°.

6. Convergent Evolution with Bats

Bats (order Chiroptera) are the other major mammalian lineage that evolved biological sonar. Despite diverging from cetaceans roughly 85 Mya at the base of the Laurasiatheria, the two groups show extraordinary convergence in their echolocation systems:

  • Both use high-frequency, short-duration broadband clicks or chirps
  • Both exhibit “buzz phases” with shrinking inter-pulse intervals during prey capture
  • Both have modified middle ear ossicles and isolated inner ear capsules
  • Both possess specialized laryngeal (bats) or nasal (whales) structures for click production
  • Both use target strength and spectral content to identify prey

Remarkably, molecular studies by Liu et al. (2010) demonstrated that the gene Prestin — encoding the motor protein driving outer hair cell motility that determines cochlear frequency selectivity — shows near-identical amino acid substitutions in bats and toothed whales. This is one of the clearest documented cases of molecular convergence in mammalian evolution: echolocation selected for the same biochemistry independently in two lineages separated by 85 Myr.

5. Echolocation System Diagram

Dolphin Echolocation: Click Production, Melon Focus, Echo Returnblowholephonic lipsair sacsMelon (graded lipid)c: 1310 m/s → 1500 m/sskull (reflector)Forward beam ~10° half-widthpeak ~120 kHz, SL ≈ 220 dBtarget fishTS ≈ -35 dBEcho returns to mandible / auditory bullamandibular fat body (sound-conducting)ear (isolatedfrom skull)Click generated at phonic lips → focused by graded-index melon → radiated forward → echoes enter mandible fat → auditory bullaRound-trip sets inter-click interval ICI = 2d/c; buzz phase (ICI ≤ 2 ms) during final approachResolution limit λ ≈ 1 cm at 120 kHz — ~mm-scale discrimination demonstrated experimentally

6. Simulation: Sonar Physics and Melon Ray-Tracing

This simulation (i) computes echo level vs range from the sonar equation for bottlenose and sperm whales including absorption; (ii) plots inter-click interval vs target range and identifies the buzz-phase regime; (iii) lists target strengths of representative cetacean prey; and (iv) traces acoustic rays through a simple axisymmetric graded-index melon model, showing how the graded lipid profile focuses sound into a forward beam.

Python
script.py148 lines

Click Run to execute the Python code

Code will be executed with Python 3 on the server

Key Observations

  • Panel 1: At 120 kHz, absorption kills bottlenose signal beyond ~150 m; sperm whale's 20 kHz clicks retain signal past 1 km.
  • Panel 2: ICI shrinks linearly with target distance. Buzz phase (ICI < 2 ms) corresponds to prey within ~1.5 m.
  • Panel 3: Squid have very low TS (no swimbladder); sperm whales still detect them reliably via exquisite SNR.
  • Panel 4: The graded-index melon bends diverging rays from the phonic lips into a collimated forward beam — a biological Luneburg lens.

Module Summary

Click Generation

Phonic lips (museau de singe) driven by pressurized air from lungs; not laryngeal

Peak Frequencies

Bottlenose 100–200 kHz; sperm whale broadband 5–25 kHz; λ ≈ 1–25 cm in seawater

Source Levels

Bottlenose 220 dB; sperm whale up to 236 dB (loudest biological sound on Earth)

Melon Lens

Graded lipid (1310–1500 m/s) focuses sound into forward beam — biological Luneburg lens

Sonar Equation

EL = SL – 2TL + TS; TL = 20 log(r) + α(f)·r; detection when EL > NL + DT

Absorption

Rises sharply with f: 120 kHz → 40 dB/km; 20 Hz → 1 dB/1000 km

ICI & Buzz

ICI = 2d/c; buzz phase at ICI ≤ 2 ms during final prey pursuit

Signature Whistles

Individual acoustic “names”; vocally learned — referential labeling outside humans

References

  1. Au, W.W.L. (1993). The Sonar of Dolphins. Springer.
  2. Au, W.W.L. & Hastings, M.C. (2008). Principles of Marine Bioacoustics. Springer.
  3. Cranford, T.W., Amundin, M. & Norris, K.S. (1996). Functional morphology and homology in the odontocete nasal complex. Journal of Morphology, 228(3), 223–285.
  4. Norris, K.S. & Harvey, G.W. (1972). A theory for the function of the spermaceti organ of the sperm whale. NASA Special Publication 262, 397–417.
  5. Møhl, B., Wahlberg, M., Madsen, P.T., Heerfordt, A. & Lund, A. (2003). The monopulsed nature of sperm whale clicks. JASA, 114(2), 1143–1154.
  6. Madsen, P.T. & Surlykke, A. (2013). Functional convergence in bat and toothed whale biosonars. Physiology, 28(5), 276–283.
  7. Branstetter, B.K. & Finneran, J.J. (2008). Comodulation masking release in bottlenose dolphins. JASA, 124(1), 625–633.
  8. Caldwell, M.C. & Caldwell, D.K. (1965). Individualized whistle contours in bottle-nosed dolphins. Nature, 207, 434–435.
  9. King, S.L. & Janik, V.M. (2013). Bottlenose dolphins can use learned vocal labels to address each other. PNAS, 110(32), 13216–13221.
  10. Madsen, P.T. et al. (2002). Sperm whale sound production studied with ultrasound time/depth-recording tags. JEB, 205, 1899–1906.