Module 2

Anatomy & Physiology

Where the steady-state response is generated, why the modulation rate decides which part of the pathway dominates, and what makes the response frequency-specific.

The response has no single generator

It is tempting to look for “the” source of the ASSR, but the scalp-recorded response is the summed activity of a distributed network. Source-localisation work shows that cortical and subcortical regions are both active, and — importantly — the number and location of the contributing generators changes with the modulation frequency of the stimulus [11]. The practical consequence is that choosing a modulation rate is, in effect, choosing which part of the auditory pathway you are listening to.

Modulation rate selects the generator

Animal recordings give the clearest picture of why rate matters. Neurons in the inferior colliculus respond best to modulation rates in the region of 20–40 Hz, whereas neurons in the cochlear nucleus respond best to rates above about 80 Hz[12]. Mapped onto the scalp response, this means modulation rates around 40 Hz are weighted toward the midbrain and higher (auditory cortex, thalamus, inferior colliculus), while rates above roughly 80 Hz are weighted toward the brainstem — the cochlear nucleus and superior olivary complex[9].

Source-imaging studies in humans reach the same conclusion from the other direction: amplitude-modulated tones near 80–90 Hz mainly elicit subcortical activity, while tones near 40 Hz mainly elicit cortical activity [11]. This single fact — rate selects the generator — explains the entire clinical division between the two responses introduced in Module 1.

Why this matters at the bedside

Because the 40-Hz response depends on cortical and thalamic generators, it is sensitive to the state of those structures: it shrinks with sleep, sedation, and anaesthesia, and is unreliable in the immature nervous system. The 80–90-Hz response, arising from robust brainstem generators, is comparatively indifferent to sleep and age — which is exactly why the higher rates are used for objective threshold estimation in sleeping infants [9].

See it

Select a modulation-rate band below to see which generators dominate the scalp response. At high rates the brainstem nuclei carry the response; at low rates the weighting shifts up to the midbrain, thalamus, and cortex.

CochleaCochlear nucleusSuperior olivary complexInferior colliculusMedial geniculate bodyAuditory cortexcortexcochlea

≈ 80–90 Hz. Brainstem-weighted (cochlear nucleus, superior olivary complex). Robust during sleep and across ages — the choice for objective threshold estimation in infants.

Marker size and bar length show each generator's relative contribution to the scalp response at the selected modulation rate. Schematic — see Module 2 for the cited basis.

The 40-Hz response and the superposition account

There is a long-standing question of whether a steady-state response is simply a train of overlapping transient responses, or a genuine entrained oscillation. Galambos and colleagues originally proposed that the 40-Hz response represents the superposition of middle-latency responses: the major MLR waves are separated by roughly 25 ms, so when stimuli arrive every 25 ms — a 40-Hz rate — successive responses overlap in phase and reinforce one another[1].

Deconvolution experiments, which recover the underlying transient response from a rapid stimulus train, have largely supported this account for the 40-Hz response: a synthetic ASSR built from the subject’s own ABR and MLR waves reproduces the recorded 40-Hz steady-state response well [3]. The superposition and entrainment views are best treated as complementary rather than mutually exclusive; for a clinical reader, the key point is that the steady-state response is firmly anchored in the same transient physiology as the ABR and MLR.

Place specificity along the cochlea

Frequency-specific threshold estimation only works if a carrier tone excites a restricted region of the basilar membrane. The tonotopic organisation of the cochlea — and its preservation along the auditory pathway — supports processing in parallel, frequency-specific channels, which is the anatomical basis for testing several carriers at once [11].

This place specificity has been measured directly. Using high-pass masking and derived-band techniques with simultaneously presented carriers at 500, 1000, 2000, and 4000 Hz, Herdman and colleagues found that each carrier’s response was generated within about half an octave of its carrier frequency [10]. In other words, the four-carrier recording really does probe four distinct cochlear places — the assumption behind an estimated audiogram.

The frequency-specificity trade-off

There is a genuine tension built into the stimulus. ASSR amplitude is not constant across carriers: it decreases as carrier frequency rises, with one MEG study reporting the response at 250 Hz roughly three times larger than at 4000 Hz [11]. Larger responses are easier to detect, which favours low frequencies.

At the same time, modulating a pure tone necessarily introduces additional spectral energy on either side of the carrier — so-called spectral splatter — and deeper or faster modulation spreads that energy further, eroding the very frequency specificity the carrier was chosen for [9]. Stimulus design therefore balances three competing goals: a detectable response, a place-specific one, and an efficient recording. The Recording Technique module takes up how amplitude, frequency, and mixed modulation make that trade.