You are on page 1of 9

Neuroscience 337 (2016) 306–314


ALEXANDRE LEHMANN, b,c,d,ey Key words: rhythmic entrainment, human brainstem, audi-
DIANA JIMENA ARIAS a,b*y AND tory cortex, electro-encephalography, temporal regularity,
MARC SCHÖNWIESNER b,c,d stimulus omissions.
University of Quebec at Montreal, Montreal, QC, Canada
International Laboratory for Brain, Music and Sound
Research (BRAMS), Montreal, QC, Canada INTRODUCTION
Center for Research on Brain, Language and Music (CRBLM), Neurons in the auditory cortex can synchronize their
Montreal, QC, Canada activity to a regular stimulation (Nozaradan et al., 2011,
Department of Psychology, University of Montreal, Montreal, 2012). Such neurons may underlie a pattern-forming
QC, Canada dynamical system or a bank of oscillators that couple to
Department of Otolaryngology Head & Neck Surgery, external rhythms and thus enable entrainment to different
McGill University, Montreal, QC, Canada rates of the auditory input. Entrainment to stimulus regu-
larities may underlie cognitive functions such as the per-
Abstract—Neurons in the auditory cortex synchronize their ception of beat in music (Jones, 1974; Large and Kolen,
responses to temporal regularities in sound input. This cou- 1994; Gasser and Eck, 1996; Eck, 1999; Large and
pling or ‘‘entrainment” is thought to facilitate beat extraction Jones, 1999; van Noorden and Moelants, 1999; Large
and rhythm perception in temporally structured sounds, and Grondin, 2008) and the ability to synchronize move-
such as music. As a consequence of such entrainment, ments with predictable, periodic sounds (Deutsch, 1983;
the auditory cortex responds to an omitted (silent) sound Peper et al., 1995; Jones and Pfordresher, 1997; Jones
in a regular sequence. Although previous studies suggest and Yee, 1997; Scheirer, 1998; Pressing, 1999; Repp,
that the auditory brainstem frequency-following response 1999; van Noorden and Moelants, 1999; Snyder and
(FFR) exhibits some of the beat-related effects found in the
Krumhansl, 2001). Several models have been proposed
cortex, it is unknown whether omissions of sounds evoke
a brainstem response. We simultaneously recorded cortical for the extraction of temporal structure, such as the afore-
and brainstem responses to isochronous and irregular mentioned bank of oscillators (Large et al., 2015), but also
sequences of consonant–vowel syllable /da/ that contained models based on note duration (Longuet-Higgins and
sporadic omissions. The auditory cortex responded Lee, 1982), autocorrelation (Brown, 1993; Vos et al.,
strongly to omissions, but we found no evidence of evoked 1994), pulse saliency (Parncutt, 1994), comb filtering
responses to omitted stimuli from the auditory brainstem. (Scheirer, 1998), neural networks (Gasser et al., 1999),
However, auditory brainstem responses in the isochronous rule-based systems (Desain and Honing, 1999; Eck,
sound sequence were more consistent across trials than in 2001), and information theory (Casagrande and Eck,
the irregular sequence. These results indicate that the audi- 2005). None of these models is tied to specific neural pro-
tory brainstem faithfully encodes short-term acoustic prop-
cessing stages, perhaps because the neural basis of beat
erties of a stimulus and is sensitive to sequence
regularity, but does not entrain to isochronous sequences induction and rhythm perception is complex and involves
sufficiently to generate overt omission responses, even for many auditory and non-auditory brain regions (Schubotz
sequences that evoke such responses in the cortex. These and Cramon, 2001; Grahn and Brett, 2007; Chen et al.,
findings add to our understanding of the processing of 2008, 2009; Grahn, 2009; Teki et al., 2011; Konoike
sound regularities, which is an important aspect of human et al., 2012; Grahn and Rowe, 2013).
cognitive abilities like rhythm, music and speech percep- In particular, it is unclear at the moment to what extent
tion. Ó 2016 IBRO. Published by Elsevier Ltd. All rights auditory brainstem structures contribute to beat
reserved. extraction. Auditory brainstem responses exhibit some
of the beat-related effects found in the cortex. Brainstem
responses to sound onsets falling on a beat are larger
than responses to off-beat onsets (Tierney and Kraus,
2013). Individuals who are able to tap in synchrony with
*Correspondence to: A. Lehmann, Department of Otolaryngology a simple beat also have less variable timing and greater
Head & Neck Surgery, McGill University, Montreal, QC, Canada. between-trial phase-locking of the auditory brainstem
E-mail address: (D. J. Arias). response than those less able to tap in synchrony
Shared first authorship: these authors contributed equally to this (Tierney and Kraus, 2013). Auditory brainstem responses
work. are also modulated by musical expertise in general (Wong
Abbreviations: FFR, frequency-following response; FFT, Fourier
transform; Fpz, fronto-central electrode; RMS, root mean square et al., 2007) and respond to other features of music, such
amplitude; SOA, stimulus onset asynchrony. as intervals (Lee et al., 2009; Lerud et al., 2014).
0306-4522/Ó 2016 IBRO. Published by Elsevier Ltd. All rights reserved.

A. Lehmann et al. / Neuroscience 337 (2016) 306–314 307

Here we asked whether the human auditory brainstem Procedure

responds to infrequent sound omissions and whether
The experiment was conducted in a sound-attenuated
regularity in the presented sequence of sounds
and electromagnetically shielded room. Participants
modulates this response. Responses evoked when a
were comfortably seated and watched a silent subtitled
sound is omitted from a regular sequence are thought to
movie while auditory-evoked responses were recorded.
indicate that neural populations have become entrained
They were instructed to remain immobile during the
to the temporal structure of the sequence (Snyder and
recording periods to minimize muscle artefacts. The
Large, 2005; Zanto et al., 2006). Perhaps the most basic
experiment started with a 2-min recording of click-
form of an omission response occurs when sounds are
evoked ABR to verify the integrity of the peripheral and
infrequently omitted from an isochronous sequence of
brainstem auditory processing. The test consisted of a
identical sounds. Such events evoke a cortical mismatch
series of 2500 clicks of 100-ls duration presented
response (Näätänen, 1975; Tervaniemi et al., 1994; Yabe
binaurally at 75-dB SPL at a presentation rate of 20 Hz
et al., 1997; Näätänen et al., 2007).
through insert earphones (ER2, Etymotic Research,
We measured auditory-evoked responses to
isochronous and temporally jittered sequences of a
Participants then completed two experimental
consonant–vowel syllable, simultaneously at the
conditions of 54 min each, in which /da/ stimuli were
auditory brainstem and cortical level. Stimuli were
presented binaurally via insert earphones at 72.5-dB
pseudo-randomly omitted and we compared responses
SPL and at an average presentation rate of 11.1 Hz.
to stimuli and omissions between brainstem and cortex
Stimulus presentation was controlled by custom Matlab
and between temporally regular and irregular
scripts (The MathWorks, and
sequences. If portions of the auditory brainstem track
stimuli were generated and played digitally by a
temporal regularity in sound sequences and contribute
dedicated signal processor (RX6, Tucker-Davis
to beat extraction, then they should respond to omitted
Technologies, at a 50-kHz sampling rate.
sounds during isochronous but not irregular sequences.
Within conditions, stimuli were presented in either an
An absence of such responses to omitted stimuli in both
isochronous or a temporally jittered (irregular) sequence
conditions would imply that the brainstem does not
(Fig 1). In the isochronous condition, all sounds were
contribute to these brain mechanisms, or that the
presented with a stimulus onset asynchrony (SOA) of
contribution does not result in an evoked response. If
90 ms (inter-stimulus interval of 50 ms), whereas in the
brainstem structures entrain to repetitive sound
irregular condition the SOA was randomly selected from
sequences, we would also expect a more robust
a uniform distribution ranging from 57 ms to 127 ms in
frequency encoding of these responses during
each trial (inter-stimulus intervals between 17–87 ms).
isochronous than irregular sequences, as well as higher
The average onset asynchrony was 90 ms in both
neural consistency. In the cortex, we expected to
conditions. Stimuli were infrequently (8%) omitted from
replicate previous findings in the form of a typical tri-
each sequence. Omissions were always separated by 8
phasic response evoked by omissions.
to 18 /da/ stimuli. Five silent blocks of 160 trials without
stimulus presentation were inserted at pseudorandom
EXPERIMENTAL PROCEDURES times during both conditions.
Each condition consisted of 31250 /da/ trials, 2500
Participants omission trials and 2560 silent trials.
Fifteen healthy normal-hearing participants (3 males)
between the ages of 22 and 41 years were recruited to
EEG recordings
take part in the experiment. Participants gave informed
consent and received a financial compensation for their EEG signals were recorded from five active sintered Ag-
time. One participant was excluded due to a noisy EEG AgCl electrodes placed at the hairline fronto-central
signal. The experimental procedure conformed to the electrode (Fpz), the vertex (Cz), the seventh cervical
World Medical Association’s Declaration of Helsinki and (C7), and the mastoids (M1 and M2). Active electrodes
was approved by the Research Ethics Committee of contain the first amplifier stage within the electrode
the Faculty of Arts and Sciences of the University of cover and provide impedance transformation on the
Montreal. electrode to prevent interference currents from
generating significant impedance-dependent nuisance
voltages. We therefore did not control electrode
impedances but rather kept direct-current offset close to
A synthetic consonant–vowel syllable, /da/, was used to zero during electrode placement. Reference-free
evoke brainstem and cortical responses. Evoked and electrode signals were amplified, sampled at 16384 Hz
frequency-following responses (FFRs) to this stimulus (ActiveTwo amplifier, BioSemi,, and
have been studied extensively (Russo et al., 2004; Skoe stored for offline analysis.
and Kraus, 2010a). In a pilot experiment, this stimulus
evoked larger-amplitude responses than other simple
Data analysis
and complex tones and syllables. The stimulus is 40-ms
long and its fundamental frequency varies slightly Data analysis was performed in Matlab (The Mathworks,
between 117 and 121 Hz over this duration. using eeglab (Delorme
308 A. Lehmann et al. / Neuroscience 337 (2016) 306–314

A Isochronous condition measured the spectral amplitude of frequency bins from

105 Hz to 135 Hz (corresponding to the stimulus F0). In
da da sion
da silence addition, we measured the root mean square amplitude
(RMS) for each FFR in the same time window of 5 ms
40ms 50ms 40ms 160 x SOA to 50 ms after stimulus onset. Furthermore, we measured
SOA = 90ms
trial-by-trial consistency of the responses to the sound
stimuli in the isochronous and irregular conditions by com-
B Irregular condition
puting correlation coefficients between even- and odd-
numbered trials (Hornickel and Kraus, 2013). Response
omis consistency was found to be lower in non-bilingual adults
da da sion da silence
(Krizman et al., 2014), and poor readers (Hornickel and
min.17ms mean 50ms max. 87ms 160 x SOA Kraus, 2013). It has been interpreted as increased neural
variability as shown in experimental-animal models
Fig. 1. Presentation of stimuli in isochronous and irregular (Centanni et al., 2014).
sequences. Both sequences contain /da/ syllables, infrequent syllable
Spectral amplitude measures were entered into two
omissions, and blocks of 160 silent SOAs. (A) In the isochronous
condition stimuli were presented with an SOA of 90 ms (ISI 50 ms). different repeated-measures ANOVAs with factors
(B) In the irregular condition the SOA varied between 57 and 127 ms ‘‘sequence regularity” (isochronous or irregular) and
with an average of 90 ms, and the ISI varied between 17 ms and stimulus type: the first ANOVA compared omissions and
87 ms with an average of 50 ms. silent blocks to check for effects of entrainment
(increased response amplitude within sound sequence),
and Makeig, 2004), erplab (Lopez-Calderon and Luck, the second ANOVA compared prior /da/ and post /da/
2014) and letswave ( stimuli to check for effects of adaptation (increased
wave) toolboxes, as well as custom Matlab scripts. Fol- response amplitudes after omissions). The same
lowing published procedures, responses were bandpass procedure was applied to the RMS measures.
filtered into a high- or low-frequency band in order to iso- Response consistency was compared across conditions
late subcortical and cortical components of the EEG sig- in a repeated measures ANOVA with factors ‘‘sequence
nal (Musacchia et al., 2008; Skoe and Kraus, 2010a; regularity” (isochronous or irregular) and stimuli type
Lehmann et al., 2015). For the subcortical response, data (prior /da/ or post /da). Post-hoc t-tests were used to
from a Fpz were re-referenced to the seventh cervica detect significant differences at 5% alpha level.
(C7), as this montage has been shown to elicit a brain-
stem FFR with a better signal-to-noise ratio than the tradi- Cortical Event-Related potentials and steady-state-
tional montage (Cz referenced to mastoids) used for evoked potentials. To analyze cortical responses, EEG
auditory-evoked potentials (King and Sininger, 1992; signals from electrode Cz were re-referenced to the
Krishnan et al., 2005; Cousineau et al., 2015). For the cor- average signal recorded from the left and right mastoid,
tical response, recording from the vertex electrode (Cz) bandpass-filtered between 0.1 Hz and 50 Hz, and
was referenced to linked mastoids, as done for the corti- downsampled to 256 Hz.
cal components by a previous studies that simultaneously In order to compute event-related potentials, epochs
recorded cortical and subcortical activity (Musacchia spanning from -100 to 600 ms relative to the onset of
et al., 2008). omissions and relative to the onset of /da/ stimuli. We
selected /da/ stimuli that were preceded and followed by
Brainstem frequency-following responses. To analyze four other /da/ stimuli to exclude inconsistent responses
auditory brainstem responses, continuous EEG due to adjacent omissions or silent trials. Epochs
recordings from the hairline electrode (Fpz) were re- containing artifacts were rejected by visual inspection
referenced to C7, bandpass-filtered off-line between and by using EEGLAB’s automatic iterative rejection
100 Hz and 2 kHz, and split into epochs spanning from procedure with an initial threshold of five standard
20 ms to +70 ms relative to stimulus onset (Krishnan deviations (Delorme and Makeig, 2004). Individual peak
et al., 2005; Skoe and Kraus, 2010a). Epochs containing amplitudes of the most prominent negative peak and the
artifacts were rejected by visual inspection and by using most prominent positive peak were extracted from
eeglab’s automatic iterative rejection procedure with an responses to omissions in the isochronous and irregular
initial threshold of five standard deviations (Delorme and conditions and the peak-to-peak amplitude differences
Makeig, 2004). FFRs were obtained by averaging epochs were computed and compared between conditions with
separately for each condition and trial type: responses to / a t-test.
da/ stimuli preceding an omission (‘‘prior /da/”), those just We also computed steady-state-evoked potentials
after an omission (‘‘post /da/”), the omissions themselves (SSEPs) to capture neural entrainment to the
(‘‘omission”), and silent trials excluding the first 50 trials of presentation rate of the stimuli (11.1 Hz). For each
each silent block (‘‘silence”). participant, 15 epochs of 33-s time-locked to the onset
Because the sustained period of the FFR reflects of a /da/ stimulus were extracted from the isochronous
phase-locking to characteristics of the stimulus and irregular conditions such that no epoch contained
waveform (Song et al., 2008; Skoe and Kraus, 2010a), silent trials. Those epochs were averaged and
we computed a fast Fourier transform (FFT) of each transformed into the frequency domain using a FFT.
FFR between 5 ms and 50 ms after stimulus onset and The signal-to-noise ratio of the frequency-domain signal
A. Lehmann et al. / Neuroscience 337 (2016) 306–314 309

was increased by subtracting, for each frequency bin, the Cortical SSEP. In the cortex, we found a clear peak in
average amplitude measured at three neighboring the spectral amplitude at the repetition rate of the stimuli
frequency bins (Nozaradan et al., 2011). The spectral in the isochronous sequence, but not in the irregular
amplitude at the presentation rate was compared sequence (t13 = 4.261, p = <.05; see Fig. 3C).
between isochronous and irregular conditions using a
two-tailed paired-sample t-test. Pre vs post. In addition, the FFR analysis showed
enhanced responses to stimuli presented directly after
an omission compared to those presented before an
omission in both conditions (Fig. 3B), in the form of
higher spectral amplitude at the fundamental frequency
Cortex but not brainstem responds to omitted of the stimuli (F1,13 = 12.202, p = <.05, isochronous
sounds sequence: t13 = 3.160, p = <.05, irregular sequence:
t13 = 2.284, p = <.05) and higher RMS amplitude
In the cortex, we found a clear evoked response to
(F1,13 = 59.039, p = <.05, isochronous sequence:
omitted stimuli in both regular and jitter sequences
t13 = 4.123, p = <.05, irregular sequence:
(Fig. 2A). Negative-to-positive peak-to-peak amplitude
t13 = 7.863, p = <.05).
was larger for omissions occurring in the isochronous
compared to the irregular sequence (t13 = 6,200,
p = <.05). In the brainstem, no response was apparent DISCUSSION
to omissions in either condition, and brainstem activity in To trace entrainment-related processes along the
omission trials did not differ from activity in the silent auditory pathway, we simultaneously recorded cortical
intervals (Repeated measures ANOVA, interaction effect and subcortical responses to isochronous and irregular
of condition* stimuli type, F1,13 = 1.395, p > .05, sequences of sounds containing sporadic omissions.
Fig. 3B). Sounds flanking omissions evoked clear We asked whether early brainstem centers respond to
responses in both conditions (Fig. 2B). Note that here infrequent sound omissions and whether presentation
the term ‘‘omission” is used generally to address cortical regularity modulates the auditory brainstem response.
and brainstem activity obtained from the presentation of We observed responses evoked by omissions in the
an omitted stimulus, and therefore should not be auditory cortex, but not in the brainstem. However,
confused with the terms omission N1 or omission MMN brainstem responses showed higher inter-trial
obtained from other protocols (Yabe et al., 1997; consistency when stimuli were presented in an
SanMiguel et al., 2013), because cortical omission isochronous sequence than in an irregular sequence.
responses may indicate several high-level mechanisms, We also observed a steady-state-evoked response at
such as predictive coding (see introduction), but brain- the average presentation rate in the cortex during
stem omission responses, if they exist, may indicate isochronous, but not jittered presentation. Lastly we
other, perhaps lower-level, mechanisms. found stronger adaptation in the cortex and brainstem of
responses to stimuli that preceded omissions than those
that followed omissions.
Enhanced brainstem and cortical encoding of
isochronous sound sequence Cortex, but not brainstem responds to omitted
Despite the lack of evoked responses to omissions in the
brainstem, we found effects of stimulus sequence We did not observe responses evoked by omissions in
regularity. The intertrial consistency of brainstem the brainstem, even though auditory cortex responded
responses to sounds presented in the isochronous strongly to these omissions (Busse and Woldorff, 2003;
sequence was higher than that in the irregular sequence Zanto et al., 2006; Honing et al., 2012). Cortical omission
(repeated measures ANOVA, main effect of condition, responses showed the expected latency and morphology
F1,13 = 40.504, p < .05, Fig. 3A). In order to test (Bendixen et al., 2012), and were larger in the isochro-
whether this effect could simply be explained by the nous than in the irregular sequence, likely because
length of the prior silent interval, we compared the entrainment and the build-up of temporal expectancy
consistency obtained in the irregular condition to a increases with sequence regularity (Schwartze et al.,
worst-case scenario where each of the two bins used to 2011).
compute consistency were composed of the larger and Similar findings were reported by Oceák and
smaller SOA trials respectively. This worst-case colleagues, who found that the amplitude of standard
scenario consistency measure did not differ significantly minus omission responses was significantly higher for
from the originally computed one (t26 = 0.1013, isochronous SOA compared to random SOA (Oceák
p = 0.92). et al., 2006). Although the interpretation of omission
Additionally, sound stimuli following omissions (‘post / responses is still under debate (Predictive coding:
da/’) showed higher consistency than stimuli preceding Cornella et al., 2015; regularity violation: Näätänen et al.,
omissions (‘prior /da/’) (main effect of stimulus type, 2007; sound offset rebound: Horváth et al., 2010; loudness
F1,13 = 42.942, p = <.05, isochronous sequence: changes: Oceák et al., 2006; Fishman, 2014;), there is
t13 = 6.187, p = <.05, irregular sequence: mounting evidence for predictive coding in the auditory
t13 = 3.802, p = <.05). system (Bendixen et al., 2012) and computational models
310 A. Lehmann et al. / Neuroscience 337 (2016) 306–314

A Isochronous B Isochronous
4 Irregular 4

Amplitude[ µV]

Amplitude [µV]
2 2

0 0

-2 -2

-100 0 100 200 300 400 500 -100 0 100 200 300 4 00 500
Time (ms) Time (ms)

C Isochronous Prior /da/ D Irregular Prior /da/

0.6 Post /da/ 0.6 Post /da/
0.5 Omission 0.5 Omission
0.4 0.4
Amplitude [µV]

Amplitude [µV]
0.3 0.3
0.2 0.2
0.1 0.1
0 0
-0.1 -0.1
-0.2 -0.2
-0.3 -0.3
-0.4 -0.4
-10 0 10 20 30 40 50 60 -10 0 10 20 30 40 50 60
Time (ms) Time (ms)

Fig. 2. Grand average of cortical and brainstem potentials measured in the different stimulus conditions. Cortical responses to omissions (A) and to
/da/ stimuli (B) in the isochronous (black line) and the irregular condition (gray dashed line). Responses to the ongoing sound sequence have been
subtracted to omissions. Brainstem responses to /da/ stimuli just before an omission (pale gray dashed line), to /da/ stimuli right after an omission
(black line) and to the omission (dark gray line) in the isochronous (C) and irregular conditions (D). No omission-evoked response was apparent.

Fig. 3. (A) Consistency index of /da/ events in brainstem. It corresponds to the coefficient of correlation between odd and even trials within each
condition. Isochronous sequences (gray bars) elicited responses of higher inter-trial consistency than irregular sequences (white bars). The highest
consistency was observed for stimuli directly following an omission (Post). (B) Spectral FFR amplitudes at stimulus F0 (120 Hz) for different
stimulus conditions in isochronous (gray bars) and irregular sequences (white bars). Responses to stimuli directly following an omission (Post) were
larger than those preceding omissions (Prior). No amplitude differences were found between omissions and silent periods. (C) Spectral SSEP
amplitudes at average stimulus presentation rate in the cortex. Values were extracted from epochs containing stimuli and omissions but excluding
silent blocks. No response was observed in the irregular condition. All panels show mean values ± SEM. Asterisks highlight significant differences
at p < .05.

of such mechanisms account for omission responses changes in wave V (thought to arise from the inferior col-
(Wacongne et al., 2012; Schröger et al., 2014). The lack liculus) or earlier potentials (Slabu et al., 2010; Grimm
of omission responses in the auditory brainstem in either et al., 2011). There is conflicting evidence for the larger
sequence presentations conditions supports the notion question whether brainstem processing is affected by
that the auditory brainstem faithfully encodes short-term cognitive and task demands such as attentional focus
acoustic properties of a stimulus (Galbraith et al., 1995; and listening context (Chandrasekaran et al., 2009; Skoe
Skoe and Kraus, 2010a), but does not contribute and Kraus, 2010b; Lehmann and Schönwiesner, 2014)
significantly to extracting regularities in sound input and or not (Varghese et al., 2015).
to predicting future input based on such regularities. This In the present paper, omission duration and frequency
hypothesis is supported by previous results. Two previous was suitable to elicit cortical omission responses.
studies examined ABRs in a predictive coding paradigm However, because auditory cortex tends to respond
and found no evidence for responses evoked by sound better to slower amplitude modulations than auditory
A. Lehmann et al. / Neuroscience 337 (2016) 306–314 311

brainstem nuclei (Giraud et al., 2000), it is possible that beta oscillations (Iversen et al., 2009; Fujioka and
omissions in faster stimulus sequences would have trig- Trainor, 2012), and gamma oscillations (Snyder and
gered brainstem omission responses. Large, 2005; Zanto et al., 2006). Our study clearly shows
a difference in cortical SSEPs between isochronous and
Consistency of brainstem responses depends on irregular conditions, but the relationship between sus-
sequence regularity tained cortical oscillation and parametric variations of tim-
ing variability remains to be investigated.
Even if cognitive activity changes brainstem processing in
the short term, such a modulation is likely mediated by Differential adaptation mechanisms in cortex and
efferent projections (Tzounopoulos and Kraus, 2009) brainstem
which modulate ongoing subcortical activity, but may not
generate activity, and would thus be insufficient to pro- The observed larger responses to stimuli after an
duce overt responses to omissions. Our finding of higher omission in both the cortical and subcortical recordings
consistency of the brainstem response to sounds pre- may be due to adaptation (Thompson and Spencer,
sented in an isochronous sequence indicates that, while 1966; Ulanovsky et al., 2003). Neural adaptation at the
no overt omission response was present, the responses brainstem level has been observed in the transient audi-
to sound stimuli were modulated by sequence regularity. tory brainstem response to clicks (Lasky, 1997; Hood,
To rule out that the lower consistency in the irregular con- 1998) but for the first time it is reported here in the FFR.
dition is due to bottom-up adaptation effects, we used a This is not trivial since the FFR is not predictable from
worst-case scenario computation of consistency, and the response to click and tones and may involve distinct
showed that this entrainment effect cannot be explained neural generators (Galbraith et al., 1998; Palmer and
solely by the variability in SOA across trials. Shamma, 2004; Johnson et al., 2008; Skoe and Kraus,
Higher response consistency has been associated 2010a).
with more robust processing of sounds and increased
temporal precision of encoding (Schatteman et al., Subcortical involvement in beat processing
2008; Anderson et al., 2012). Studies in animals suggest
that thalamic neurons entrain to repetitive sound stimula- The aforementioned mechanisms – response to omitted
tion at low repetitions rates (Gao et al., 2009). It may thus stimuli, modulation by sequence regularity and
be that a lower rate would have resulted in measurable adaptation – differentially observed in cortical and
omission responses. Recording brainstem responses to brainstem responses, underlie the capacity to extract
sequences with long SOA is difficult due to the large num- and process beat/rhythm, and therefore point to a
ber of repetitions required for sufficient signal quality. It selective involvement of the brainstem (Zanto et al.,
has been demonstrated that cortical responses to an 2006). However the protocol used in the current study
unexpected offset of regular sequences are faster when does not allow us to elaborate on such processes
the sequence is attended than during passive listening because our SOA (90 ms period) was below the range
(Yamashiro et al., 2009; Andreou et al., 2015), however, of human tempo perception (200 ms/300 BPM –
this was not the case for isochronous sequences. We 1000 ms/60 BPM: Drake and Botte, 1993).
thus think that attending to the sequences would likely In sum, our results suggest that stimulus omissions in
not have increased the likelihood of measuring an overt fast sound sequences do not trigger measurable omission
omission response. responses in the human ABR. The human auditory
brainstem response selectively exhibits relevant
properties related to beat extraction (sequence
Cortical responses are modulated by sequence regularity modulation, adaptation).
Cortical event-related responses are modulated by the Acknowledgments—This work was funded by the National
temporal structure of sound sequences (Luck, 2005; Science and Engineering Council of Canada. We thank Dr. Nina
Lakatos et al., 2008; Schwartze et al., 2011). Here we Kraus for providing the /da/ stimulus. MS was supported by a
career grant from the Fonds de Recherche du Quebec – Sante.
found cortical steady-state-evoked potentials (SSEPs)
DJA was supported by an undergraduate research stipend from
with clear spectral amplitude at the average rate of the National Science and Engineering Council of Canada.
stimulation in the isochronous sequence but not in the
irregular sequence. Such sustained activity in response
to isochronous sequences may have contributed to the
observed increased intertrial consistency in brainstem
response, potentially through corticofugal connections Abecasis D, Brochard R, Del Rı́o D, Dufour A, Ortiz T (2009) Brain
(Tzounopoulos and Kraus, 2009). A sustained cortical lateralization of metrical accenting in musicians. Ann N Y Acad Sci
oscillation at the beat frequency may underlie several pre- 1169:74–78. available at:
vious results (Arnal and Giraud, 2012; Merchant et al., 19673756 [accessed August 3, 2015].
2015), including that sounds falling on a beat of a rhythmic Anderson S, Parbery-Clark A, White-Schwoch T, Kraus N (2012)
Aging affects neural precision of speech encoding. J Neurosci
tone sequence evoke higher-amplitude event-related
potentials (Brochard et al., 2003; Abecasis et al., 2009; Andreou LV, Griffiths TD, Chait M (2015) Sensitivity to the temporal
Potter et al., 2009; Winkler et al., 2009; Schaefer et al., structure of rapid sound sequences – an MEG study. Neuroimage
2011; Tierney and Kraus, 2014), evoked and induced 110:194–204.
312 A. Lehmann et al. / Neuroscience 337 (2016) 306–314

Arnal LH, Giraud AL (2012) Cortical oscillations and sensory [accessed
predictions. Trends Cogn Sci 16:390–398. August 3, 2015].
10.1016/j.tics.2012.05.003. Fishman YI (2014) The mechanisms and meaning of the mismatch
Bendixen A, SanMiguel I, Schröger E (2012) Early negativity. Brain Topogr 27:500–526.
electrophysiological indicators for predictive processing in Fujioka T, Trainor L (2012) Internalized timing of isochronous sounds
audition: a review. Int J Psychophysiol 83:120–131. available at: is represented in neuromagnetic beta oscillations. J Neurosci 32:1791–1802. available at:
Brochard R, Abecasis D, Potter D (2003) The ‘‘Ticktock” of our 22302818 [accessed August 3, 2015].
internal clock direct brain evidence of subjective accents in Galbraith GC, Arbagey PW, Branski R, Comerci N, Rector PM (1995)
isochronous sequences. Psychology:362–366. available at: http:// Intelligible speech encoded in the human brain stem frequency- [accessed August 3, following response. Neuroreport 6:2363–2367.
2015]. Galbraith GC, Bhuta SM, Choate AK, Kitahara JM, Mullen Jr TA
Brown J (1993) Determination of the meter of musical scores by (1998) Brain stem frequency-following response to dichotic
autocorrelation. J Acoust Soc Am 94:1953–1957. available at: vowels during attention. Neuroreport 9:1889–1893. Gao L, Meng X, Ye C, Zhang H, Liu C, Dan Y, Poo M-M, He J, Zhang
407518 [accessed August 3, 2015]. X (2009) Entrainment of slow oscillations of auditory
Busse L, Woldorff MG (2003) The ERP omitted stimulus response to thalamic neurons by repetitive sound stimuli. J Neurosci
‘‘no-stim” events and its implications for fast-rate event-related 29:6013–6021.
fMRI designs. Neuroimage 18:856–864. Gasser M, Eck D (1996) Representing rhythmic patterns in a network
Casagrande N, Eck D (2005) Finding meter in music using an of oscillators. In: Proceedings of the 4th International Conference
autocorrelation phase matrix and Shannon finding meter in music on Music Perception and Cognition. New Jersey: Lawrence
using an autocorrelation phase matrix and Shannon entropy. In: Erlbaum Associates Hillsdale, NJ. p. 361–366. Available at:
Proceedings of the 6th International Conference on Music
Information Retrieval (ISMIR). London: University of London. p. 4792&rep=rep1&type=pdf [accessed August 3, 2015].
504–509. Gasser M, Eck D, Port R (1999) Meter as Mechanism: A Neural
Centanni TM, Chen F, Booker AM, Engineer CT, Sloan AM, Network Model that Learns Metrical Patterns. Connect Sci.
Rennaker RL, LoTurco JJ, Kilgard MP (2014) Speech sound available at:
processing deficits and training-induced neural plasticity in rats 095400999116331 [accessed August 3, 2015].
with dyslexia gene knockdown. PLoS One 9. Giraud A, Lorenzi C, Ashburner J, Wable J, Johnsrude I, Frackowiak
Chandrasekaran B, Hornickel J, Skoe E, Nicol T, Kraus N (2009) R, Kleinschmidt A, Wolfgang J, Lorenzi C, Ashburner J,
Context-dependent encoding in the human auditory brainstem Johnsrude I, Frackowiak R (2000) Representation of the
relates to hearing speech in noise: implications for developmental temporal envelope of sounds in the human brain. J
dyslexia. Neuron 64:311–319. available at: http://www. Neurophysiol 84:1588–1598. Grahn Ja (2009) The role of the basal ganglia in beat perception:
pmcentrez&rendertype=abstract [accessed September 7, 2014]. neuroimaging and neuropsychological investigations. Ann N Y
Chen JL, Penhune VB, Zatorre RJ (2008) Listening to musical Acad Sci 1169:35–45. available at:
rhythms recruits motor regions of the brain. Cereb Cortex pubmed/19673753 [accessed July 28, 2015].
18:2844–2854. available at: Grahn J, Brett M (2007) Rhythm and beat perception in motor areas
pubmed/18388350 [accessed August 3, 2015]. of the brain. Cogn Neurosci J 19:893–906.
Chen JL, Penhune VB, Zatorre RJ (2009) The role of auditory and Grahn JA, Rowe JB (2013) Finding and feeling the musical beat:
premotor cortex in sensorimotor transformations. Ann N Y Acad striatal dissociations between detection and prediction of
Sci 1169:15–34. available at: regularity. Cereb Cortex 23:913–921. available at: http://www.
pubmed/19673752 [accessed August 3, 2015].
Cornella M, Bendixen A, Grimm S, Leung S, Schröger E, Escera C pmcentrez&rendertype=abstract [accessed August 3, 2015].
(2015) Spatial auditory regularity encoding and prediction: human Grimm S, Escera C, Slabu L, Costa-Faidella J (2011)
middle-latency and long-latency auditory evoked potentials. Brain Electrophysiological evidence for the hierarchical organization of
Res 1626:21–30. available at: auditory change detection in the human brain. Psychophysiology
retrieve/pii/S0006899315003078.. 48:377–384.
Cousineau M, Bidelman GM, Peretz I, Lehmann A (2015) On the Honing H, Merchant H, Háden GP, Prado L, Bartolo R (2012) Rhesus
relevance of natural stimuli for the study of brainstem correlates: monkeys (Macaca mulatta) detect rhythmic groups in music, but
the example of consonance perception. PLoS One 10:e0145439. not the beat. PLoS One 7:e51369. available at: http://www.
available at: & pmcentrez&rendertype=abstract [accessed March 25, 2015].
Delorme A, Makeig S (2004) EEGLAB: an open source toolbox for Hood LJ (1998) Clinical applications of the auditory brainstem
analysis of single-trial EEG dynamics including independent response. Singular Publishing Group. available at:
component analysis. J Neurosci Methods 134:9–21.
Desain P, Honing H (1999) Computational models of beat induction: Hornickel J, Kraus N (2013) Unstable representation of sound: a
the rule-based approach. J New Music Res 28:29–42. available biological marker of dyslexia. J Neurosci 33:3500–3504. available
at: at:
3123 [accessed August 3, 2015]. 12.2013.
Deutsch D (1983) The generation of two isochronous sequences in Horváth J, Müller D, Weise A, Schröger E (2010) Omission mismatch
parallel. Percept Psychophys 34:331–337. negativity builds up late. Neuroreport 21:537–541.
Drake C, Botte M-C (1993) Tempo sensitivity in auditory sequences: Iversen JR, Repp BH, Patel AD (2009) Top-down control of rhythm
Evidence for a multiple-look model. Percept Psychophys perception modulates early auditory responses. Ann N Y Acad Sci
54:277–286. 1169:58–73. available at:
Eck D (1999) Learning simple metrical preferences in a network of 19673755 [accessed July 2, 2015].
Fitzhugh-Nagumo oscillators. In: The Proceedings of the Twenty- Johnson KL, Nicol T, Zecker SG, Kraus N (2008) Developmental
First Annual Conference of the Cognitive Science Society. New plasticity in the human auditory brainstem. J Neurosci
Jersey: Lawrence Erlbaum Associates. 28:4000–4007. available at:
Eck D (2001) A positive-evidence model for rhythmical beat induction. abstract/28/15/4000 &
J New Music Res 30:187–200. available at: http://www. 28/15/4000.
A. Lehmann et al. / Neuroscience 337 (2016) 306–314 313

Jones MR (1974) Cognitive representations of serial patterns. Hum Luck SJ (2005) An introduction to the event-related potential
Inf Process Tutorials Perform Cogn 1. technique. 1st ed. MIT press.
Jones MR, Pfordresher PQ (1997) Tracking musical patterns using Merchant H, Grahn J, Trainor L, Rohrmeier M, Fitch WT (2015)
joint accent structure. Can J Exp Psychol Can Psychol Finding the beat: a neural perspective across humans and non-
expérimentale 51:271–291. human primates. Philos Trans R Soc Lond B Biol Sci
Jones MR, Yee W (1997) Sensitivity to time change: the role of 370:20140093. available at:
context and skill. J Exp Psychol Hum Percept Perform pubmed/25646516.
23:693–709. Musacchia G, Strait D, Kraus N (2008) Relationships between
King AJ, Sininger YS (1992) Electrode configuration for auditory behavior, brainstem and cortical encoding of seen and heard
brainstem response audiometry. Am J Audiol 1:63–67. available speech in musicians and non-musicians. Hear Res 241:34–42.
at: Näätänen R (1975) Selective attention and evoked potentials in
Konoike N, Kotozaki Y, Miyachi S, Miyauchi CM, Yomogida Y, humans: a critical review. Biol Psychol 2:237–307.
Akimoto Y, Kuraoka K, Sugiura M, Kawashima R, Nakamura K Näätänen R, Paavilainen P, Rinne T, Alho K (2007) The mismatch
(2012) Rhythm information represented in the fronto-parieto- negativity (MMN) in basic research of central auditory processing:
cerebellar motor system. Neuroimage 63:328–338. available at: a review. Clin Neurophysiol 118:2544–2590. available at: http:// [accessed July 1, [accessed July 9,
2015]. 2014].
Krishnan A, Xu Y, Gandour J, Cariani P (2005) Encoding of pitch in Nozaradan S, Peretz I, Missal M, Mouraux A (2011) Tagging the
the human brainstem is sensitive to language experience. Brain neuronal entrainment to beat and meter. J Neurosci
Res Cogn Brain Res 25:161–168. available at: http://www.ncbi. 31:10234–10240. available at: [accessed February 4, 2015]. pubmed/21753000 [accessed June 10, 2014].
Krizman J, Skoe E, Marian V, Kraus N (2014) Bilingualism increases Nozaradan S, Peretz I, Mouraux A (2012) Selective neuronal
neural response consistency and attentional control: evidence for entrainment to the beat and meter embedded in a musical
sensory and cognitive coupling. Brain Lang 128:34–40. rhythm. J Neurosci 32:17572–17581. available at: http://www.
Lakatos P, Karmos G, Mehta ADAD, Ulbert I, Schroeder CECECE [accessed July 15, 2015].
(2008) Entrainment of neuronal oscillations as a mechanism of Oceák A, Winkler I, Sussman E, Alho K (2006) Loudness summation
attentional selection. Science 320:110–113. available at: http:// and the mismatch negativity event-related brain potential in & http:// humans. Psychophysiology 43:13–20. Palmer A, Shamma S (2004) Physiological representations of
Large EW, Grondin S (2008) Resonating to musical rhythm: theory speech. In: Speech Processing in the Auditory
and experiment. Psychol Time:189–232. System. Springer. p. 163–230.
Large E, Jones M (1999) The dynamics of attending: how people Parncutt R (1994) Model of pulse salience and metrical accent a
track time-varying events. Psychol Rev 106:119–159. available perceptual in musical rhythms. Music Percept An Interdiscip J
at: [accessed 11:409–464.
August 3, 2015]. Peper CLE, Beek PJ, van Wieringen PCW (1995) Coupling strength
Large EW, Kolen JF (1994) Resonance and the perception of musical in tapping a 2: 3 polyrhythm. Hum Mov Sci 14:217–245.
meter. Conn Sci 6:177–208. Potter DD, Fenwick M, Abecasis D, Brochard R (2009) Perceiving
Large EW, Herrera JA, Velasco MJ (2015) Neural networks for beat rhythm where none exists: event-related potential (ERP)
perception in musical rhythm. Front Syst Neurosci 9:159. correlates of subjective accenting. Cortex 45:103–109. available
available at: at: [accessed
fcgi?artid=4658578&tool=pmcentrez&rendertype=abstract.. August 3, 2015].
Lasky RE (1997) Rate and adaptation effects on the auditory evoked Pressing J (1999) The referential dynamics of cognition and action.
brainstem response in human newborns and adults. Hear Res Psychol Rev 106:714.
111:165–176. Repp BH (1999) Detecting deviations from metronomic timing in
Lee KM, Skoe E, Kraus N, Ashley R (2009) Selective subcortical music: effects of perceptual structure on the mental timekeeper.
enhancement of musical intervals in musicians. J Neurosci Percept Psychophys 61:529–548.
29:5832–5840. available at: Russo N, Nicol T, Musacchia G, Kraus N (2004) Brainstem
pubmed/19420250 [accessed July 25, 2015]. responses to speech syllables. Clin Neurophysiol
Lehmann A, Schönwiesner M (2014) Selective attention modulates 115:2021–2030. available at:
human auditory brainstem responses: relative contributions articlerender.fcgi?artid=2529166&tool=pmcentrez&
of frequency and spatial cues. PLoS One 9:e85442. available rendertype=abstract [accessed November 19, 2014].
at: SanMiguel I, Saupe K, Schröger E (2013) I know what is missing
3893196&tool=pmcentrez&rendertype=abstract [accessed here: electrophysiological prediction error signals elicited by
August 7, 2014]. omissions of predicted” what” but not” when”. Front Hum
Lehmann A, Skoe E, Moreau P, Peretz I, Kraus N (2015) Impairments Neurosci 7.
in musical abilities reflected in the auditory brainstem: evidence Schaefer RS, Vlek RJ, Desain P (2011) Decomposing rhythm
from congenital amusia. Eur J Neurosci 42:1644–1650. http://dx. processing: electroencephalography of perceived and self- imposed rhythmic patterns. Psychol Res 75:95–106. available
Lerud KD, Almonte FV, Kim JC, Large EW (2014) Mode-locking at:
neurodynamics predict human auditory brainstem responses to 3036830&tool=pmcentrez&rendertype=abstract [accessed
musical intervals. Hear Res 308:41–49. available at: http://www. June 6, 2014]. [accessed August 3, 2015]. Schatteman TA, Hughes LF, Caspary DM, Caspary DM (2008) Aged-
Longuet-Higgins H, Lee C (1982) The perception of musical rhythms. related loss of temporal processing: altered responses to
Perception 11:115–128. available at: http://www.perceptionweb. amplitude modulated tones in rat dorsal cochlear nucleus.
com/perception/fulltext/p11/p110115.pdf [accessed August 3, Neuroscience 154:329–337.
2015]. Scheirer E (1998) Tempo and beat analysis of acoustic musical
Lopez-Calderon J, Luck SJ (2014) ERPLAB: an open-source signals. J Acoust Soc Am 103:588–601. available at: http://
toolbox for the analysis of event-related potentials. Front Hum
Neurosci 8:213. available at: http://www.pubmedcentral. [accessed August 3, 2015]. Schröger E, Bendixen A, Denham SL, Mill RW, Bo }hm TM, Winkler I
rendertype=abstract [accessed July 24, 2015]. (2014) Predictive regularity representations in violation detection
314 A. Lehmann et al. / Neuroscience 337 (2016) 306–314

and auditory stream segregation: from conceptual to Tierney A, Kraus N (2014) Neural entrainment to the rhythmic
computational models. Brain Topogr 27:565–577. available at: structure of music. J Cogn Neurosci:400–408. available at:
Schubotz R, von Cramon D (2001) Interval and ordinal properties of [accessed August 3, 2015].
sequences are associated with distinct premotor areas. Cereb Tzounopoulos T, Kraus N (2009) Learning to encode timing:
Cortex:210–222. available at: http://cercor.oxfordjournals. mechanisms of plasticity in the auditory brainstem. Neuron
org/content/11/3/210.short [accessed August 3, 2015]. 62:463–469. available at:
Schwartze M, Rothermich K, Schmidt-Kassow M, Kotz SA (2011) articlerender.fcgi?artid=2792730&tool=pmcentrez&
Temporal regularity effects on pre-attentive and attentive rendertype=abstract [accessed May 26, 2014].
processing of deviance. Biol Psychol 87:146–151. http://dx.doi. Ulanovsky N, Las L, Nelken I (2003) Processing of low-probability
org/10.1016/j.biopsycho.2011.02.021. sounds by cortical neurons. Nat Neurosci 6:391–398. available at:
Skoe E, Kraus N (2010a) Auditory brain stem response to complex [accessed March
sounds: a tutorial. Ear Hear 31:302–324. available at: http://www. 19, 2015]. van Noorden L, Moelants D (1999) Resonance in the perception of
pmcentrez&rendertype=abstract. musical pulse. J New Music Res 28:43–66. available at: http://
Skoe E, Kraus N (2010b) Hearing it again and again: on-line [accessed
subcortical plasticity in humans. PLoS One 5:e13645. available August 3, 2015].
at: Varghese L, Bharadwaj HM, Shinn-Cunningham BG (2015) Evidence
2964325&tool=pmcentrez&rendertype=abstract [accessed against attentional state modulating scalp-recorded auditory
June 18, 2014]. brainstem steady-state responses. Brain Res 1626:146–164.
Slabu L, Escera C, Grimm S, Costa-Faidella J (2010) Early change available at:
detection in humans as revealed by auditory brainstem and S000689931500520X.
middle-latency evoked potentials. Eur J Neurosci 32:859–865. Vos P, Van Dijk A, Schomaker L (1994) Melodic cues for metre.
Snyder JS, Krumhansl CL (2001) Tapping to ragtime: Perception-London 23:965–976. available at: http://www.
cues to pulse finding. Music Percept 18:455–489. available at: [accessed August 3, 2015].
UA&db_id=WOS&SID=N1keaj3j11cakI5aIIi&search_mode= Wacongne C, Changeux J-P, Dehaene S (2012) A neuronal model of
CitedFullRecord&isickref=120772261. predictive coding accounting for the mismatch negativity. J
Snyder JS, Large EW (2005) Gamma-band activity reflects the metric Neurosci 32:3665–3678.
structure of rhythmic tone sequences. Cogn Brain Res Winkler I, Denham SL, Nelken I (2009) Modeling the auditory scene:
24:117–126. predictive regularity representations and perceptual objects.
Song JH, Skoe E, Wong PCM, Kraus N (2008) Plasticity in the adult Trends Cogn Sci 13:532–540. available at: http://www.ncbi.nlm.
human auditory brainstem following short-term linguistic training. [accessed July 15, 2014].
J Cogn Neurosci 20:1892–1902. available at: http://www. Wong PCM, Skoe E, Russo NM, Dees T, Kraus N (2007) Musical experience shapes human brainstem encoding of linguistic pitch
pmcentrez&rendertype=abstract. patterns. Nat Neurosci 10:420–422. available at: http://www.ncbi.
Teki S, Grube M, Kumar S, Griffiths TD (2011) Distinct neural [accessed July 6, 2015].
substrates of duration-based and beat-based auditory timing. J Yabe H, Tervaniemi M, Reinikainen K, Näätänen R (1997) Temporal
Neurosci 31:3805–3812. available at: http://www.pubmedcentral. window of integration revealed by MMN to sound omission. Neuroreport:8. available at:
rendertype=abstract [accessed July 28, 2015]. Fulltext/1997/05260/Temporal_window_of_integration_revealed_
Tervaniemi M, Saarinen J, Paavilainen P, Danilova N, Näätänen R by_MMN_to.35.aspx.
(1994) Temporal integration of auditory information in sensory Yamashiro K, Inui K, Otsuru N, Kida T, Kakigi R (2009) Automatic
memory as reflected by the mismatch negativity. Biol Psychol auditory off-response in humans: an MEG study. Eur J Neurosci
38:157–167. 30:125–131.
Thompson RF, Spencer WA (1966) Habituation: a model Zanto T, Snyder J, Large E (2006) Neural correlates of rhythmic
phenomenon for the study of neuronal substrates of behavior expectancy. Adv Cogn Psychol 2:221–231. available at: ~large/
73:16–43. Publications/ZantoSnyderLarge2006.pdf" xlink:type="simple"
Tierney A, Kraus N (2013) The ability to move to a beat is linked to the id="ir330">
consistency of neural responses to sound. J Neurosci SnyderLarge2006.pdf [accessed August 3, 2015].

(Accepted 8 September 2016)

(Available online 22 September 2016)