Sound Repetition Rate in the Human Auditory Pathway

J Neurophysiol
88: 1433–1450, 2002; 10.1152/jn.00156.2002.
Sound Repetition Rate in the Human Auditory Pathway:
Representations in the Waveshape and Amplitude of fMRI Activation
MICHAEL P. HARMS1,2 AND JENNIFER R. MELCHER1,2,3
Eaton-Peabody Laboratory, Massachusetts Eye and Ear Infirmary, Boston 02114; 2Harvard–Massachusetts Institute of
Technology Division of Health Sciences and Technology, Speech and Hearing Bioscience and Technology Program,
Cambridge 02139; and 3Department of Otology and Laryngology, Harvard Medical School, Boston, Massachusetts 02115
1
Received 5 March 2002; accepted in final form 14 May 2002
INTRODUCTION
It is well known from human psychophysical experiments
that the perception of a succession of sounds depends strongly
on the rate of sound presentation. For instance, when bursts of
noise are presented repeatedly at a low rate (e.g., ⬍10/s), each
burst can be separately resolved (Miller and Taylor 1948;
Address for reprint requests: M. P. Harms, Eaton Peabody Laboratory, 243
Charles St., Boston, MA 02114 (E-mail: [email protected]).
www.jn.org
Symmes et al. 1955). In contrast, bursts presented at a higher
rate fuse to form a single, modulated percept. In experiments
where multiple series of sounds are presented simultaneously
(e.g., a series of high and a series of low frequency tone bursts),
the rate of sound presentation influences whether the series are
perceived as single or separate streams, as well as the perceived temporal pattern within each stream (Bregman 1990;
Royer and Robin 1986). The dependencies on rate observed in
controlled psychophysical experiments such as these suggest
that rate plays an important role in the perception of the more
complex acoustic conditions encountered in everyday life.
Since repetition rate plays so basic a role in determining how
sounds are heard, it is not surprising that there have been
numerous neurophysiological studies of rate in animals. Broad
trends concerning the coding of rate in the auditory pathway
have emerged from this work. For instance, the highest repetition rates at which neurons respond faithfully to each successive sound in a train (or each successive cycle of amplitude
modulated stimuli) tends to decrease from brain stem to thalamus to cortex (e.g., Creutzfeldt et al. 1980; Langner 1992;
Schreiner and Langner 1988). In cortex, the neural coding of
low and high rates may be accomplished by different populations of neurons, one coding low-rate stimuli through stimulussynchronized activity and the other coding high rates in the
overall amount of discharge activity (Lu and Wang 2000; Lu et
al. 2001). While the animal work has shed light on the neural
representations of repetition rate, the degree to which the
animal findings extend to humans remains uncertain because of
interspecies differences, anesthesia differences, and a paucity
of data in humans that can serve as a link to the animal work.
In the end, direct neurophysiological data in human listeners is
important if we are to understand how repetition rate is represented in the activity patterns of the human brain.
Most previous neurophysiological studies of repetition rate
in humans have used noninvasive techniques for probing brain
function, such as evoked potential and evoked magnetic field
measurements. The evoked response work has examined averaged responses at short, middle, and long latencies to various
types of brief stimuli (e.g., clicks, tone and noise bursts)
presented at different rates (Näätänen and Picton 1987; Picton
et al. 1974; Thornton and Coleman 1975). A particular strength
The costs of publication of this article were defrayed in part by the payment
of page charges. The article must therefore be hereby marked ‘‘advertisement’’
in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
0022-3077/02 $5.00 Copyright © 2002 The American Physiological Society
1433
Downloaded from http://jn.physiology.org/ by 10.220.33.4 on June 16, 2017
Harms, Michael P. and Jennifer R. Melcher. Sound repetition rate
in the human auditory pathway: representations in the waveshape and
amplitude of fMRI activation. J Neurophysiol 88: 1433–1450, 2002;
10.1152/jn.00156.2002. Sound repetition rate plays an important role
in stream segregation, temporal pattern recognition, and the perception of successive sounds as either distinct or fused. This study was
aimed at elucidating the neural coding of repetition rate and its
perceptual correlates. We investigated the representations of rate in
the auditory pathway of human listeners using functional magnetic
resonance imaging (fMRI), an indicator of population neural activity.
Stimuli were trains of noise bursts presented at rates ranging from low
(1–2/s; each burst is perceptually distinct) to high (35/s; individual
bursts are not distinguishable). There was a systematic change in the
form of fMRI response rate-dependencies from midbrain to thalamus
to cortex. In the inferior colliculus, response amplitude increased with
increasing rate while response waveshape remained unchanged and
sustained. In the medial geniculate body, increasing rate produced an
increase in amplitude and a moderate change in waveshape at higher
rates (from sustained to one showing a moderate peak just after train
onset). In auditory cortex (Heschl’s gyrus and the superior temporal
gyrus), amplitude changed somewhat with rate, but a far more striking
change occurred in response waveshape—low rates elicited a sustained response, whereas high rates elicited an unusual phasic response that included prominent peaks just after train onset and offset.
The shift in cortical response waveshape from sustained to phasic with
increasing rate corresponds to a perceptual shift from individually
resolved bursts to fused bursts forming a continuous (but modulated)
percept. Thus at high rates, a train forms a single perceptual “event,”
the onset and offset of which are delimited by the on and off peaks of
phasic cortical responses. While auditory cortex showed a clear,
qualitative correlation between perception and response waveshape,
the medial geniculate body showed less correlation (since there was
less change in waveshape with rate), and the inferior colliculus
showed no correlation at all. Overall, our results suggest a population
neural representation of the beginning and the end of distinct perceptual events that is weak or absent in the inferior colliculus, begins to
emerge in the medial geniculate body, and is robust in auditory cortex.
1434
M. HARMS AND J. MELCHER
J Neurophysiol • VOL
METHODS
Four series of experiments were conducted. The first two examined
the effect of repetition rate on the response to a noise burst train in the
inferior colliculus (IC), Heschl’s gyrus (HG), and the superior temporal gyrus (STG; exp. I), or the IC and medial geniculate body
(MGB; exp. II). The remaining experiments (exps. III and IV) were
aimed at understanding one of the findings from exp. I, namely an
unusual form of temporal response in the cortex to trains with a high
repetition rate.
A total of 12 subjects participated in these experiments. They
ranged in age from 19 to 35 yr (mean ⫽ 25 yr). Ten of the subjects
were male. Nine were right-handed. Subjects had no known audiological or neurological disorders.
This study was approved by the institutional committees on the use
of human subjects at the Massachusetts Institute of Technology,
Massachusetts Eye and Ear and Infirmary, and Massachusetts General
Hospital. All subjects gave their written informed consent.
Exps. I and II: noise burst trains with different burst
repetition rates
Nine subjects participated in a total of 11 imaging sessions for exps.
I and II (exp. I: 5 sessions, subjects 1–5; exp. II: 6 sessions, subjects
2, 5, and 6 –9).
The stimuli were bursts of uniformly distributed white noise. Individual noise bursts in all four experiments were 25 ms in duration
(full-width half-maximum), with a rise/fall time of 2.5 ms. The bursts
were presented at repetition rates of 1, 2, 10, and 35/s (exp. I) or 2, 10,
20, and 35/s (exp. II). The 1/s rate was used in only three of the five
sessions of exp. I. The spectrum of the noise stimulus at the subjects’
ears was low-pass (6-kHz cutoff), reflecting the frequency response of
the acoustic system.
Noise bursts were presented in 30-s trains alternated with 30-s “off”
periods, during which no auditory stimulus was presented (Fig. 1,
top). Four alternations between “train on” and “off” periods constituted a single scanning “run” (total duration 240 s). For all but two
sessions (in exp. I), each of the four rates was presented once during
each run, and their order was varied across runs. Within a train, the
repeated noise bursts were identical (i.e., “frozen”), but the noise
bursts differed across trains and runs. For the other two sessions, the
same rate was presented throughout a run, and this rate was varied
across runs. For these two sessions, the noise burst was frozen within
a run, but differed across runs. In each session, the total number of
train presentations at each rate was between 8 –13.
Exp. III: small numbers of noise bursts
To investigate how the initial bursts of a train contribute to cortical
responses to the onset of a train, we examined the responses to a single
noise burst and short clusters of noise bursts. Responses were collected in three imaging sessions with three subjects (exp. III; subjects
2, 5, and 10).
Either one noise burst or a cluster of noise bursts (2 or 5) was
presented once every 18 s, constituting a single “trial” (Fig. 1, bottom). For the clusters of five noise bursts, the interstimulus interval
(ISI, onset-to-onset) between noise bursts was 28.6 ms, equivalent to
the ISI for a rate of 35/s. For clusters of two noise bursts, two different
ISIs were used: 500 ms (2/s rate) and 28.6 ms (35/s rate). For two
sessions, the same stimulus was used in all of the trials for a given run
(12 runs total; 270 s per run; 45 total repetitions per trial type). In the
third session (subject 10), the stimulus was randomized across trials (7
runs; 288 s per run; 28 repetitions per trial type).
Exp. IV: noise burst trains with different durations
The effect of train duration was examined in two imaging sessions
with two subjects (exp. IV; subjects 11 and 12). Trains of four
88 • SEPTEMBER 2002 •
www.jn.org
Downloaded from http://jn.physiology.org/ by 10.220.33.4 on June 16, 2017
of evoked potential and magnetic field measurements is that they
can be used to examine responses to individual stimuli within a
train up to much higher rates than with other noninvasive brain
imaging techniques (see following paragraph). A limitation, however, is that the sites of response generation cannot always be
reliably localized. Evoked magnetic field examinations of repetition rate are further limited in that they provide information
mainly concerning cortical areas because of inherent limitations in
probing subcortical function (Erné and Hoke 1990).
Positron emission tomography (PET) and functional magnetic resonance imaging (fMRI), two techniques for spatially
mapping brain activity, have also been used to examine the
dependence of human brain activation on repetition rate. Compared with evoked potential and magnetic field measurement,
fMRI lacks the temporal resolution needed to separately resolve the responses produced by individual stimuli in a train
(except at extremely low rates, e.g., approximately 0.1/s), and
the temporal resolution of PET is even less. An important
advantage, however, is that both PET and fMRI enable activation to be directly localized to brain stem, thalamic, and
cortical structures of the auditory pathway (Griffiths et al.
2001; Guimaraes et al. 1998; Lockwood et al. 1999; Melcher et
al. 1999). The localization provided by fMRI is particularly
precise because of the technique’s high spatial resolution and
direct mapping to anatomy. Despite the fact that fMRI and PET
can show activation at different stages of the auditory pathway,
previous PET and fMRI studies varying rate have focused
largely on cortical areas (Binder et al. 1994; Dhankhar et al.
1997; Frith and Friston 1996; Giraud et al. 2000; Price et al.
1992; Rees et al. 1997; Tanaka et al. 2000). Additionally, most
of the studies focused on the low “rates” characteristic of
speech (e.g., ⬍2.5 words or syllables/s) because they were
directed at understanding speech processing. Overall, there is
limited PET or fMRI data concerning the representations of
rate within the human auditory pathway. Specifically, there is
little information concerning the transformation of rate representations from structure to structure within the pathway for a
wide range of psychophysically relevant rates.
The present fMRI study compared the representation of
repetition rate across cortical and subcortical structures of the
human auditory pathway using a wide range of rates. Stimuli
were trains of repeated noise bursts with repetition rates ranging from low (where each burst could be resolved individually)
to high (where individual bursts were not distinguishable and
the train was perceived as a continuous, but modulated, sound).
Noise bursts were chosen as the elemental stimulus based on
the assumption that broadband sound would elicit robust responses by activating neurons across a wide range of characteristic frequencies. fMRI was selected for its high spatial
resolution, its localizing capabilities, and its higher temporal
resolution (approximately 2 s) compared with PET (⬎10 s).
The latter feature proved important because one of the most
striking differences in rate representation across structures
occurred in the temporal dynamics of the fMRI response.
Portions of this work were presented at the annual meeting
of the Society for Neuroscience (1997), the 21st annual meeting of the Association for Research in Otolaryngology (1998),
the 4th and 5th International Conferences on Functional Mapping of the Human Brain (1998 and 1999), and in M. P. Harms’
doctoral thesis (Massachusetts Institute of Technology, 2002).
RATE REPRESENTATION IN HUMAN AUDITORY PATHWAY
1435
Task
Subjects were instructed to listen to the noise burst stimuli. At the
end of each scanning run, subjects reported their alertness on a
qualitative scale ranging from 1 (fell asleep during run) to 5 (highly
alert). Alertness ratings were almost always in the 3–5 range, and
were never 1.1
For exp. II, subjects performed the following task to further ensure
that they remained attentive: they indicated whenever they detected an
occasional increment or decrement in intensity (of 6 dB, lasting 1 s)
by raising or lowering their index finger. Each subject identified more
than 90% of the intensity changes.
Imaging
different durations (15, 30, 45, and 60 s) were presented with an “off”
period of 40 s following each train. Noise burst repetition rate within
each train was always 35/s. Each train duration was presented once
per run (8 –9 runs; 310 s per run) with the order of durations randomized across runs. Supplementary information concerning the effects of
train duration was obtained in two additional experiments that used a
single, long-train duration (60 s) and 35/s noise bursts.
Acoustic stimulation
Separately for each ear, the subject’s threshold of hearing to 10/s
noise bursts was determined in the scanner room. For all experiments,
the stimuli were presented binaurally at 55 dB above this threshold.
During both threshold determination and functional imaging, there
was an on-going low-frequency background noise produced primarily
by the pump for the liquid helium (used to supercool the magnet coils)
(Ravicz et al. 2000). This sound reaches levels of ⬃80 dB SPL in the
frequency range of 50 –300 Hz. Additionally during functional imaging, each image acquisition generated a “beep” of approximately 115
dB SPL at 1.0 kHz (1.5 T scanner) or ⬃130 dB SPL at 1.4 kHz (3 T
scanner, see Imaging). The stimuli for all experiments were clearly
audible during functional imaging. For the low-rate trains, an individual burst was occasionally masked by a coinciding imaging
“beep.” However, because the imaging was cardiac gated (see Imaging), this coincidence of a noise burst with image acquisition occurred
only infrequently throughout the low-rate trains.
Noise bursts were delivered through a headphone assembly that
provided approximately 30 dB of attenuation at the primary frequency
of the scanner-generated sounds (1.0 or 1.4 kHz; Ravicz and Melcher
2001). Specifically, the noise bursts were produced by a digital-toanalog board (running under LabView), amplified, and fed to a pair of
audio transducers housed in a shielded box adjacent to the scanner.
The output of the transducers reached the subject’s ears via air-filled
tubes that were incorporated into sound attenuating earmuffs.
J Neurophysiol • VOL
1
The subjects for the first two sessions of exp. III did report difficulties in
maintaining a high level of alertness due to the sparseness and uniformity of
the stimulus trials. Therefore to help maintain alertness, the subject (subject
10) for the third session was instructed to count the number of trials per run.
2
While there are also approaches for reducing the impact of the background
acoustic noise on auditory activation during multislice imaging (Edmister et al.
1999; Hall et al. 1999), these sacrifice temporal resolution or data-taking
efficiency (e.g., Melcher et al. 1999).
88 • SEPTEMBER 2002 •
www.jn.org
Downloaded from http://jn.physiology.org/ by 10.220.33.4 on June 16, 2017
FIG. 1. Schematic of the stimulus paradigm for exps. I-III. In exps. I and II,
trains of noise bursts at a given repetition rate were presented for 30 s, followed
by a 30 s “off” period. This alteration was repeated 4 times for each imaging
“run,” typically using a different repetition rate for each “train on” period. Tick
marks represent an image acquisition (approximately every 2 s). The expanded
view uses a smaller time scale to illustrate the stimulus—in this case a portion
of a prolonged 10/s noise burst train. In exp. III, “trials” consisting of either 1,
2, or 5 noise bursts were presented once every 18 s (15–16 trials per run). The
interstimulus interval for the 2 noise bursts was either 500 ms or 28.6 ms (i.e.,
2 NBs@2/s and 2 NBs@35/s; in this case the expanded view shows the
complete stimulus for a given trial). The trials with 5 noise bursts used an
interstimulus interval of 28.6 ms.
Subjects were imaged using a 1.5 or 3 Tesla whole-body scanner
(General Electric) and a head coil (transmit/receive). The scanners
were retrofitted for high-speed imaging (i.e., single-shot echo-planar
imaging; Advanced NMR Systems, Inc.). Exps. I and II were conducted at 1.5 T. Exps. III and IV were conducted at 3 T (except for one
of the supplementary sessions of exp. IV). Subjects rested supine in
the scanner. To reduce head motion, a bite bar was custom-molded to
the subject’s teeth and mounted to the head coil, or pillow and foam
were packed snugly around the head. Each imaging session lasted
approximately 2 h and included the following procedures:
1) Contiguous sagittal images of the whole head were acquired.
2) An automated, echo-planar– based shimming procedure was
performed to increase magnetic field homogeneity within the brain
regions to be functionally imaged (Reese et al. 1995).
3) The brain slice to be functionally imaged was selected using the
sagittal images as a reference. For exps. I, III, and IV, the selected
(near-coronal) slice intersected the IC and the posterior aspect of HG
and STG in both hemispheres (Fig. 2, left and middle). When multiple
transverse temporal gyri were present, the anterior one was intersected. Based on these criteria, we expect that a portion of primary
auditory cortex was intersected in both hemispheres of all subjects for
exps. I, III, and IV (Rademacher et al. 1993, 2001). For exp. II, the
slice intersected the IC and MGB (located just ventral and lateral to
the cerebral aqueduct; Fig. 2, right). A single slice, rather than
multiple slices, was imaged in all experiments to reduce the impact of
scanner-generated acoustic noise on auditory activation.2
4) A T1-weighted, high-resolution anatomical image was acquired
of the selected brain slice for subsequent overlay of the functional data
[thickness ⫽ 7 mm; in-plane resolution ⫽ 1.6 ⫻ 1.6 mm; TR ⫽ 10 s,
TI ⫽ 1200 ms, TE ⫽ 40 ms (exps. I and II) or 57 ms (exps. III and
IV)]. A second high-resolution anatomical image was acquired at the
end of the session after functional imaging. A comparison of the initial
and final T1 images allowed for a gross check of subject movement
over the session.
5) Functional images of the selected slice were acquired using a
blood oxygenation level– dependent (BOLD) sequence (sessions at
1.5 T: asymmetric spin echo, TE ⫽ 70 ms, ␶ offset ⫽ ⫺25 ms, flip ⫽
90°; sessions at 3 T: gradient echo, TE ⫽ 40 or 30 ms, flip ⫽ 90 or
60°). Slice thickness was 7 mm. In-plane resolution was 3.1 ⫻ 3.1
mm. The beginning of each scanning “run” included four discarded
images to ensure that image signal level had approached a steady
state. During the remainder of the run, functional images of the
selected slice were acquired repeatedly (Fig. 1).
1436
M. HARMS AND J. MELCHER
Functional imaging was performed using a cardiac gating method
that increases the detectability of activation in the inferior colliculus
(Guimaraes et al. 1998) [except for one session of exp. III that used a
fixed interimage interval (TR) of 2 s]. Image acquisitions were synchronized to every other QRS complex in the subject’s electrocardiogram, and the interimage interval (TR) was recorded. The average TR
across all sessions was 2,035 ms (the average within a session varied
from 1,521 to 2,650 ms). Fluctuations in heart rate lead to variations
in TR that result in image-to-image variations in image signal strength
(i.e., T1 effects). Using the measured TR values, image signal was
corrected to account for these variations (Guimaraes et al. 1998).
Analysis
The images for each scanning run were
corrected for any movements of the head that may have occurred over
the course of the imaging session. Each functional image of a session
was translated and rotated to fit the first image of the first functional
run using standard software (SPM95; without spin history correction;
Friston et al. 1995, 1996). Because only one functional slice was
acquired, these corrections for motion were necessarily limited to
adjustments within the imaging plane. In most cases, the motion
correction algorithm was well-behaved and resulted in an improvement in image alignment. However, for one session, the algorithm
introduced some clearly artifactual movement, so the premotion corrected data were utilized. Additionally, we did not include the MGB
of one subject in the analysis, because the image translations calculated by the motion-correction algorithm were smaller than the movement evident at the location of the MGB in the T1 anatomical images
acquired pre- and post- functional imaging. A similar discrepancy did
not occur for the IC of this subject, so the IC data were included. The
images for each run were further processed in two ways to enhance
the likelihood of detecting activation. 1) Image signal versus time for
each voxel was corrected for linear or quadratic drifts in signal
strength over each run (i.e., drift-corrected). 2) Image signal versus
time for each voxel and run was normalized such that the timeaverage signal had the same (arbitrary) value for all voxels and runs.
(Specifically, the signal vs. time data were ratio normalized based on
the intercept of a least-square quadratic fit to the data). This normalization was done to eliminate artificial discontinuities in the signal
level between runs in the subsequently concatenated data. All subse-
IMAGE PREPROCESSING.
J Neurophysiol • VOL
quent analyses were performed on the drift-corrected, normalized
images.
Maps of activation to noise burst
trains (exps. I, II, and IV) were derived as follows. First, each image
was assigned to either a “train on” or “off” period. Stimulus-evoked
changes in image signal typically have a delay of 4 – 6 s (Bandettini et
al. 1993; Buckner et al. 1996; Kwong et al. 1992). To account for this
(hemodynamic) delay, the first three images taken after the onset of a
noise burst train were assigned to the preceding “off” period, and the
first three images after the train offset were assigned to the preceding
“train on” period. For each rate, the images assigned to each “train on”
period and its following “off” period were concatenated into a single
file. Image signal strength during train on versus off periods was then
compared for each voxel using an unpaired t-test (Press et al. 1992).
The P value result of this statistical test was plotted for each voxel
with P ⱕ 0.01 to yield a spatial map of activation. P values were not
corrected to account for the correlated nature of fMRI time-series
(Purdon and Weisskoff 1998), nor were they adjusted for the repeated
application (voxel-by-voxel) of a statistical test (Friston et al. 1994).
GENERATING ACTIVATION MAPS.
Responses were analyzed quantitatively within four anatomically defined regions of interest (ROIs):
the IC, MGB, HG, and STG. These ROIs, which were defined
separately for each hemisphere, were first identified in the highresolution anatomical images (in-plane resolution: 1.6 ⫻ 1.6 mm) of
the functional imaging plane. These “high resolution” ROIs were
down-sampled to the same resolution as the functional images (3.1 ⫻
3.1 mm), to yield the ROIs used for all subsequent analyses. The ROI
borders were defined as follows:
IC. In exps. I, III, and IV, the IC were readily identified as distinct
anatomical circular areas (e.g., Fig. 3). For exp. II, only the caudal
edge of the IC were distinguishable (e.g., Fig. 6), so the area of each
IC ROI was defined as a circle sized to fit this visible edge. The IC
ROIs were defined liberally to include voxels at the edge of the IC.
MGB. The ambient cistern defined the caudal border of the MGB
ROI. The distance from this caudal border to the rostral edge of the
MGB ROI was determined from atlases of histologically prepared
brains, as was the distance between the midline and medial edge of the
MGB ROI. These distances were computed by first normalizing atlas
measurements to maximum brain width, and then multiplying the
normalized atlas measurements by the maximum width of the imaged
brain slice for each subject. The lateral edge of the ambient cistern
DEFINING REGIONS OF INTEREST.
88 • SEPTEMBER 2002 •
www.jn.org
Downloaded from http://jn.physiology.org/ by 10.220.33.4 on June 16, 2017
FIG. 2. Functional imaging planes superimposed on sagittal, anatomical images. In exps. I, III, and IV, the plane (thick white
line) passed through the inferior colliculi (left) and Heschl’s gyri (middle). In exp. II, the plane passed through the inferior colliculi
(located just lateral to the brachium of the inferior colliculi) and the medial geniculate bodies of the thalamus (located just ventral
and lateral to the cerebral aqueduct; right).
RATE REPRESENTATION IN HUMAN AUDITORY PATHWAY
defined the lateral edge of the MGB ROI. The lateral extent of the ROI
was liberal in that it probably included a portion of the lateral
geniculate body in some subjects. Correspondingly, activation generally did not reach the lateral-most edge of the MGB ROI.
HG. When HG was visible as a “mushroom” protruding from the
surface of the superior temporal plane, the lateral edge of this mushroom defined the lateral edge of the HG ROI. The medial edge of the
ROI was the medial-most aspect of the Sylvian fissure. When a
distinct mushroom was not present, the HG ROI covered approximately the medial third of the superior temporal plane (extending
from the medial-most aspect of the Sylvian fissure). In the inferiorsuperior dimension, the HG ROI extended superiorly to the edge of
the overlying parietal lobe and inferiorly so as to entirely encompass
any activation centered on HG.
STG. The STG ROI was defined as the superior temporal cortex
lateral to the HG ROI. The definition of the inferior and superior
borders was the same as for the HG ROI.
The resulting average number of total voxels (3.1 ⫻ 3.1 ⫻ 7 mm)
per hemisphere in the ROIs was 5 (IC), 10 (MGB), 29 (HG), or 36
(STG).
CALCULATING RESPONSE TIME COURSES. The time course of response was computed for specific voxels within each ROI. For exps.
I and II, the voxels (3.1 ⫻ 3.1 ⫻ 7 mm) were chosen based on the
activation maps for a particular “reference rate” (i.e., the rate that
typically produced the strongest activation in the maps): 35/s for IC,
20/s for MGB, 10/s for HG, and 2/s for STG. For each IC and MGB
ROI, we used the single voxel with the lowest P value at the reference
rate. For each HG and STG ROI, we averaged the responses of the
four voxels (not necessarily contiguous) with the lowest P values at
J Neurophysiol • VOL
the reference rate.3 Note that for a given structure, session, and
hemisphere, the same voxels were used in computing the response
time course at each rate. For exps. III and IV, which focused on
auditory cortex, the same number of lowest P value voxels (four in
each hemisphere) were selected for analysis within the HG and STG
ROIs. However, the activation map for selecting voxels was based on
a single run of music (4 repetitions of the first 30 s of the fourth
movement in Beethoven’s Symphony No. 7). Music was used because
1) it typically evokes larger magnitude cortical responses than noise
burst trains, so robust cortical activation maps could be obtained with
a single run, thereby allowing more time for collecting responses to
the primary stimuli of interest in exps. III and IV, and 2) the dominant
amplitude modulation frequencies of the music stimulus (⬍5 Hz)
were comparable with the “reference rates” for auditory cortex.4
For exps. I and II, response time courses were computed as follows.
Because cardiac gating results in an irregular temporal sampling, the
time series for each imaging “run” and voxel was linearly interpolated
to a consistent 2-s interval between images, using recorded interimage
intervals to reconstruct when each image occurred. These data were
then temporally smoothed using a three-point, zero-phase filter (with
coefficients 0.25, 0.5, 0.25). A response “block” was defined to
include the 10 s prior to a noise burst train, the period coinciding with
the train, and the off period following the train. These response blocks
were averaged according to rate to give an average signal versus time
waveform for each rate, session, and hemisphere. The signal at each
time point was then converted to a percent change in signal relative to
a baseline. The baseline was defined as the average signal from t ⫽
⫺6 to 0 s, with time t ⫽ 0 s corresponding to the onset of the noise
burst train. Finally, for each rate, the percent change time courses
were averaged across sessions and hemispheres.5 For exp. IV, response time courses were calculated the same way, except response
blocks were averaged according to train duration.
For exp. III, time courses for each stimulus were computed as
described above, with the following exceptions: 1) no temporal
smoothing was applied (to avoid disproportionally altering the responses, which were expected to be brief in duration) and 2) the
baseline signal level for converting time courses to percent change
was based on the average of just two time points, t ⫽ ⫺2 to 0 s. [This
was done because the “off” period between stimuli (18 s) was less for
this experiment than for the others (30 or 40 s) and we wanted to avoid
including time points where the response may have not yet returned to
baseline from the preceding stimulus.]
For exps. I and II, response
magnitude in each auditory structure was quantified using two measures computed from the percent change time courses. “Time-average” percent change, a measure of the overall response strength during
a noise burst train, was computed as the mean percent change from
QUANTIFYING RESPONSE MAGNITUDE.
3
In a supplementary analysis, we determined that response waveshape was
not sensitive to our voxel selection criteria. For comparison, time courses were
computed using all voxels with P ⬍ 0.01 at the reference rate (instead of just
the voxels showing the strongest activation). As expected, the resulting responses were reduced in magnitude. However, their waveshape was unaffected.
4
It is unlikely that the results for exps. III and IV were affected by the use
of a music stimulus, rather than noise burst trains, to select particular voxels
within an ROI. In particular, in several sessions (data not shown), in which the
stimuli included both music and trains of 2/s and 35/s noise bursts, we obtained
similar responses for the noise burst trains in HG and STG, irrespective of
whether the reference voxels were chosen using activation maps based on
music or 2/s noise bursts. Typically, at least two of the four reference voxels
were in common between the two activation maps. Importantly, the cortical
response time courses for music and 2/s noise burst trains are similar in shape.
5
In a supplementary analysis, we found that the waveshape of the average
responses (Fig. 4) was unaffected when the responses for individual sessions
and hemispheres were first normalized (by dividing by their maximum value)
and then averaged. This result indicates that average response waveshape was
not unduly influenced by just a small subset of individual responses.
88 • SEPTEMBER 2002 •
www.jn.org
Downloaded from http://jn.physiology.org/ by 10.220.33.4 on June 16, 2017
FIG. 3. Activation maps for the inferior colliculus (IC) (2 subjects, exp. I).
Stimuli were noise burst trains with repetition rates of 2, 10, or 35/s. Each
panel shows a T1-weighted anatomic image (grayscale) and superimposed
activation map (color). Rectangle superimposed on the diagrammatic image
(bottom, right) indicates the area shown in each panel. For the activation maps,
regions are colored according to the result of a t-test comparison of image
signal strength during “train on” and “off” periods. In this and all subsequent
figures, blue and yellow correspond to the lowest (P ⫽ 0.01) and highest (P ⫽
2 ⫻ 10⫺9) significance levels, respectively. (Areas with P ⬎ 0.01 are not
colored). The activation maps (based on functional images with an in-plane
resolution of 3.1 ⫻ 3.1 mm) and anatomic images (1.6 ⫻ 1.6 mm) have been
interpolated. Images are displayed in radiological convention, so the subject’s
right is displayed on the left. R, right; L, left.
1437
1438
M. HARMS AND J. MELCHER
t ⫽ 4 to 30 s. “Onset” percent change, a measure of the response
amplitude near the beginning of a noise burst train, was computed as
the maximum percent change from t ⫽ 4 to 10 s. Since “time-average”
and “onset” percent change were calculated from the percent change
time courses, they indicate image signal deviations relative to a 6-s
baseline immediately preceding the stimulus (i.e., the baseline period
used in calculating the time courses). For exp. III, peak percent change
was defined as the maximum value in the percent change time courses.
RESULTS
Response to noise burst trains: effect of burst repetition rate
FIG. 4. Response time courses averaged across sessions and hemispheres [solid lines; IC, n ⫽ 22 for 2, 10,
and 35/s, n ⫽ 12 for 20/s; medial geniculate body (MGB),
n ⫽ 10 for all rates; HG and STG, n ⫽ 10 for 2, 10, and
35/s, n ⫽ 6 for 1/s]. Dashed lines give the mean ⫾ SE at
each time point. Note that the vertical scale for the IC and
MGB responses differs slightly from the scale for Heschl’s gyrus (HG) and superior temporal gyrus (STG).
J Neurophysiol • VOL
88 • SEPTEMBER 2002 •
www.jn.org
Downloaded from http://jn.physiology.org/ by 10.220.33.4 on June 16, 2017
INFERIOR COLLICULUS. Activation maps for the IC showed an
increase in activation with increasing burst repetition rate.
Figure 3 demonstrates this increase for two sessions from
exp. I. The maps show activation that is greatest at 35/s, less at
10/s, and absent at 2/s. (Activation strength is reflected in the
maps as a lower P value from the statistical comparison of
image signal level during train “on” and “off” periods). Greater
IC activation at higher rates is also demonstrated by the maps
in Fig. 6, which correspond to two sessions from exp. II.
Figure 4 (left) shows the time course of the responses in the
IC averaged across all sessions. At all rates, the response was
“sustained” in that image signal increased when the noise burst
train was turned on, remained elevated while the train was on,
and decreased once the train was turned off. The amplitude of
the sustained response during the “train on” period increased
with increasing rate.
The increase in response amplitude was quantified using two
measures: maximum percent signal change near the beginning
of the “train on” period (“onset” percent change), and percent
signal change time-averaged over the on period (“time-average” percent change; defined in METHODS). On average, both
measures increased with increasing rate (Fig. 5, top left). Onset
and time-average percent change showed a significant increase
from 2/s to 10/s (P ⫽ 0.01, onset; P ⫽ 0.05, average; paired
t-test), and from 10/s to 35/s (P ⫽ 0.02, onset; P ⫽ 0.006,
average). Plots of percent change versus rate for individual IC
also showed an overall trend of increasing percent change with
increasing rate (Fig. 5, top middle and right). For 19 of 22 IC,
the response at 35/s was greater than the response at 2/s (for
both measures).
For the rates that overlapped between exps. I and II (2, 10,
and 35/s) there was no significant difference between the
percent signal change values (P ⬎ 0.1, t-test), suggesting that
the two main differences between these experiments (intensity
detection task and imaging plane) did not have a strong effect
on inferior colliculus responses. There was no significant difference between the values obtained from the left and right IC
(P ⬎ 0.3, paired t-test, collapsing the data across all rates).
In summary, the IC showed a sustained response to noise
RATE REPRESENTATION IN HUMAN AUDITORY PATHWAY
1439
FIG. 5. Response magnitude vs. repetition rate in the IC, MGB, HG, and
STG. Left: time-average and onset percent change averaged across sessions
and hemispheres. Bars indicate the SE (see Fig. 4 legend for the number of
sessions and hemispheres represented by each data point). Middle and right:
Time-average and onset percent change for each session and hemisphere vs.
rate. To facilitate comparison of the trends across rate, each curve has been
displaced vertically by adding a constant (specific to each curve), such that the
resulting mean of the values for 2, 10, and 35/s is equal to the population mean
for these rates (left). In all plots, the repetition rate axis uses a categorical scale.
Note that there are no data at 20/s for all of the HG and STG curves and for
10 of the IC curves.
burst trains. The amplitude of this response increased with
increasing burst repetition rate.
MEDIAL GENICULATE BODY. In contrast with the IC, activation
maps for the MGB usually showed a nonmonotonic change in
activation with rate. The trend for the MGB is illustrated by the
maps for two sessions in Fig. 6. The maps show an increase in
MGB activation with increasing rate in the 2/s–20/s range, but
a decrease from 20/s to 35/s.
The trend in the activation maps parallels the rate dependence of time-average percent signal change in the MGB, but
not onset percent change. The close correspondence between
time-average percent change and activation maps is to be
expected since the maps are based on a comparison of timeaverage signal levels during “train on” and “off” periods.
Time-average percent change increased significantly from 2/s
to 20/s (P ⫽ 0.005, paired t-test), and decreased from 20/s to
35/s (P ⫽ 0.03; Fig. 5, left). Onset percent signal change
showed the same trend from 2/s to 20/s (i.e., a significant
increase; P ⬍ 0.001), but not at high rates in that there was no
difference between 20/s and 35/s (P ⫽ 0.9). The different rate
dependence for onset versus time-average percent change is
also apparent overall in the plots for individual MGBs (Fig. 5,
J Neurophysiol • VOL
FIG. 6. Activation maps for the MGB and IC (2 subjects, exp. II). Stimuli
were noise burst trains with repetition rates of 2, 10, 20, or 35/s. See Fig. 3
legend.
88 • SEPTEMBER 2002 •
www.jn.org
Downloaded from http://jn.physiology.org/ by 10.220.33.4 on June 16, 2017
middle and right), despite the intersession variability in the
precise trends between the rates. Neither onset nor time-average percent change differed significantly between the left and
right MGBs (P ⬎ 0.15, paired t-test, collapsing the data across
all rates).
The different rate-dependencies for onset and time-average percent change indicate that the time course of the MGB
response varies with rate. This variation is illustrated in Fig.
4 (2nd column). On average, responses to a 35/s train peaked
just after train onset and then declined by approximately
50% during the remainder of the train. This moderate decrease in the response differs from the largely sustained
responses at the lower rates of 2/s, 10/s, and 20/s. A quantitative comparison of onset percent change to the percent
change at the end of the train (i.e., at 30 s) confirmed the
response difference at the highest rate. For the 35/s train,
response amplitude was significantly less at the end of the
train (P ⫽ 0.04, paired t-test), consistent with a response
decrease. In contrast, there was no significant difference at
the lower rates (P ⬎ 0.1), consistent with a sustained
response. Thus MGB responses varied over the course of
high, but not lower rate trains.
The rate dependencies seen in time-average percent
change and the activation maps can be explained in terms of
the time course and onset amplitude of MGB responses.
1440
M. HARMS AND J. MELCHER
Between 2/s and 20/s, the increase in time-average percent
change (and activation in the maps) is largely attributable to
the increase in sustained response amplitude, which is simultaneously reflected as an increase in onset percent
change. Given that 20/s and 35/s evoke equal onset responses, the decrease in time-average percent change (and
in the maps) between these two rates can be primarily
attributed to a change in the response to the latter portion of
the train (i.e., the change from a sustained response to one
with a moderate decrease following onset).
In summary, between 2/s and 20/s, onset percent change
increased with increasing rate in MGB, while response time
courses remained primarily sustained. Between 20/s and 35/s,
there was no change in onset amplitude, but the response
waveshape changed from sustained to moderately decreasing
following the train onset.
HESCHL’S GYRUS AND SUPERIOR TEMPORAL GYRUS. A nonmonotonic relationship between rate and activation was apparent in the activation maps for HG and STG (Fig. 7). The maps
showed an activation increase from 1/s to 2/s, and a decrease
from 10/s to 35/s.
As expected, the trends in the activation maps paralleled
the rate dependence of time-average percent signal change.
In HG, time-average percent change increased from 2/s to
10/s (P ⫽ 0.05, paired t-test), but decreased markedly from
10/s to 35/s (P ⬍ 0.001; Fig. 5, left). These trends were
observed consistently in individual HG (Fig. 5, middle). In
STG, the rate of greatest time-average percent change (2/s)
was less than in HG (10/s; Fig. 5, left). For the six STG with
J Neurophysiol • VOL
6
The time courses in Fig. 4 are based on responses averaged over the
entirety of each experimental session, so they do not address whether response
waveshape might have changed over the repeated train presentations within a
session. To test for such changes in cortex, we computed response time courses
for each session based on the three initial and three final presentations of the
2/s and 35/s trains. For each rate, the waveshapes of these “initial” and “final”
time courses, averaged across sessions, were not substantially different in
either HG or STG, indicating that systematic, within-session waveshape
changes did not occur (see Harms 2002, p. 32 for details).
88 • SEPTEMBER 2002 •
www.jn.org
Downloaded from http://jn.physiology.org/ by 10.220.33.4 on June 16, 2017
FIG. 7. Activation maps for HG and STG (2 subjects, exp. I). Stimuli were
noise burst trains with repetition rates of 1, 2, 10, or 35/s. See Fig. 3 legend.
1/s data, time-average percent change at 2/s was greater than
at 1/s (P ⫽ 0.003, paired t-test). Time-average percent
change in STG tended to decrease from 2/s to 10/s and from
10/s to 35/s, so that the overall decrease from 2/s to 35/s was
significant (P ⫽ 0.002; Fig. 5).
Onset percent change again showed differences compared
with time-average percent change. In HG, the difference
was primarily one of degree. Onset percent change at 10/s
was significantly greater than both 2/s and 35/s (P ⫽ 0.01,
paired t-test; Fig. 5, left and right), but the decrease from
10/s to 35/s averaged only 20% for onset percent change
compared with 50% for time-average percent change. In
STG, onset and time-average percent change had overall
different trends. Whereas time-average percent change decreased from 2/s to 35/s (P ⫽ 0.002), onset percent change
was unchanged over this range (P ⫽ 0.4; Fig. 5, left and
right).
A dramatic rate-dependent change in response waveshape
accounts for the differences between onset and time-average
percent change in HG and STG. At low rates, responses
were sustained, whereas at high rates they were not (Fig. 4,
3rd and 4th columns).6 At the highest rate of 35/s, image
signal increased to a peak occurring 6 s following train onset
(“on-peak”), declined substantially over the next 8 s, increased slightly over the remainder of the on period, and
peaked again 6 s following train offset (“off-peak”). The
most prominent features of this “phasic” response are the
peaks just after train onset and offset. In HG, the reduction
in time-average percent change between 10/s and 35/s was
partly because of 1) the decrease in onset percent change,
and 2) the more dramatic signal decline during the on period
for the 35/s train. In STG, onset percent change did not vary
significantly with rate, so the decline in time-average percent change at high rates was primarily due to the change in
response waveshape.
While the rate-dependencies of the HG and STG responses
paralleled each other, there were also clear differences between
the two structures. In both HG and STG, the signal decline
during the on period became increasingly pronounced with
increasing repetition rate, so responses had an increasingly
phasic appearance. However, at any given rate, the magnitude
of the signal decline was greater in STG. In STG, the magnitude of the decline (measured as a percentage decrease from
the onset peak to the value at 14 s in the average response time
courses; Fig. 4) was 22, 25, 58, and 93% for 1/s, 2/s, 10/s, and
35/s respectively. In HG, the corresponding values were all
less: 15, 13, 32, and 78%.
In light of the phasic response for high-rate trains, it is not
surprising that the activation maps frequently showed little
evidence of activity at the 35/s rate (e.g., Fig. 7, top left). The
activation maps were based on the difference in time-average
signal between “train on” and “off” periods. It is clear that for
RATE REPRESENTATION IN HUMAN AUDITORY PATHWAY
1441
However, the amplitude of the on-peak did not vary significantly with rate.
Response to small numbers of noise bursts
the response to the 35/s train (Fig. 4), the difference between
the time-average of these two periods will be close to zero,
even though cortex is responding robustly (albeit transiently).
Thus for cortex, the activation maps, calculated using a standard method, provide only a partial picture of cortical rate
dependencies.
In HG and STG there were right–left differences in response
magnitude. In HG, both time-average and onset percent
changes were greater on the right for 16 of 18 possible comparisons (P ⬍ 0.001, paired t-test, collapsing the data across all
rates). In STG, the same trend was apparent, but was weaker
(right greater in 12 of 18 cases; P ⬍ 0.02). These right–left
differences may reflect a functional difference between right
and left auditory cortex. Alternatively, they may reflect a
functional difference in auditory cortex in the anterior–posterior dimension. For example, since HG tends to be shifted more
anteriorly on the right compared with the left hemisphere
(Leonard et al. 1998; Penhune et al. 1996; Rademacher et al.
2001), the imaged slice may have sampled different regions of
primary cortex in the two hemispheres (Morosan et al. 2001).
In summary, responses in HG were sustained at low rates,
but became phasic at high rates. The most prominent features
of this phasic response are signal peaks just after train onset
and offset. The amplitude of the on-peak (onset percent
change) was greatest at 10/s. Responses in STG also showed a
progression from sustained to phasic with increasing rate.
J Neurophysiol • VOL
7
In making these comparisons, we took into account the two main differences between exps. I and III. First, because the responses in exp. III were
computed without temporal smoothing, we recomputed the onset percent
change values from exp. I without temporal smoothing. The resulting mean
onset percent change values for a 35/s train increased by 20% for both HG and
STG. Second, we took into account the difference in imaging parameters (exp.
I: 1.5 T scanner, ASE sequence; exp. III: 3 T scanner, GE sequence). The effect
of this difference was estimated empirically using responses to music obtained
under the same conditions as exp. I (11 sessions) and exp. III (11 sessions).
Time-average percent change was, on average, about 25% greater for the exp.
III conditions, so the onset amplitude from exp. I (calculated without temporal
smoothing) was revised up by this percentage before comparison with the
single and multiple noise burst responses.
88 • SEPTEMBER 2002 •
www.jn.org
Downloaded from http://jn.physiology.org/ by 10.220.33.4 on June 16, 2017
FIG. 8. Top: average response time courses in HG and STG to either a
single noise burst, or a cluster of 2 or 5 noise bursts. Each trace is an average
across both hemispheres of 3 subjects. Bottom: normalized peak response for
each subject and hemisphere. Dashed line indicates the prediction from a
model in which each successive noise burst evokes a response identical to the
1 NB response and the responses to each burst add.
To investigate how the initial bursts of a train contribute to
cortical responses at train onset, we examined the responses to
a single noise burst, and clusters of two or five noise bursts
with a burst-to-burst ISI of 28.6 ms (35/s rate) or 500 ms (2/s
rate). Both single and clustered noise bursts elicited measurable responses in HG and STG. The responses, averaged across
subjects and hemispheres, peaked 4 – 6 s after the stimulus, and
then returned to baseline by 8 –10 s (Fig. 8, top). After 8 –10 s,
the average response dipped below baseline. However, this
response feature, unlike the others, was dominated by the data
for only one of the three subjects (subject 2).
Figure 8 (bottom) shows normalized peak response versus
number of noise bursts for each subject and hemisphere. These
normalized responses were quantified as the peak percent signal change in the response time course (which always occurred
at t ⫽ 4 or 6 s), divided by the peak percent change for a single
noise burst. The normalized peak response generally increased
with increasing number of noise bursts (Fig. 8, bottom). However, the response increase was always less than would be
predicted by a model in which each successive noise burst
evokes a response equivalent to the 1 NB response and the
responses to each burst add (i.e., linear growth). Similarly, for
every subject and hemisphere, the peak response to 5
NBs@35/s was ⬍2.5 times the response to 2 NBs@35/s. These
results are consistent with a model in which the responses to
noise bursts at the beginning of a train are greater than those
occurring later. The fact that the peak response for 2 NBs@2/s
was always greater than for 2 NBs@35/s indicates that any
decline in response from the first burst to the second was
greater at high, compared with low, rates.
We compared the mean peak percent change for single and
multiple noise bursts with the mean onset percent change for
35/s trains from exp. I to gain an appreciation for the proportion of the on-peak accounted for by the earliest noise bursts in
the train.7 In STG, we estimated that the mean peak percent
change for 1 NB and 5 NBs@35/s was approximately 40% and
65%, respectively, of the mean onset percent change. In HG,
the corresponding estimates were approximately 25% and
40%. These values indicate that the earliest noise bursts of a
high-rate train account for a substantial portion of the on-peak,
especially in STG.
1442
M. HARMS AND J. MELCHER
FIG. 9. Response time courses in HG (top)
and IC (bottom) to 35/s noise burst trains,
with durations of 15, 30, 45, and 60 s. Each
trace is an average across hemispheres for a
given subject. The off-peak in each HG response is indicated by an arrow.
By considering noise burst trains with different durations,
we tested whether the off-peak in cortical phasic responses is
specifically linked to train termination. Two subjects were
studied using 35/s noise burst trains with durations of 15, 30,
45, and 60 s. For both subjects and all durations, HG responses
showed a distinct off-peak after train offset (Fig. 9, top).
Regardless of train duration, the off-peak occurred approximately 6 s after train offset, indicating a strong coupling
between off-peak and train termination. A similarly strong
coupling between off-peak and train termination was also
found in STG for both subjects (not shown). One subject
(subject 11) was unusual in that the response in STG did not
show a clear off-peak for voxels selected by our standard
criteria. Nevertheless, there was a clear off-peak for other,
nearby voxels, and this off-peak always occurred approximately 6 s after train offset, regardless of train duration. In
contrast to cortex, IC responses were largely sustained for all
durations and showed no sign of an off-peak (Fig. 9, bottom).
Data for two additional subjects further support the strong
coupling between cortical off-peak and train termination.
These subjects, tested with a single train duration of 60 s,
showed off-peaks in both HG and STG that occurred approximately 6 s after train offset. All of the train duration data taken
together indicate that the cortical off-peak is specifically
evoked by the termination of high-rate noise burst trains.
DISCUSSION
fMRI responses to trains of noise bursts changed substantially with burst repetition rate in every studied structure,
although the nature of the changes was highly structure-dependent. In the IC, response amplitude following train onset increased with increasing rate while response waveshape remained unchanged (i.e., sustained). In the MGB, increasing
rate produced an increase in onset amplitude up to a point
where a further increase in rate instead produced a change in
waveshape (from a largely sustained response to one showing
a distinct peak just after train onset). In HG, the site of primary
auditory cortex, onset amplitude changed somewhat with rate,
but the most striking change occurred in response waveshape.
J Neurophysiol • VOL
At low rates the waveshape was sustained, while at high rates
it was strongly phasic in that there were prominent response
peaks just after train onset and offset. In STG, which includes
secondary auditory areas, onset amplitude showed no systematic dependence on rate, whereas response waveshape showed
a strong and dramatic rate dependence paralleling that in HG.
Overall, from midbrain to thalamus to cortex, there was a
systematic shift in the form of response rate-dependencies from
one of amplitude to one of waveshape.
Sustained response waveshapes (as seen in subcortical structures and for low rates in auditory cortex) are commonly
reported in the fMRI literature. In contrast, phasic responses
(as seen for higher rates in auditory cortex) are not, nor are
their individual signature features. One signature feature—the
prominent peak following stimulus onset— has been reported
for a few prolonged acoustic, odorant, and visual stimuli (Bandettini et al. 1997; Giraud et al. 2000; Jäncke et al. 1999; Sobel
et al. 2000) but is nevertheless a fairly uncommon feature for
responses in the fMRI literature. A second signature feature of
phasic responses—the peak following stimulus offset—is
highly unusual. To our knowledge, the only other reported
fMRI “off-response” occurred in a subregion of primary visual
cortex following the transition from steady white light to
darkness (Bandettini et al. 1997). The paucity of previous
reports of phasic fMRI responses may be partly an issue of
detection since phasic responses are poorly detected by some
of the most commonly-used analysis approaches (e.g., a t-test
comparison of stimulus “on” and “off” periods, or equivalently, correlation or analyses using the SPM software package
that assume a sustained response; Bandettini et al. 1993; Sobel
et al., 2000). It is also possible that phasic responses have not
been seen because they reflect neurophysiological mechanisms
that are only invoked in particular, largely unexplored stimulus
regimes.
It is widely assumed that different sound features (e.g.,
frequency, bandwidth, repetition rate) are represented in the
amplitude of fMRI activation or amplitude variations with
position (e.g., Giraud et al. 2000; Talavage et al. 2000; Wessinger et al. 2001; Yang et al. 2000). In contrast, the possibility
of representations in the temporal dimension is not generally
entertained, and this makes the wide variations in cortical
response waveshape of this study especially intriguing. A few
88 • SEPTEMBER 2002 •
www.jn.org
Downloaded from http://jn.physiology.org/ by 10.220.33.4 on June 16, 2017
Response to high-rate (35/s) noise burst trains:
effect of train duration
RATE REPRESENTATION IN HUMAN AUDITORY PATHWAY
other studies have also reported covariations between sound
characteristics and temporal fMRI activation patterns in the
auditory system. For instance, Gaschler-Markefski et al. (1997)
examined the degree of temporal stationarity of auditory cortical fMRI responses and reported regional variations depending on stimulus and task. In studying fMRI responses to
amplitude modulated noise, Giraud et al. (2000) found an
increasingly prominent peak at stimulus onset with increasing
modulation rate, a result that strongly parallels the findings of
the present study (see Comparison to previous fMRI and PET
studies). The present study and these previous reports suggest
that fMRI temporal patterns— or more specifically the temporal variations in neural activity underlying these patterns—may
be an important way in which sound is represented in the
auditory system.
Role of rate per se in determining fMRI responses
fMRI responses and underlying neural activity
To understand the significance of the different fMRI response waveshapes, it is necessary to first consider the extent
to which waveshape is governed by neural, metabolic, and
hemodynamic factors. While the relationship between neural
activity and fMRI responses is not fully understood, it is
generally accepted that neural activity and image signal are
ultimately linked through a chain of metabolic and hemodynamic events. For the form of fMRI in this study (BOLD
fMRI), this linkage is as follows. When there is an increase in
neural activity in the form of synaptic events or neural discharges,8 there is a corresponding increase in local brain me8
We consider both synaptic events and neural discharges as “neural activity,” because there is evidence in favor of each as a contributor. 2DG studies
suggest that synaptic events may dominate the metabolic response (and hence
the fMRI response; Auker et al. 1983; Jueptner and Weiller 1995; Nudo and
Masterton 1986). A recent study by Logothetis et al. (2001), which simultaneously recorded intracortical activity and fMRI responses, also suggests that
synaptic activity may dominate. However, there are also reports of a strong
coupling between discharges and fMRI responses (Heeger et al. 2000; Rees et
al. 2000), possibly because discharges and synaptic activity were themselves
J Neurophysiol • VOL
tabolism and oxygen consumption (Sokoloff 1989). The increase in oxygen consumption is accompanied by an increase
in blood flow and blood volume in the active brain region.
However, the increase in flow dominates, such that the local
concentration of deoxygenated hemoglobin actually decreases,
which is important because deoxygenated hemoglobin is paramagnetic (Pauling and Coryell 1936) and thus influences local
image signal levels. The net effect of a decrease in deoxygenated hemoglobin is an increase in image signal. When the
entire chain of events is considered together, increases and
decreases in neural activity result in concordant changes in
image signal strength (Kwong et al. 1992; Ogawa et al. 1993;
Springer et al. 1999). Since hemodynamic changes occur over
the course of seconds, fMRI effectively provides a temporally
low-pass filtered view of neural activity. More specifically,
since fMRI is sampling activity over small volumes of brain
(i.e., voxels), the responses can be thought of as showing the
time-envelope of population neural activity on a voxel-byvoxel basis.
Previous work has shown that the relative timing and magnitude of stimulus-evoked changes in blood flow, blood volume, and oxygen consumption can influence the waveshape of
the fMRI response (Buxton et al. 1998; Mandeville et al.
1998). While this raises the possibility that changes in waveshape from sustained to phasic reflect changes in hemodynamics rather than underlying neural activity, we believe this to be
unlikely for both of the main components of the phasic response, namely the off-peak and the on-peak. It is particularly
unlikely that a hemodynamic explanation accounts for the
off-peak. Previous hemodynamic modeling and experimentation has not predicted an off-peak following stimulus termination, and we know of no plausible model that could generate
such a component. Therefore the emergence of an off-peak
with increasing repetition rate in auditory cortex is almost
certainly attributable to a rate-dependent increase in neural
activity at stimulus offset.
The other major feature that distinguishes phasic from sustained responses, namely the sharp decline in signal that forms
the prominent onset peak, requires more detailed consideration
because it is known that declines in signal can theoretically
occur over the course of a prolonged stimulus for completely
hemodynamic reasons (Buxton et al. 1998). However, measurements of BOLD signal, blood flow, and blood volume
responses have failed to illustrate a case for which purely
hemodynamic features could generate a signal decline as dramatic as those seen here (e.g., Hoge et al. 1999a; Mandeville et
al. 1999). Additionally, separate evidence works against the
idea that the signal decline is driven primarily by hemodynamic rather than neural influences. The reasoning follows
from the fact that the same voxels were capable of showing
either a phasic response (and the associated dramatic signal
decline) or a sustained response depending on the stimulus.
Since the time course of the phasic and sustained responses is
strongly correlated in those studies. It seems likely that the relative contribution of synaptic events versus discharges to the fMRI response will depend on
the specifics of the local neural circuitry (Bandettini and Ungerleider 2001),
and therefore may in fact vary across different regions of the brain (Auker et
al. 1983; Mathiesen et al. 1998), or even for different types of stimuli. For the
purposes of this discussion, we leave open the possibility that either or both
synaptic activity and discharges “contribute” to the fMRI responses we measured.
88 • SEPTEMBER 2002 •
www.jn.org
Downloaded from http://jn.physiology.org/ by 10.220.33.4 on June 16, 2017
In this study, noise burst duration was held constant while
rate was varied, so overall stimulus energy and sound-time
fraction (STF) covaried with rate (resulting in an approximately 12-dB differential in sound pressure level for 2/s vs.
35/s noise burst trains). While this raises the possibility that the
wide range of response waveshapes in auditory cortex was due
primarily to changes in parameters other than rate, we do not
believe this to be the case for two reasons. First, in a separate
study, we have found that varying the intensity of 2/s or 35/s
noise bursts over a 20- to 30-dB range has no effect on
response waveshape (Harms et al. 2001). Second, we have
found that changing rate from 2/s to 35/s while holding STF
constant (and therefore varying burst duration) still produces a
change in waveshape from sustained to phasic (although STF
does have some influence on response waveshape; Harms
2002; Harms and Melcher 1999). In the case of response
amplitude, the precise rate dependencies might be somewhat
different if stimulus energy and STF were held constant instead
of burst duration, because varying energy alone can produce
changes in response amplitude (Hall et al. 2001; Sigalovsky et
al. 2001), as may also be the case for changes in STF.
1443
1444
M. HARMS AND J. MELCHER
very similar over the first 6 – 8 s, the “operational history” of
the hemodynamic system is presumably similar as well. In
light of this common initial response, a hemodynamic system
that could subsequently generate grossly different response
waveshapes seems unlikely, unless the differences reflect differences in underlying neural activity.
While response waveshape varied with rate within a structure, it also varied across structures for a given rate. It is known
that there can be spatial heterogeneity in tissue hemodynamics
(Chen et al. 1998; Davis et al. 1998), so the possibility that
regional variations in hemodynamics play some role in the
waveshape differences across structures cannot be discounted.
Still, the heterogeneities in hemodynamics that have been
documented are not sufficient to account for the dramatic
waveshape changes that occur across the pathway as a whole
(from the inferior colliculus to cortex).
Given that fMRI responses reflect the time-envelope of
population neural activity, the prominent declines in fMRI
signal that occur at high rates in MGB, HG, and STG provide
clear evidence for an overall decline in neural activity during
the first seconds of a train (⬍10 s). This decline likely includes
decreases in synaptic, as well as discharge activity, since both
forms of activity can be reflected in fMRI signals.
While an overall decrease in neural activity early in highrate trains is clear, the subsecond temporal details of activity
during this decrease remain unresolved because fMRI provides
a low-pass filtered view of neural activity. Previous electrophysiological data suggest various possible forms for the temporal details underlying the overall decline in neural activity.
For instance, it may be that the fMRI signal decline reflects a
burst-to-burst adaptation in neural activity in which each successive burst early in a train elicits progressively less activity
(Fruhstorfer et al. 1970; Ritter et al. 1968; Roth and Kopell
1969). Alternatively, a variant of this may occur in which
activity does not always decrease in a strictly progressive
fashion across consecutive bursts but sometimes shows an
increase from one burst to the next (e.g., facilitation or enhancement; Brosch et al. 1999; Budd and Michie 1994; Loveless et al. 1989). (As long as decreases from burst to burst
occur more often than increases, the time-envelope of neural
activity would still decrease.) Another possibility is that population neural activity is not synchronized to individual bursts
(Lu and Wang 2000; Lu et al. 2001) but instead occurs in
response to the train as a whole with an initial peak in activity
followed by a lower level of activity. All of these possibilities
would result in a decline in the time-envelope of population
neural activity and are therefore consistent with the prominent
declines seen in fMRI signal.
While we cannot conclusively determine the temporal details of activity during the declines in fMRI signal, it is worth
recognizing that aspects of our data are consistent with the idea
that there is a burst-to-burst adaptation in neural activity. For
instance, the fMRI responses to small numbers of noise bursts
are suggestive of an adaptation process because the fMRI
response amplitude did not increase in proportion to the number of bursts, but rather showed a slower than linear growth.
Whether neural activity and fMRI signal are coupled in an
approximately linear manner (and under what circumstances)
J Neurophysiol • VOL
9
One neural interpretation of the trends in onset amplitude follows from a
simple model in which there is a neural response to each successive burst in a
train and two competing effects of increasing rate: 1) an increased number of
bursts per unit time (working to increase total neural activity during the initial
seconds of the train, and hence onset amplitude) and 2) a decreased response
to individual bursts because of increased adaptation (working to decrease total
neural activity). If this latter effect due to adaptation were relatively unimportant at low rates, but dominant at high rates, the result would be an increase in
onset percent change with increasing rate at low rates and a plateau or decrease
at high rates—the trends we observed in MGB and HG.
88 • SEPTEMBER 2002 •
www.jn.org
Downloaded from http://jn.physiology.org/ by 10.220.33.4 on June 16, 2017
fMRI response onset and neural adaptation
is an open question under active investigation (Boynton et al.
1996; Dale and Buckner 1997; Gratton et al. 2001; Hoge et al.
1999b; Logothetis et al. 2001; Vazquez and Noll 1998). However, if they are, the slower than linear growth in the fMRI
responses to small numbers of noise bursts suggests that the
neural activity produced by each successive burst in a train is
progressively less (i.e., there is burst-to-burst adaptation in
neural activity). If the decline in neural activity from burst to
burst were to continue until there is little or no burst-evoked
activity, the time-envelope of neural activity would decline
substantially following the onset of the train, and so correspondingly would the fMRI signal (as observed for high-rate
trains). Another aspect of the data consistent with neural adaptation is the growth in onset amplitude with rate. If neural
activity and fMRI response amplitude vary in direct proportion, onset amplitude may be viewed roughly as an indicator of
the time-average neural activity during the first seconds of a
train. If there were no adaptation and each successive burst in
a train produced an identical increase in neural activity, the
time-average neural activity during the first seconds of a train
would increase in direct proportion to rate, and onset amplitude
would be expected to do the same. Instead, a proportional
increase in onset amplitude did not occur in any structure. This
is most obvious at high rates in MGB, HG, and STG, where
onset amplitude either declined or did not change with increasing rate.9 However, it can also be seen at lower rates and in the
IC. For instance, an increase in rate from 2/s to 10/s increased
onset amplitude by less than twofold in every structure, well
short of the fivefold increase expected if growth were proportional to rate. This result is consistent with neural adaptation
occurring in all of the studied structures, even the IC where
fMRI response waveshapes are sustained and do not immediately suggest an underlying adaptation.
Looking across structures, the data indicate that any neural
adaptation increased with increasing position in the pathway.
For instance, at any given rate, the percentage decline in signal
following the on-peak increased progressively from IC to
MGB to auditory cortex (HG and STG), suggesting an increasing degree of adaptation in the underlying population neural
activity. An increase in adaptation across structures is also
suggested by the fact that the growth in onset amplitude with
rate falls increasingly short of predictions assuming no adaptation. For instance, the increase in onset amplitude from 2/s to
10/s falls increasingly short of the fivefold increase predicted in
the absence of adaptation as one moves from IC (1.69) to MGB
(1.42) to auditory cortex (HG, 1.26; STG, 0.92). Similarly, the
increase in onset amplitude from 10/s to 35/s falls increasingly
short of the 3.5-fold prediction [IC, 1.42; MGB, 1.36; auditory
cortex, 0.80 (HG), 0.99 (STG)]. Thus if there is burst-to-burst
adaptation in population neural activity early in a train, it
increases from IC to MGB to auditory cortex.
RATE REPRESENTATION IN HUMAN AUDITORY PATHWAY
J Neurophysiol • VOL
and HG, changes in onset amplitude as a function of repetition
rate were consistent with what might be predicted based on
microelectrode recordings in animals of neural spiking in response to amplitude-modulated trains.
RELATIONSHIP TO ELECTRIC RECORDINGS IN HUMANS. A trend
of generally greater adaptation at cortical versus brainstem
levels of the auditory pathway fits with data concerning two of
the most studied components of human auditory evoked potentials: wave V of the brain stem– evoked potential and the
long latency potential N1. Wave V is likely generated by
neurons projecting to the IC (Melcher and Kiang 1996; Møller
1998), while the primary generators of N1 have been localized
to auditory cortex (e.g., Näätänen and Picton 1987; Reite et al.
1994). Wave V and N1 show different sensitivities to stimulus
repetition rate. For example, in responses averaged over many
click presentations, a high click rate (⬎50/s) is required to
generate a 30 –50% amplitude decrement in wave V of the
brain stem– evoked potential (Jiang et al. 1991; Suzuki et al.
1986; Thornton and Coleman 1975). In contrast, similar decrements in N1 occur at much lower rates (0.5–2/s), as seen in
averaged responses and by comparing the individual responses
to successive stimuli in a train (Davis et al. 1966, 1972;
Fruhstorfer 1971; Fruhstorfer et al. 1970; Picton et al. 1977;
Ritter et al. 1968; Roth and Kopell 1969). The substantially
different rate sensitivities for wave V and N1 indicate greater
stimulus-to-stimulus adaptation in the cortical neurons generating N1 than in the brain stem neurons generating wave V.
Phasic response “off-peak” and neural off responses
The off-peak in the phasic response may be related to
electrophysiological “off-responses” observed intracortically
following the termination of stimulus trains. For instance,
following 175-ms click trains, Steinschneider et al. (1998)
found transient increases in cortical multiunit activity in awake
monkeys. Using depth electrodes implanted in the vicinity of
human auditory cortex, Chatrian et al. (1960) showed gross
potential responses following 3-s click trains. In both of these
electrophysiological studies, the rate dependence of the offresponse resembled our fMRI cortical data in that an offresponse was present for high, but not low, repetition rates.
Given this similarity, it seems quite possible that the fMRI and
electrophysiological off-responses to stimulus trains arise from
similar underlying neural mechanisms.
The fMRI off-peak to high-rate stimulus trains may also be
physiologically related to off-responses that occur in evoked
potential and magnetic field recordings following the cessation
of prolonged individual stimuli. Support for this idea comes
from the fact that a cortical fMRI off-peak can be elicited by a
prolonged noise burst (30-s duration; Sigalovsky et al. 2001).
In humans, extracranial evoked potential (Hillyard and Picton
1978; Pfefferbaum et al. 1971; Picton et al. 1978; Spychala et
al. 1969) and magnetic field recordings (Hari et al. 1987;
Lammertmann and Lutkenhoner 2001; Pantev et al. 1996) have
also shown responses to the offset of prolonged noise or tone
bursts (with durations ranging from 0.5 to 10 s). The generation
site of electrophysiological off-responses has been localized to
auditory cortex (Hari et al. 1987; Noda et al. 1998; Pantev et al.
1996). Therefore both the electrophysiological and fMRI offresponses imply increased activity in auditory cortical neurons
following sound offset, a form of response that has ample
88 • SEPTEMBER 2002 •
www.jn.org
Downloaded from http://jn.physiology.org/ by 10.220.33.4 on June 16, 2017
RELATIONSHIP TO ELECTROPHYSIOLOGICAL DATA IN ANIMALS. A
trend of increasing adaptation with increasing position in the
auditory pathway has emerged from several animal neurophysiological studies explicitly designed to compare responses
across structures. For instance, microelectrode recordings of
the response to paired stimuli with different interstimulus intervals have shown an increase in recovery time with increasing level in the auditory pathway. In unanesthetized animals
(cats and rabbits), the average interval required for 50% recovery of the response to the second of two clicks is 2 ms in the
auditory nerve, cochlear nucleus, and superior olivary complex, but 7 ms in the inferior colliculus and 20 ms in auditory
cortex (Fitzpatrick and Kuwada 1999). In unanesthetized
guinea pig, Creutzfeldt et al. (1980) recorded responses to
amplitude modulated tones simultaneously from thalamic and
cortical neurons (specifically 9 thalamo-cortical unit pairs for
which the correlation of spontaneous activity suggested a direct
synaptic connection). Activity in the cortical neurons declined
more rapidly over successive cycles of the AM tone than did
the activity in the thalamic neurons, indicating greater adaptation in the cortical neurons. Finally, recording near-field potentials from the IC and auditory cortex in response to brief
noise bursts in unanesthetized chinchilla, Burkard et al. (1999)
found that the mean response amplitude (averaged across noise
burst presentations) decreased more in cortex than IC as repetition rate was increased. Their results are again consistent
with greater adaptation in cortex than lower structures in the
auditory pathway.
The extensive animal literature regarding modulation transfer functions (MTFs) also suggests a change in temporal response properties from the IC to auditory cortex. Here we
focus on studies that quantify their results in terms of rate
MTFs (rMTF; average firing rate vs. modulation frequency),
rather than temporal MTFs, since changes in the “synchronization” or phase locking of neural activity (in the absence of
average rate changes) are unlikely to be reflected in fMRI
activity. Furthermore, since most animal studies use shortduration stimulus trains (approximately 1 s), the most appropriate measure of the present study for comparison to the
animal results is onset amplitude. In the IC, the best modulation frequency (BMF; the frequency at which the rMTF has its
largest value) for individual neurons is generally greater than
approximately 30 Hz (Krishna and Semple 2000; Langner and
Schreiner 1988; Muller-Preuss et al. 1994). In contrast, BMFs
in auditory cortex tend to be less than approximately 20 Hz
(Bieser and Muller-Preuss 1996; Eggermont 1991; Schreiner
and Raggio 1996; Schreiner and Urbas 1988). These values are
consistent with this study in that onset amplitude steadily
increased in the IC for noise burst rates ⱕ35/s (the highest rate
employed), but peaked in HG at a lower rate (10/s). While the
variation in onset amplitude with rate in HG was rather small
(and in STG, onset amplitude did not vary at all), a similarly
weak “tuning” also holds in population neural activity, in that
the rMTF averaged across cortical neurons is primarily lowpass, or only weakly band-pass (Eggermont 1994, 1998; Eggermont and Smith 1995; Schreiner and Urbas 1986). The
relatively flat nature of the average rMTF in cortex probably
reflects a weak tuning in the rMTFs of many individual neurons (Eggermont 1998; Schreiner and Raggio 1996) but could
also be due in part to the summation of activity across sharply
tuned units having a wide range of BMFs. Overall, in both IC
1445
1446
M. HARMS AND J. MELCHER
Phasic response recovery
In addition to the prominent on- and off-peaks of fMRI
phasic responses, there is a third component, namely the steady
signal increase that begins approximately 10 s after train onset
and continues to the end of the train. A similar “signal recovery” has been observed in auditory cortex in response to tone
bursts (Robson et al. 1998) and in the supplementary motor
area in response to a finger tapping task (Nakai et al. 2000).
This signal recovery, which in this study can be seen in the
responses to intermediate (10/s) and high-rate (35/s) trains of
various durations (e.g., 30 – 60 s; Figs. 4 and 9), has no obvious
analog in electric- or magnetic-evoked responses to prolonged
auditory stimuli, since these responses do not generally increase over the course of the stimulus but remain fairly constant, or even decrease (Lammertmann and Lutkenhoner 2001;
Picton et al. 1978). However, most of the stimulus durations
used in this literature are far less than those used in this study,
so it is possible that an electric or magnetic analog would be
identified if longer stimulus durations were tried. It is also
possible that the phasic response signal recovery is related to
subjects’ anticipation of the termination of high-rate trains (and
thus may be loosely analogous to electrophysiological correlates of expectation such as the contingent negative variation;
Donchin et al. 1978). To explore this idea, we performed two
pilot experiments in which, for some runs, the subject’s attention was diverted to the visual domain with a highly demanding, ongoing visual task, while presenting a 35/s noise burst
train in our standard “30 s on, 30 s off” protocol. Comparison
of task versus no task showed the expected decrease in auditory
response amplitude in the visual task condition (Woodruff et
al. 1996), but no obvious change in the form of the signal
recovery. If the signal recovery does indeed reflect anticipation
of train termination, it would appear to be a “low-level” expectation that is not readily eliminated by overt diversion of
attention.
J Neurophysiol • VOL
Comparison to previous fMRI and PET studies—auditory
and nonauditory
Several imaging studies have examined rate effects in auditory cortex using stimuli limited to low rates. fMRI studies
presenting blocks of syllables or words at rates ⱕ2.5/s show
primarily sustained responses in auditory cortex and an increase in response amplitude with increasing rate (Binder et al.
1994; Dhankhar et al. 1997; Rees et al. 1997). Both the time
courses and amplitude variations are consistent with the results
of this study at low rates. PET studies have also investigated
the effect of changes in the repetition rate of words or long
duration (500 ms) tone bursts presented at rates ⱕ1.5/s (Frith
and Friston 1996; Price et al. 1992; Rees et al. 1997). Because
PET requires the collection of photon counts over an extended
duration (e.g., tens of seconds), response time courses cannot
be obtained, and results are reported only in terms of the
overall percent signal change between stimulus and control
periods. Computed in this “time-averaged” manner, these PET
rate studies report a monotonic relationship between rate and
percent change. The findings of this study are in agreement
with this trend at low repetition rates.
The fMRI studies of Tanaka et al. (2000) and Giraud et al.
(2000) examined a wider range of rates and presented data in
agreement with some aspects of the present study. Using trains
of tone bursts presented at rates ranging from 0.5 to 20/s,
Tanaka et al. found that “time-averaged” percent change in the
transverse temporal gyrus was maximal at a rate of 5/s, consistent with our observation of a maximum in time-average
percent change between 2–10/s in auditory cortex. To reduce
the influence of acoustic imaging noise on auditory activation,
Tanaka et al. took a very different approach from the present
study and used a paradigm that leaves a long interval (12 s)
between image acquisitions. The fact that Tanaka et al. still
found a “time-average” rate dependence in agreement with the
present study indicates that this dependence is not sensitive to
scanner-generated acoustic noise.10
Giraud et al. (2000) examined the time course, as well as the
amplitude of activation, and for high rates identified a cortical
response component that is likely analogous to the “on-peak”
in the phasic responses of the present study. The stimulation
paradigm presented AM noise and unmodulated noise in alternating 30 s blocks. Although the use of unmodulated noise as
the comparison condition differs from the present study, a
relationship between AM rate and changes in cortical response
waveshape was found that is consistent with the present results.
For auditory cortex and high AM rates (greater than approximately 32 Hz), image signal changes during AM presentations
were modeled best by a transient response at AM noise onset.
In contrast, for low rates, the signal changes were better modeled by a sustained response. Thus the Giraud et al. study, like
the present one, indicates the emergence of a prominent onpeak with increasing rate in auditory cortex. For rates greater
than those used in the present study, Giraud et al. also reported
rate-dependent changes in response waveshape for subcortical
10
The insensitivity of our reported rate dependencies to scanner noise is
further supported by separate replication of the time course findings for 2/s and
35/s noise burst trains using clustered image acquisition with a long interimage interval (8 –10 s; Edmister et al. 1999), combined with a sampling
approach that allows time courses to be reconstructed with high temporal
resolution (e.g., 2 s; Harms 2002; I. Sigalovsky, personal communication).
88 • SEPTEMBER 2002 •
www.jn.org
Downloaded from http://jn.physiology.org/ by 10.220.33.4 on June 16, 2017
precedent at the single neuron level in auditory cortex (Abeles
and Goldstein 1972; Goldstein et al. 1968; He et al. 1997;
Howard et al. 1996; Recanzone 2000).
The fMRI and electrophysiological off-responses resemble
one another functionally in that both show a decrease in magnitude with decreasing sound duration. For instance, evoked
potential off-responses following tone bursts have been shown
to decrease in magnitude as burst duration is reduced from 9 to
1 s (Hillyard and Picton 1978; Picton et al. 1978), or from 2.5
to 0.5 s (Pfefferbaum et al. 1971). The fMRI off-response to
high-rate trains is lower in magnitude for a 15-s train compared
with longer duration trains (ⱖ30 s; Fig. 9). The duration
dependence of fMRI and electrophysiological off-responses
may indicate that a diminishing percentage of cortical neurons
respond to sound offset as sound duration decreases, or that the
neurons responding to the offset of sound do so less robustly as
duration decreases. The fact that the fMRI and electrophysiological data were obtained for very different ranges of duration
(15– 60 s vs. 0.5–9 s) suggests that changes in sound duration
may be reflected in cortical population neural activity in the
same basic way over a broad range of sound durations.
RATE REPRESENTATION IN HUMAN AUDITORY PATHWAY
Relationship between fMRI response waveshape
and sound perception
The fact that the various noise burst trains used in this study
differ considerably from a perceptual standpoint raises the
possibility that some of the observed trends in the fMRI responses are correlates of perception, rather than simply rate.
The most striking of the observed trends, the change in cortical
response waveshape, showed a qualitative correlation with the
perceptual attributes of the stimuli. For the low rate that elicited sustained responses (i.e., 2/s), the individual bursts in the
train were distinguishable. For the high rate that elicited the
most phasic responses (i.e., 35/s), the noise bursts fused so that
the overall train sounded continuous (although modulated).
Thus distinctly different cortical response waveshapes occurred for two distinctly different perceptual regimens.
J Neurophysiol • VOL
The possible correlation between fMRI response waveshape
and perception fits with the idea that population neural activity
in auditory cortex encodes the beginning and end of perceptually distinct acoustic “events.” At a 2/s rate, individual noise
bursts of a train are the distinguishable events, so there would
be successive neural responses to each burst, resulting in a
sustained fMRI response. In contrast, 35/s noise burst trains are
perceptually continuous and would be treated as a single event,
thus producing neural responses primarily at the onset and
offset of the train, and therefore the phasic fMRI response.
Coordinated fMRI and psychophysical experiments will be
necessary to more fully explore the potential coupling between
perception and neural activity as seen through fMRI response
waveshape.
The trend in fMRI response waveshape across structures
suggests that the coding of perceptually distinct acoustic events
in population neural activity occurs to different degrees depending on structure. While auditory cortex showed a clear,
qualitative correlation between perception and response waveshape, the MGB showed less correlation in that there was less
difference in waveshape for 2/s versus 35/s bursts. In the IC,
there was no apparent correlation since waveshape was the
same regardless of whether the individual bursts of a train were
distinct (as at 2/s) or fused (35/s). Thus any correlation between perception and response waveshape diminished from
high to lower stages of the pathway.
Overall, our results suggest a population neural representation of the beginning and the end of distinct perceptual events
that, while weak or absent in the midbrain, begins to emerge in
the thalamus and is robust in auditory cortex.
The authors thank J. Guinan, M. Tramo, I. Sigalovsky, C. Lane, P. Cariani,
and M. Hawley for numerous helpful comments and suggestions, and B. Norris
for considerable assistance in figure preparation.
This study was supported by National Institute on Deafness and Other
Communication Disorders Grants PO1DC-00119, RO3DC-03122, and
T32DC-00038 and a Martinos Scholarship.
REFERENCES
ABELES M AND GOLDSTEIN MH JR. Responses of single units in the primary
auditory cortex of the cat to tones and to tone pairs. Brain Res 42: 337–352,
1972.
AUKER CR, MESZLER RM, AND CARPENTER DO. Apparent discrepancy between
single-unit activity and [14C]deoxyglucose labeling in optic tectum of the
rattlesnake. J Neurophysiol 49: 1504 –1516, 1983.
BANDETTINI PA, JESMANOWICZ A, WONG EC, AND HYDE JS. Processing strategies for time-course data sets in functional MRI of the human brain. Magn
Reson Med 30: 161–173, 1993.
BANDETTINI PA, KWONG KK, DAVIS TL, TOOTELL RBH, WONG EC, FOX PT,
BELLIVEAU JW, WEISSKOFF RM, AND ROSEN BR. Characterization of cerebral blood oxygenation and flow changes during prolonged brain activation.
Hum Brain Map 5: 93–109, 1997.
BANDETTINI PA AND UNGERLEIDER LG. From neuron to BOLD: new connections. Nature Neurosci 4: 864 – 866, 2001.
BIESER A AND MULLER-PREUSS P. Auditory responsive cortex in the squirrel
monkey: neural responses to amplitude-modulated sounds. Exp Brain Res
108: 273–284, 1996.
BINDER JR, RAO SM, HAMMEKE TA, FROST JA, BANDETTINI PA, AND HYDE JS.
Effects of stimulus rate on signal responses during functional magnetic
resonance imaging of auditory cortex. Brain Res Cogn Brain Res 2: 31–38,
1994.
BLINKENBERG M, BONDE C, HOLM S, SVARER C, ANDERSEN J, PAULSON OB,
AND LAW I. Rate dependence of regional cerebral activation during performance of a repetitive motor task: a PET study. J Cereb Blood Flow Metab
16: 794 – 803, 1996.
88 • SEPTEMBER 2002 •
www.jn.org
Downloaded from http://jn.physiology.org/ by 10.220.33.4 on June 16, 2017
areas, including midbrain and lower brain stem. The activation
in these areas was described as fairly weak, so the response rate
dependencies would be strengthened by replication. Nevertheless, their data suggest that midbrain and brain stem responses
may become more transient with increasing rate, but at much
higher rates than for cortex.
Robson et al. (1998) reported fMRI responses to an approximately 30-s train of 5/s tone bursts (100-ms burst duration)
that are remarkably similar to what we measured to 10/s noise
bursts in cortex in that they include 1) an initial on-peak
characterized by a 30 –50% signal decline, 2) a subsequently
slowly increasing signal, and 3) a possible small signal increase or prolongation following train termination. Robson et
al. (1998) found that the cortical fMRI response to the tone
burst train could be fairly well described by a model in which
there is a response to each burst in a train, but the magnitude
of the response decreases progressively with each successive
burst until a steady-state response level is reached.
PET and fMRI studies of rate effects in the visual, somatosensory, and motor systems are generally consistent with the
rate dependence of “time-averaged” percent change in auditory
cortex. Over the low rates (⬍5 Hz) used in studies of the motor
system, activation in primary motor cortex increases with
increasing rate of finger tapping or finger flexion-extension
movements (Blinkenberg et al. 1996; Jäncke et al. 1998; Rao et
al. 1996; Schlaug et al. 1996). For primary visual (Fox and
Raichle 1984, 1985; Kwong et al. 1992; Mentis et al. 1997;
Thomas and Menon 1998; Zhu et al. 1998) and somatosensory
(Ibanez et al. 1995; Takanashi et al. 2001) cortex, stimuli have
been presented over a wider rate range, comparable with the
range used in the present study. A consistent result in these
studies (see Kampe et al. 2000 for an exception) is that “timeaveraged” activation increases at low rates, peaks around 8 Hz,
and then declines, a trend similar to the one seen here for
primary auditory cortex. These visual and somatosensory studies do not provide information concerning the time course of
activation, so it is not clear whether the decline in time-average
activation at high rates reflects a change in response waveshape, as is the case for primary auditory cortex. Nevertheless,
the general trend for a decline in time-average activation beyond about 8 Hz raises the possibility that changes in response
waveshape from sustained to phasic with increasing rate are
not unique to auditory cortex, but instead are a general property
of sensory cortical areas.
1447
1448
M. HARMS AND J. MELCHER
J Neurophysiol • VOL
FRISTON KJ, ASHBURNER J, FRITH CD, POLINE J-B, HEATHER JD, AND FRACKOWIAK RSJ. Spatial registration and normalization of images. Hum Brain
Map 2: 165–189, 1995.
FRISTON KJ, WILLIAMS S, HOWARD R, FRACKOWIAK RSJ, AND TURNER R.
Movement-related effects in fMRI time-series. Magn Reson Med 35: 346 –
355, 1996.
FRISTON KJ, WORSLEY KJ, FRACKOWIAK RSJ, MAZZIOTTA JC, AND EVANS AC.
Assessing the significance of focal activations using their spatial extent.
Hum Brain Map 1: 210 –220, 1994.
FRITH CD AND FRISTON KJ. The role of the thalamus in “top down” modulation
of attention to sound. Neuroimage 4: 210 –215, 1996.
FRUHSTORFER H. Habituation and dishabituation of the human vertex response.
Electroencephalogr Clin Neurophysiol 30: 306 –312, 1971.
FRUHSTORFER H, SOVERI P, AND JARVILEHTO T. Short-term habituation of the
auditory evoked response in man. Electroencephalogr Clin Neurophysiol
28: 153–161, 1970.
GASCHLER-MARKEFSKI B, BAUMGART F, TEMPELMANN C, SCHINDLER F,
STILLER D, HEINZE HJ, AND SCHEICH H. Statistical methods in functional
magnetic resonance imaging with respect to nonstationary time-series: auditory cortex activity. Magn Reson Med 38: 811– 820, 1997.
GIRAUD AL, LORENZI C, ASHBURNER J, WABLE J, JOHNSRUDE I, FRACKOWIAK R,
AND KLEINSCHMIDT A. Representation of the temporal envelope of sounds in
the human brain. J Neurophysiol 84: 1588 –1598, 2000.
GOLDSTEIN MH JR, HALL JL II, AND BUTTERFIELD BO. Single-unit activity in
the primary auditory cortex of unanesthetized cats. J Acoust Soc Am 43:
444 – 455, 1968.
GRATTON G, GOODMAN-WOOD MR, AND FABIANI M. Comparison of neuronal
and hemodynamic measures of the brain response to visual stimulation: an
optical imaging study. Hum Brain Map 13: 13–25, 2001.
GRIFFITHS TD, UPPENKAMP S, JOHNSRUDE I, JOSEPHS O, AND PATTERSON RD.
Encoding of the temporal regularity of sound in the human brainstem.
Nature Neurosci 4: 633– 637, 2001.
GUIMARAES AR, MELCHER JR, TALAVAGE TM, BAKER JR, LEDDEN P, ROSEN
BR, KIANG NY-S, FULLERTON BC, AND WEISSKOFF RM. Imaging subcortical
auditory activity in humans. Hum Brain Map 6: 33– 41, 1998.
HALL DA, HAGGARD MP, AKEROYD MA, PALMER AR, SUMMERFIELD AQ,
ELLIOTT MR, GURNEY EM, AND BOWTELL RW. “Sparse” temporal sampling
in auditory fMRI. Hum Brain Map 7: 213–223, 1999.
HALL DA, HAGGARD MP, SUMMERFIELD AQ, AKEROYD MA, PALMER AR, AND
BOWTELL RW. Functional magnetic resonance imaging measurements of
sound-level encoding in the absence of background scanner noise. J Acoust
Soc Am 109: 1559 –1570, 2001.
HARI R, PELIZZONE M, MAKELA JP, HALLSTROM J, LEINONEN L, AND LOUNASMAA OV. Neuromagnetic responses of the human auditory cortex to on- and
offsets of noise bursts. Audiology 26: 31– 43, 1987.
HARMS MP. Sound temporal envelope and time-patterns of activity in the
human auditory pathway: an fMRI study. (PhD thesis). Cambridge, MA:
Massachusetts Institute of Technology, 2002.
HARMS MP AND MELCHER JR. Understanding novel fMRI time courses to
rapidly presented noise bursts. Neuroimage 9: S847, 1999.
HARMS MP, SIGALOVSKY IS, GUINAN JJ JR, AND MELCHER JR. Temporal
dynamics of fMRI responses in human auditory cortex: dependence on
stimulus type. Assoc Res Otolaryngol Abst 24: 185, 2001.
HE J, HASHIKAWA T, OJIMA H, AND KINOUCHI Y. Temporal integration and
duration tuning in the dorsal zone of cat auditory cortex. J Neurosci 17:
2615–2625, 1997.
HEEGER DJ, HUK AC, GEISLER WS, AND ALBRECHT DG. Spikes versus BOLD:
what does neuroimaging tell us about neuronal activity? Nature Neurosci 3:
631– 633, 2000.
HILLYARD SA AND PICTON TW. On and off components in the auditory evoked
potential. Percept Psychophys 24: 391–398, 1978.
HOGE RD, ATKINSON J, GILL B, CRELIER GR, MARRETT S, AND PIKE GB.
Stimulus-dependent BOLD and perfusion dynamics in human V1. Neuroimage 9: 573–585, 1999a.
HOGE RD, ATKINSON J, GILL B, CRELIER GR, MARRETT S, AND PIKE GB. Linear
coupling between cerebral blood flow and oxygen consumption in activated
human cortex. Proc Natl Acad Sci USA 96: 9403–9408, 1999b.
HOWARD MA III, VOLKOV IO, ABBAS PJ, DAMASIO H, OLLENDIECK MC, AND
GRANNER MA. A chronic microelectrode investigation of the tonotopic
organization of human auditory cortex. Brain Res 724: 260 –264, 1996.
HUDETZ AG, BISWAL BB, SHEN H, LAUER KK, AND KAMPINE JP. Spontaneous
fluctuations in cerebral oxygen supply. An introduction. Adv Exp Med Biol
454: 551–559, 1998.
88 • SEPTEMBER 2002 •
www.jn.org
Downloaded from http://jn.physiology.org/ by 10.220.33.4 on June 16, 2017
BOYNTON GM, ENGEL SA, GLOVER GH, AND HEEGER DJ. Linear systems
analysis of functional magnetic resonance imaging in human V1. J Neurosci
16: 4207– 4221, 1996.
BREGMAN A. Auditory Scene Analysis. The Perceptual Organization of Sound.
Cambridge: MIT Press, 1990.
BROSCH M, SCHULZ A, AND SCHEICH H. Processing of sound sequences in
macaque auditory cortex: response enhancement. J Neurophysiol 82: 1542–
1559, 1999.
BUCKNER RL, BANDETTINI PA, O’CRAVEN KM, SAVOY RL, PETERSON SE,
RAICHLE ME, AND ROSEN BR. Detection of cortical activation during averaged single trials of a cognitive task using functional magnetic resonance
imaging. Proc Natl Acad Sci USA 93: 14878 –14883, 1996.
BUDD TW AND MICHIE PT. Facilitation of the N1 peak of the auditory ERP at
short stimulus intervals. Neuroreport 5: 2513–2516, 1994.
BURKARD RF, SECOR CA, AND SALVI RJ. Near-field responses from the round
window, inferior colliculus, and auditory cortex of the unanesthetized chinchilla: Manipulations of noise burst level and rate. J Acoust Soc Am 106:
304 –312, 1999.
BUXTON RB, WONG EC, AND FRANK LR. Dynamics of blood flow and oxygenation changes during brain activation: The balloon model. Magn Reson
Med 39: 855– 864, 1998.
CHATRIAN GE, PETERSEN MC, AND LAZARTE JA. Responses to clicks from the
human brain: some depth electrographic observations. Electroencephalogr
Clin Neurophysiol 12: 479 – 489, 1960.
CHEN W, ZHU XH, TOSHINORI K, ANDERSEN P, AND UGURBIL K. Spatial and
temporal differentiation of fMRI BOLD response in primary visual cortex of
human brain during sustained visual simulation. Magn Reson Med 39:
520 –527, 1998.
CREUTZFELDT O, HELLWEG FC, AND SCHREINER C. Thalamocortical transformation of responses to complex auditory stimuli. Exp Brain Res 39: 87–104,
1980.
DALE AM AND BUCKNER RL. Selective averaging of rapidly presented individual trials using fMRI. Hum Brain Map 5: 329 –340, 1997.
DAVIS H, MAST T, YOSHIE N, AND ZERLIN S. The slow response of the human
cortex to auditory stimuli: recovery process. Electroencephalogr Clin Neurophysiol 21: 105–113, 1966.
DAVIS H, OSTERHAMMEL A, WIER CC, AND GJERDINGEN DB. Slow vertex
potentials: interactions among auditory, tactile, electric and visual stimuli.
Electroencephalogr Clin Neurophysiol 33: 537–545, 1972.
DAVIS TL, KWONG KK, WEISSKOFF RM, AND ROSEN BR. Calibrated functional
MRI: mapping the dynamics of oxidative metabolism. Proc Natl Acad Sci
USA 95: 1834 –1839, 1998.
DHANKHAR A, WEXLER BE, FULBRIGHT RK, HALWES T, BLAMIRE AM, AND
SHULMAN RG. Functional magnetic resonance imaging assessment of the
human brain auditory cortex response to increasing word presentation rates.
J Neurophysiol 77: 476 – 483, 1997.
DONCHIN E, RITTER W, AND MCCALLUM WC. Cognitive psychophysiology: the
endogenous components of the ERP. In: Event-Related Brain Potentials in
Man, edited by Callaway E, Tueting P, and Koslow SH. New York:
Academic Press, 1978, p. 349 – 411.
EDMISTER WB, TALAVAGE TM, LEDDEN PJ, AND WEISSKOFF RM. Improved
auditory cortex imaging using clustered volume acquisitions. Hum Brain
Map 7: 89 –97, 1999.
EGGERMONT JJ. Rate and synchronization measures of periodicity coding in cat
primary auditory cortex. Hear Res 56: 153–167, 1991.
EGGERMONT JJ. Temporal modulation transfer functions for AM and FM
stimuli in cat auditory cortex. Effects of carrier type, modulating waveform
and intensity. Hear Res 74: 51– 66, 1994.
EGGERMONT JJ. Representation of spectral and temporal sound features in three
cortical fields of the cat. Similarities outweigh differences. J Neurophysiol
80: 2743–2764, 1998.
EGGERMONT JJ AND SMITH GM. Synchrony between single-unit activity and
local field potentials in relation to periodicity coding in primary auditory
cortex. J Neurophysiol 73: 227–245, 1995.
ERNÉ SN AND HOKE M. Short-latency evoked magnetic fields from the human
auditory brainstem. Adv Neurol 54: 167–176, 1990.
FITZPATRICK DC AND KUWADA S. Responses of neurons to click-pairs as
simulated echoes: auditory nerve to auditory cortex. J Acoust Soc Am 106:
3460 –3472, 1999.
FOX PT AND RAICHLE ME. Stimulus rate dependence of regional cerebral blood
flow in human striate cortex, demonstrated by positron emission tomography. J Neurophysiol 51: 1109 –1120, 1984.
FOX PT AND RAICHLE ME. Stimulus rate determines regional brain blood flow
in striate cortex. Ann Neurol 17: 303–305, 1985.
RATE REPRESENTATION IN HUMAN AUDITORY PATHWAY
J Neurophysiol • VOL
vates brain from striate through frontal cortex. Neuroimage 5: 116 –128,
1997.
MILLER GA AND TAYLOR WG. The perception of repeated bursts of noise. J
Acous Soc Am 20: 171–182, 1948.
MØLLER AR. Neural generators of the brainstem auditory evoked potentials.
Sem Hearing 19: 11–27, 1998.
MOROSAN P, RADEMACHER J, SCHLEICHER A, AMUNTS K, SCHORMANN T, AND
ZILLES K. Human primary auditory cortex: cytoarchitectonic subdivisions
and mapping into a spatial reference system. Neuroimage 13: 684 –701,
2001.
MULLER-PREUSS P, FLACHSKAMM C, AND BIESER A. Neural encoding of amplitude modulation within the auditory midbrain of squirrel monkeys. Hear
Res 80: 197–208, 1994.
NÄÄTÄNEN R AND PICTON T. The N1 wave of the human electric and magnetic
response to sound: a review and an analysis of the component structure.
Psychophysiology 24: 375– 425, 1987.
NAKAI T, MATSUO K, KATO C, TAKEHARA Y, ISODA H, MORIYA T, OKADA T
AND SAKAHARA H. Post-stimulus response in hemodynamics observed by
functional magnetic resonance imaging— difference between the primary
and sensorimotor area and the supplementary motor area. Magn Reson
Imaging 18: 1215–1219, 2000.
NODA K, TONOIKE M, DOI K, KOIZUKA I, YAMAGUCHI M, SEO R, MATSUMOTO
N, NOIRI T, TAKEDA N, AND KUBO T. Auditory evoked off-response: its
source distribution is different from that of on-response. Neuroreport 9:
2621–2625, 1998.
NUDO RJ AND MASTERTON RB. Stimulation-induced [14C]2-deoxyglucose
labeling of synaptic activity in the central auditory system. J Comp Neurol
245: 553–565, 1986.
OBRIG H, NEUFANG M, WENZEL R, KOHL M, STEINBRINK J, EINHAUPL K, AND
VILLRINGER A. Spontaneous low frequency oscillations of cerebral hemodynamics and metabolism in human adults. Neuroimage 12: 623– 639, 2000.
OGAWA S, MENON RS, TANK DW, KIM SG, MERKLE H, ELLERMAN JM, AND
UGURBIL K. Functional brain mapping by blood oxygenation level-dependent contrast magnetic resonance imaging: a comparison of signal characteristics with a biophysical model. Biophys J 64: 803– 812, 1993.
PANTEV C, EULITZ C, HAMPSON S, ROSS B, AND ROBERTS LE. The auditory
evoked “off” response: sources and comparison with the “on” and the
“sustained” responses. Ear Hear 17: 255–265, 1996.
PAULING L AND CORYELL CD. The magnetic properties and structure of
hemoglobin, oxyhemoglobin and carbonmonoxyhemoglobin. Proc Natl
Acad Sci USA 22: 210 –216, 1936.
PENHUNE VB, ZATORRE RJ, MACDONALD JD, AND EVANS AC. Interhemispheric
anatomical differences in human primary auditory cortex: probabilistic
mapping and volume measurement from magnetic resonance scans. Cereb
Cortex 6: 661– 672, 1996.
PFEFFERBAUM A, BUCHSBAUM M, AND GIPS J. Enhancement of the average
evoked response to tone onset and cessation. Psychophysiology 8: 332–339,
1971.
PICTON TW, HILLYARD SA, KRAUSZ HI, AND GALAMBOS R. Human auditory
evoked potentials. I: Evaluation of components. Electroencephalogr Clin
Neurophysiol 36: 179 –190, 1974.
PICTON TW, WOODS DL, BARIBEAU-BRAUN J, AND HEALEY TMG. Evoked
potential audiometry. J Otolaryngol 6: 90 –119, 1977.
PICTON TW, WOODS DL, AND PROULX GB. Human auditory sustained potentials. II. Stimulus relationships. Electroencephalogr Clin Neurophysiol 45:
198 –210, 1978.
PRESS WH, TEUKOLSKY SA, VETTERLING WT, AND FLANNERY BP. Numerical
Recipes in C: The Art of Scientific Computing: Cambridge: Cambridge
University Press, 1992.
PRICE C, WISE R, RAMSAY S, FRISTON K, HOWARD D, PATTERSON K, AND
FRACKOWIAK R. Regional response differences within the human auditory
cortex when listening to words. Neurosci Lett 146: 179 –182, 1992.
PURDON PL AND WEISSKOFF RM. Effect of temporal autocorrelation due to
physiological noise and stimulus paradigm on voxel-level false-positive
rates in fMRI. Hum Brain Map 6: 239 –249, 1998.
RADEMACHER J, CAVINESS JVS, STEINMETZ H, AND GALABURDA AM. Topographical variation of the human primary cortices: implications for neuroimaging, brain mapping, and neurobiology. Cereb Cortex 3: 313–329, 1993.
RADEMACHER J, MOROSAN P, SCHORMANN T, SCHLEICHER A, WERNER C,
FREUND HJ, AND ZILLES K. Probabilistic mapping and volume measurement
of human primary auditory cortex. Neuroimage 13: 669 – 683, 2001.
88 • SEPTEMBER 2002 •
www.jn.org
Downloaded from http://jn.physiology.org/ by 10.220.33.4 on June 16, 2017
IBANEZ V, DEIBER MP, SADATO N, TORO C, GRISSOM J, WOODS RP, MAZZIOTTA
JC, AND HALLETT M. Effects of stimulus rate on regional cerebral blood flow
after median nerve stimulation. Brain 118: 1339 –1351, 1995.
JÄNCKE L, BUCHANAN T, LUTZ K, SPECHT K, MIRZAZADE S, AND SHAH NJS.
The time course of the BOLD response in the human auditory cortex to
acoustic stimuli of different duration. Brain Res Cogn Brain Res 8: 117–124,
1999.
JÄNCKE L, SPECHT K, MIRZAZADE S, LOOSE R, HIMMELBACH M, LUTZ K, AND
SHAH NJ. A parametric analysis of the ’rate effect’ in the sensorimotor
cortex: a functional magnetic resonance imaging analysis in human subjects.
Neurosci Lett 252: 37– 40, 1998.
JIANG ZD, WU YY, AND ZHANG L. Amplitude change with click rate in human
brainstem auditory-evoked responses. Audiology 30: 173–182, 1991.
JUEPTNER M AND WEILLER C. Review. Does measurement of regional cerebral
blood flow reflect synaptic activity?—implications for PET and fMRI.
Neuroimage 2: 148 –156, 1995.
KAMPE KK, JONES RA, AND AUER DP. Frequency dependence of the functional
MRI response after electrical median nerve stimulation. Hum Brain Map 9:
106 –114, 2000.
KRISHNA BS AND SEMPLE MN. Auditory temporal processing: responses to
sinusoidally amplitude-modulated tones in the inferior colliculus. J Neurophysiol 84: 255–273, 2000.
KWONG KK, BELLIVEAU JW, CHESLER DA, GOLDBERG IE, WEISSKOFF RM,
PONCELET BP, KENNEDY DN, HOPPEL BE, COHEN MS, TURNER R, CHENG
H-M, BRADY TJ, AND ROSEN BR. Dynamic magnetic resonance imaging of
human brain activity during primary sensory stimulation. Proc Natl Acad Sci
USA 89: 5675–5679, 1992.
LAMMERTMANN C AND LUTKENHONER B. Near-DC magnetic fields following a
periodic presentation of long-duration tonebursts. Clin Neurophysiol 112:
499 –513, 2001.
LANGNER G. Periodicity coding in the auditory system. Hear Res 60: 115–142,
1992.
LANGNER G AND SCHREINER CE. Periodicity coding in the inferior colliculus of
the cat. I. Neuronal mechanisms. J Neurophysiol 60: 1799 –1822, 1988.
LEONARD CM, PURANIK C, KULDAU JM, AND LOMBARDINO LJ. Normal variation in the frequency and location of human auditory cortex landmarks.
Heschl’s gyrus: where is it? Cereb Cortex 8: 397– 406, 1998.
LOCKWOOD AH, SALVI RJ, COAD ML, ARNOLD SA, WACK DS, MURPHY BW,
AND BURKARD RF. The functional anatomy of the normal human auditory
system: responses to 0.5 and 4.0 kHz tones at varied intensities. Cereb
Cortex 9: 65–76, 1999.
LOGOTHETIS NK, PAULS J, AUGATH M, TRINATH T, AND OELTERMANN A.
Neurophysiological investigation of the basis of the fMRI signal. Nature
412: 150 –157, 2001.
LOVELESS N, HARI R, HAMALAINEN M, AND TIIHONEN J. Evoked responses of
human auditory cortex may be enhanced by preceding stimuli. Electroencephalogr Clin Neurophysiol 74: 217–227, 1989.
LU T, LIANG L, AND WANG X. Temporal and rate representations of timevarying signals in the auditory cortex of awake primates. Nature Neurosci 4:
1131–1138, 2001.
LU T AND WANG X. Temporal discharge patterns evoked by rapid sequences of
wide- and narrowband clicks in the primary auditory cortex of cat. J Neurophysiol 84: 236 –246, 2000.
MANDEVILLE JB, MAROTA JJA, AYATA C, MOSKOWITZ MA, WEISSKOFF RM,
AND ROSEN BR. MRI measurement of the temporal evolution of relative
CMRO2 during rat forepaw stimulation. Magn Reson Med 42: 944 –951,
1999.
MANDEVILLE JB, MAROTA JJA, KOSOFSKY BE, KELTNER JR, WEISSLEDER R,
ROSEN BR, AND WEISSKOFF RM. Dynamic functional imaging of relative
cerebral blood volume during rat forepaw stimulation. Magn Reson Med 39:
615– 624, 1998.
MATHIESEN C, CAESAR K, AKGOREN N, AND LAURITZEN M. Modification of
activity-dependent increases of cerebral blood flow by excitatory synaptic
activity and spikes in rat cerebellar cortex. J Physiol (Lond) 512: 555–566,
1998.
MELCHER JR AND KIANG NY. Generators of the brainstem auditory evoked
potential in cat. III: identified cell populations. Hear Res 93: 52–71, 1996.
MELCHER JR, TALAVAGE TM, AND HARMS MP. Functional MRI of the auditory
system. In: Functional MRI, edited by Moonen CTW and Bandettini PA.
Berlin: Springer, 1999, p. 393– 406.
MENTIS MJ, ALEXANDER GE, GRADY CL, HORWITZ B, KRASUSKI J, PIETRINI P,
STRASSBURGER T, HAMPEL H, SCHAPIRO MB, AND RAPOPORT SI. Frequency
variation of a pattern-flash visual stimulus during PET differentially acti-
1449
1450
M. HARMS AND J. MELCHER
J Neurophysiol • VOL
SOKOLOFF L. Circulation and energy metabolism of the brain. In: Basic
Neurochemistry, edited by Siegel G, Agranoff B, Albers RW, and Molinoff
P. New York: Raven, 1989, p. 565–590.
SPRINGER CS, JR, PATLAK CS, PALYKA I, AND HUANG W. Principles of susceptibility contrast-based functional MRI: the sign of the functional MRI
response. In: Functional MRI, edited by Moonen CTW and Bandettini PA.
Berlin: Springer, 1999, p. 91–102.
SPYCHALA P, ROSE DE, AND GRIER JB. Comparison of the “on” and “off”
characteristics of the acoustically evoked response. Int Audiol 8: 416 – 423,
1969.
STEINSCHNEIDER M, RESER DH, FISHMAN YI, SCHROEDER CE, AND AREZZO JC.
Click train encoding in primary auditory cortex of the awake monkey:
evidence for two mechanisms subserving pitch perception. J Acoust Soc Am
104: 2935–2955, 1998.
SUZUKI T, KOBAYASHI K, AND TAKAGI N. Effects of stimulus repetition rate on
slow and fast components of auditory brain-stem responses. Electroencephalogr Clin Neurophysiol 65: 150 –156, 1986.
SYMMES D, CHAPMAN LF, AND HALSTEAD WC. The fusion of intermittent white
noise. J Acoust Soc Am 27: 470 – 473, 1955.
TAKANASHI M, ABE K, YANAGIHARA T, OSHIRO Y, WATANABE Y, TANAKA H,
HIRABUKI N, NAKAMURA H, AND FUJITA N. Effects of stimulus presentation
rate on the activity of primary somatosensory cortex: a functional magnetic
resonance imaging study in humans. Brain Res Bull 54: 125–129, 2001.
TALAVAGE TM, LEDDEN PJ, BENSON RR, ROSEN BR, AND MELCHER JR.
Frequency-dependent responses exhibited by multiple regions in human
auditory cortex. Hear Res 150: 225–244, 2000.
TANAKA H, FUJITA N, WATANABE Y, HIRABUKI N, TAKANASHI M, OSHIRO Y,
AND NAKAMURA H. Effects of stimulus rate on the auditory cortex using
fMRI with ’sparse’ temporal sampling. Neuroreport 11: 2045–2049, 2000.
THOMAS CG AND MENON RS. Amplitude response and stimulus presentation
frequency response of human primary visual cortex using BOLD EPI at 4 T.
Magn Reson Med 40: 203–209, 1998.
THORNTON ARD AND COLEMAN MJ. The adaptation of cochlear and brainstem
auditory evoked potentials in humans. Electroencephalogr Clin Neurophysiol 39: 399 – 406, 1975.
VAZQUEZ AL AND NOLL DC. Nonlinear aspects of the BOLD response in
functional MRI. Neuroimage 7: 108 –118, 1998.
WESSINGER CM, VANMETER J, TIAN B, VAN LARE J, PEKAR J, AND RAUSCHECKER JP. Hierarchical organization of the human auditory cortex revealed by functional magnetic resonance imaging. J Cogn Neurosci 13: 1–7,
2001.
WOODRUFF PWR, BENSON RR, BANDETTINI PA, KWONG KK, HOWARD RJ,
TALAVAGE T, BELLIVEAU J, AND ROSEN BR. Modulation of auditory and
visual cortex by selective attention is modality-dependent. Neuroreport 7:
1909 –1913, 1996.
YANG Y, ENGELIEN A, ENGELIEN W, XU S, STERN E, AND SILBERSWEIG DA. A
silent event-related functional MRI technique for brain activation studies
without interference of scanner acoustic noise. Magn Reson Med 43: 185–
190, 2000.
ZHU XH, KIM SG, ANDERSEN P, OGAWA S, UGURBIL K, AND CHEN W.
Simultaneous oxygenation and perfusion imaging study of functional activity in primary visual cortex at different visual stimulation frequency: quantitative correlation between BOLD and CBF changes. Magn Reson Med 40:
703–711, 1998.
88 • SEPTEMBER 2002 •
www.jn.org
Downloaded from http://jn.physiology.org/ by 10.220.33.4 on June 16, 2017
RAO SM, BANDETTINI PA, BINDER JR, BOBHOLZ JA, HAMMEKE TA, STEIN EA,
AND HYDE JS. Relationship between finger movement rate and functional
magnetic resonance signal change in human primary motor cortex. J Cereb
Blood Flow Metab 16: 1250 –1254, 1996.
RAVICZ ME AND MELCHER JR. Isolating the auditory system from acoustic
noise during functional magnetic resonance imaging: examination of noise
conduction through the ear canal, head, and body. J Acoust Soc Am 109:
216 –231, 2001.
RAVICZ ME, MELCHER JR, AND KIANG NY-S. Acoustic noise during functional
magnetic resonance imaging. J Acoust Soc Am 108: 1683–1696, 2000.
RECANZONE GH. Response profiles of auditory cortical neurons to tones and
noise in behaving macaque monkeys. Hear Res 150: 104 –118, 2000.
REES G, FRISTON K, AND KOCH C. A direct quantitative relationship between
the functional properties of human and macaque V5. Nature Neurosci 3:
716 –723, 2000.
REES G, HOWSEMAN A, JOSEPHS O, FRITH CD, FRISTON KJ, FRACKOWIAK RSJ,
AND TURNER R. Characterizing the relationship between BOLD contrast and
regional cerebral blood flow measurements by varying the stimulus presentation rate. Neuroimage 6: 270 –278, 1997.
REESE TG, DAVIS TL, AND WEISSKOFF RM. Automated shimming at 1.5 T
using echo-planar image frequency maps. J Magn Reson Imaging 5: 739 –
745, 1995.
REITE M, ADAMS M, SIMON J, TEALE P, SHEEDER J, RICHARDSON D, AND
GRABBE R. Auditory M100 component 1: relationship to Heschl’s gyri.
Brain Res Cogn Brain Res 2: 13–20, 1994.
RITTER W, VAUGHAN JHG, AND COSTA LD. Orienting and habituation to
auditory stimuli: a study of short term changes in average evoked responses.
Electroencephalogr Clin Neurophysiol 25: 550 –556, 1968.
ROBSON MD, DOROSZ JL, AND GORE JC. Measurements of the temporal fMRI
response of the human auditory cortex to trains of tones [published erratum
appears in. Neuroimage 8: 228, 1998]. Neuroimage 7: 185–198, 1998.
ROTH WT AND KOPELL BS. The auditory evoked response to repeated stimuli
during a vigilance task. Psychophysiology 6: 301–309, 1969.
ROYER FL AND ROBIN DA. On the perceived unitization of repetitive auditory
patterns. Percept Psychophys 39: 9 –18, 1986.
SCHLAUG G, SANES JN, THANGARAJ V, DARBY DG, JANCKE L, EDELMAN RR,
AND WARACH S. Cerebral activation covaries with movement rate. Neuroreport 7: 879 – 883, 1996.
SCHREINER CE AND LANGNER G. Coding of temporal patterns in the central
auditory nervous system. In: Auditory Function: Neurobiological Bases of
Hearing, edited by Edelman GM, Gall WE, and Cowan WM. New York:
John Wiley, 1988, p. 337–361.
SCHREINER CE AND RAGGIO MW. Neuronal responses in cat primary auditory
cortex to electrical cochlear stimulation. II. Repetition rate coding. J Neurophysiol 75: 1283–1300, 1996.
SCHREINER CE AND URBAS JV. Representation of amplitude modulation in the
auditory cortex of cat. I. The anterior auditory field (AAF). Hear Res 21:
227–241, 1986.
SCHREINER CE AND URBAS JV. Representation of amplitude modulation in the
auditory cortex of the cat. II. Comparison between cortical fields. Hear Res
32: 49 – 64, 1988.
SIGALOVSKY I, HAWLEY ML, HARMS MP, AND MELCHER JR. Sound level
representations in the human auditory pathway investigated using fMRI.
Neuroimage 13: S939, 2001.
SOBEL N, PRABHAKARAN V, ZHAO Z, DESMOND JE, GLOVER GH, SULLIVAN EV,
AND GABRIELI JDE. Time course of odorant-induced activation in the human
primary olfactory cortex. J Neurophysiol 83: 537–551, 2000.