J Comp Physiol A (1997) 181: 665±676 Ó Springer-Verlag 1997 ORIGINAL PAPER G. Langner á M. Sams á P. Heil á H. Schulze Frequency and periodicity are represented in orthogonal maps in the human auditory cortex: evidence from magnetoencephalography Accepted: 25 July 1997 Abstract Timbre and pitch are two independent perceptual qualities of sounds closely related to the spectral envelope and to the fundamental frequency of periodic temporal envelope ¯uctuations, respectively. To a ®rst approximation, the spectral and temporal tuning properties of neurons in the auditory midbrain of various animals are independent, with layouts of these tuning properties in approximately orthogonal tonotopic and periodotopic maps. For the ®rst time we demonstrate by means of magnetoencephalography a periodotopic organization of the human auditory cortex and analyse its spatial relationship to the tonotopic organization by using a range of stimuli with dierent temporal envelope ¯uctuations and spectra and a magnetometer providing high spatial resolution. We demonstrate an orthogonal arrangement of tonotopic and periodotopic gradients. Our results are in line with the organization of such maps in animals and closely match the perceptual orthogonality of timbre and pitch in humans. Key words Periodicity pitch á Tonotopy á Auditory cortex á Magnetoencephalography á Neuronal map Abbreviations AC auditory cortex á BMF best modulation frequency á CF characteristic frequency á ECD equivalent current dipole á f0 fundamental frequency á MEG magnetoencephalography á SF sustained ®eld G. Langner (&) á P. Heil á H. Schulze Institute of Zoology, Technical University Darmstadt, Schnittspahnstr. 3, D-64287 Darmstadt, Germany Fax: +49-6151/164808, e-mail: [email protected] M. Sams Low Temperature Laboratory, Helsinki University of Technology, Finland Introduction The relationships between frequency, periodicity, pitch, and timbre Acoustic signals used by humans and animals for communication are often harmonic in structure. A harmonic signal is characterized by a temporal envelope which ¯uctuates periodically with a period given by the reciprocal of the fundamental frequency (f0), and a spectrum composed of frequencies which are integer multiples of f0. Any portion of the spectrum which contains at least two adjacent harmonics has the same temporal envelope period and, in a human listener, elicits the same pitch as the fundamental component alone. This percept is referred to as Ôperiodicity pitch' or, because the fundamental component is not required to elicit that pitch, the percept of the Ômissing fundamental' (Schouten 1970). The pitch of a harmonic signal can be varied, by varying its temporal envelope period without varying its spectral envelope or bandwidth ± two parameters essential for the timbre of a sound (Fig. 1). Conversely, timbre can be varied by varying the spectral envelope without varying the temporal envelope period, i.e. without varying pitch (Fig. 1). For example, the timbre of a pure tone diers from the timbre of a harmonic sound, but it elicits the same pitch provided it has the same period as the temporal envelope of the harmonic sound (Fig. 1). Likewise, dierent musical instruments (or voices) can dier in timbre when producing the same note. In that sense pitch and timbre are largely independent and may be considered as orthogonal perceptual parameters related to temporal envelope periodicity and spectral content, respectively. The independence of pitch and timbre was demonstrated in psychophysical experiments (Plomp and Steeneken 1971; Krumhansl and Iverson 1992), and pitch and timbre were shown to compete in streaming experiments (Singh 1987). 666 coincidence units are activated (Langner 1981, 1983, 1992). Moreover, as found in the midbrain of cat and chinchilla, neurons with dierent BMFs are spatially mapped in an orderly fashion, such that the gradients of BMF (periodotopy) and CF (tonotopy) are roughly orthogonal to each other (Schreiner and Langner 1988; Langner et al. 1992). Representation of frequency and periodicity in the auditory cortex Fig. 1 Pitch and timbre are largely independent percepts. Signals with the same temporal envelope period have the same pitch, but they may have dierent spectral envelopes and therefore dierent timbres. Signals with dierent periods elicit a dierent pitch, while their spectral envelopes (and their timbre) may be similar Representation of frequency and periodicity in the auditory brainstem The independence of pitch and timbre is in line with response properties of neurons in the brainstem of various animals. In addition to their frequency tuning neurons are often tuned to particular modulation frequencies of temporal envelopes (for review see Langner 1992). Neurons which are tuned to a certain characteristic frequency (CF) may prefer dierent temporal envelope modulations, e.g. they may have dierent best modulation frequency (BMF). To a ®rst approximation their spectral and temporal tuning properties are independent. In the cochlear nucleus of various animals, the tuning of neurons to temporal envelope modulations is evident mainly with respect to the synchronization of their discharges to the modulation (Frisina et al. 1990; Rhode and Greenberg 1994). In the auditory midbrain, synchronization is relatively poor, especially to high modulation frequencies (>100 Hz, corresponding to modulation periods<10 ms). However, neurons can be tuned to such high modulation frequencies with respect to their average discharge rate (Langner 1981, 1983, 1992; Rees and Mùller 1983; Rose and Capranica 1985; Langner and Schreiner 1988; Heil et al. 1995). In a theory of temporal correlation analysis neurons in the auditory midbrain are considered as coincidence detectors. Action potentials at their inputs represent delayed and undelayed neuronal responses to the envelope of periodic signals. When these dierent neuronal delays are compensated by the period of the signal the Orthogonality of neuronal representations of frequency and envelope periodicity has been described in the auditory cortex (AC) analogue of the Mynah bird (Hose et al. 1987). In contrast, Pantev et al. (1989), using magnetoencephalography (MEG) in humans, found that the locations of the magnetic ®eld generators within the supratemporal cortex that were activated with a harmonic sound of missing fundamental and with a pure tone of a frequency corresponding to that fundamental were in close register. These authors concluded that ``... the tonotopic organization of the primary auditory cortex re¯ects the pitch rather than the frequency of the stimulus''. This conclusion appears unsatisfactory, as it is dicult to reconcile such an organization with the perceptual orthogonality of pitch and timbre, as outlined above. Such an organization would also be at variance with the quasiorthogonal arrangement of periodotopy and tonotopy found in animals. However, the stimulus set used by Pantev et al. (1989), viz. only two pure tones and one harmonic sound, may have been too restricted to reveal the spatial representation of frequency and temporal envelope period. We therefore attempted to reinvestigate this issue using a more extensive set of stimuli, viz. up to six pure tones and six harmonic sounds, and a magnetometer with high spatial resolution, hoping that these conditions would be suited to resolve the discrepancy between the nearly orthogonal arrangement of tonotopy and periodotopy in the auditory midbrain and forebrain of animals and their proposed parallel arrangement in the human AC. Materials and methods Acoustic stimuli Acoustic stimuli were pure tones and harmonic sounds. Their spectra are schematically illustrated in Fig. 2. Pure-tone frequencies were 50 Hz, 100 Hz, 200 Hz, 400 Hz, and, in some experiments, also 800 Hz and 1600 Hz. In the ®gures they are referred to as s50, s100, etc. (Fig. 2, left column). Harmonic sounds (Fig. 2, centre and right columns) were composed of harmonics of 50 Hz, 100 Hz, 200 Hz, and 400 Hz and thus elicited a pitch corresponding to these fundamental frequencies. All harmonic sounds had an upper cut-o frequency of 5 kHz, but the lower cut-o frequency was either 400 Hz (Fig. 2, centre) or 800 Hz (Fig. 2, right). Signals with 400 Hz cut-o are referred to as p50, p100, etc., and those with 800 Hz cut-o as h100 and h200 (h50 and h400 were 667 Fig. 2 Schematic spectra of pure tones and harmonic signals used as stimuli. The high cut-o frequency was 5000 Hz for all harmonic sounds, while the low cut-o frequency was 400 Hz for p50, p100, p200, and p400 and 800 Hz for h100 and h200. The only harmonic stimulus of this set which includes its fundamental frequency is p400. The spectra do not show the roll-o of the sound transmission not used). Signals with the same cut-o frequency have very similar spectral envelopes. Thus, pure tones and harmonic signals labelled with the same number (signals within a given row in Fig. 2) have the same temporal envelope period and elicit the same pitch. Signals labelled with the same letter (signals within a given column in Fig. 2) have the same spectral envelope and elicit a similar timbre. All stimuli had durations of 500 ms, including linear rise and fall times of 10 ms, and were presented with interstimulus intervals of 1.2 s. Stimulus presentation Six subjects (one female, ®ve males) participated in the experiments. In early experiments, pure tones (four subjects) and harmonic sounds (six subjects) were presented in separate experimental sessions. Within each session stimuli were presented in random order. In later experiments (four subjects), each harmonic sound was followed by a pure tone with the same pitch, and stimulus pairs of dierent pitch were presented in random order. The sounds were produced by equipment located outside the magnetically shielded room inside which the subjects were seated during the measurements. Sounds were led to the subjects' ears via plastic tubes of several meters in length and specially designed ear pieces. The transfer function of this system was ¯at from 1 kHz to 2 kHz and fell o by 15 dB/octave below 1 kHz and by 20 dB/octave above 2 kHz. Before an experiment, each subject adjusted loudness levels of all stimuli to be equal (60 dB HL). Prior to magnetoencephalographic recording subjects were asked to remember the highest number of consecutive signals (or signal pairs) with the same pitch in order to keep their attention high during the recording session. One to ®ve independent measurements were performed per subject and within each session each stimulus was repeated about 200 times. The duration of a session was about 45 min. Magnetoencephalographic recording Auditory-evoked magnetic ®elds were recorded with a 122-channel magnetometer, covering the whole head (Ahonen et al. 1993; HaÈmaÈlaÈinen et al. 1993; MaÈkelaÈ et al. 1993). Location of the head with respect to the SQUID sensors was determined by measuring the magnetic ®elds produced by small currents delivered to three coils attached to the scalp. The location of the coils on the preauricular points and the nasion was measured with a 3D-digitizer. The recording bandpass was 0.03±100 Hz (3-dB points; high-pass roll-o 35 dB/decade, low-pass over 80 dB/decade) with a sampling rate of 0.4 kHz. Trials contaminated by eye blinks were rejected. Data analysis After digital low-pass ®ltering at 40 Hz equivalent current dipoles (ECDs) that best described the measured magnetic ®eld at a given latency were found by least-squares ®t in a spherical volume conductor (Kaukoranta et al. 1986). We used a one-dipole model, separately for the left and right hemisphere, with a subset of channels over each hemisphere. Such a one-dipole approximation is only a ®rst approximation, since in each hemisphere multiple sources may be expected to be active at most times (for example in dierent cytoarchitectonic areas of the supratemporal AC). However, there is no reasonable estimate of how many sources should be assumed to be active at a given time. Because it has previously been shown that the one-dipole model provides a very good approximation of the measured magnetic ®eld during short time segments (for review see Hari 1990), we concentrated our analysis of ECD locations on those short-time segments during which the measured magnetic ®eld was best approximated by the one-dipole model. In order to make our interpretation of the data as demonstrated in the ®gures more comprehensible we added ellipses (Fig. 5), polygons (Fig. 6) and arrows or dotted lines (Figs. 5±10), all of them matched by eye. Results Basic observations Typically, the computed time-course of the ECD magnitude and the corresponding goodness-of-®t of the onedipole model showed peaks around 100 ms and 200 ms after stimulus onset followed by sustained plateaus. These de¯ections are referred to as M100, M200, and SF (sustained ®eld), respectively. Figure 3 illustrates an example with additional peaks around 60 ms (M60) and 150 ms (M150) which were seen in some subjects. The precise latency of the prominent de¯ections was stimulus dependent (see also Forss et al. 1993; Roberts and Poeppel 1996). Figure 4 shows the latency of the peak of M100 from the right hemisphere of all subjects which were stimulated with harmonic sounds of 400 Hz cut-o plotted against the envelope period. The contin- 668 Fig. 3 Time-course of the magnitude of an equivalent current dipole (ECD), computed for consecutive 2-ms intervals from the magnetic ®elds measured over the left hemisphere of one subject. The stimulus was a harmonic signal (p100) with a duration of 500 ms which started at t = 10 ms. Acoustic delays have been subtracted. The goodness-of-®t (g) between the ®eld predicted by the one-dipole model and the original data is also shown. Typically, peaks of the ECD magnitude and g were observed at latencies corresponding to peak de¯ections of the averaged original MEG signal uous line without symbols in Fig. 4 represents the result of a linear regression analysis: latencyM100 = (89 3) ms + (1.38 0.24) á period (ms); r = 0.774 and P < 0.001. The location of the ECD in each hemisphere was not stable as a function of time. Two representative examples from dierent hemispheres of two subjects are shown in Fig. 5a, b. The data in this ®gure and most of the following ®gures are presented as viewed from a lateral perspective. Three-dimensional data analyses are presented in Figs. 11, 12, and 13. During certain time Fig. 4 Latency of the peak of M100 plotted as a function of temporal envelope period of harmonic signals with a cut-o frequency of 400 Hz. Results were obtained from the right hemisphere of six subjects stimulated through the contralateral ear. The results of a linear regression analysis (continuous line without symbols) are also shown Fig. 5a, b ECD location as a function of time after stimulus onset (stimulus: p400; time steps: 5 ms) and as seen from a lateral perspective. Data are from the right (a) and left auditory cortices (b) of two subjects. Note that dierent regions (elliptical boundaries) seem to be active at dierent delays with the momentary maximum of activity jumping rapidly between those regions (indicated by arrows) 669 segments after stimulus onset the ECD is located within rather restricted cortical areas (elliptical boundaries in Fig. 5) and the ECD location changes relatively little as a function of time. At other times, however, there appear to be rapid (i.e. often within <5 ms) and pronounced changes in ECD location from one restricted cortical area to another or back and forth between two such areas. However, the goodness-of-®t during such rapid changes in ECD location was very small, preventing a functional interpretation. Figure 6 shows for the right hemisphere of one subject the ECD locations during short time segments around M100 and M200 and during SF where the goodness-of-®t was maximal (80±98%), i.e. around the peaks of the ECD magnitude. Each data point represents 2 ms. The ECD trajectories are viewed from a lateral perspective and their approximate position in relation to the whole head is indicated by the inset in Fig. 6a, b Lateral view of ECD trajectories in the right hemisphere of one subject for stimulation with pure tones (a) and harmonic sounds (b). The labels next to the trajectories identify the stimuli (see Materials and methods and cf. Fig. 2). Each data point designates the location of the ECD during a 2-ms interval within time segments of maximal goodness-of-®t. The selected segments had durations of 10 ms for M100, of 20 ms for M200, and of 30 ms for SF. The arrows indicate the systematic spatial progressions of the ECD trajectories with frequency for pure tones and with fundamental frequency for harmonic sounds during M100 and during SF. A systematic progression during M200 is questionable. The positions of M100, M200, and SF relative to one another were similar in most subjects and in both hemispheres. For a better comprehension, their positions in relation to a whole head are schematically indicated in the insert in a Fig. 6a. The stimuli were four pure tones (a) and four harmonic sounds (b). Note that the ECD trajectories of M100, M200, and SF are spatially segregated. The ECD trajectories of M200 are located most anteriorly, as has been described earlier (e.g. Pantev et al. 1989; Hari 1990), and those of SF are located slightly more anterior than those of M100, corroborating more recent results of Pantev et al. (1994). Also note that for each de¯ection (M100, M200, SF) the ECDs obtained with pure tones and with harmonic sounds are localized in approximately equivalent cortical areas. However, there are systematic detailed dierences which are addressed further below. Tonotopy Previous MEG studies have demonstrated that the tonotopic gradient obtained for M100 and for SF runs mainly along the lateromedial axis, thus approximately perpendicular to the projection plane of Figs. 6±10, with more lateral ECD locations for low frequencies and more medial locations for high frequencies (e.g. Romani et al. 1982; Pantev et al. 1988, 1989, 1994, 1995; Cansino et al. 1994). However, the tonotopic gradient for M100 has a signi®cant component also in the tangential plane, so that a gradient should also be visible from a lateral perspective. Indeed, in several cases a tonotopic order of ECD trajectories obtained with pure tones of dierent frequencies was evident in a lateral view. In the case illustrated in Fig. 6a, for example, the ECD trajectories during M100 shifted with increasing frequency from a more posterior and superior location to a more anterior and inferior location (arrow in M100 box in Fig. 6a; see also Fig. 10), a shift in line with results of Pantev et al. (1994, 1995). A similar shift of ECD trajectories with frequency, in this case from posterior-inferior to anterior-superior, was seen during SF (arrow in SF box in Fig. 6a). While during M200 the positional changes of ECD locations for s100, s200, and s400 seem to indicate a gradient mainly from superior to anterior, this gradient is not corroborated by the ECD position for s50. The three-dimensional orientation of the tonotopic gradient during M100 was analysed in the following way and the results are demonstrated in Figs. 12 and 13: for each 10-ms ECD trajectory with a maximal goodness-of®t the centre of gravity was ®rst computed. Then, the distances in three-dimensional space of the centres for pure-tone stimuli of neighbouring frequencies were computed. Figure 12a shows, for all four subjects for which data were available, cumulative distances of these centres of gravity in the left hemisphere plotted against pure-tone frequency. Note that the frequency axis in Fig. 12a is logarithmic and that the cumulative distances increase roughly linearly with the logarithm of stimulus frequency. The thick solid line in Fig. 12a represents the result of a linear regression analysis between the logarithm to the base 2 of stimulus frequency and the cumulative distance. This function indicates an ap- 670 proximately equal spacing of octaves in the human AC, a result also in line with those of previous studies (e.g. Romani et al. 1982; Pantev et al. 1989, 1994, 1995; Cansino et al. 1994). The thick broken line in Fig. 12a represents the component of the average cumulative distance only along the lateromedial or depth coordinate. The proximity of this line to the individual data and their ®t reveals that the depth coordinate is a major contributor to the cumulative three-dimensional distances of the data. The thin broken line in Fig. 12a represents the component of the average cumulative distance within the tangential or lateral surface plane. This line is not quite as close to the data as the thick broken line. Nevertheless, the contribution of the tangential plane to the cumulative three-dimensional distances is signi®cant. In other words, frequency is represented along the depth coordinate, with low frequencies more lateral and high frequencies more medial, and also within the tangential dimension, with low frequencies more superior and posterior and high frequencies more anterior and inferior. The average three-dimensional orientation of the tonotopic gradient, from low to high frequencies, and the spatial frequency resolution that emerged from this analysis, are represented by the orientation and length of a vector in Fig. 13. The top and middle panels present a lateral and a top view, respectively. The lateral view can conveniently be compared with the raw data (e.g. Figs. 6a, 10). Periodotopy The ECD trajectories during M100 and during SF obtained with harmonic sounds of dierent fundamental frequency also showed a topographic order that was evident in a lateral perspective. Examples of periodotopic organizations during M100 are illustrated in Figs. 6b, 8, and 10, and during SF in Fig. 7. In each case, the approximate course of the periodotopic gradient is indicated by an arrow. The periodotopic gradients obtained in dierent subjects dier somewhat, but in general seem to run along a posterior and inferior to anterior and superior axis during M100 (Figs. 6b, 8, 10), and along a posterior and more superior to anterior and more inferior axis during SF (Fig. 7). The reproducibility of the measurements is indicated by Fig. 7a, b, which shows results from two measurements made on the same subject (PH) several months apart. Note that the obtained periodotopic gradients were quite similar, viz. were oriented from posteriorsuperior to anterior-inferior (see arrows). Furthermore, absolute coordinates obtained in dierent measurements and subjects de®ned from external head coordinates are quite similar. The average distance of the centres of gravity of ECD trajectories for corresponding stimuli in the three dierent experiments illustrated in Fig. 7 is 12.0 4.0 mm, only about twice the average distance Fig. 7a±c Lateral view of the ECD locations during SF (~380 ms). For each stimulus, the ECD trajectory represents a 20-ms segment (time interval: 2 ms). The results were obtained from the right hemisphere of two subjects with (a) being a repetition of (b) several months later. The approximate orientations of the periodotopic gradients are indicated by arrows between centres of gravity for neighbouring stimuli obtained in the same experiment (viz., 6.3 4.8 mm). Orthogonality of periodotopy and tonotopy In a given subject there are obvious dierences in the orientations of the tonotopic and periodotopic gradients for corresponding de¯ections, as seen from the lateral perspective (cf. Fig. 6a and 6b). To investigate this issue further, ®ve subjects were stimulated with harmonic signals diering in f0 (hence in pitch) and in the low cuto frequency of the spectrum. The lower cut-o frequency was 400 Hz for p50, p100, p200, and p400, and 800 Hz for h100 and h200 (Fig. 2). This dierence in 671 Fig. 9 Lateral view of ECD trajectories during M60 (10-ms segments; time intervals of 2 ms) in the left hemisphere of subject HS. Note the dierent orientations of the tonotopic and periodotopic gradients (dashed lines). Also note that the trajectories for the two harmonic stimuli with a cut-o frequency of 800 Hz (viz. h100 and h200) are displaced relative to those with a cut-o frequency of 400 Hz (viz. p50, p100, p200, p400) in the direction of the tonotopic gradient. Note the grid-like regular spacing of ECD trajectories, especially for harmonic signals Fig. 8a±c Lateral view of the ECD trajectories during M100 in the left hemisphere of two subjects (a and c) and in the right hemisphere of a third subject (b). Subjects were stimulated with six harmonic sounds diering in fundamental frequency and in low cut-o frequency which was 400 Hz for p50, p100, p200, and p400, and 800 Hz for h100 and h200 (see also Fig. 2 for key to the stimuli). For each stimulus, the ECD trajectory represents a 10-ms segment (time interval: 2 ms). The approximate orientations of the periodotopic gradients are indicated by arrows with continuous lines while the approximate displacements of h100 and h200 (owing to their higher cut-o frequency) are indicated by arrows with dotted lines the lower spectral boundary also constitutes a shift of the centre of gravity of the stimulus spectrum along the frequency axis. We therefore expected to obtain dierences in the ECD locations between p- and h-stimuli that should indicate the tonotopic gradient. In addition, stimuli with identical cut-o frequency, but with dierent fundamental frequency, should reveal the periodotopic gradient. Figure 8 shows the results obtained for M100 from the left hemisphere of two subjects (Fig. 8a, c) and from the right hemisphere of a third subject (Fig. 8b). As noted above, the periodotopic gradients, revealed by stimuli of dierent fundamental frequency and similar spectral envelope, are oriented roughly from posterior to anterior (as suggested by the arrows with continuous lines). ECD trajectories for stimuli of dierent cut-o frequency are displaced. In other words, stimuli eliciting the same pitch are not colocalized. As suggested by the arrows with dotted lines, the displacement is roughly along the superior to inferior axis and within, approximately orthogonal to the periodotopic gradient (see also Fig. 10). A similar result was obtained for M60 in subjects with a prominent de¯ection at this latency. Data from one subject (HS) are illustrated in Fig. 9. Note the gridlike arrangement of ECDs suggesting a regular representation of signal parameters along two axes as well as equal distances for ECDs elicited by harmonic stimuli separated by octaves. Note also that during M60 the periodotopic gradient and the displacement of p- and h-stimuli are reversed compared to those during M100 (cf. Figs. 8 and 9). The reliability of these measurements is again emphasized by the similarities in the absolute coordinates of ECD trajectories. The average dierences in threedimensional space between the centres of gravity of ECD trajectories for corresponding stimuli in dierent subjects was 10.3 6.3 mm, less than twice the average distance between neighbouring stimuli in the same subject (viz., 6.5 4.0 mm). 672 It may be noteworthy that in some cases the ECD trajectories for p400, the only harmonic stimulus whose spectrum included the f0, showed peculiarities. For example, in subject GL (Fig. 8a) the position of the ECD trajectory for p400 would seem more appropriate for h400, a stimulus which was not tested. Since p400 is a combination of h400 and the fundamental frequency of 400 Hz (see Materials and methods), p400 would be expected to elicit the same response as h400, provided the subject did not hear or was able to Ôignore or suppress' the fundamental component. In subject PH (Fig. 7a, b; Fig. 6b) the orientation of the ECD trajectory for p400 is markedly dierent from the orientation of the trajectories for the other stimuli. Evidence for orthogonality from experiments using harmonic and pure tones in the same session As noted above, the relative displacement of ECD trajectories for stimuli of dierent cut-o frequency was roughly orthogonal to the periodotopic gradient. More direct evidence for an approximately orthogonal organization of tonotopy and periodotopy comes from the analysis of the locations of ECD trajectories for harmonic and pure tone stimuli which were presented to four subjects as stimulus pairs in the same experimental session. Figure 9 presents such data for M60. In this lateral view the tonotopic gradient, revealed by the ECD trajectories for pure-tone stimuli of dierent frequency, runs approximately from inferior to superior, while the periodotopic gradient, revealed by the ECD trajectories for harmonic stimuli of dierent f0 runs approximately from anterior to posterior. While in this lateral view tonotopic and periodotopic gradients (dotted lines in Fig. 9) are not orthogonal to each other, it is important to stress that they are clearly not parallel. The displacement to more superior loci of ECD trajectories for harmonic stimuli of higher (h100, h200) relative to those of lower cut-o frequency (p50, p100, etc.) is approximately in the direction that is expected from the orientation of the tonotopic gradient. Note that the ECD trajectories for the harmonic stimuli with a cut-o frequency of 400 Hz (viz. p50, p100, etc.) are located between the ECD trajectories for pure-tone stimuli of 400 and 800 Hz, while harmonic stimuli with a cut-o frequency of 800 Hz (h100 and h200) are located between pure tones of 800 and 1600 Hz. For harmonic stimuli of dierent fundamental frequencies the ECD trajectories are regularly spaced with an average of 6 mm/octave and for pure tones of dierent frequencies with an average of 8 mm/octave. The cut-o frequencies of p100 and h100, as well as of p200 and h200 and h200 (400 Hz and 800 Hz, respectively) are also separated by one octave, however, in this case along the frequency axis. Accordingly, the corresponding ECDs are separated by about 10 mm in the direction of the tonotopic gradient. Corresponding data for M100, from a dierent subject and hemisphere, are illustrated in Fig. 10. In this case and in this lateral view, tonotopic and periodotopic gradients are roughly orthogonal and, as during M60, the displacement of ECD trajectories for harmonic stimuli of higher (h100, h200) relative to those of lower cut-o frequency (p50, p100, etc.) is again approximately in the direction expected from the orientation of the tonotopic gradient. Note that the tonotopic and periodotopic gradients run in directions opposite to those of the corresponding gradients during M60 (cf. Figs. 9 and 10). Three-dimensional analysis of ECD positions The most direct evidence for an orthogonal organization of tonotopy and periodotopy comes from analysis of the three-dimensional spatial relationships between ECD trajectories for pure-tone and harmonic stimuli in all four subjects which were stimulated with both types of signals in the same experimental session. The analysis was performed in the following way. We ®rst calculated the centre of gravity of the ECD trajectory for each stimulus in a given subject and hemisphere during M100 and, where possible, during M60. Next, we obtained the vector between centres of gravity for pure-tone and for harmonic stimuli that are neighbours along the dimen- Fig. 10 Lateral view of ECD trajectories during M100 (10-ms segments; time intervals of 2 ms) in the right hemisphere of subject PH. Note the quasi-orthogonal orientations of the tonotopic and periodotopic gradients (dashed lines). Also note that the trajectories for the two harmonic stimuli with a cut-o frequency of 800 Hz (viz. h100 and h200) are displaced relative to those with a cut-o frequency of 400 Hz (viz. p50, p100, p200, p400) approximately in the direction of the tonotopic gradient 673 sion of frequency and fundamental frequency, respectively, i.e. the vector between s50 and s100 (s50®s100), between s100 and s200 (s100®s200), etc., between h100 and h200 (h100®h200), between p50 and p100 (p50®p100), between p100 and p200 (p100®p200), and between p200 and p400 (p200®p400). Finally, we calculated the angles formed by all vectors between two neighbouring pure-tone stimuli with all vectors between two neighbouring harmonic stimuli (e.g. the angles formed by s200®s400 with p50®p100, with p100®p200, etc.). Figure 11 shows the frequency distributions of these angles, with a bin width of 15 degrees, for M100 in the left and right hemispheres and for M60 in the left hemisphere. The number of M60 data with sucient goodness of ®t for the right hemisphere was too limited for an evaluation. The angles were not evenly distributed. Rather, the distributions peak near 90 degrees, as identi®ed in Fig. 11. These values emphasize that, on average, tonotopic and periodotopic gradients are oriented nearly orthogonally to one other. Another independent demonstration of the orthogonality of tonotopy and periodotopy comes from a comparison of the average tonotopic and periodotopic vectors in three-dimensional space. Figure 12b plots the cumulative three-dimensional distances of the centres of gravity of ECD trajectories during M100 for neighbouring harmonic stimuli, in a fashion analogous to that described above for pure-tone stimuli (see Fig. 12a). The periodotopic data in Fig. 12b are from the same subjects and hemispheres that provided the tonotopic data in Fig. 12a and the angle distribution of Fig. 11a, and, in addition, from two subjects (SA and SL) who were stimulated with harmonic signals only. Although the cumulative distances of the centres of gravity for harmonic signals scatter more widely among subjects than those for pure tones (cf. Figs. 12a and 12b), they also increase roughly linearly with the logarithm of f0 (thick solid line in Fig. 12b). Thus, octaves of f0 are approximately equally spaced in the human AC. However, the spatial resolution for fundamental frequency (viz. 5.5 mm/octave) is less than that for pure tone frequency (viz. 7.1 mm/octave). The thick broken line in Fig. 12b represents the component of the average cumulative distance along the lateromedial or depth coordinate only. Note that this line deviates considerably from the individual data, thus the depth coordinate is not the major contributor to the cumulative three-dimensional distances of the periodotopic data. In contrast, the thin broken line, which represents the component of the average cumulative distance within the tangential or lateral surface plane, is in rather close register with the data. Thus, the tangential plane contributes most to the cumulative threedimensional distances of the ECD centres of gravity for harmonic stimuli. To allow the comparison with the tonotopic vector, the average three-dimensional orientation of the periodotopic gradient, from low to high fundamental frequencies, and the spatial frequency resolution that Fig. 11a±c Distributions of angles (binwidth: 15 degrees) formed by tonotopic and periodotopic gradients during M100 in the left (a) and right hemisphere (b) and during M60 in the left hemisphere (c). Each entry is the angle formed by a vector between the centres of gravity of ECD trajectories for two pure-tone stimuli of neighbouring frequency and a vector between the centres of gravity of ECD trajectories for two harmonic stimuli of neighbouring fundamental frequency, as described in detail in the Results. The mean angles, the standard deviations, the vector strengths, and the likelihoods of the distributions to be random, as obtained by circular statistics, are also identi®ed emerged from this analysis, are also represented by the orientation and length of a vector in Fig. 13. The two vectors form an angle of 95 degrees when viewed from the lateral perspective (Fig. 13a), and an angle of 89 degrees when viewed from the top perspective (Fig. 13b). However, to derive the average spatial organization, the two vectors should be viewed from an 674 Fig. 12a, b Cumulative three-dimensional distances in the left hemisphere between the centres of gravity of ECD trajectories during M100 as a function of frequency for pure-tone stimuli (a) and of fundamental frequency for harmonic stimuli (b). Dierent symbols represent dierent subjects. The thick continuous lines represent the results of linear regression analyses between cumulative distance (cd) and the logarithm to the base 2 of frequency and fundamental frequency (1d f). These results suggest a 30% greater spacing for tonotopy than for periodotopy (7.1 versus 5.5 mm/octave). The thick broken lines represent the components of the distances along the lateromedial or depth coordinate and the thin broken lines within the tangential or surface plane only optimal perspective, i.e. by looking down onto that plane in which both vectors have their maximal length. In this plane, the tonotopic vector has a length corresponding to a frequency resolution of 7.1 mm/octave and the periodotopic vector a length corresponding to a resolution of 5.5 mm/octave, i.e. the values derived from the cumulative three-dimensional distances of the ECD centres of gravity (cf. Fig. 12). This optimal plane is tilted relative to a horizontal plane and points downward in the anterior and medial directions. Figure 13c shows that in this optimal plane, periodotopic and tonotopic vectors form an angle of 85 degrees, i.e. an angle again close to 90 degrees. Fig. 13 Average orientation and spatial resolution of tonotopic and periodotopic gradients during M100 represented by the orientation and length of two vectors. The radius of each circle represents 7.1 mm/octave. The data are shown from a lateral view (top), the same perspective as used for Figs. 5±10, from a top view (middle), and from an optimal view (bottom), i.e. by looking down onto that plane in which both vectors have their maximal length. This optimal plane is tilted relative to a horizontal plane and points downward in the anterior and medial directions. The angle formed by the average tonotopic and periodotopic vectors is 85 degrees (bottom panel), but appears slightly dierent when viewed from a lateral or top perspective, viz. 95 (top) and 89 degrees (middle), respectively Discussion The present study has demonstrated with the MEG technique the existence of a tonotopic and a periodotopic organization of the human AC and, most importantly, an approximately orthogonal arrangement of tonotopic and periodotopic gradients. 675 The tonotopic organization, which was analysed here in most detail for the generators of M100 (Figs. 6a, 10, 12a, 13), compares well with that reported in previous studies (e.g. Romani et al. 1982; Pantev et al. 1989, 1994, 1995; LieÂgeois-Chauvel et al. 1994; Cansino et al. 1994). In agreement with our results, the tonotopic gradient during M100 has been reported to be oriented along the depth coordinate with low frequencies being represented more laterally and high frequencies more medially (Romani et al. 1982; Pantev et al. 1989, 1994, 1995). These studies also document a component of the tonotopic gradient in the tangential plane with low frequencies being represented more posteriorly and high frequencies more anteriorly. Likewise, the above studies and ours agree in that there is an approximately equal spacing of octaves (Figs. 6a, 10, 12a). Our cumulative distances suggest an average spatial resolution of 7.1 mm/octave. The values that can be derived from other studies vary considerably. The data of Cansino et al. (1994; their Fig. 9) reveal three-dimensional resolutions of about 4±7 mm/octave in dierent individuals, those of Romani et al. (1982; their Fig. 4) average resolutions of about 4 mm/octave, and those of Pantev et al. (1994; their Fig. 5) some 2±3 mm/octave. Whatever the source of this variability, it is important to note that these studies, while they disagree with respect to absolute distances, do agree with respect to the relative spatial relationships of ECD locations for dierent stimuli. Furthermore, they seem to agree with respect to the relative spatial relationships of ECD locations for dierent de¯ections of the response, e.g. the anterior location of M200 relative to M100, of SF relative to M100, etc. (see e.g. Pantev et al. 1989; Hari 1990; McEvoy et al. 1994). The periodotopic organization of the human AC, demonstrated here for M60 (Figs. 9, 13c) SF (Figs. 6a, 10, 12a), and in most detail for M100, con®rms our preliminary results reported elsewhere (Langner et al. 1994). The nearly orthogonal arrangement of the tonotopic and the periodotopic gradients, again documented here in most detail for M100 (Figs. 6, 8, 10, 12, 13), constitutes the central ®nding of the present study. This ®nding is radically opposed to the conclusion drawn by Pantev et al. (1989) from their MEG study in humans, viz. that ``... the spatial arrangement of the generators of the magnetic wave M100 re¯ects the pitch rather than the spectral contents of the stimulus'', i.e. in eect the conclusion that tonotopy and periodotopy are organized in parallel. We believe that these opposing conclusions originate from the design and selection of stimuli. Pantev et al. (1989) used only two pure-tone stimuli with frequencies of 250 and 1000 Hz and only one harmonic signal, composed of the fourth to eighth harmonic of the 250Hz f0, i.e. of 1000 Hz, 1250 Hz, 1500 Hz, and 1750 Hz. Clearly, the selection of only a single harmonic stimulus prevents the demonstration of any periodotopic gradient. Furthermore, this harmonic signal was presented together with a (continuous) narrow-band noise centred at the f0. This may explain why that the ECD evoked by this stimulus was located closer to the ECD evoked by the 250-Hz tone than to that evoked by the 1000-Hz tone. The demonstration of an orthogonal arrangement of tonotopic and periodotopic gradients in the present study was achieved by selecting stimuli that allowed the variation of temporal envelope period without varying the spectral envelope and vice versa (see Fig. 2). The orthogonality could easily be missed by the use of stimuli that are designed such that temporal envelope period and spectral envelope vary concomitantly. The orthogonal arrangement agrees well with results of single- and multi-unit mapping studies of tonotopy and periodotopy in the auditory midbrain and forebrain of mammals and birds, which have demonstrated nearly orthogonal maps of CF and BMF (Hose et al. 1987; Schreiner and Langner 1988; Langner et al. 1992). It was shown that a temporal analysis of periodicity analysis is underlying the spatial representation of periodicity orthogonal to the tonotopic map in the midbrain (Langner and Schreiner 1988). The similarity of representations in the midbrain of animals and in the human cortex may suggest also similar mechanisms of pitch perception. The conclusion would be that one dimension of the cortical map represents the result of the cochlear frequency analysis and corresponds to the perceptual quality of timbre, while the other dimension, orthogonal to the frequency axis, represents the result of the temporal periodicity analysis in the brainstem and corresponds to the perceptual quality of pitch. As a result, signals with the same timbre (e.g. two vowels [a:]) are represented at dierent locations in this map, provided they dier in pitch (e.g. when uttered by a male subject and a female subject). The orthogonality of tonotopy and periodotopy, as well as their relative spatial resolutions, obtained from our results in the human AC, are also in line with the perceptual independence of pitch and timbre as described by Plomp and Steeneken (1971). In fact, the correspondence is most remarkable. These authors presented subjects with stimuli of ®ltered periodic pulses. Stimuli diered in repetition rate (200±640 Hz), i.e. in pitch, and in the centre frequency of the 1/3-octave bandpass ®lter (2000±6400 Hz). Subjects were asked to judge and rank the similarity of signals presented in triads. The multi-dimensional scaling analysis of their data revealed an orthogonal organization of pulse repetition rate and centre frequency, i.e. a perceptual orthogonality of pitch and timbre. Furthermore, the perceptual distances of stimuli diering in centre frequency were about 1.6 times as large as those of stimuli diering in repetition rate. This compares remarkably well with the ratio of 1.3 between the spatial resolutions for frequency and fundamental frequency (viz. 7.1 mm/ octave divided by 5.5 mm/octave) obtained in the present study. This close correspondence may constitute another example of the match of behavioural discriminability of a stimulus parameter and the scaling of its 676 cortical representation, well known from the somatosensory and visual systems, as well as from the auditory system, particularly of echolocating bats (Suga et al. 1987). Acknowledgements This study was supported by the Academy of Finland, the Deutsche Forschungsgemeinschaft, and the Human Frontier Science Program Organization. We are grateful to Prof. R. Hari and many colleagues at the Low Temperature Laboratory of the Helsinki University of Technology for making these experiments possible. References Ahonen A, HaÈmaÈlaÈinen M, Kajola MJ, Knuutila JET, Laine PP, Lounasmaa OV, Parkkonen LT, Simola JT, Tesche CD (1993) 122-Channel SQUID instrument for investigating the magnetic signals from the human brain. Physica Script T49: 198±205 Cansino S, Williamson SJ, Karron D (1994) Tonotopic organization of human auditory association cortex. Brain Res 663: 38± 50 Forss N, MaÈkelaÈ JP, McEvoy L, Hari R (1993) Temporal integration and oscillatory responses of the human auditory cortex revealed by evoked magnetic ®elds to click trains. Hear Res 68: 89±96 Frisina RD, Smith RL, Chamberlain SC (1990) Encoding of amplitude modulation of the gerbil cochlear nucleus: I. A hierarchy of enhancement. Hear Res 44: 99±122 HaÈmaÈlaÈinen M, Hari R, Ilmoniemi RJ, Knuutila J, Lounasmaa OV (1993) Magnetoencephalography ± theory, instrumentation, and applications to noninvasive studies of the working human brain. Rev Mod Phys 65: 413±497 Hari R (1990) The neuromagnetic method in the study of the human auditory cortex. In: Grandori F, Hoke M, Romani GL (eds) Auditory-evoked magnetic ®elds and electric potentials. Advances in Audiology, vol 6. Karger, Basel, pp 222±282 Heil P, Schulze H, Langner G (1995) Ontogenetic development of periodicity coding in the inferior colliculus of the mongolian gerbil. Aud Neurosci 1: 363±383 Hose B, Langner G, Scheich H (1987) Topographic representation of periodicities in the forebrain of the Mynah bird: one map for pitch and rhythm? Brain Res 422: 367±373 Kaukoranta E, HaÈmaÈlaÈinen M, Sarvas J, Hari R (1986) Mixed and sensory nerve stimulation activate dierent cytoarchitectonic areas in the human primary somatosensory cortex SI. Neuromagnetic recordings and statistical considerations. Exp Brain Res 63: 60±66 Krumhansl CL, Iverson P (1992) Perceptual interactions between musical pitch and timbre. J Exp Psychol 18: 739±751 Langner G (1981) Neuronal mechanisms for pitch analysis in the time domain. Exp Brain Res 44: 450±454 Langner G (1983) Evidence for neuronal periodicity detection in the auditory system of the guinea fowl: implications for pitch analysis in the time domain. Exp Brain Res 52: 333±355 Langner G (1992) Periodicity coding in the auditory system. Hear Res 60: 115±142 Langner G, Sams M, Heil P, Schulze H, Hari R (1994) Orthogonal representation of frequency and periodicity in the human auditory cortex: evidence from magneto-encephalography. Soc Neurosci 20 (part1): 325 (abstract) Langner G, Schreiner CE (1988) Periodicity coding in the inferior colliculus of the cat. I. Neuronal mechanisms. J Neurophysiol 60: 1799±1822 Langner G, Schreiner CE, Albert M (1992) Tonotopy and periodotopy in the auditory midbrain of cat and guinea fowl. In: Cazals Y, Horner K, Demany L (eds) Auditory physiology and perception. Pergamon Press, Oxford, pp: 241±248 LieÂgeois-Chauvel C, Musolini A, Badier JM, Marquis P, Chauvel P (1994) Evoked potentials recorded form the auditory cortex in man: evaluation and topography of the middle latency components. Electroencephalogr Clin Neurophysiol 92: 204±214 MaÈkelaÈ JP, Ahonen A, HaÈmaÈlaÈinen M, Hari R, Ilmoniemi R, Kajola M, Knuutila J, Lounasmaa OV, McEvoy L, Salmelin R, Salonen O, Sams M, Simola J, Tesche C, Vasama J-P (1993) Functional dierences between auditory cortices of the two hemispheres revealed by whole-head neuromagnetic recordings. Human Brain Mapping 1: 48±56 McEvoy L, Hari R, Imada T, Sams M (1994) Human auditory cortical mechanisms of sound lateralization: I Interaural time dierences at sound onset. Hear Res 67: 98±109 Pantev C, Hoke M, Lehnertz K, LuÈtkenhoÈner B, Anogianakis G, Wittkowski W (1988) Tonotopic organization of the human auditory cortex revealed by transient auditory evoked magnetic ®elds. Electroencephalogr Clin Neurophysiol 69: 160±170 Pantev C, Hoke M, LuÈtkenhoÈner B, Lehnertz K (1989) Tonotopic organization of the auditory cortex: pitch versus frequency representation. Science 246: 486±488 Pantev C, Eulitz C, Elbert T, Hoke M (1994) The auditory evoked sustained ®eld: origin and frequency dependence. Electroencephalogr Clin Neurophysiol 90: 82±90 Pantev C, Bertrand O, Eulitz C, Verkindt C, Hampson S, Schuierer G, Elbert T (1995) Speci®c tonotopic organizations of dierent areas of the human auditory cortex revealed by simultaneous magnetic and electric recordings. Electroencephalogr Clin Neurophysiol 94: 26±40 Plomp R, Steeneken HJM (1971) Pitch versus timbre. Seventh International Congress on Acoustics, Budapest, pp 387±380 Rees A, Mùller AR (1983) Responses of neurons in the inferior colliculus of the rat to AM and FM tones. Hear Res 10: 301± 330 Rhode WS, Greenberg S (1994) Encoding of amplitude modulation in the cochlear nucleus of the cat. J Neurophysiol 71: 1797±1825 Roberts TPL, Poeppel D (1996) Latency of auditory evoked M100 as a function tone frequency. Neuroreport 7: 1138±1140 Romani GL, Williamson SJ, Kaufman L (1982) Tonotopic organization of the human auditory cortex. Science 216: 1339±1340 Rose GJ, Capranica RR (1985) Sensitivity to amplitude modulated sounds in the anuran auditory nervous system. J Neurophysiol 53: 446±465 Schouten JF (1970) The residue revisited. In: Plomp R, Smoorenburg GF (eds) Frequency analysis and periodicity detection in hearing. Sijthof, Leiden, pp 41±54 Schreiner CE, Langner G (1988) Periodicity coding in the inferior colliculus of the cat. II. Topographical organization. J Neurophysiol 60: 1823±1840 Singh PG (1987) Perceptual organization of complex-tone sequences ± a tradeo between pitch and timber. J Acoust Soc Am 82: 886±899 Suga N, Niwa H, Taniguchi I, Margoliash D (1987) The personalized auditory cortex of the mustached bat ± adaptation for echolocation. J Neurophysiol 58: 643±654
© Copyright 2025 Paperzz