|
|
||||||||
The Journal of Neurophysiology Vol. 87 No. 5 May 2002, pp. 2237-2261
Copyright ©2002 by the American Physiological Society
Laboratory of Auditory Neurophysiology, Department of Biomedical Engineering, Johns Hopkins University School of Medicine, Baltimore, Maryland 21205
| |
ABSTRACT |
|---|
|
|
|---|
Liang, Li, Thomas Lu, and Xiaoqin Wang. Neural Representations of Sinusoidal Amplitude and Frequency Modulations in the Primary Auditory Cortex of Awake Primates. J. Neurophysiol. 87: 2237-2261, 2002. We investigated neural coding of sinusoidally modulated tones (sAM and sFM) in the primary auditory cortex (A1) of awake marmoset monkeys, demonstrating that there are systematic cortical representations of embedded temporal features that are based on both average discharge rate and stimulus-synchronized discharge patterns. The rate-representation appears to be coded alongside the stimulus-synchronized discharges, such that the auditory cortex has access to both rate and temporal representations of the stimulus at high and low frequencies, respectively. Furthermore, we showed that individual auditory cortical neurons, as well as populations of neurons, have common features in their responses to both sAM and sFM stimuli. These results may explain the similarities in the perception of sAM and sFM stimuli as well as the different perceptual qualities effected by different modulation frequencies. The main findings include the following. 1) Responses of cortical neurons to sAM and sFM stimuli in awake marmosets were generally much stronger than responses to unmodulated tones. Some neurons responded to sAM or sFM stimuli but not to pure tones. 2) The discharge rate-based modulation transfer function typically had a band-pass shape and was centered at a preferred modulation frequency (rBMF). Population-averaged mean firing rate peaked at 16- to 32-Hz modulation frequency, indicating that the A1 was maximally excited by this frequency range of temporal modulations. 3) Only approximately 60% of recorded units showed statistically significant discharge synchrony to the modulation waveform of sAM or sFM stimuli. The discharge synchrony-based best modulation frequency (tBMF) was typically lower than the rBMF measured from the same neuron. The distribution of rBMF over the population of neurons was approximately one octave higher than the distribution of tBMF. 4) There was a high degree of similarity between cortical responses to sAM and sFM stimuli that was reflected in both discharge rate- or synchrony-based response measures. 5) Inhibition appeared to be a contributing factor in limiting responses at modulation frequencies above the rBMF of a neuron. And 6) neurons with shorter response latencies tended to have higher tBMF and maximum discharge synchrony frequency than those with longer response latencies. rBMF was not significantly correlated with the minimum response latency.
| |
INTRODUCTION |
|---|
|
|
|---|
Human speech and musical sounds
contain prominent temporal modulations in both amplitude and frequency.
Low-frequency (<50 Hz) modulations are important for speech perception
and melody recognition, whereas modulations at higher frequencies
produce other types of sensations such as pitch and roughness
(Houtgast and Steeneken 1973
; Rosen
1992
). Amplitude and frequency modulations (AM and FM) are also
important components of communication sounds of animals and are found
in a wide range of species-specific vocalizations. The neural
representation of amplitude- and frequency-modulated sounds begins at
the auditory periphery, where auditory-nerve fibers faithfully
represent both fine and coarse temporal structures of complex sounds in
their temporal discharge patterns (Johnson 1980
;
Joris and Yin 1992
; Palmer 1982
). At
subsequent brain stem nuclei along the ascending auditory pathway, the
precision of the temporal representation degrades gradually, due to the
biophysical properties of neurons along the ascending pathway and
temporal integration of converging inputs from one station to the next (Blackburn and Sachs 1989
; Creutzfeldt et al.
1980
; de Ribaupierre et al. 1980
; Frisina
et al. 1990
; Langner and Schreiner 1988
). In a
modeling study of the transformation of temporal discharge patterns
from the auditory-nerve to the cochlear nucleus, Wang and Sachs
(1995)
showed that the reduction of phase-locking in stellate
cells can result from three mechanisms: convergence of subthreshold
inputs on the soma, inhibition, and the well-known dendritic low-pass
filtering (Rall and Agmon-Snir 1998
). These basic
mechanisms may also operate at successive nuclei leading to the
auditory cortex, progressively reducing the temporal limit of
stimulus-synchronized responses.
It has long been known that neurons in the auditory cortex have a
limited capacity to represent temporally modulated signals (Goldstein et al. 1959
; deRibaupierre et al.
1972
; Whitfield and Evans 1965
). In contrast to
subcortical neurons, neurons in the auditory cortex can only
synchronize to temporally modulated signals at modulation rates of up
to tens of Hertz (Eggermont 1991
, 1994
; Gaese and
Ostwald 1995
; Schreiner and Urbas 1988
) compared
with hundreds or thousands of Hertz subcortically (Creutzfeldt
et al. 1980
; Frisina et al. 1990
; Joris
and Yin 1992
). Because most of the studies in the past three
decades on this subject were conducted in anesthetized animals, with a
few exceptions (Bieser and Müller-Preuss 1996
;
Goldstein et al. 1959
; deRibaupierre et al.
1972
), it has been suspected that the low temporal response
rate reported in the auditory cortex might partially be caused by
anesthetics, which have been shown to alter temporal responses
properties of the auditory cortex (Gaese and Ostwald
2001
; Goldstein et al. 1959
). It is therefore
important to obtain measurements of cortical responses to temporally
modulated signals under unanesthetized conditions, which could provide
a better correlation with the perception of these signals.
While mechanisms based on stimulus-synchronized discharges have long
been assumed to be the predominant means for the cortex to represent
temporal modulations (see review by Langner 1992
), the
significance of discharge rate-based mechanisms in representing temporal features of complex sounds has gained little attention. This
is perhaps due to the fact that sustained discharges are not commonly
observed under anesthetized conditions. Under the awake condition,
however, neurons in the auditory cortex often respond with sustained
discharges throughout the entire stimulus duration (Bieser and
Müller-Preuss 1996
; Evans and Whitfield 1964
; Lu et al. 2001a
,b
; Recanzone et al.
2000
). Our recent study of the auditory cortex in awake
primates using sequential stimuli has provided clear evidence to
support a two-stage temporal processing mechanism that suggests
temporal coding for slowly changing acoustic events and rate coding for
rapidly changing acoustic events (Lu et al. 2001b
). The
present study using continuously modulated signals provides further
supporting evidence for such a mechanism.
Another important issue regarding the cortical processing of temporal
modulations is how cortical neurons represent similar temporal features
that are introduced by different means. In the spectral domain, the
notion of frequency filtering has been well established on the basis of
the response area obtained from pure tones or other types of stimuli.
It is not clear, however, whether a common temporal processing
mechanism, or a temporal filter, is applied by cortical neurons to a
variety of time-varying signals. To answer these questions, it is
necessary to comprehensively test the temporal response properties of
cortical neurons with a variety of temporally modulated signals. In
this study, we systematically characterized cortical responses to two
representative classes of temporally modulated signals, sinusoidally
amplitude-modulated (sAM) and frequency-modulated (sFM) tones, in a
large number of single-units in awake marmoset monkeys
(Callithrix jacchus), a highly vocal primate species
(Wang 2000
). Preliminary observations from the present
study were presented at two conferences (Liang et al.
1999
; Wang et al. 2001
).
| |
METHODS |
|---|
|
|
|---|
Animal preparation and recording procedures
Details on animal preparation and recording procedures were
described in a previous study (Lu et al. 2001a
) and are
only briefly described here. Marmosets were adapted to sit quietly
during recording sessions in an apparatus specially designed for this
species. The auditory cortex was accessed laterally using a single
tungsten microelectrode of impedance typically ranging from 2 to 5 M
at 1 kHz (A-M Systems) through a small hole (diameter, ~1.0 mm) in the skull. Only one opening in the skull existed at any given time
during the recording sessions. Each hole was sealed by dental cement
after several days of recordings. Necessary steps were taken to ensure
sterility during all recording sessions. Daily recording sessions (3-5
h) were carried out for several months in each animal. The advantage of
this procedure was that it only left a very small portion of the cortex
exposed, which greatly increased the recording stability, avoided
excess tissue growth and reduced the chance of infections through the
opening. All recording sessions were conducted in a double-walled,
sound-proof chamber (IAC-1024). The interior of the chamber was covered
by 3-in acoustic absorption foam (Sonex, Illbruck). The experimental procedures were approved by the Animal Care and Use Committee at The
Johns Hopkins University.
Densely positioned recording holes were made covering the auditory cortex. The data presented were mainly obtained from the primary auditory cortex (A1) and may include a few neurons from the immediately adjacent areas that responded to sAM and/or sFM stimuli. The location of A1 was determined by its tonotopic organization, its relationship to the lateral belt area (which was more responsive to noises than tones), and by its response properties (e.g., highly responsive to tonal stimuli). Electrode penetrations, perpendicular to the cortical surface, were made within each recording hole under visual guidance via an operating microscope. This gave good control and estimates of recording depths. Single-units were encountered at all cortical layers, but the majority of the recorded units was from upper layers, judging by the depths and response characteristics. On average, one to three well-isolated single-units were studied in each daily session. A representative example of raw recordings is shown in Fig. 1. Signal-to-noise ratio was typically >10:1 in our recordings. Spike waveforms were filtered, digitized, and detected using a template-matching discriminator (MSD, Alpha-Omega Engineering) and were closely and constantly monitored by an experimenter as the recording proceeded. The template matching method prevented any unwanted noises (e.g., due to the animal's movement) from triggering false spikes.
|
Acoustic stimuli
Two types of temporally modulated sounds were used as the
experimental stimuli in this study: sAM and sFM sounds. For sAM stimuli, the carrier frequency, set at a unit's characteristic frequency (CF), was held constant while its amplitude was modulated by
a sinusoid. For sFM sounds, the amplitude remained constant while the
carrier frequency, centered at a unit's CF, was sinusoidally modulated. Acoustic stimuli were delivered in free-field through a
loudspeaker located ~1 meter in front of the animal. Frequency tuning
was obtained using a series of randomly presented tone bursts (50-100
ms in duration), from which the CF of a unit was determined as the
frequency at which the strongest discharge rate was evoked. A
rate-level function was obtained at CF. Most units in the awake
auditory cortex exhibited nonmonotonic discharge rate versus sound
level functions (Pfingst and O'Connor 1981
; Wang
et al. 1999
), for which a preferred sound level can be
determined. After these initial characterizations, multiple repetitions
of sAM and sFM stimuli were delivered at the preferred sound level for
nonmonotonic units (or 30 dB above threshold if otherwise). In a subset
of units, sAM and/or sFM stimuli were tested at multiple sound levels.
Several stimulus parameters were varied to probe neural responses. For
every unit included in the analysis, modulation frequency was typically
varied between 1 and 512 Hz in a base-2 logarithmic scale; finer steps
were used in testing some units. The modulation depth of sAM stimuli
was generally set at 100%; in a subset of sampled units, a range of
depth (0-100%) was tested. The modulation depth of sFM stimuli was
usually set at an optimal (in terms of maximum firing rate) or a nearly
optimal depth centered at a unit's CF. Multiple FM depths were tested
in some units. Unlike sAM stimuli, the firing rate resulting from a sFM
stimulus was not monotonically related to modulation depth. The optimal depth, at which the maximum firing rate was achieved, varied from unit
to unit. It was therefore not feasible to choose a fixed FM modulation
depth for all units. Using the optimal FM depth for a unit allowed
measurement of modulation selectivity to be made at its maximum
discharge level, thus increasing the robustness of the measurement.
Because firing rates of a unit at a given modulation frequency
generally increased with increasing AM depth for a sAM stimulus, it was
possible to used a fixed AM depth (100%) to test all units. The
duration of each sAM and sFM stimulus was 1,000 ms. Neural activities
prior to and following stimulus presentation were also recorded to
estimate spontaneous discharges and to reveal any long-lasting effects.
Ten to 20 repetitions of a sAM or sFM stimulus were presented at each
modulation frequency at a given sound level. Stimuli of all modulation
frequencies were presented randomly. Inter-stimulus intervals were >1
s. Stimuli were synthesized at 100-kHz sampling rate and low-pass
filtered at 50 kHz. The spike times were digitized at 50-kHz sampling rate.
Data analysis
The results reported were based on 211 single-units recorded
from the left hemispheres of three awake marmoset monkeys. Responses to
sAM and sFM stimuli were recorded in 200 and 142 units, respectively. Some units responded to both types of stimuli and others responded to
either sAM or sFM stimuli. All units were recorded indiscriminately, provided they could be driven by either sAM or sFM stimuli. Given the
diversity of cortical responses in the awake preparation, we did not
expect all recorded units to respond to both types of stimuli.
Discharge rates of the majority of units varied with changing
modulation frequency. We separated sAM and sFM responses, respectively,
into two groups in the analyses according to the characteristics of the
discharge rate versus modulation frequency profile of a unit. A rate
profile was considered having a "band-pass" shape if its peak value
was higher than values at both the lower and higher frequency sides.
Some units with band-pass rate profiles also exhibited increased
discharge rates with increasing modulation frequency at frequencies
much higher than the frequency corresponding to the peak of the rate
profile. This portion of responses was not included in the analysis of
rate-based modulation selectivity because they were more likely to be
influenced by spectral effects at such high modulation frequencies.
Responses with band-pass rate profiles were further screened on the
basis of a d' value, d' = |µRd
µRs|/
Rs, where
µRd is the mean discharge rate during a
stimulus presentation, µRs is the mean
spontaneous discharge rate, and
Rs is the SD
of the spontaneous discharge rate. Values of d' were
calculated for responses to each modulation frequency tested. If a unit
had a band-pass rate profile and a maximum d' value
1.0,
it was classified into the band-pass (BP) group. The rest of the units
were referred to as the nonband-pass (non-BP) group, which included
units whose maximum d' values were <1.0 (indicating weak
responses to sAM or sFM stimuli) as well as those units whose rate
profiles did not exhibit "band-pass" shapes. A unit belonged to
either the BP or non-BP group. This classification process was
separately applied to sAM and sFM responses. For sAM responses, 146/200
(73%) units belonged to the BP group and 54/200 (27%) units belonged
to the non-BP group. For sFM responses, the numbers were 93/142 (65%,
BP group) and 49/142 (35%, non-BP group), respectively. Units in the
BP group responded to the change in modulation frequency by
their firing rates and were analyzed for discharge rate-based
modulation selectivity. Units in both BP and non-BP groups were
analyzed for discharge synchrony-based modulation selectivity.
Population averages were computed for each group of units as well as
for all the units.
Unless specified, average discharge rates were calculated over a window
including the stimulus duration and 100 ms after stimulus offset:
(tonset,
toffset + 100 ms), where
tonset and
toffset are stimulus onset and offset
times. Spontaneous discharge rates were estimated from activities prior
to stimulus onset (500 ms) and subtracted from raw discharge rates.
Several response measures, described in the following text, were used
to quantify the modulation selectivity of cortical neurons. Comparison
between response measures resulting from sAM and sFM stimuli were made
between populations of units that responded to each stimulus type as
well as on a unit-by-unit basis in individual units in which both
measures could be obtained. Statistical comparisons between
distributions of response measures were made using the Wilcoxon
rank-sum test (Rice 1988
). Unit-by-unit comparisons
between response measures recorded from the same unit were made using
the paired t-test. P < 0.01 was considered statistically
significant for these analyses.
RATE MODULATION TRANSFER FUNCTION (rMTF). The relationship between average discharge rate and modulation frequency is referred to as the discharge rate-based modulation transfer function (rMTF). The discharge rate-based best modulation frequency (rBMF) was calculated in the following steps for each unit belonging to the BP-group. For units with peaks in their rMTFs that consisted of two or more points, an estimate of rBMF was first obtained as the modulation frequency corresponding to the largest discharge rate of a rMTF. The rBMF was then calculated by weighting those modulation frequencies that were continuous and adjacent to the estimated rBMF and whose discharge rates were not significantly different from the estimated rBMF (P > 0.05, Wilcoxon rank-sum test). A geometric mean was used to average the modulation frequencies by their discharge rates. The rBMF would be equal to the estimated rBMF if discharge rates at other frequencies were all significantly different from that at the estimated rBMF. This method has an advantage over simply assigning the rBMF to one of tested modulation frequencies corresponding to the largest discharge rate, as commonly used in most previous studies. For example, a rMTF that had a broad peak centered on two modulation frequencies with similar discharge rates would have a calculated rBMF near their geometric mean but weighted closer to the modulation frequency that produced a stronger discharge.
To calculate the half-height bandwidth (BW) of a rMTF, the initial estimate of the rBMF and the corresponding discharge rate were used as the reference. Two points on the rMTF curve, each on one side of the estimated rBMF, that were nearest to half the discharge rate at the estimated rBMF were interpolated linearly from the tested modulation frequencies above and below the half-discharge rate. The distance between the modulation frequencies corresponding to these two points was the BW of a rMTF. A Q-measure, defined as rBMF/BW, was used to quantify the sharpness of tuning of a rMTF.SYNCHRONY MODULATION TRANSFER FUNCTION (tMTF).
Stimulus-synchronized discharges were characterized first by the
vector strength (VS) (Goldberg and Brown 1969
) and then
converted to the Rayleigh statistics
(2nVS2, where n is the
total number of spikes) (Mardia and Jupp 2000
) to assess
their statistical significance. VS was calculated with a time window
beginning 100 ms after stimulus onset to the end of the stimulus. The
values of the Rayleigh measure >13.8 were considered as statistically
significant (P < 0.001) (Mardia and Jupp
2000
). The relationship between Rayleigh statistics and
modulation frequency was referred to as the temporal modulation
transfer function (tMTF). In some units, tMTF had a band-pass
shape. A response measure called discharge synchrony-based best
modulation frequency (tBMF) was calculated from each tMTF. For
units whose tMTFs consisted of two or more significant values (Rayleigh
statistic > 13.8), the tBMF was obtained by weighting modulation
frequencies with significant VS that were adjacent, continuous and
surrounding the modulation frequency corresponding to the maximum VS. A
geometric mean was used to average the modulation frequencies by their
VS. To calculate maximum synchronization frequency
(fmax), we first determined the
highest modulation frequency at which significant discharge synchrony
was found. A linear interpolation was made between this frequency and
the adjacent, higher, tested modulation frequency with nonsignificant
Rayleigh statistics. The modulation frequency where the interpolated
Rayleigh statistic line crossed 13.8 was taken to be
fmax.
CALCULATION OF THE DOUBLING OF SYNCHRONIZATION FREQUENCY. A subset of units exhibited discharge patterns that were synchronized to twice the modulation frequency (2fm) under certain stimulus conditions. For such units, tMTFs based on Rayleigh statistics were calculated at fm and 2fm, respectively [i.e., 2n(VSfm)2 and 2n(VS2fm)2]. A unit was considered having the doubling of synchronization frequency if the peak of the tMTF(2fm) was higher than the magnitude of the tMTF(fm) at the corresponding modulation frequency. For this type of unit, fmax based on both tMTF(fm) and tMTF(2fm) were computed in the same manner as described in the preceding text.
MINIMUM RESPONSE LATENCY. Minimum response latency to sAM or sFM stimuli was determined, respectively, on the basis of a composite PSTH of a unit's responses at all tested modulation frequencies. A cumulative post-stimulus histogram (PSTH) was then constructed by integrating the PSTH over time. The time after stimulus onset at which the spike count in a bin of the cumulative PSTH exceeded twice the largest SD of spike counts prior to stimulus onset was calculated as the minimum response latency. The binwidth used in the calculation was 1 ms. The minimum response latency defined here could be different from the first spike latency measured from CF tones in each unit. We used this latency measure instead of the first spike latency because it was a more direct indicator of onset timing of a neuron in response to sAM or sFM stimuli. The first spike latency was not always available as some neurons did not respond well to unmodulated CF tones.
| |
RESULTS |
|---|
|
|
|---|
General observations
The majority of the sampled neurons responded to CF tones as well sAM or sFM stimuli. However, neurons generally responded more strongly to sAM and sFM stimuli, often with sustained firing, than to tones as judged by number of spikes evoked over the duration of the sAM and sFM stimuli (1 s). Some of the neurons could only be driven by sAM or sFM stimuli with proper parameters (e.g., modulation frequency and depth). Representative examples of responses to sAM and sFM stimuli are shown in Fig. 2. Overall discharge patterns can be seen from the dot raster and PSTH. Temporal discharge patterns are further illustrated by period histograms computed from the same group of units in Fig. 3. These examples show that responses of units generally varied as a function of modulation frequency. They also show that some, but not all, recorded units exhibited stimulus-synchronized discharges at low modulation frequencies that gradually disappeared with increasing modulation frequency (Fig. 2, A-C). Responses to sAM and sFM stimuli often diminished at high modulation frequencies for the frequency range tested (Fig. 2Db). Not all units responded at low modulation frequencies (Fig. 2, Da and Ea). In general, sustained discharges were limited to a narrower range of modulation frequencies than onset discharges (Fig. 2, D and E). These and other response characteristics are quantitatively analyzed in the following sections.
|
|
Discharge rate-based modulation frequency selectivity
PROPERTY OF INDIVIDUAL NEURONS. The majority of recorded units displayed selectivity for a particular modulation frequency when measured by average discharge rate. Figure 2 shows responses to both sAM (a) and sFM (b) stimuli recorded from the same units across a range of modulation frequencies. rMTF from all five units shown in Fig. 2 are plotted in Fig. 4. For example, the unit shown in Fig. 2C responded most strongly near 32-Hz modulation frequency. PSTHs in Fig. 2C show sustained firing at a modulation frequency of 32 Hz for both sAM (Fig. 2Ca) and sFM (Fig. 2Cb) stimuli. rMTFs produced by sAM and sFM stimuli had similar band-pass shapes and peaked at a modulation frequency of 32 Hz (Fig. 4C). The peak in a rMTF is conventionally referred to as the rBMF. In this study, we used a quantitative method to calculate the rBMF instead of using a particular tested modulation frequency (see METHODS). Other examples in Fig. 2 illustrate the typical range of modulation frequency selectivity observed in the recorded units. A prominent feature in these examples, representative of our large samples, is the sustained firing for the entire stimulus duration at modulation frequencies near rBMF (Fig. 2, D and E). Neurons generally responded more weakly at modulation frequencies lower than rBMF. In some cases, there were no responses or only brief onset responses at these lower modulation frequencies (e.g., Fig. 2E). The disappearance of sustained discharges at modulation frequencies higher than rBMF was commonly observed (e.g., Fig. 2, B-E). The lack of responses at high modulation frequencies appeared to result from inhibition in many cases (e.g., Fig. 2, C and D). In general, rMTFs derived from responses of a unit to sAM and sFM stimuli had similar shapes and closely matched rBMF.
|
POPULATION PROPERTIES. Using the procedures described in METHODS, we have computed rBMF from the units that responded reliably to sAM and/or sFM stimuli (see METHODS). Figure 5A shows distributions of rBMFsAM and rBMFsFM, respectively. Both distributions are centered between 16 and 32 Hz (rBMFsAM: median = 22.6 Hz, rBMFsFM: median = 18.1 Hz, see Table 1) and are statistically indistinguishable (Wilcoxon rank-sum test, P = 0.1). Among the 211 single-units that we studied, rBMFsAM could be determined in 146 units (69%), whereas as rBMFsFM could be determined in 93 units (44%). Figure 5B shows the relationship between rBMFsAM and rBMFsFM in 76 units where both were determined in the same unit. rBMFsAM and rBMFsFM were highly correlated (correlation coefficient r = 0.7, Table 1). A paired t-test showed that there was no significant difference (P = 0.02) between rBMFsAM and rBMFsFM when compared on a unit-by-unit basis. A closer examination revealed that units in the upper 50th percentile of rBMFsAM (rBMFsAM > median rBMFsAM = 20.7 Hz) appeared to have significantly higher rBMFsAM than rBMFsFM values (paired t-test, P < 0.01). There was no statistically significant difference (paired t-test, P = 0.09) between rBMFsAM and rBMFsFM for the units in the lower 50th percentile of rBMFsAM (Table 1). The distribution of the difference between rBMFsAM and rBMFsFM pairs is shown in Fig. 5C. The close match between rBMFsAM and rBMFsFM was found in a substantial proportion of this population of units. The median of the distribution was 0.46 octaves, with 50 of 76 units (66%) having closely matched rBMFsAM and rBMFsFM (differences within 1.0 octave). These data show that there was a great degree of similarity in the responses to sAM and sFM stimuli, as reflected in mean firing rate, both at the level of single neurons and the level of populations of neurons in the auditory cortex. In general, the match between the rBMFsAM and rBMFsFM was independent of whether discharges were synchronized to the modulation waveform or not.
|
|
BANDWIDTH OF rMTF. In Fig. 6 we analyzed and compared half-height BW and sharpness of tuning (Q) of rMTF for both sAM and sFM stimuli (see METHODS). The distributions of BW across populations of the neurons were similar for both types of stimuli (Fig. 6A) and were not statistically different (Wilcoxon rank-sum test, P = 0.16). The distributions of Q values (Fig. 6C) were also similar between sAM and sFM stimuli (Wilcoxon rank-sum test, P = 0.77). The medians of BW distributions were between 32 and 64 Hz (BWsAM: 53.9 Hz, BWsFM: 49.3 Hz, see Table 1), approximately one octave greater than the medians of rBMF (Fig. 5), which resulted in Q distributions centered around 0.5 octaves (QsAM: 0.45, QsFM: 0.47). In a subpopulation of units, BW and Q could be measured for both sAM and sFM stimuli in the same units. Figure 6, B and D, shows that, when compared on a unit-by-unit basis, the two measures of the tuning width of rMTF were not statistically different (paired t-test, BW: P = 0.27, Q: P = 0.1) between sAM and sFM stimuli (Table 1). These results showed that in addition to the similarity in rBMF, there was also a similarity in the sharpness of tuning of rMTF produced by sAM and sFM stimuli, both at the level of single neurons and across populations of neurons.
|
Discharge synchrony-based modulation frequency selectivity
PROPERTY OF INDIVIDUAL NEURONS.
The examples given in Figs. 2 and 3 also show that discharges of
cortical neurons in response to sAM and sFM stimuli could exhibit
stimulus-synchronized temporal patterns. Period histograms shown in
Fig. 3, A-E, corresponding to the units shown in Fig. 2,
A-E, further illustrate temporal discharge patterns evoked by the modulated sounds. Discharges synchronized to modulation waveform
should show peaks in both of the two periods plotted (Fig. 3).
Discharges registered in the first but not the second period of a
histogram indicate that they were synchronized to stimulus onset but
not to the modulation waveform. The unit in Fig. 3B
responded to sAM stimuli with well-synchronized discharges at
modulation frequencies
128 Hz as can been seen from the period histogram. The phase delay increased with increasing modulation frequency. At a 32-Hz modulation frequency, discharges from preceding period began to appear (Fig. 3Ba). The temporal discharge
patterns in response to sFM stimuli (Fig. 3Bb) in the same
unit differed markedly from those to sAM stimuli (Fig. 3Ba)
in that there were two clusters of firings within each modulation
period. This was because both the upward and downward trajectory of the
modulation waveform excited this unit. In general, when sAM stimuli
were used, discharges could be synchronized at a rate approximately equal to the modulation frequency, whereas for sFM stimuli, response synchronization could occur at a rate twice as large as the modulation frequency. Moreover, stimulus-induced synchronization was sometimes produced by one type of the modulated sounds but not by another type in
an individual unit. For example, the unit in Figs. 2D and
3D responded with synchronized discharges to sFM but not sAM stimuli.
128 Hz for sAM stimuli and
were strongest at 16 Hz (Fig. 7B). This peak in tMTF has
traditionally been referred to as tBMF. tBMF was quantitatively
determined in this study using a weighting method (see
METHODS). Because this unit apparently responded to both
the upward and downward trajectory of the sFM stimuli, the Rayleigh
statistics calculated based on the modulation frequency of the sFM
stimuli had small values (Fig. 7B). In contrast, Rayleigh
statistics had high values when calculated based on twice the
modulation frequency. This indicated that the periodicity in the sFM
stimuli was not accurately represented by temporal discharge patterns
of this type of response. Despite the different temporal firing
patterns produced by sAM and sFM stimuli, the average discharge rate of
this unit reached the maximum at the modulation frequency of 16 Hz for
both sAM and sFM stimuli (Fig. 4B). For the unit shown in
Fig. 7C, there were no significant stimulus-synchronized
discharges for either sAM or sFM stimuli at a modulation frequency of
32 Hz where mean firing rates reached the maximum for both stimuli
(Fig. 4C). The strongest stimulus-synchronized responses
were observed at 16 Hz for sAM and between 8 and 16 Hz for sFM stimuli
in this unit (Fig. 7C). Additional examples in Fig. 7,
D and E, further demonstrate the lack of
significant stimulus-synchronized discharges at modulation frequencies
where the units discharged maximally as judged by mean firing rate
(Fig. 4, D and E).
|
POPULATION PROPERTIES. Figure 8A shows distributions of tBMFsAM and tBMFsFM that were analyzed based on calculations at the modulation frequency. The two distributions were statistically indistinguishable (Wilcoxon rank-sum test, P = 0.82). An important property is that the median tBMF of the population is 9.6 Hz for sAM stimuli and 10.0 Hz for sFM stimuli, respectively, which are nearly one octave lower than their counterparts derived from rMTF (median rBMFsAM: 22.6 Hz, median rBMFsFM: 18.1 Hz, see Table 1). Direct comparison between tBMFsAM and tBMFsFM in the same unit is shown in Fig. 8, B and C. Similar to the discharge rate-based analysis, a large proportion of recorded units had closely matched tBMFs produced by sAM and sFM stimuli (Fig. 8B). The difference between tBMFsAM and tBMFsFM was smaller than the difference between rBMFsAM and rBMFsFM (tBMF: median difference 0.21 octave; rBMF: median difference 0.46 octave; see Table 1). A paired t-test showed that there was no significant difference (P = 0.95) between tBMFsAM and tBMFsFM when both were measured in the same units. These data show that both at the level of single neurons and across populations of neurons, there was a large degree of similarity in the preferred modulation frequency as measured by stimulus-synchronized discharges. It should be noted that statistically significant stimulus-synchronized discharges were not detected in a substantial number of units studied (sAM responses: 66/200, 33%; sFM response: 67/142, 47%).
|
LIMIT ON STIMULUS SYNCHRONIZED DISCHARGES. Another important measure of stimulus-synchronized discharges is the maximum synchronization frequency (fmax). This measure indicates the upper limit of stimulus-synchronized discharges in each unit, whereas tBMF defines the modulation frequency at which the strongest discharge synchronization could be induced. Figure 9 shows the distributions of fmax for both sAM and sFM stimuli, respectively. The two distributions were statistically indistinguishable (Wilcoxon rank-sum test, P = 0.97) and had medians of 34.2 Hz (sAM) and 39.4 Hz (sFM), respectively, which were much higher than their counterparts in tBMF (median tBMFsAM: 9.6 Hz, median tBMFsFM: 10.0 Hz, see Table 1). Figure 9B shows the cumulative distributions of fmax for both types of stimuli, which characterizes the upper boundary of stimulus-synchronized activities for the population of recorded units. These curves show how well the A1, as a whole, can represent temporal modulations by temporal discharge patterns. The cumulative distributions of fmax for both sAM and sFM stimuli are nearly identical (Fig. 9B), indicating the similarity in stimulus-synchronized discharges that resulted from these two classes of stimuli. The curves have low-pass shapes and begin to drop more rapidly above ~16 Hz. The medians of cumulative fmax distributions are between 32 Hz and 64 Hz for responses to both types of stimuli (Fig. 9B). There were <10% of units that were able to synchronize to modulation waveform at 256 Hz. Further comparison on a unit-by-unit basis between fmax measured from sAM and sFM responses is shown in Fig. 9, C and D. There was a high degree of correlation between fmax in individual units as well (r = 0.7, Fig. 9C). Many units had closely matched fmax (Fig. 9D). A paired t-test showed that there was no significant difference (P = 0.11) between fmax for sAM and sFM stimuli when both were measured in the same units.
|
DOUBLING OF SYNCHRONIZATION FREQUENCY. As the example in Figs. 2B and 7B showed, some units exhibited discharge patterns that were synchronized to twice the modulation frequency (2fm). This was more commonly observed in the responses of sFM stimuli when the frequency component of a stimulus shifted into and out of a unit's excitatory response area during each modulation cycle. In these cases, the synchronization index calculated at 2fm was greater than that calculated at fm (e.g., Fig. 7B). Figure 10 shows the analysis of such cases for both classes of stimuli. There were 36 units (36/93, ~39% of samples) that exhibited synchronization frequency doubling due to sFM stimuli. In contrast, only a small number of units (7/146, ~5% of samples) were found to show this property with sAM stimuli. In the latter case, the doubling was likely caused by on and off responses to each modulation cycle. For most of the units shown in Fig. 10, fmax can be measured using either fm or 2fm. A higher fmax value was obtained for most of these units when 2fm was used in the calculation (Fig. 10). In four units, fmax could only be measured using 2fm but not fm in their responses to sAM stimuli, indicating a nearly complete doubling of synchronization frequency (Fig. 10, larger pluses). The doubling of the synchronization frequency did not result in a significant shift of the fmax distribution when the entire population of neurons was considered.
|
Comparison between rate- and synchrony-based modulation frequency selectivity
As illustrated in Figs. 4 and 7, neurons were typically tuned to a higher modulation frequency when measured by average discharge rate than by synchronized discharges. In Fig. 11, we compared rate- and synchrony-based modulation frequency selectivity on a unit-by-unit basis when both rBMF and tBMF could be measured in the same units. For the vast majority of units, rBMF was greater than tBMF, for both sAM (Fig. 11A) and sFM (Fig. 11B) stimuli. On average, rBMF is more than twice higher than tBMF (Table 2). This means that the average discharge rate reaches the maximum at modulation frequencies as high as where the strongest stimulus-synchronized discharges could be observed. The difference between rBMF and tBMF is statistically significant (sAM: paired t-test, P < 0.001; sFM: paired t-test, P < 0.001; Table 2). The correlation coefficient between rBMF and tBMF was small (sAM: 0.31, sFM: 0.09; Table 2). In some units, fmax was found at frequencies higher than rBMF as shown in Fig. 11, C and D. Direct comparisons showed that fmax did not differ significantly from rBMF for sAM responses (paired t-test, P = 0.03, Table 2), although a significant difference was found for sFM responses (paired t-test, P <0.01, Table 2). Correlation between fmax and rBMF was poor (sAM: 0.20, sFM: 0.01; Table 2). These comparisons showed that rBMF, a discharge rate-based measurement, of a unit was not significantly correlated with discharge synchrony-based measurements (tBMF and fmax). In contrast, tBMF and fmax were highly correlated while they differed significantly (sAM: paired t-test, P < 0.001, r = 0.65; sFM: paired t-test, P < 0.001, r = 0.70; Table 2). fmax is more than three times higher than tBMF (Fig. 11, E and F, Table 2). The distributions in Fig. 11 again showed the similarity between responses to sAM and sFM stimuli.
|
|
Comparisons between neural populations
In Fig. 12, rate- and
synchrony-based response measures were compared between different
populations of units. The recorded units were partitioned into BP and
non-BP groups in our analyses (see METHODS). Averaged
discharge rates are plotted versus modulation frequency for both of
these groups as well as for all units in Fig. 12, A (sAM)
and B (sFM). The population-averaged discharge rate profiles
of the BP-group showed a maximum between 16 and 32 Hz of modulation
frequency (Fig. 12, A, and B, 

),
similar to that observed in the distributions of rBMF measured from
individual units (Fig. 5A). This feature can also be seen
when the responses of the entire population are averaged (Fig. 12,
A and B, - - -). These observations indicate
that not only were there more units tuned to modulation frequencies in
the range of 16-32 Hz, the A1 responded collectively more strongly to
this range of modulation frequencies than to lower or higher modulation
frequencies. The profiles of the non-BP group, however, were flat
between 4 and 64 Hz and showed an increase in discharge rate at higher
modulation frequencies (Fig. 12, A and B,
-
).
|
Figure 12, C and D, showed the proportion of units that exhibited statistically significant Rayleigh statistic at each modulation frequency for the three groups of units. The highest percentages of units with significant synchronized discharges were between 4- and 16-Hz modulation frequency and were centered near 8 Hz (sAM: 48%, sFM: 32%), consistent with the distribution of tBMF of individual units (Fig. 8A). Less than half of all sampled units showed stimulus-synchronized discharges at any tested modulation frequency. These profiles reflect the overall strength of stimulus-synchronized discharges across modulation frequency that are evoked by sAM and sFM stimuli. The low percentages of units with synchronized discharges in the non-BP group were partially explained by the relatively low response magnitudes of these units (Fig. 12, A and B). Data in Fig. 12 demonstrate that the units belonging to the BP group carry far more information than units of the non-BP group in terms of both rate- and synchrony based representations of modulation frequency.
Dependency of modulation-frequency selectivity on stimulus parameters
DEPENDENCE OF MTF ON SOUND LEVEL.
Rate-level functions were nonmonotonic for narrowband stimuli (e.g.,
pure and modulated tones) in most units that we studied in awake
marmosets (Wang et al. 1999
). We tested in a subset of units the dependence of modulation selectivity on sound level and found
that the shape of rMTF and, to a lesser extent, the shape of tBMF was
relatively invariant across supra-threshold sound levels. Figure
13, A and B,
shows examples of sAM responses from two units. In each case, while
changing sound level resulted in changes in firing rates, both rBMF and
tBMF remained largely unchanged across a wide range of sound levels
(Fig. 13, A and B). Notice that the rate-level
relationship was nonmonotonic for the units shown in Fig. 13,
A and B. Figure 13C shows an analysis
of sound-level dependence of rBMF (Fig. 13Ca) and tBMF (Fig.
13Cb) over a population of units. For most of the units
tested, changing sound level did not result in large changes in rBMF or
tBMF. Similar observations were obtained with sFM responses as shown in
Fig. 14, in individual examples (Fig.
14, A and B) as well as in a population analysis
(Fig.14C). The unit shown in Fig. 14B had large
changes in tMTF but much smaller changes in rMTF across sound levels. In general, the shape of rMTF, and consequently rBMF, tended to be more
resistant to changes in sound level than did tMTF and tBMF. These data
showed that although changing sound level may change the peak firing
rate and the sharpness of tuning to the preferred modulation frequency,
it usually did not result in significant shift of MTF. There appeared
to be, however, greater changes due to sound level in sFM responses
than in sAM responses.
|
|
DEPENDENCE OF MTF ON MODULATION DEPTH. Modulation depth was another important parameter that affected the responsiveness of a unit to modulated sounds. In general, increasing modulation depth of a sAM stimulus at or near rBMF always led to a monotonic change in discharge rate (increased or saturated). For the unit shown in Fig. 15A, the maximum firing rate of the rMTF increased from ~13 to ~38 spikes/s as modulation depth of the sAM stimuli was increased from 50 to 100%. However, the rBMF remained unchanged near 32 Hz. This was also true for the synchronization measure, with tBMF near 16 Hz (Fig. 15Ab). Similar properties can be seen in another example in Fig. 15B. Figure 15C shows a population analysis that further strengthens these observations regarding responses to sAM stimuli.
|
|
Temporal modulation is essential to drive many cortical neurons
Temporal modulation (in amplitude or frequency) was
essential to drive many units that were unresponsive or only weakly
responsive to pure tones. Figure 17,
A and B, shows two representative examples of a
class of units that required sufficient AM to fire. The unit in Fig.
17A gave only onset discharges at zero modulation depth (equivalent to a CF tone) but gave sustained discharges when the modulation depth of the sAM stimulus was raised to
50%. Figure 17B shows another example that had offset discharges at
modulation depths <70-80% but fired continuously throughout the
stimulus duration at greater modulation depths. Discharge rate versus
modulation depth functions from a group of such units is shown in Fig.
17C. These units had weak or no responses to sAM stimuli at
zero modulation depth, i.e., unmodulated CF tones. Some units did not
respond to pure tones until they were sufficiently modulated in
frequency, as shown by examples in Fig. 17, D-F. In
contrast to sAM stimuli, sFM stimuli generally produced maximum firing
rate at a particular modulation depth in each unit. Increasing
modulation depth beyond this optimal depth could result in a reduction
in firing rate (Fig. 17F), presumably due to the recruitment
of flanking inhibitions by expanded spectral side bands.
|
Evidence of inhibition in shaping modulation frequency selectivity
Inhibition appeared to play a role in limiting the range of modulation frequency that a unit would respond and thus contributed to the observed modulation frequency selectivity. In the examples shown in Fig. 2, C and D, the responses of the units were clearly inhibited at some modulation frequencies higher than rBMF for both sAM and sFM stimuli. In some cases, inhibitory effects were observed at modulation frequencies below rBMF (e.g., Figs. 2Da and 4D). In general, inhibition was usually observed at both extremes of a rMTF and was often stronger at the high-frequency side. However, inhibition could only be detected in units that had spontaneous discharges. For the units that had nearly zero spontaneous rate (e.g., Fig. 2B) one cannot be sure that the disappearance of discharges after onset responses at a modulation frequency of 512 Hz was due to inhibition or not. In Fig. 18A, we analyzed the percentage of units that showed significant inhibition below spontaneous discharge rate in their sAM responses at each tested modulation frequency. The percentage of units with inhibition was smallest (~2%) near 32-Hz modulation frequency, which was consistent with the observation that there were more units with rBMF near this modulation frequency than anywhere else (Fig. 5A). The largest percentage of units with inhibition was at the highest modulation frequency tested, 512 Hz (~13%, Fig. 18A). There were fewer units with inhibition at the lowest modulation frequency tested, 4 Hz (~6%). Figure 18B shows the population-averaged mean firing rate, in response to sAM stimuli, as a function of modulation frequency, based on rMTF from individual units that were aligned by their rBMF. At the highest and lowest modulation frequencies tested, both the mean rate and 1 SD above the mean rate fell below the spontaneous activity level (Fig. 18B). This indicated that there was a stronger presence of inhibition at these modulation frequencies. The shapes of the distributions shown in Fig. 18 are reminiscent of the shape of rBMF distributions in Fig. 5A, suggesting that inhibition may play a role in shaping overall modulation frequency selectivity observed in A1.
|
Relationship between modulation-frequency selectivity and response latency and CF
In Fig. 19, all three response measures are plotted, respectively, versus minimum response latency for both sAM and sFM stimuli. The minimum response latency was calculated from a unit's responses to modulated tones across all modulation frequencies tested (see METHODS). There was no clear correlation between rBMF and the latency for both sAM and sFM stimuli (Fig. 19A). rBMF did not differ significantly between units with shorter latencies and those with longer latencies (Wilcoxon rank-sum test, P = 0.03, Table 3). However, there appeared to be a weak inverse relationship between tBMF and the latency (Fig. 19B). Units with higher tBMF tended to have shorter latency, and those with the longest latency (~100 ms) mostly had the lowest tBMF (Fig. 19B). The median tBMF of units with latencies shorter than the median latency was greater than that of the units with latencies longer than the median latency (sAM: 12.5 vs. 7.9 ms, P < 0.001; sFM: 12.0 vs. 8.6 ms, P = 0.018; see Table 3). fmax showed the strongest dependency on the latency (Fig. 19C, sAM: 60.7 vs. 26.7 ms, P < 0.001; sFM: 60.7 vs. 23.3 ms, P < 0.01; see Table 3). The inverse relationship between fmax and the latency was clearer than that observed with tBMF. In contrast, there appeared to be no dependency between the three response measures and the CF (Fig. 20). The medians calculated from units with CF less than the median CF were not statistically different from those from units with CF greater than the median CF (Table 3), suggesting that the observed modulation selectivity is a likely fundamental property of the A1 across the tonotopic axis.
|
|
|
| |
DISCUSSION |
|---|
|
|
|---|
Common temporal processing mechanism
In the reported experiments, temporal modulations were introduced
in the amplitude or frequency domain. While these two classes of
stimuli differed significantly in their spectral contents, they shared
the same modulation waveform: a sinusoid with a particular temporal
modulation frequency that gave rise to the perceived time-varying
property of these sounds. Results presented in this report clearly
demonstrated that there was a high degree of similarity between
cortical responses to sAM and sFM stimuli. Auditory cortical neurons
could be selective to particular modulation frequencies assessed by
their mean firing rate or by discharge synchrony. The selectivity was
similar regardless of whether the temporal modulation was created in
the amplitude or frequency domain, suggesting that it was the temporal
modulation frequency per se, and not the spectral contents, to which
the neurons were selective. Because amplitude and frequency modulations
are produced along different stimulus dimensions, the match between
major aspects of sAM and sFM responses (e.g., rBMF, BW, tBMF, and
fmax) suggests a shared representation of temporal modulations by cortical neurons. Our findings support the notion that there exists a common mechanism in
cortical neurons for extracting temporal profiles from a variety of
complex sounds with different spectral contents (Wang et al. 2001
). In fact, our recent studies further showed that auditory cortical neurons processed the temporal modulation in a similar manner
regardless of whether it was based on tone or noise carriers (Wang et al. 2002
). Together, the evidence suggests that
it is the "temporal modulation," and not the amplitude or frequency modulation per se, that most auditory cortical neurons appear to
extract from a complex acoustic environment.
The similarity between cortical responses to sAM and sFM stimuli has
been noted in some previous studies. Both the studies of
Eggermont (1994)
and that of Gaese and Ostwald
(1995)
found that there was no significant difference between
tBMFsAM and tBMFsFM. Similar mean tBMFs have been reported when nonsinusoidal envelopes were
used to modulate tone carriers (Eggermont 1994
;
Schreiner and Urbas 1988
). Our findings are consistent
with these results in that the BMF of most cortical neurons is
relatively insensitive to the spectral contents with which temporal
modulations are created. However, the range of tBMF, over which the
match between sAM and sFM responses was observed, was higher and more
spread out in A1 under the awake condition as reported here than those
observed in anesthetized animals in previous reports. Furthermore, we
showed, on the basis of well-isolated single-units, that the similarity between cortical responses to sAM and sFM stimuli was also reflected in
other response measures such as rBMF, BW, Q, and
fmax.
Rate vs. temporal coding
While temporally modulated signals have been used in a number of
previous studies of the auditory cortex in various species, some
fundamental questions remain largely unanswered. An important issue is
the role of rate coding in neural representation of temporally modulated signals in the auditory cortex. Previous studies have focused
mainly on analysis of stimulus-synchronized temporal discharge patterns
that are found throughout the auditory system. Under commonly used
anesthetized conditions, there are very limited stimulus-driven
sustained discharges in the auditory cortex to support rate-coding
schemes. As the results of this study and other studies (e.g.,
Bieser and Müller-Preuss 1996
) showed, the unanesthetized auditory cortex was highly responsive, and mean firing
rate clearly demonstrated feature selectivity. The selectivity for a
particular temporal modulation frequency in terms of average discharge
rate in the absence of stimulus-synchronized discharge patterns was
often observed in our recordings from the unanesthetized auditory
cortex and represented a temporal-to-rate transformation by cortical
neurons. The results from the present study suggest that the discharge
rate-based temporal modulation selectivity may be involved in
processing the finer temporal structures of complex sounds represented
by higher temporal modulation frequencies that are not adequately
encoded by stimulus-synchronized discharge patterns of cortical
neurons. These findings provide further support for a two-stage
temporal processing model we have proposed on the basis of cortical
responses to sequential stimuli (Lu et al. 2001b
).
Bieser and Müller-Preuss (1996)
suggested that
high AM frequencies may be encoded in a rate code. We have further
extended this notion by showing that the rate-coding is likely needed
for encoding temporal modulations at high frequencies, in
both the amplitude and frequency domain. In both the present study and that of Bieser and Müller-Preuss (1996)
, the
carrier frequency of a sAM stimulus was set at a unit's CF. The notion
of rate coding supported by these studies is conceptually different
from the one suggested by Schulze and Langner (1997)
. In
that study, it was shown that the firing rate of a cortical neuron
could be influenced by high-frequency AM introduced at a carrier
frequency far away from a unit's CF (Schulze and Langner
1997
), a phenomenon that is generally interpreted as a spectral
rather than temporal effect.
Modulation selectivity and onset timing
The findings in this study also bear some relationship to previous
studies of onset response timing to acoustic stimuli as the ability of
cortical neurons to represent periodic signals may be partially
correlated with their minimum spike latency (e.g., Fig. 19). Although
cortical neurons can show almost the same preciseness in their first
spike latency, albeit with longer mean latencies, compared with the
auditory-nerve (Heil 1997
; Heil and Irvine
1997
; Phillips and Hall 1990
), they have a much
diminished capacity to synchronize to rapidly occurring sequential
acoustic events, such as the modulation cycles in the sAM and sFM
stimuli. There is an important difference between the response to
stimulus onset and that of a sustained periodic signal. In the latter
case, the neuron may not have time to "reset" before the next
acoustic event. Thus any long-lasting effects could contribute to the
next response. It is possible that the peak in the rMTF or tMTF could
be partially a result of a limited window of integration of responses
to successive stimulation. Our data indicated that there were major
differences between onset and sustained portions of the response (e.g.,
Figs. 2 and 17). We found that the bandwidth of rMTFs was generally
narrower when considering mainly the sustained part of the response
(e.g., Fig. 2E). The majority of the existing
electrophysiological studies that investigated onset responses
(including Heil and Irvine 1997
; Phillips and
Hall 1990
) were typically conducted in anesthetized conditions
where the responses were largely described as phasic. The onset
responses characterized under such experimental conditions may not
fully account for the variety of responses
especially rate-based,
nonstimulus-synchronized responses
under unanesthetized conditions
such as those reported here. For example, in the auditory cortex of
awake animals, some cortical neurons do not exhibit well-timed onset
responses (e.g., Fig. 2C).
Comparison with previous studies
Most of the previous single- or multi-unit studies of cortical
representations of sinusoidal amplitude and frequency modulations were
conducted using anesthetized animals with a few exceptions (Bieser 1998
; Bieser and Müller-Preuss
1996
; Creutzfeldt et al. 1980
). Under
anesthetized conditions, phasic responses are the typical discharge
patterns in the auditory cortex, even for responses to complex
vocalizations. A distinct characteristic of neural responses in
unanesthetized auditory cortex is sustained firing, a property that
stands in sharp contrast to barbiturate-anesthetized cortex. It has
long been recognized that anesthetics have profound side effects on the
response properties of the auditory cortex, more so than other sensory
cortices, possibly because of longer pathways leading to the cortex
from the auditory periphery. Goldstein et al. (1959)
showed that evoked cortical potentials could synchronize to higher
repetition frequencies of clicks in the unanesthetized condition than
in the anesthetized condition. Our recent studies showed that
stimulus-following rates in A1 were higher in the awake marmosets than
in anesthetized cats for click train stimuli (Lu and Wang
2000
; Lu et al. 2001b
).
The tBMFs observed in our study are higher than what has been observed
in most studies of anesthetized animals and are comparable to other
studies of A1 neurons in awake animals. Mean tBMFs obtained with sAM
and sFM stimuli were 5.4 ± 3.4 and 5.9 ± 5.2 Hz,
respectively, in the A1 of ketamine-anesthetized cats (Eggermont
1994
) and ~10 Hz in the A1 of rats anesthetized by
pentobarbital-based Equithesin (Gaese and Ostwald 1995
).
In both studies, tungsten microelectrodes were used and statistical
significance of synchronized responses was assessed by Rayleigh test at
P < 0.001, the same significance level used in the
present study. A higher mean tBMF (14.2 ± 8.8 Hz) in A1 was
reported by Schreiner and Urbas (1988)
using sAM and
rectangular AM stimuli. Their recordings, mostly multi-units, were made
using low-impedance carbon fiber microelectrodes in paralyzed cats
infused with pentobarbital anesthetics. The synchronization measure
used in Schreiner and Urbas (1988)
study included
contributions of both the first and second harmonics of the modulation
frequency from the Fourier spectrum. The spectral component
corresponding to the first harmonic of the modulation frequency in
their analysis is equivalent to the vector strength and similar
measures used in the two studies cited above (Eggermont
1994
; Gaese and Ostwald 1995
). Bieser and
Müller-Preuss (1996)
studied cortical responses to sAM
sounds in awake squirrel monkeys and reported a mean tBMF of 17.8 Hz in
A1 with the distribution peaked between 8 and 16 Hz. The response
synchronization in their study was quantified by cross-correlating the
PSTH with the amplitude envelope of the AM stimuli. Multi-unit
activities were recorded using low-impedance tungsten microelectrodes
in this study. The Rayleigh test was not used to assess the statistical
significance of response synchronization in the studies by
Schreiner and Urbas (1988)
and Bieser and
Müller-Preuss (1996)
. The mean tBMF reported in the
present study from awake marmosets is 15.6 ± 21.4 Hz for sAM and
14.2 ± 17.6 Hz for sFM stimuli, respectively (Table 1),
calculated on a linear scale as has been done in the preceding referred studies.
In general, tBMF measured in awake animals has a greater spread (i.e.,
larger SD) than that measured in anesthetized animals, reflecting more
diverse, stimulus-driven temporal discharge patterns. We did not
observe in our data significant spontaneous oscillations as reported by
other studies of anesthetized cortex (e.g., Eggermont 1992
; Gaese and Ostwald 1995
; Lu and Wang
2000
). The BMFs derived from tMTF or rMTF reported in the
present study clearly resulted from stimulus-driven activities. The
lower values of tBMFs observed in anesthetized auditory cortex (<10
Hz) suggested that they may possibly be correlated with cortical
spindle frequencies under anesthesia (Kenmochi and Eggermont
1997
). In fact, Gaese and Ostwald (1995)
reported that tBMFs were significantly correlated with oscillation
frequencies. Our recordings were based on well-isolated single-units
(Fig. 1), obtained using high-impedance tungsten microelectrodes and
were mostly from neurons in upper cortical layers that represent the
output of A1. Most of the recordings in A1 of anesthetized animals were
obtained using low-impedance electrode from middle cortical layers that
receive the bulk of thalamocortical inputs. It is known that neurons in
the medial geniculate body have much higher stimulus-following response
rates (Creutzfeldt et al. 1980
; de Ribaupierre et
al. 1980
).
We observed in the present study that rBMF was higher than tBMF in both
individual neurons and across populations of cortical neurons (Fig.
11). This is consistent with observations made earlier by
Schreiner and Urbas (1988)
in cats and Bieser and
Müller-Preuss (1996)
in squirrel monkeys. Gaese
and Ostwald (1995)
found that most neurons in anesthetized rats
did not display tuned rBMF. The distribution of rBMF (Fig. 5) obtained
in the present study is similar to that reported by Bieser and
Müller-Preuss (1996)
(for sAM stimuli); both are centered
at higher modulation frequencies and are more widely distributed across
modulation frequency than those observed in anesthetized animals. There
is also a significant difference between population-averaged rMTFs
observed in A1 of anesthetized and awake animals. In the study of
Eggermont (1994)
, the population-averaged rMTF has a
peak near ~10 Hz, whereas our data show peaks between 16 and 32 Hz,
about one octave higher (Fig. 12, A and B). This
clearly indicates that the A1 is maximally excited at distinctly
different modulation frequencies under the anesthetized and awake conditions.
Correlation with psychophysics
It has been shown that both humans (Viemeister
1979
) and nonhuman primates (Moody 1994
) are
able to detect AM of a tone or noise carrier. However, like all other
sensory systems, the auditory system has limited time resolution and is
unable to detect amplitude changes that occur too rapidly. The temporal
modulation transfer function is usually measured psychophysically based
on the detection threshold, with respect to modulation depth, as a
function of modulation frequency. The shape of a psychophysical tMTF is
generally low-pass for low-frequency carriers up to ~60 Hz (for 1 kHz
carriers) (Viemeister 1979
), above which the tMTF begins
to increase because sidebands in the frequency spectrum of the stimuli
can then be resolved by the auditory system. As carrier frequency
increases, the cutoff frequency of tMTF increases as well until
100-130 Hz (for >2 kHz carrier frequency) (Kohlrausch et al.
2000
) before spectral effects confound the measurement.
Increased cutoff frequency of tMTF at higher carrier frequencies
reflects increased width of the critical band (Zwicker and Fastl
1999
). The majority of the neurons we studied had their CFs >1
kHz (Fig. 20), with the CF distribution centered near 7 kHz. The
low-pass portion of psychophysical tMTF is considered a consequence of
temporal processing, whereas the high-frequency portion is due to
spectral effects. The cortical responses analyzed in the present study
are likely correlated with the temporal aspect of the psychophysical
tMTF. We did observe neurons, in particular those in the non-BP group,
whose discharges increased with increasing modulation frequency above
~100 Hz (Fig. 12, A and B).
The two measures of discharge synchrony used in this study are tBMF and fmax. The fmax measure should be more closely related to psychophysical tMTF because it reflects a threshold value of discharge synchrony rather than a maximum value as does the tBMF. The presence of discharge synchrony can be a basis for the auditory cortex to temporally discriminate modulated tones at low modulation frequencies versus unmodulated tones with the same carrier frequency. The cumulative distributions of fmax have a low-pass shape for both sAM and sFM stimuli (Fig. 9B) and begin to drop at ~4-Hz modulation frequency. There are <10% of neurons with significant discharge synchrony at modulation frequencies >128 Hz in our samples (Figs. 9B, 12 C and D). These characteristics mirror those of the psychophysical tMTF described in the preceding text. Furthermore, the psychophysical tMTF has been shown to be independent of sound level except at very low intensities. The tBMFs measured in cortical neurons are largely independent of sound level as well (Figs. 13 and 14).
In nearly all of the analyzed response measures, we found similarities
between those obtained using sAM and sFM stimuli. This includes rBMF
(Fig. 5), tBMF (Fig. 8), and fmax
(Fig. 9) as well as BW and Q (Fig. 6). Such a similarity in
cortical coding of these two classes of modulated sounds may underlie
the similarity in the perception of these sounds (Edwards and
Viemeister 1994
; Moore and Sek 1992
). It has
been proposed by a number of psychophysical studies that FM signals are
converted into an AM-equivalent representation in the auditory system.
While both the auditory periphery and brain stem have been suggested as
potential sites of such conversions on the basis of cochlear filters
(Zwicker and Fastl 1999
) and binaural processing
(Saberi and Hafter 1995
), respectively, it is unclear
whether sAM and sFM stimuli like those used in the present study are
similarly represented in these subcortical processing stations in the
manner reported here. The findings of the present study show that the
modulation frequencies embedded in sAM and sFM sounds are extracted and
represented in a strikingly similar way by populations of auditory
cortical neurons. They suggest that there might be a shared temporal
processing mechanism at the level of both single and populations of
cortical neurons.
Another possible psychophysical correlate of
fmax is the lower limit of pitch,
which is defined as the lowest repetition rate that evokes a sensation
of pitch. Krumbholz et al. (2000)
showed a lower limit of
pitch near 30 Hz as did a companion study by Pressnitzer et al.
(2001)
using melody sequences. The distributions of
fmax are centered close to 30 Hz for
the cortical neurons we studied (Fig. 9, Table 1). For modulation
frequencies below fmax, modulation
periods can be resolved temporally because there are significant
discharges synchronized to the modulation periods. Higher modulation
frequencies that correspond psychophysically to the sensation of
roughness (Zwicker and Fastl 1999
), for example, are
likely represented by a rate code instead of by a temporal code because
the rBMF can be as high as 256 Hz in individual neurons (Fig. 5). Our
recent study using click train stimuli suggested rate-coding for even
higher repetition frequencies (Lu et al. 2001b
).
Finally, it would be interesting to know whether the preferred temporal modulation frequencies in the A1 of awake marmoset monkeys are comparable to those of other primate species including humans under unanesthetized conditions.
| |
ACKNOWLEDGMENTS |
|---|
We thank Dr. Ross Snider for contributions to the data acquisition system used in our study, S. Eliades for assistance with stimulus design, A. Pistorio for assistance with animal training and the preparation of the manuscript, and Dr. E. Bartlett and C. DiMattina for comments on the manuscript. Dr. L. Liang was a visiting scientist in the Department of Biomedical Engineering at The Johns Hopkins University School of Medicine during 1998-2000.
This research was supported by National Institute on Deafness and Other Communication Disorders Grant DC-03180 and by a Presidential Early Career Award for Scientists and Engineers (X. Wang).
Present address of L. Liang: Hearing Center, Pear River Hospital of First Medical University, Guangzhou 510282, Guangdong Province, P. R. China.
| |
FOOTNOTES |
|---|
Address for reprint requests: X. Wang, Dept. of Biomedical Engineering, Johns Hopkins University School of Medicine, 720 Rutland Ave., Ross 424, Baltimore, MD 21205 (E-mail: xwang{at}bme.jhu.edu).
Received 11 October 2001; accepted in final form 7 January 2002.
| |
REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
N. Ding and J. Z. Simon Neural Representations of Complex Temporal Modulations in the Human Auditory Cortex J Neurophysiol, November 1, 2009; 102(5): 2731 - 2743. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. F. Brugge, K. V. Nourski, H. Oya, R. A. Reale, H. Kawasaki, M. Steinschneider, and M. A. Howard III Coding of Repetitive Transients by Auditory Cortex on Heschl's Gyrus J Neurophysiol, October 1, 2009; 102(4): 2358 - 2374. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Sadagopan and X. Wang Nonlinear Spectrotemporal Interactions Underlying Selectivity for Complex Sounds in Auditory Cortex J. Neurosci., September 9, 2009; 29(36): 11192 - 11202. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Itatani and G. M. Klump Auditory Streaming of Amplitude-Modulated Sounds in the Songbird Forebrain J Neurophysiol, June 1, 2009; 101(6): 3212 - 3225. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. H. Scott, B. J. Malone, and M. N. Semple Representation of Dynamic Interaural Phase Difference in Auditory Cortex of Awake Rhesus Macaques J Neurophysiol, April 1, 2009; 101(4): 1781 - 1799. [Abstract] [Full Text] [PDF] |
||||
![]() |
H.-R. Geis and J. G. G. Borst Intracellular Responses of Neurons in the Mouse Inferior Colliculus to Sinusoidal Amplitude-Modulated Tones J Neurophysiol, April 1, 2009; 101(4): 2002 - 2016. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Remedios, N. K. Logothetis, and C. Kayser An Auditory Region in the Primate Insular Cortex Responding Preferentially to Vocal Communication Sounds J. Neurosci., January 28, 2009; 29(4): 1034 - 1045. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. D. Washington and J. S. Kanwal DSCF Neurons Within the Primary Auditory Cortex of the Mustached Bat Process Frequency Modulations Present Within Social Calls J Neurophysiol, December 1, 2008; 100(6): 3285 - 3304. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Bendor and X. Wang Neural Response Properties of Primary, Rostral, and Rostrotemporal Core Fields in the Auditory Cortex of Marmoset Monkeys J Neurophysiol, August 1, 2008; 100(2): 888 - 906. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. C. Middlebrooks Auditory Cortex Phase Locking to Amplitude-Modulated Cochlear Implant Pulse Trains J Neurophysiol, July 1, 2008; 100(1): 76 - 91. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Kayser, C. I. Petkov, and N. K. Logothetis Visual Modulation of Neurons in Auditory Cortex Cereb Cortex, July 1, 2008; 18(7): 1560 - 1574. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. M. Caspary, L. Ling, J. G. Turner, and L. F. Hughes Inhibitory neurotransmission, plasticity and aging in the mammalian central auditory system J. Exp. Biol., June 1, 2008; 211(11): 1781 - 1791. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Zhou and M. M. Merzenich Enduring effects of early structured noise exposure on temporal modulation in the primary auditory cortex PNAS, March 18, 2008; 105(11): 4423 - 4428. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. K. Pandya, D. L. Rathbun, R. Moucha, N. D. Engineer, and M. P. Kilgard Spectral and Temporal Processing in Rat Posterior Auditory Cortex Cereb Cortex, February 1, 2008; 18(2): 301 - 314. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Genzel and L. Wiegrebe Time-variant spectral peak and notch detection in echolocation-call sequences in bats J. Exp. Biol., January 1, 2008; 211(1): 9 - 14. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Luo, Y. Wang, D. Poeppel, and J. Z. Simon Concurrent Encoding of Frequency and Amplitude Modulation in Human Auditory Cortex: Encoding Transition J Neurophysiol, December 1, 2007; 98(6): 3473 - 3485. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. J. Malone, B. H. Scott, and M. N. Semple Dynamic Amplitude Coding in the Auditory Cortex of Awake Rhesus Macaques J Neurophysiol, September 1, 2007; 98(3): 1451 - 1474. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Xu, V. C. Kotak, and D. H. Sanes Conductive Hearing Loss Disrupts Synaptic and Spike Adaptation in Developing Auditory Cortex J. Neurosci., August 29, 2007; 27(35): 9417 - 9426. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Ter-Mikaelian, D. H. Sanes, and M. N. Semple Transformation of Temporal Properties between Auditory Midbrain and Cortex in the Awake Mongolian Gerbil J. Neurosci., June 6, 2007; 27(23): 6091 - 6102. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Poulsen, T. W. Picton, and T. Paus Age-Related Changes in Transient and Oscillatory Brain Responses to Auditory Stimulation in Healthy Adults 19-45 Years Old Cereb Cortex, June 1, 2007; 17(6): 1454 - 1467. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. L. Bartlett and X. Wang Neural Representations of Temporally Modulated Signals in the Auditory Thalamus of Awake Primates J Neurophysiol, February 1, 2007; 97(2): 1005 - 1017. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. C. Nelson and L. H. Carney Neural Rate and Timing Cues for Detection and Discrimination of Amplitude-Modulated Tones in the Awake Rabbit Inferior Colliculus J Neurophysiol, January 1, 2007; 97(1): 522 - 539. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Luo, Y. Wang, D. Poeppel, and J. Z. Simon Concurrent Encoding of Frequency and Amplitude Modulation in Human Auditory Cortex: MEG Evidence J Neurophysiol, November 1, 2006; 96(5): 2712 - 2723. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. W. Middleton, A. Longtin, J. Benda, and L. Maler The cellular basis for parallel neural transmission of a high-frequency stimulus and its low-frequency envelope PNAS, September 26, 2006; 103(39): 14596 - 14601. [Abstract] [Full Text] [PDF] |
||||
![]() |
U. Firzlaff, S. Schornich, S. Hoffmann, G. Schuller, and L. Wiegrebe A Neural Correlate of Stochastic Echo Imaging J. Neurosci., January 18, 2006; 26(3): 785 - 791. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. N. O'Connor, C. I. Petkov, and M. L. Sutter Adaptive Stimulus Optimization for Auditory Cortical Neurons J Neurophysiol, December 1, 2005; 94(6): 4051 - 4067. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. R. Percaccio, N. D. Engineer, A. L. Pruette, P. K. Pandya, R. Moucha, D. L. Rathbun, and M. P. Kilgard Environmental Enrichment Increases Paired-Pulse Depression in Rat Auditory Cortex J Neurophysiol, November 1, 2005; 94(5): 3590 - 3600. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. J. Eliades and X. Wang Dynamics of Auditory-Vocal Interaction in Monkey Auditory Cortex Cereb Cortex, October 1, 2005; 15(10): 1510 - 1523. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Qin, M. Sakai, S. Chimoto, and Y. Sato Interaction of Excitatory and Inhibitory Frequency-receptive Fields in Determining Fundamental Frequency Sensitivity of Primary Auditory Cortex Neurons in Awake Cats Cereb Cortex, September 1, 2005; 15(9): 1371 - 1383. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. M. N. Woolley and J. H. Casseday Processing of Modulated Sounds in the Zebra Finch Auditory Midbrain: Responses to Noise, Frequency Sweeps, and Sinusoidal Amplitude Modulations J Neurophysiol, August 1, 2005; 94(2): 1143 - 1157. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. L. Bartlett and X. Wang Long-Lasting Modulation by Stimulus Context in Primate Auditory Cortex J Neurophysiol, July 1, 2005; 94(1): 83 - 104. [Abstract] [Full Text] [PDF] |
||||
![]() |
F.-G. Zeng, K. Nie, G. S. Stickney, Y.-Y. Kong, M. Vongphoe, A. Bhargave, C. Wei, and K. Cao Speech recognition with amplitude and frequency modulations PNAS, February 15, 2005; 102(7): 2293 - 2298. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Kajikawa, L. de La Mothe, S. Blumell, and T. A. Hackett A Comparison of Neuron Response Properties in Areas A1 and CM of the Marmoset Monkey Auditory Cortex: Tones and Broadband Noise J Neurophysiol, January 1, 2005; 93(1): 22 - 34. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. X. JORIS, C. E. SCHREINER, and A. REES Neural Processing of Amplitude-Modulated Sounds Physiol Rev, April 1, 2004; 84(2): 541 - 577. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Lu and X. Wang Information Content of Auditory Cortical Responses to Time-Varying Acoustic Stimuli J Neurophysiol, January 1, 2004; 91(1): 301 - 313. [Abstract] [Full Text] |
||||
![]() |
R. E. Beitel, C. E. Schreiner, S. W. Cheung, X. Wang, and M. M. Merzenich Reward-dependent plasticity in the primary auditory cortex of adult monkeys trained to discriminate temporally modulated signals PNAS, September 16, 2003; 100(19): 11070 - 11075. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. L. Barbour and X. Wang Auditory Cortical Responses Elicited in Awake Primates by Random Spectrum Stimuli J. Neurosci., August 6, 2003; 23(18): 7194 - 7206. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. C. Hart, A. R. Palmer, and D. A. Hall Amplitude and Frequency-modulated Stimuli Activate Common Regions of Human Auditory Cortex Cereb Cortex, July 1, 2003; 13(7): 773 - 781. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. J. Eliades and X. Wang Sensory-Motor Interaction in the Primate Auditory Cortex During Self-Initiated Vocalizations J Neurophysiol, April 1, 2003; 89(4): 2194 - 2207. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. L. Barbour and X. Wang Temporal Coherence Sensitivity in Auditory Cortex J Neurophysiol, November 1, 2002; 88(5): 2684 - 2699. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Visit Other APS Journals Online |