|
|
||||||||
The Journal of Neurophysiology Vol. 86 No. 5 November 2001, pp. 2616-2620
Copyright ©2001 by the American Physiological Society
RAPID COMMUNICATION
Laboratory of Auditory Neurophysiology, Department of Biomedical Engineering, Johns Hopkins University School of Medicine, Baltimore, Maryland 21205
| |
ABSTRACT |
|---|
|
|
|---|
Wang, Xiaoqin and Siddhartha C. Kadia. Differential Representation of Species-Specific Primate Vocalizations in the Auditory Cortices of Marmoset and Cat. J. Neurophysiol. 86: 2616-2620, 2001. A number of studies in various species have demonstrated that natural vocalizations generally produce stronger neural responses than do their time-reversed versions. The majority of neurons in the primary auditory cortex (A1) of marmoset monkeys responds more strongly to natural marmoset vocalizations than to the time-reversed vocalizations. However, it was unclear whether such differences in neural responses were simply due to the difference between the acoustic structures of natural and time-reversed vocalizations or whether they also resulted from the difference in behavioral relevance of both types of the stimuli. To address this issue, we have compared neural responses to natural and time-reversed marmoset twitter calls in A1 of cats with those obtained from A1 of marmosets using identical stimuli. It was found that the preference for natural marmoset twitter calls demonstrated in marmoset A1 was absent in cat A1. While both cortices responded approximately equally to time-reversed twitter calls, marmoset A1 responded much more strongly to natural twitter calls than did cat A1. This differential representation of marmoset vocalizations in two cortices suggests that experience-dependent and possibly species-specific mechanisms are involved in cortical processing of communication sounds.
| |
INTRODUCTION |
|---|
|
|
|---|
Species-specific vocalizations
are communication sounds that many species rely on for their survival
and social interactions. Communication sounds differ from other types
of acoustic signals in that they are behaviorally relevant to a
species. Although the biological importance of these acoustic signals
is well recognized (Snowdon et al. 1982
), their neural
representation in the cerebral cortex has remained elusive (Wang
2000
). A fundamental issue in understanding how the auditory
system processes communication sounds is whether such sounds are
processed differently from behaviorally irrelevant sounds. A long line
of studies of cortical plasticity, both in development and adulthood
(see reviews by Buonomano and Merzenich 1998
;
Schmidt et al. 1999
), suggest that cortical
representation of communication sounds should differ from that of
behaviorally irrelevant sounds.
Preference of neural responses to natural vocalizations over
time-reversed vocalizations has also been reported in various species
such as bats (Esser et al. 1997
), songbirds
(Doupe and Konishi 1991
; Margoliash
1983
), and cats (Gehr et al. 2000
). Wang et al. (1995a)
studied responses to both natural and synthetic vocalizations in populations of neurons in the primary auditory cortex
(A1) of a highly vocal primate, the common marmoset (Callithrix jacchus jacchus). It was found that the majority of neurons showed stronger responses to natural marmoset twitter calls than to
time-reversed twitter calls. Time-reversed calls do not bear the
behavioral meaning associated with natural twitter calls but have the
same spectral contents and similar acoustic complexity as the natural calls. While this finding has implications for the role of behavioral relevance and species-specificity underlying cortical responses, it
alone does not rule out the possibility that differences in cortical
responses may simply be due to differences in the acoustic structures
between the natural and time-reversed calls. One way to address this
issue is to study neural responses to marmoset twitter calls, in both
natural and time-reversed forms, in the auditory cortex of another
species that does not encounter marmoset vocalizations in its acoustic
environment. We have performed this comparative analysis in domestic
cats, a mammalian species whose A1 shares a number of similar
anatomical and physiological properties with A1 of primates. We
reasoned that the differences between responses of cat A1 to natural
and time-reversed marmoset twitter calls should solely be due to
differences in the acoustic structures of these two types of
sounds because neither bears any behavioral relevance to the cats used
in our study. The results of this study showed that neurons in cat A1
did not exhibit preference to marmoset natural twitter calls as
observed in marmoset A1 and therefore suggest experience-dependent and
possibly species-specific mechanisms underlying cortical processing of
behaviorally relevant vocalizations in the auditory cortex of marmosets.
| |
METHODS |
|---|
|
|
|---|
Animal preparation and recording procedures
Surgical procedures were described in details in a previous
study (Lu and Wang 2000
). Anesthesia was maintained
throughout an experiment by intravenous injection of pentobarbital
sodium. The location of A1 was confirmed electrophysiologically.
Recording experiments were conducted with the animal placed within a
double-wall soundproof chamber (IAC-1024) whose interior was covered by
3-in acoustic absorption foam (Sonex, Illbruck). Multi-unit
extracellular recordings were made using tungsten microelectrodes
(Microprobe, 1-2 M
impedance at 1 kHz) from the middle cortical
layers (depths of 600-900 µm). The characteristic frequency (CF) and
threshold were identified using a manually controlled oscillator and
attenuator. Neural activity was amplified and filtered at 0.3-7 kHz.
Action potentials were detected by a window discriminator. A1 was
systematically sampled from dorsal to ventral regions. These
experimental conditions and recordings procedures were comparable to
those used in earlier marmoset experiments (Wang et al.
1995a
) from which the comparison data were obtained.
Vocalization stimuli
Three pairs of natural and time-reversed twitter calls
(identical to those used in earlier marmoset experiments) were studied in the cat experiments. Figure 1 displays
the three natural twitter calls. Details on acquisition of these
vocalizations were given in Wang et al. (1995a)
. A
time-reversed twitter call (hereafter referred to as reversed
call) was generated by reversing the time course of a natural
twitter call. Acoustic stimuli were delivered under free-field
conditions by a speaker located ~1 m in front of the animal. The
speaker (XTS-35, Radio Shack) had a flat (±5 dB) frequency response
from 100 Hz to 20 kHz. Vocalization stimuli were generated through a
16-bit D/A converter at 48-kHz sampling rate and delivered at 60-70 dB
SPL. All stimuli, 20 repetitions each, were presented in random
order. Inter-stimulus intervals were >1 s.
|
Data analysis
The results presented in this report were based on 70 units
recorded from A1 of two cats with CFs ranging from 4 to 9 kHz. Responses of marmoset A1 neurons were based on Wang et al.
(1995a)
and subsequent electrophysiological experiments
performed under identical conditions. Eighty-nine units recorded from
A1 of two marmosets with CFs between 4 and 9 kHz were analyzed for
direct comparison with the data obtained from the cat experiments.
Responses of these units to the three pairs of natural and reversed
calls as used in the cat experiments were analyzed for the present
study. The multi-unit responses from marmoset A1 were recorded with the same brand of electrode (Microprobe, 1-2 M
), at similar recording depths (middle cortical layers) and under the same anesthetic condition
(barbiturate anesthesia) as in the cat experiments. The vocalization
stimuli were delivered at 65 dB SPL, 10-20 repetitions per stimulus.
Mean firing rate, calculated over the stimulus duration, was used to
measure responses to natural and reversed calls for both marmoset and
cat data. We chose to use this simple measure because it reflects the
overall responsiveness of the neurons studied in both species and does
not involve other assumptions. Spontaneous discharge rate was estimated
from recording intervals prior to stimulus presentations and removed
from the mean firing rate. A selectivity index
(d) was used to quantify the difference between responses to
a pair of natural and reversed calls on a unit by unit basis and is
defined as follows
|
|
1.0) indicates that a neuron responded only to the natural
(reversed) call. For each stimulus pair, only the units with the
minimum mean firing rate (either RNat
or RRev) > 3 spikes/s were
included in the analyses. Statistical significance between response
measures was evaluated using a t-test. P < 0.001 is considered statistically significant (Tables
1 and 2).
|
|
| |
RESULTS |
|---|
|
|
|---|
Natural twitter calls are composed of a series of "phrases,"
each of which is made of several upward FM sweeps and their harmonics (Fig. 1). The spectral components of the first harmonic are centered near 7 kHz and spread to as low as 4-5 kHz in samples across different animals (Agamaite and Wang 1997
; Epple
1968
). A reversed call has spectral contents occupying the same
frequency range as does the natural call but has downward FM sweeps
instead and reversed time courses of their amplitudes (as reflected in
its envelope). The overall energy of a reversed call is identical to
that of a natural twitter call.
In Fig. 2A, mean firing rates of responses to a pair of natural and reversed calls are compared for the neurons studied in cat A1. Overall, responses of 69 units are distributed around the dashed line that indicates equal discharge rates to both stimuli. The difference between responses to the two stimuli is further quantified by a selectivity index (see METHODS). The distribution of the selectivity index (Fig. 2B) has a mean of 0.047 (equivalent to a 1.1 ratio of RNat/RRev), indicating that natural and reversed calls produced approximately equal responses over the population of the neurons studied. This is in contrast to the response characteristics of neurons in marmoset A1 where the same natural twitter call resulted in greater mean firing rates than did the corresponding reversed call in most of the 89 units studied (Fig. 2C). The distribution of the selectivity index of marmoset A1 units (Fig. 2D) has a mean of 0.479 (equivalent to a 2.8 ratio of RNat/RRev), which is far greater than that of cat A1 neurons. The distributions of the selectivity indices of two species are statistically different (P < 0.0001, Table 1).
|
Quantitative analyses of responses to all three pairs of natural and reversed calls are shown in Fig. 3. As is seen in Fig. 2, the disparity in neural selectivity between cat A1 and marmoset A1 is evident in all three cases. In each case, neurons in marmoset A1 showed significantly higher selectivity index than did neurons in cat A1 (Fig. 3A, Table 1). The average selectivity index over three cases is 0.068 in cats and 0.403 in marmosets (P < 0.0001, Table 1). The greater selectivity index in marmoset A1 resulted from stronger responses to the natural twitter calls (Fig. 3B). On average, mean firing rate of marmoset A1 neurons is about twice as high as that of cat A1 neurons in response to natural twitter calls (12.08 vs. 5.96 spikes/s, P < 0.0001, Table 1). Responses to reversed calls (Fig. 3C), on the other hand, have similar magnitudes in both species. Mean firing rate to reversed calls averaged over three cases shows a small but nonsignificant difference between marmosets and cats (6.32 vs. 5.27 spikes/s, P = 0.042, Table 1). Furthermore, responses to natural twitter calls are significantly larger than responses to reversed calls in marmosets for all stimulus pairs tested (Tables 1 and 2). In cats, only a small difference is observed in the same comparison (Tables 1 and 2). In summary, these results clearly showed that, on average, neurons in marmoset A1 responded more strongly to natural twitter calls than to their time-reversed versions, whereas neurons in cat A1 responded to both types of sounds approximately equally.
|
| |
DISCUSSION |
|---|
|
|
|---|
The A1 of cats shares some similarities with the A1 of primates,
including marmosets. In both species, A1 receives thalamic inputs
predominantly from the ventral division of the medial geniculate body
(Aitkin and Park 1993
; Middlebrooks and Zook
1983
; Morel and Kaas 1992
; Winer
1992
). A tonotopic map exists in A1 of both species
(Aitkin et al. 1986
; Imig et al. 1977
;
Merzenich and Brugge 1973
; Merzenich et al.
1975
). Although the CF progresses from low frequency to high
frequency rostrocaudally in marmoset A1, whereas it progresses in the
opposite direction in cat A1, there have been no reports of major
differences between the two species in basic functional properties of
A1 as determined by pure tone responses. The hearing ranges of
marmosets and cats overlap at ~1-20 kHz (Fay 1988
).
The frequency range of marmoset twitter calls is well within sensitive
hearing frequencies of cats. The similar response magnitudes for the
reversed calls between marmoset and cat (Fig. 3C) are an
assuring indication that auditory cortices of both species are capable
of responding to sounds with the acoustic complexity such as that found
in marmoset twitter calls. It further indicates that there was no
systematic bias in these two sets of data obtained from separate experiments.
The natural twitter calls used in this study are biologically important
to marmosets. Twitters are social calls in the marmoset's vocal
repertoire (Epple 1975
) and are frequently used in vocal exchanges between members of a colony (Wang 2000
). They
bear, however, no behavioral relevance to the cats under study, which were never exposed to marmoset vocalizations. The time-reversed twitter
calls are not behaviorally meaningful to both marmosets and cats. The
differential responses due to natural and reversed calls observed in
marmosets, but absent in cats, suggest that neural mechanisms other
than those responsible for encoding acoustic structures of complex
sounds are involved in cortical processing of behaviorally important
communication sounds. These mechanisms may include experience-based and
developmental plasticity (functionally and/or structurally) as well as
predisposed and specialized circuitry in the marmoset's auditory
system that does not exist in cats. Such mechanisms represent the
adaptation of the auditory system to sensory environment over different
time scales (from days, years, to generations). It has been
demonstrated that learning spatial-temporal sensory input patterns
resulted in reorganization of response properties accordingly in the
sensory cortices of primates (Recanzone et al. 1992
;
Wang et al. 1995b
). There was also evidence in songbirds
that neurons selective to bird's own song emerged from development
(Doupe 1997
). A recent study showed that plasticity of
the cochleotopic map has different characteristics in the auditory
cortices of mustached bats and Mongolian gerbils (Sakai and Suga
2001
), suggesting species-specific mechanisms operating at A1.
While it is unclear which or all of these mechanisms have contributed
to the observed differential responses, the findings of this study
suggest that experience-dependent and possibly species-specific mechanisms are involved in cortical processing of behaviorally important communication sounds. It remains unclear whether the observed
disparity in cortical responses was created in A1 or might have been
contributed by subcortical processing. Our result has direct
implications for interpreting neural responses to natural and
artificial sounds. It argues strongly that the exploration of cortical
mechanisms responsible for encoding communication sounds must be based
on biologically meaningful models.
| |
ACKNOWLEDGMENTS |
|---|
We thank Drs. Michael Merzenich and Christoph Schreiner for supporting the marmoset experiments referred to in this study as well as Drs. Ralph Beitel and Steven Cheung, who participated in those experiments. We thank Dr. Edward Bartlett and T. Lu for helpful comments on the manuscript and A. Pistorio for proofreading the manuscript.
This research was supported by National Institute on Deafness and Other Communication Disorders Grant DC-03180 and by a Presidential Early Career Award for Scientists and Engineers (X. Wang).
| |
FOOTNOTES |
|---|
Address for reprint requests: X. Wang, Dept. of Biomedical Engineering, Johns Hopkins University School of Medicine, 720 Rutland Ave., Ross 424, Baltimore, MD 21205 (E-mail: xwang{at}bme.jhu.edu).
Received 26 March 2001; accepted in final form 25 June 2001.
| |
REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
K. M. M. Walker, B. Ahmed, and J. W. H. Schnupp Linking cortical spike pattern codes to auditory perception. J. Cogn. Neurosci., January 1, 2008; 20(1): 135 - 152. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. C. Puckett, P. K. Pandya, R. Moucha, W. Dai, and M. P. Kilgard Plasticity in the Rat Posterior Auditory Field Following Nucleus Basalis Stimulation J Neurophysiol, July 1, 2007; 98(1): 253 - 265. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Gourevitch and J. J. Eggermont Evaluating Information Transfer Between Auditory Cortical Neurons J Neurophysiol, March 1, 2007; 97(3): 2533 - 2543. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Gourevitch and J. J. Eggermont Spatial Representation of Neural Responses to Natural and Altered Conspecific Vocalizations in Cat Auditory Cortex J Neurophysiol, January 1, 2007; 97(1): 144 - 158. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. W. H. Schnupp, T. M. Hall, R. F. Kokelaar, and B. Ahmed Plasticity of temporal pattern codes for vocalization stimuli in primary auditory cortex. J. Neurosci., May 3, 2006; 26(18): 4785 - 4795. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Sander and H. Scheich Left Auditory Cortex and Amygdala, but Right Insula Dominance for Human Laughing and Crying J. Cogn. Neurosci., October 1, 2005; 17(10): 1519. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Tian and J. P. Rauschecker Processing of Frequency-Modulated Sounds in the Lateral Auditory Belt Cortex of the Rhesus Monkey J Neurophysiol, November 1, 2004; 92(5): 2993 - 3013. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. P. Rauschecker and B. Tian Processing of Band-Passed Noise in the Lateral Auditory Belt Cortex of the Rhesus Monkey J Neurophysiol, June 1, 2004; 91(6): 2578 - 2589. [Abstract] [Full Text] [PDF] |
||||
![]() |
T Q GENTNER Neural Systems for Individual Song Recognition in Adult Birds Ann. N.Y. Acad. Sci., June 1, 2004; 1016(1): 282 - 302. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Suta, E. Kvasnak, J. Popelar, and J. Syka Representation of Species-Specific Vocalizations in the Inferior Colliculus of the Guinea Pig J Neurophysiol, December 1, 2003; 90(6): 3794 - 3808. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Visit Other APS Journals Online |