JN Miami Valley Hospital
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


J Neurophysiol 92: 52-65, 2004; doi:10.1152/jn.01253.2003
0022-3077/04 $5.00
This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via ISI Web of Science (5)
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Medvedev, A. V.
Right arrow Articles by Kanwal, J. S.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Medvedev, A. V.
Right arrow Articles by Kanwal, J. S.

Local Field Potentials and Spiking Activity in the Primary Auditory Cortex in Response to Social Calls

Andrei V. Medvedev and Jagmeet S. Kanwal

Department of Physiology and Biophysics, Georgetown University, Washington, DC 20057

Submitted 23 December 2003; accepted in final form 9 February 2004


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
The mustached bat, Pteronotus parnellii, uses complex communication sounds ("calls") for social interactions. We recorded both event-related local field potentials (LFPs) and single/few-unit (SU) spike activity from the same electrode in the posterior region of the primary auditory cortex (AIp) during presentation of simple syllabic calls to awake bats. Temporal properties of the LFPs, which reflect activity within local neuronal clusters, and spike discharges from SUs were studied at 138 recording sites in six bats using seven variants each of 14 simple syllables presented at intensity levels of 40–90 dB SPL. There was no clear spatial selectivity to different call types within the AIp area. Rather, as shown previously, single units responded to multiple call types with similar values of the peak response rate in the peri-stimulus time histogram (PSTH). The LFPs and SUs, however, showed a rich temporal structure that was unique for each call type. Multidimensional scaling (MDS) of the averaged waveforms of call-evoked LFPs and PSTHs revealed that calls were better segregated in the two-dimensional space based on the LFP compared with the PSTH data. A representation within the "LFP-space" revealed that one of the dimensions correlated with the predominant and fundamental frequency of a call. The other dimension showed a high correlation with "harmonic complexity" ("fine" spectral structure of a call). We suggest that the temporal pattern of LFP and spiking activity reflects call-specific dynamics at any locus within the AIp area. This dynamic contributes to a distributed (population-based) representation of calls. Alternatively stated, the fundamental frequency and harmonic structure of calls, and not the recording location within the AIp, determines the temporal structure of the call-evoked LFP.


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
Auditory signals are important for social interactions in many species because they provide a fast and effective means of communication between conspecifics. Frogs, dolphins, whales, bats, some nonhuman primates, and many species of birds, such as songbirds, are only a few examples of those species that use complex sounds for social communication (Brenowitz et al. 1984Go; Busnel 1963Go; Fenton 1977Go; Green 1974Go; Kanwal et al. 1994Go; Margoliash 1983Go; Margoliash and Fortune 1992Go; Owren 1990Go; Payne and McVay 1971Go; Rado et al. 1998Go; Rauschecker et al. 1995Go; Rowell and Hinde 1961Go). Neural mechanisms and coding schemes, which underlie effective transmission and processing of auditory information in animals and humans are poorly understood. While it is virtually impossible to directly study neural mechanisms for processing of communication sounds, e.g., speech in humans, animal models may provide invaluable information about common mechanisms for perception of communication sounds.

The auditory system of bats has been extensively studied by several groups of researchers, both to understand their echolocation behavior (Casseday et al. 1994Go; Grinnell 1963Go; Suga 1965Go; Suga and Taniguchi 1983Go) and their ability to use complex communication sounds for social interactions (Fenton 1977Go; Kanwal 1999Go; Kanwal et al. 1994Go, 2000Go; Leippert 1994Go; Porter 1979Go). Even though many well-structured communication sounds (or simply "calls") in different species of bats have been described in terms of their behavioral context, the neural mechanisms of their processing are not well understood. A detailed classification of the acoustic structure of simple syllables and composites in the mustached bat, Pteronotus parnellii, has been described previously (Kanwal et al. 1994Go). Single-unit (SU) studies of neural processing of the simple syllables and composites reveal that neurons in the Doppler-shifted constant frequency (DSCF) and FM-FM areas of the primary auditory cortex are dually specialized for both, processing of echolocation pulse and its echo (Suga 1965Go) and calls (Kanwal 1999Go; Kanwal et al. 1994Go). The stimulus-locked response of these neurons is combination-sensitive, i.e., facilitated when presented with the combination of constant frequency (CF) tones at their two best frequencies (BFlow and BFhigh).

In contrast to neurons in the DSCF area, neurons in the anterior (AIa) and the posterior (AIp) parts of the primary auditory cortex of the mustached bat are considered as relatively unspecialized and arranged tonotopically along the antero-posterior axis. Our recent study has revealed that AIp neurons also show multipeaked excitatory frequency tuning to harmonic frequencies (Peng and Kanwal 2001Go). The best frequencies of AIp neurons, however, do not correspond to combinations of pulse and echo frequencies, as is the case for the DSCF neurons, but are harmonically related (Peng and Kanwal 2001Go). Distribution of the best frequencies over many AIp neurons shows prominent peaks at BF1 = 24 kHz and BF2 = 48 kHz. These frequencies are commonly present in the power spectra of many calls. Accordingly, single neurons in the AIp area respond to several different types of calls and their variants. Here we examine call specificity as revealed not in the peak response rate of single neurons, but in the temporal pattern of activity of both local field potentials (LFPs) and SU activity in the AIp area in response to the presentation of best variants of different call types. The unspecialized AIp area in mustached bats can be easily homologized to the primary auditory cortical areas of other mammalian species on the basis of topological, hodological, and functional criteria; therefore the results obtained from this study should be generally useful to understand call processing in the primary auditory cortex of many other highly vocal mammals, including humans.


    METHODS
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
The surgery, acoustic stimulation methods, and recording of electrophysiological activity for these experiments are similar to those described previously (Kanwal et al. 1999Go) and therefore are only briefly presented here.

Surgery and recording of neural activity

Mustached bats, P. parnellii, were caught in Trinidad and transported to the Animal Care Facility at Georgetown University. Under isoflurane/air mixture (medical grade, Anaquest) anesthesia, a skin incision was made at the mid-line on the head, and a 2-mm-diameter metal post was affixed just behind the intersection of the sagittal and coronal sutures with cyanoacrylic glue (Loctite 411). Bats were allowed to recover for ≥3 days before the first recording session.

During electrophysiological recordings, the head of the bat was restrained by clamping the metal post, and the bat's body was suspended in a Styrofoam mold by elastic bands in a heated (31°C), sound-proofed, and echo-attenuated chamber (IAC 400A). The bat was monitored throughout the experiment with a video camera. The electrophysiological activity was recorded with sharpened, vinyl-coated tungsten-microelectrodes with tip diameters of ~10 µm and impedance of ~1 M{Omega} inserted into the cortex perpendicular to the skull to a depth of 300–600 µm through a small (50 µm) hole. An indifferent tungsten-wire electrode was placed on the dura mater near the recording electrode. To record neuronal action potentials, the signal was amplified by an AC preamplifier and band-pass filtered between 600 Hz and 4 kHz. LFPs were obtained from the same electrode with a low band-pass filter (1–300 Hz). Single units were isolated using graphical overlay of up to four time-amplitude acceptance/rejection windows on the recorded waveforms, allowing selection of one specific waveform. All surgeries and animal experiments were performed with approval of the Georgetown University Animal Care and Use Committee.

Sound stimuli used in this study were 30-ms-long CF tones and digitized calls of the same species. CF tones were presented from two condenser loudspeakers to determine the BFs of neurons at the recording site and the corresponding auditory thresholds as described elsewhere (Kanwal et al., 1999). Tone bursts A and B corresponded to a low BF (20–29 kHz) and a high BF (40–58 kHz) of a neuron, respectively, and were presented at a rate of 4/s singly as well as simultaneously to determine whether a neuron shows nonlinear facilitation ("combination sensitivity") for tonal combinations. The values of BFs, the approximate harmonic relationship between the BFs (within 1–2 kHz), and a lack of combination sensitivity were used to distinguish AIp neurons from DSCF neurons. To deliver the natural calls and their variants (ranging in duration from 4 to 89 ms) at a rate of 1/s, another computer (Pentium PC) with an A/D-D/A board (DT 2821G) and a sound generation program SIGNAL were used. The main set of 14 call types consisted of calls emitted by a small, captive population of bats (Kanwal et al. 1994Go). Calls were assigned numbers from 1 to 14 based on their acoustic classification via multidimensional scaling (MDS) (Kanwal et al. 1994Go). Frequency variants of these "standard" calls were synthesized on the computer by shifting the spectrum of the mean waveform up and down by 1–3 SD of the mean fundamental frequency as measured across the bat population (Kanwal et al. 1994Go). Mean waveforms represent an average out of several hundred examples of each call type obtained from a colony of bats. CF tone bursts were used to identify the BF, and a "call-scan" was performed to identify the best calls at each recording site. During the first stage of this scan, a 15-s epoch of activity (no stimulus control plus 14 calls presented at a rate of 1 call/s) was repeatedly recorded 30 times. After every 10 consecutive presentations of the same call at the same level of loudness, the sound intensity was automatically decreased by 25 dB. Thus three average levels of intensity (95, 70, and 45 dB) were tested initially. Three-step intensity screening was done using all seven sets of calls (the "standard" call and its six variants with spectra shifted by ±1, ±2, and ±3 SD of a call's fundamental frequency). The SU data from this initial screening were analyzed on-line to reveal the best variant for each call type. At the second stage of the screening process, a newly configured set of the best variants for each of the 14 call types was presented at six different levels of intensity (attenuation steps of 10 dB). As a result of the two-step screening procedure, call preference was determined for every recording site; this allowed us to further analyze the response of three to five best call variants at their optimal level of sound intensity. Each call was also presented to the animal in the reverse order, and the corresponding LFPs as well as SU responses were recorded. Background activity was also recorded for 1 s before each repetition of the auditory stimulus set.

Data analysis

Peri-stimulus time histograms (PSTHs) were calculated for 200 or 100 presentations of acoustic stimuli using a bin width of 1 ms. These histograms were used to measure the neuronal response that is traditionally defined as a stimulus-locked change in the peak response magnitude, or firing rate, and which can be detected by summation of spike trains over repeated stimulus presentations. For each call type, the LFPs were first analyzed within each animal to assess the degree of variability across animals as well as at different depths within the cortex (100–600 µm). After establishing a similar profile of the call-evoked LFP for all the recorded AIp sites in different bats, both types of the stimulus-locked responses (LFPs and PSTHs) were averaged across recording sites and all animals for each call type. The call-evoked averaged responses were compared with each other using t-tests (P < 0.01). To obtain an estimate of the duration of LFPs and response histograms, the "center of gravity," or median point, tmed, for each averaged LFP and PSTH was determined as a time point where the area under the LFP waveform or PSTH from response onset to that point equals the area to the right of that point: LFP/PSTHarea(t < tmed) = LFP/PSTHarea(t > tmed). Duration of the averaged LFPs or PSTH was taken as twice the median point. Pair-wise correlations between the averaged call-evoked LFPs were calculated within a time window 0–250 ms, the time period of the early and middle phases of a classical middle-latency auditory evoked potential (Barth and Di 1990Go). Those cross-correlation coefficients were analyzed through the MDS algorithm (SPSS software) to establish a relationship between the temporal structure of the LFP waveforms and the acoustic structure of calls.

To quantify harmonic structure of calls regardless of their frequency range, we calculated normalized spectra of call waveforms (sn) using a normalized frequency scale (fn), such that sn(fn) = s(f)/smax, fn = f/fmax, where f is frequency in kilohertz, s(f) is a power spectrum of a call waveform, and smax and fmax are the maximum power and the corresponding predominant frequency within s(f). Normalized call spectra sn(fn) thus show relative distribution of power over frequency values that are also relative to the predominant frequency. For each normalized call spectrum sn(fn), the following parameters were calculated: the "effective" frequency, feff = {Sigma}fn x s(fn)/{Sigma}s(fn) and its "deviation," dev(feff) = {surd}[{Sigma}(fnfavg)2 x s(fn)/{Sigma}s(fn)] that characterizes dispersion of frequencies around feff within the normalized spectrum sn(fn).

In total, 138 cortical sites in the left hemisphere of six bats were studied. Figure 1 shows locations of the recorded sites within the AIp area of all bats. As many as 10–12 penetrations were made in a bat within an ~1.5 x 1-mm rectangular area in the central part (50–75%) of the AIp area. For each penetration, the LFP and SU responses from one to three recording sites were used in the analysis.



View larger version (34K):
[in this window]
[in a new window]
 
FIG. 1. Dorsolateral view of the left hemisphere of the mustached bat. The Doppler-shifted constant frequency (DSCF) area is shown in gray. Symbols in the posterior portion of the primary auditory cortex (AIp) to the right of the DSCF area represent penetration sites for all animals. Anatomical landmarks (blood vessels shown by thick lines) and tuning properties of neuronal responses were used to identify the AIp area in each bat. Dorsomedial (DM), constant frequency (CF/CF), and frequency-modulated (FM-FM) areas of the auditory cortex are also shown. In the CF/CF and FM-FM areas, neurons show facilitation to combinations of CF or FM sounds, respectively.

 

    RESULTS
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
Call-evoked LFPs and SU activity within the AIp area

Responses of local neuronal populations at each recording site were obtained by the same stimulus set of simple syllabic calls and their variants. As an example, Fig. 2 A shows the LFPs and the corresponding PSTHs for SU activity for two neurons at the same recording site in response to presentation of calls 3–10. SU responses of the two neurons to several different types of calls and their variants were recorded simultaneously. These neurons responded to many calls and showed only a weak preference for call types 4–9 when scanning the response to 14 different call types (calls 1–14). Typically, the overall temporal pattern of SU responses showed a correlation to the temporal pattern of the LFP waveform. Thus the PSTHs show increases or decreases of neuronal firing coinciding with specific components of the LFP. Nevertheless, because SU activity is usually sparse, not every peak or wave in the LFP is accompanied by a distinctive change in spike activity in the individual examples of SU activity. Note that the LFPs have different waveforms for different call types. Also, responses of the SUs recorded from the same electrode showed slightly different latencies, peak rates, and durations. Thus the major feature of single unit responses and LFP's in the AIp area is their ability to respond to many calls showing preference to more than one call. Sound intensity had a marginal effect on the response profile of AIp neurons (Fig. 3 C). The best call variants and their best amplitude levels were determined from the peak response in the PSTHs via the two-step call scanning procedure. Typically, the best amplitudes for responses to calls ranged between 50 and 90 dB SPL. Bar graphs (Fig. 2B) show the best responses (call preference) for the same two units presented in Fig. 2A.



View larger version (29K):
[in this window]
[in a new window]
 
FIG. 2. A: examples of the local field potentials (LFPs; top) and the corresponding single/few unit (SU) activity (bottom) for 2 AIp neurons isolated at 1 site and recorded during presentation of 8 calls labeled by a number and an abbreviated name (see Kanwal et al. 1994Go). Call waveforms and their spectrograms are shown below the electrophysiological response on the same time scale. B: bar graph of peak response values vs. call number (1–14) for the same 2 AIp neurons shown in A to the "best" statistical variants of 14 simple syllabic calls (i.e., variants eliciting greatest peak response; see METHODS for a detailed description of call variants and call scan procedure). Only responses exceeding the average background activity plus 1 SD are shown. In this example, 2 neurons respond to many variants with some preference for calls 4–9.

 


View larger version (19K):
[in this window]
[in a new window]
 
FIG. 3. A: LFP profile at cortical depths from 100 to 600 µm evoked by presentation of call 3 (syllable TCFs). Note the reversal of LFP polarity at depth between 300 and 400 µm. B: similarity of the LFP waveform across many cortical sites within the AIp area and different animals, shown for call 4 (syllable sAFM). Several traces of the LFP recorded from different cortical sites in 3 bats are displayed on the same graph. Numbers to the right of LFP traces indicate recording sites within the AIp area shown in C. D: LFP recorded at site 8 (shown in C) in response to call 11 (syllable fSFM) presented at 3 different levels of intensity. Note the moderate effect of sound intensity on the LFP waveform. Call waveforms are shown below LFP traces.

 
Recording of electrical activity at depths from 100 to 600 µm within the AIp area revealed that the call-evoked LFPs change polarity at depths between 300 and 400 µm (Fig. 3A). From 400 to 600 µm of cortical depth, polarity did not change, and changes in the LFP were largely restricted to variations in amplitude of some positive and negative peaks, whereas the duration of those peaks remained approximately the same (Fig. 3A). Only the responses obtained at depths below 400 µm were taken into further analysis.

A comparison of the recordings from various AIp sites in the same bat versus from different bats showed that the generic temporal structure of the LFP for each call type within the AIp area was similar across different recording sites and different animals (Fig. 3B). Individual differences in LFPs between recording sites and animals were once again largely restricted to the amplitude of individual LFP components, whereas the temporal pattern of those components was similar (Fig. 3B). To detect some type of spatial gradient within neuronal responses over the AIp area (if any), we used a general linear model within the three-way ANOVA test. First, the most rostral recording site in each animal was taken as a reference. The responses (LFP or PSTH) from all other recording sites were correlated with the reference response, and those correlation coefficients were analyzed using the rostrocaudal and dorsoventral coordinates of recording sites as covariates and the animal number as a factor. The analysis performed for each call separately did not reveal any spatial gradient or differences between animals. For example, for call 4 (syllable sAFM): F = 1.74, P = 0.19, n = 19; for call 7 (syllable dRFM): F = 2.69, P = 0.09, n = 14. This allowed us to justify averaging of the LFPs recorded for each call type over all sites and all bats.

A statistical pair-wise comparison between averaged call-evoked LFPs is shown by two examples in Fig. 4. The first example shows LFPs for calls 3 and 8, which are the most different from each other as revealed by their lowest correlation coefficient (0.03) from all pair-wise comparisons between 14 call-evoked LFPs. The second example shows two LFPs with the highest correlation (0.86). All four averaged LFP waveforms have a relatively low variance (as determined by their SE), and the t-values calculated for each time point within the stimulus-locked response show significant differences (P < 0.01) between the LFP waveforms. Although the high-amplitude components are not observed later than 250 ms after the stimulus onset, the significant differences for pair-wise comparisons among 14 averaged call-evoked LFPs were observed for as long as 500–600 ms after the stimulus onset.



View larger version (35K):
[in this window]
[in a new window]
 
FIG. 4. Statistical comparison of call-evoked averaged LFPs. LFPs have the lowest correlation coefficient of 0.03 (n = 25) for calls 3 and 8 (A) and the highest correlation coefficient of 0.86 (n = 21) for calls 9 and 11 (B) from all pair-wise comparisons among 14 call-evoked LFPs. Averaged LFP (thick line) and the same LFP offset by 1 SE (thin line) are shown. t values calculated for each time point are plotted on the top graphs, and the "significance corridor" for t values is indicated by dashed lines (t-test, P < 0.01).

 
The averaged LFPs and the corresponding averaged PSTHs for all 14 call types are presented in Fig. 5, along with the amplitude envelops of calls. In this and other figures, calls are arranged according to their acoustic structure from CF sounds (calls 1–3) through FM sounds (calls 4–12) to "noise-burst" (NB) sounds (calls 13 and 14), as determined by pooling various parameters of the acoustic structure of calls and scaling them down to a single dimension (Kanwal et al. 1994Go). All LFPs have a distinctive temporal structure and contain a number of negative and positive peaks. The polarity, amplitude, and duration of these peaks are different for different call types. Nevertheless, some common features of the call-induced LFPs can be mentioned here. As a rule, the LFP starts with a small positive deflection 10–20 ms after the stimulus onset (P1). This initial positive peak is followed by a series of other negative and positive components that are higher in amplitude and longer in duration. In some cases, a relatively slow peak/wave or a multipeak complex occurs 20–30 ms after the end of a call, which is negative in some calls (1, 7, 8, and 14) and positive in other calls (4, 12, and 13). Interestingly, this "off" component is often accompanied by an increase in neuronal firing (e.g., see calls 1, 4, 5, 7, 12, 13, and 14). The LFP is usually terminated by a slow "positive-negative wave" complex. The amplitude, duration, and time course of this late component are different for different calls. During the positive slow wave, there is a reduction in neuronal firing best seen in calls 3, 6, 7, 11, and 12. Depending on the duration of a call, the late positive-negative complex wears off at 300–500 ms after the stimulus onset. For calls with a periodic temporal structure, the LFPs show oscillations with a frequency corresponding to the rate of AM within a call (for calls 6 and 8, the AM rate is ~40 Hz; for call 10, the AM rate is 18 Hz). The periodic structure of the LFP for call 9 with a high AM rate (120 Hz), however, is obscured because of the small amplitude oscillations superimposed on other slower components of the LFP. Periodicity in the evoked activity corresponding to the periodic structure of a call is less obvious in the SU activity.



View larger version (27K):
[in this window]
[in a new window]
 
FIG. 5. LFPs (thick line), peri-stimulus time histograms (PSTHs; bar graphs), and smoothed PSTHs (thin line) averaged over all recording sites/units for each call type (n = 19, 8, 19, 39, 19, 15, 32, 9, 21, 11, 13, 7, 7, and 18 for calls 1–14, respectively). Call waveforms are shown below the corresponding electrophysiological response. Note the unique temporal profile of evoked activity for each call type, as well as a correspondence between the LFP and smoothed PSTHs. Calls are ordered and referred to by specific names as previously described (Kanwal et al. 1994Go).

 
Although the LFP typically lasts longer than the corresponding call, the LFP duration does show a correlation with call duration, i.e., correlation coefficient between these values equals 0.83 ± 0.26 (SD) and is significantly different from zero (P < 0.01; Fig. 6 A), whereas the PSTH duration does not correlate with call duration (Fig. 6B). Nevertheless, it is important to note that call duration is not the only factor determining duration of the evoked activity because the LFP duration varies significantly within groups of calls of similar duration. For example, calls 1, 5, and 13 are similarly very short (<20 ms), whereas the corresponding LFPs vary from 130 ms for call 5 to ~170 ms for calls 1 and 13. Call 6, which has the same duration as calls 12 and 14, has much shorter LFPs (170 vs. 260 ms; Fig. 6A).



View larger version (16K):
[in this window]
[in a new window]
 
FIG. 6. Scatter plot of duration for averaged call-evoked LFPs (A) and PSTHs (B) vs. duration of the corresponding call. Regression line in A shows a significant correlation between LFP duration and call duration (line slope equals 0.83 ± 0.26 and is significantly >0, P < 0.01), while there is no correlation between PSTH duration and call duration (regression line slope in B equals 0.20 ± 0.26, P = 0.47).

 
Effect of call reversal on the LFP

A comparison of the LFPs to the forward and time-reversed versions of the same call show that the call-evoked LFPs in the AIp area are not significantly affected by call reversal (Fig. 7). Although the amplitudes of some individual LFP components changed in a few cases, all 14 reversed calls evoked temporal patterns of activity very similar to the corresponding patterns of activity evoked by natural calls. Correlation coefficients between the corresponding LFPs in response to a natural call and a reversed call ranged between the lowest value of 0.57 (call 14) to the highest value of 0.94 (call 9). We also calculated cross-correlation functions between the call envelopes and the LFPs at time lags from –50 to 200 ms. Because time lags giving the highest "envelope–LFP" correlation vary for different call types, we took the absolute value of the highest correlation within a 0 to 100 ms time shift for each call type. Those peak values of the "envelope–LFP" correlations appeared to be preferentially in the range 0.2–0.4 for all calls and were similar for the following four comparisons, i.e., "normal call–LFP to normal call," "reversed call–LFP to normal call," "normal call–LFP to reversed call," and "reversed call–LFP to reversed call." The corresponding mean (±SD) values for four comparisons are 0.27 ± 0.08, 0.25 ± 0.09, 0.27 ± 0.08, and 0.25 ± 0.07 and are not significantly different from each other (1-way ANOVA; F = 0.56, P = 0.64, n = 56). Thus this analysis shows that the LFPs in the AIp area are relatively insensitive to call reversal. Whereas this result is not surprising for the CF calls whose spectral structure is invariant to time reversal, the same result for the FM calls with a well-defined and specific temporal structure was unexpected.



View larger version (26K):
[in this window]
[in a new window]
 
FIG. 7. Line plots of the averaged LFPs recorded during presentation of natural calls (thick lines), and the same calls reversed in time (thin lines). For each call type, LFPs are averaged over those recording sites for which corresponding reversed calls were also presented (n = 5, 4, 10, 15, 8, 6, 16, 5, 10, 4, 4, 4, 2, and 2 for calls 1–14, respectively). Note the similar waveform and temporal profile of the LFPs to the normal and reversed calls. Amplitude is in millivolts.

 
MDS of call responses

To establish a relationship between the unique temporal structure of the LFP evoked by a call and the acoustic structure of that call, we performed MDS of 14 call-evoked LFPs as well as 14 PSTHs averaged over many units/recording sites. Pair-wise correlations between the LFPs or PSTHs were calculated within a time window of 0 to 250 ms and were converted into "distances" by standard transformation as described by Mardia et al. (1979)Go.

Two-dimensional scaling of the LFPs and PSTHs, based on their pair-wise correlations, captured 70 and 88% of variation, respectively. Again, this difference is due to the more uniform values of the PSTH correlations. We therefore limit our MDS analysis to two dimensions. This is equivalent to the usage of the first two principal components (with the highest amplitudes) to capture the major features of the data pattern (Mardia et al. 1979Go). Segregation of call-evoked LFPs and PSTHs in the abstract representational space is shown in Fig. 8. As a result of two-dimensional scaling, all 14 call-evoked LFPs were well separated and arranged roughly along a circle (Fig. 8A), whereas the PSTHs corresponding to the same set of calls are placed closer to each other with two outliers (calls 2 and 14; Fig. 8B). It should be noted, however, that segregation of variables in the abstract space by the MDS analysis is relative and invariant to linear transformations, such as shift and rotation (Mardia et al. 1979Go). The LFP representation shows a better segregation of calls than that due to a smoothed PSTH representation. To quantify this result, we plotted a distribution of normalized pair-wise distances (ranging from 0 to 3) for all calls (points) from the coordinates provided by each MDS plot. An analysis of these distributions revealed a peak corresponding to a distance of >2 for the LFP representation and a peak corresponding to a distance of <1 for the PSTH representation. This provided an index of spatial segregation of calls for each representation and revealed a higher sensitivity of the LFPs compared with PSTHs to variations in the acoustic structure of different calls. Therefore a further analysis of the relationship between acoustic parameters of a call and its position in the representational (MDS) space is presented only for the LFP data.



View larger version (14K):
[in this window]
[in a new window]
 
FIG. 8. Multidimensional scaling (MDS) representation of averaged call-evoked LFP waveforms (A) and averaged call-evoked PSTHs (B) based on their pair-wise correlations. Note a better segregation of calls in the "LFP-based" space.

 
Figure 9 shows the LFP-based MDS plot and the corresponding LFP waveforms placed along the perimeter of the MDS plot. On this plot, the call-evoked LFPs are arranged according to the similarity in their waveforms. For example, LFPs for calls 3, 4, 9, and 11 have a similar positive slow wave component at ~100 ms. This leads to a relatively high mutual correlation of those LFPs and their "proximity" within the MDS plot. LFPs for calls 7 and 13 also have a similar temporal profile of the slow component at 50–250 ms and thus those LFPs are placed close to each other. The LFPs for calls 1, 12, 6, and 10 in the top row of the MDS plot have high-amplitude multiple peaks at 10–100 ms, whereas the late slow-wave components are not prominent in this group of LFPs. When the LFP-based segregation of calls is analyzed in terms of the acoustic structure of calls (their spectrograms are shown in Fig. 10 A), the following relationships can be delineated.



View larger version (27K):
[in this window]
[in a new window]
 
FIG. 9. MDS plot of the averaged LFP waveforms based on their pair-wise correlations. LFP waveforms normalized to their maximum amplitude are shown around the perimeter of the MDS plot using the time scale indicated on the bottom graphs.

 


View larger version (31K):
[in this window]
[in a new window]
 
FIG. 10. Segregation of calls based on MDS of call-evoked LFPs. Spectrograms (A) and call normalized spectra sn (B) are shown around the perimeter of the MDS plot. Normalized spectra (sn) are calculated using the normalized frequency scale (fn), such as sn(fn) = s(f)/smax, fn = f/fmax, where f is frequency in kilohertz, s(f) is a power spectrum of a syllable waveform, and smax and fmax are the maximum power and the corresponding frequency within s(f). Note that calls are arranged along a circular configuration, and a relative position of a syllable along this circle reflects its harmonic structure. Calls placed on the right-bottom portion of the circle have a relatively simple harmonic structure with the main peak at fn = 1 and 2 additional smaller peaks at fn = 0.5 and fn = 1.5 (calls 9, 4, 3, 7, and 13). In a clockwise direction along the circle from this group, calls with a few broad spectral peaks are placed (calls 13, 5, 2, and 8). Finally, calls with a rich harmonic structure (with a high number of harmonics and relatively broad spectral peaks; calls 1, 12, 6, 10, and 11) are placed on the top of the circle. The rectangular broadband noise-burst (rBNB; call 14) is placed in the middle of the circle.

 
To quantify a relationship between the spectral content of a call and the placement of the corresponding LFP on the x-axis of the representational space, we determined two characteristic frequencies of calls. The first one is the frequency of the highest peak within the call spectrum (predominant frequency). The second frequency is the lowest harmonic within a call spectrum (fundamental frequency). By analogy with psychophysical studies, we refer to the predominant frequency as "formant" frequency and to the fundamental frequency as "pitch." It should be noted that both frequencies are the same for several call types because the fundamental frequencies of these calls are also the highest in energy. In other calls, namely 3, 4, 7, 8, 9, and 11, the highest spectral peak occurs not at the fundamental frequency (by convention, called the "1st harmonic") but at the next higher ("2nd") harmonic. The first dimension (x values) of the averaged LFPs in the representational space correlate equally well with both the "formant" and the "pitch" frequencies of the corresponding calls (the slopes of the regression lines are significantly different from 0 by t-test, t = 3.0, P = 0.01 for both frequencies; Fig. 11, A and B).



View larger version (25K):
[in this window]
[in a new window]
 
FIG. 11. Regression line plots for MDS dimensions and call parameters. x values (dimension 1) of calls in the 2-dimensional representational space correlate with the call predominant ("formant") frequency (frequency with the highest energy in the call spectrum; A) as well as the call "pitch" (the fundamental frequency in the call spectrum; B). y values (dimension 2) of calls correlate with 2 parameters used to quantify "harmonic complexity" of calls, namely, the "effective" normalized frequency feff (C) as well as its "deviation" dev (feff) (D).

 
Second, the acoustic structure of calls changes along the y-axis (dimension 2) in the representational space. Calls in the lower part of Fig. 10A (low values along the y-axis) are of the CF type or have relatively weak FM components with a low number of harmonics (1–3) in their spectra (calls 2, 5, 13, 7, 3, and 4). At the midlevel along the y-axis, there are two calls with strong FM (calls 8 and 9) and a NB call 14. Finally, calls with a rich harmonic structure and periodic FMs have a high value for the second dimension represented along the y-axis (calls 1, 12, 6, 10, and 11). The majority of these calls have a high number of harmonics in their spectra ("harmonic stack"). Thus these changes in the acoustic structure of calls along the y-axis can be characterized as an increase in "harmonic complexity." For example, CF calls with a few number of harmonics have a simpler harmonic structure compared with FM calls with a few harmonics but larger frequency variation in their spectra due to FM sweeps. Finally, calls with a high number of harmonics and/or periodic FM have the most complex harmonic structure. To quantify "harmonic complexity," we use the normalized spectra of call waveforms sn shown in Fig. 10B. By definition, the maximum occurs at fn = 1 in all spectra sn(fn). If the predominant frequency equals the fundamental frequency ("formant" = "pitch"), there is no spectral peak below fn = 1 (calls 1, 2, 6, 12, 13, and 14); otherwise there is an additional peak at fn = 0.5. "Harmonic stack" results in a "tail" of multiple small peaks at integer values of fn = 2, 3, 4, . . . (if "formant" equals "pitch" as it is the case for calls 1 and 6) or at values of fn = 1.5, 2, 2.5, 3, . . . otherwise (calls 10 and 11 in the top row of Fig. 10B). If a normalized call spectrum has only a few harmonics, the "effective" frequency feff is close to one and its dispersion dev (feff) is relatively low (also ~1). If a spectrum has a "harmonic stack," the "effective" frequency feff and its dispersion increase because of the contribution of multiple spectral peaks in the normalized spectrum. Therefore parameters feff and dev(feff) can be used to quantify "harmonic complexity." The y value (dimension 2) in the representational space of the averaged LFP is significantly correlated with both parameters feff and dev(feff) (t-test for regression slopes; for feff: t = 2.9, P = 0.013; for dev (feff): t = 3.1, P < 0.01; Fig. 11, C and D).

Some calls with apparently different spectral-temporal structure have similar normalized spectra, and they are placed close to each other in the two-dimensional space. An example includes calls 3, 4, 7, and 9 (group 1). Call 3 is a CF type of simple syllable, whereas calls 4, 7, and 9 are FM syllables. Moreover, call 7 has a periodic FM and call 9 has a series of FM sweeps and call 4 has a single FM sweep. Despite these differences in the spectral-temporal structure, the normalized spectra of these calls are similar and have a narrow spectral peak at the first harmonic (fn = 1) and two additional small peaks at fn = 0.5 and fn = 1.5 (Fig. 10B). Peak at fn = 0.5 means that the predominant frequency within these calls is higher than the fundamental frequency. Moving clockwise along the circle in the representational space from this group of calls, we find calls with a low number of harmonics but with broader spectral peaks (calls 13, 5, 2, and 8; group 2; Fig. 10B). The circular contour line in the representation space concludes with calls having a rich harmonic structure (relatively broad and multiple spectral peaks, such as in calls 1, 6, 10, 11, and 12; call 12 does not have multiple harmonics, although it has a relatively broad spectral peak at the fundamental frequency; group 3). In the majority of calls within the second and the third group, the peak at fn = 0.5 is absent because these calls have the highest energy at a relatively low fundamental frequency. The rectangular broadband noise-burst (call 14) is placed at the center of the circle. This call has a spectral "tail" at high frequencies because of its broadband spectrum (a characteristic feature of calls with a "harmonic stack") but, unlike the majority of other calls, it lacks a discrete harmonic structure. Thus two-dimensional scaling of call-evoked responses allowed us to segregate calls according to their harmonic structure.


    DISCUSSION
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
The auditory system of the mustached bat is highly developed because of the unique auditory specialization of these animals. Previous studies on the neural mechanisms for echolocation and auditory communication in bats are based on an analysis of SU activity. Different properties of single neurons, including their combination-sensitivity to specific spectral and temporal parameters, in the functionally specialized areas, such as the DSCF, CF/CF, FM-FM, etc., in the primary and secondary auditory cortices of the bat are well studied (Casseday et al. 1994Go; Grinnell 1963Go; Ohlemiller et al. 1996Go; Suga 1965Go; Suga and Taniguchi 1983Go; Suga et al. 1978Go, 1979Go). We describe event-related LFPs in response to natural sounds produced by conspecifics and show that the LFPs are highly specific for simple calls that compose the acoustic "alphabet" of the vocal repertoire for communication in the mustached bat (Kanwal 1999Go; Kanwal et al. 1994Go). This suggests that the full spectrum of response dynamics at a recording locus is important for understanding call representation and processing within the cortex. The present analysis is based on pair-wise correlations that are sensitive to the temporal structure of responses ignoring their amplitudes. Our analysis shows that the PSTHs have a more similar time course and therefore the lower "call-specificity" compared with the LFPs (Fig. 8). A possible interpretation of this result is that the PSTHs are more sensitive to similarities present within the calls, whereas the LFPs are more sensitive to their differences. Thus peak response rates of single neurons in the AIp area are inadequate for discerning different call types and therefore provide a relatively poor resolution of the neural response space into which calls can be mapped, but could be used for distinguishing species-specific calls from noncall stimuli.

Temporal characteristics of call-evoked LFPs and SU activity

The call-evoked LFP is shown here to have a number of relatively high-amplitude positive and negative components of various durations that last from 10 to ~250 ms after stimulus onset (the early and middle phases of the evoked response). During the late phase of the response, there are low-amplitude slow waves with superimposed oscillations of different frequencies that gradually attenuate over 250–600 ms time window. Thus the generic temporal structure of call-evoked LFP resembles the "classical" middle-latency auditory evoked potential (MAEP) recorded in response to clicks and pure tones in the auditory cortex of many mammalian species, including rats, cats, monkeys, and humans (Eggermont and Smith 1995Go; Franowicz and Barth 1995Go; Lee et al. 1984Go; Makela et al. 1990Go; Musiek et al. 1984Go). The click-evoked MAEP complex has a uniform spatial distribution within the primary auditory cortex of the anesthetized rat and represents the archetypal response sequence reflecting activation of distinct subpopulations of cortical cells (Barth and Di 1990Go; Franowicz and Barth 1995Go). Similar to that, each call-evoked LFP has an archetypal waveform within the AIp area of the awake mustached bat. Nevertheless, the temporal structure of a call-evoked LFP seems to be more complex than the click-evoked or single tone-evoked MAEP complex because it contains multiple positive and negative peaks as well as slow waves and/or oscillatory components depending on the specific spectrotemporal structure of a corresponding call.

LFPs are commonly viewed as being associated with relatively slow (compared with the duration of an action potential) changes in the membrane potential. The major contributing factors to an LFP recorded anywhere in the cortex are 1) synaptic potential changes resulting from thalamocortical, cortico-cortical and other inputs at the recording locus; 2) slow membrane changes such as hyperpolarization and intrinsic oscillatory activity associated with action potentials and opening and closing of ion channels; 3) asynchronous activity related to nonspiking, subthreshold activity; and finally 4) global and/or persistent slow-wave activity across the cortex. Thus LFPs represent the summed event-driven activity at specific loci within the cortex (Eggermont and Smith 1995Go; Norena and Eggermont 2002Go). The fact that synaptic activity on multiple neurons within the AIp area contributes to a cortical representation of a complex sound in the LFP does not exclude a possibility that a subset of these neurons may converge on to single neurons at other (higher) levels of processing, such as the secondary auditory areas and/or the frontal cortex auditory field. Recordings from those levels could yield SU data that would show more specialized responses to social calls than what is observed within the AIp.

Response invariance to time reversal

Calls used in this study represent the basic elements (simple syllables) of the mustached bat's vocal repertoire and are referred to as "syllables" by analogy with speech. Some of these syllables, especially the FM type, have a distinctive temporal structure that changes significantly with call reversal. For example, an upward FM sound becomes a downward FM sound with call reversal. An unexpected result of this study is that the LFP profile does not change significantly with call reversal. Nevertheless, it should not be concluded that neuronal clusters within the AIp area are completely insensitive to the temporal features of calls. The LFP profile remains similar when presented with the normal and reversed calls, but it is not identical in the two conditions. Small changes in the LFP waveform can be observed and they are likely to be related to the low-amplitude high-frequency components of the LFP. Because the main focus of this study is a relationship between spectral content of calls and the temporal structure of the LFP waveform, a detailed description of changes in the frequency-specific components of the LFP will be described elsewhere (unpublished data).

MDS of call responses

Calls of the mustached bat have been classified into three major types (CF, FM, and NB) and categorized on the basis of principle components and MDS analyses in the two- and three-dimensional representational space (Kanwal et al. 1994Go). The two-dimensional scaling of the evoked LFP and SU activity performed in this study leads to similar, although not identical, segregation of calls. Similarity in call classification between this study and the previous study is shown by the fact that the call grouping based on the LFP is also sensitive to the acoustic parameters of calls. For example, CF calls and calls with a relatively weak FM (e.g., calls 3 and 4) are segregated from calls with a pronounced FM component (e.g., calls 5 and 8) in both the LFP and PSTH representations. NB type of calls (e.g., call 14) are placed out of any cluster in either representation (Fig. 8). Our analysis of call responses based on temporal characteristics of the LFP waveforms reveals that certain aspects of the acoustic structure of a stimulus may be extracted by LFPs and SUs. Our data show that the temporal shape of an LFP evoked by a call is closely related to the spectral content of that call, in particular its overall duration and harmonic complexity, i.e., the fundamental frequency, the absolute and/or relative value of the predominant frequency, spectral discreteness, and stability.

When represented in the two-dimensional abstract space, the first dimension (x value) of the LFPs correlates both with the predominant ("formant") frequency and the fundamental frequency of a call ("pitch"). Thus these data suggest that one of the acoustic parameters, which may be represented in the AIp area, is simply a predominant and/or fundamental frequency of a call. Another spectral parameter that shapes the LFP waveform is related to the harmonic structure of a call, namely, its "harmonic complexity." We describe "harmonic complexity" of a call by two parameters that increase with the number of harmonics in the call spectrum and the spectral width. Those parameters are the "effective" frequency and its dispersion in the normalized spectrum of a call. The second dimension (y values) of LFPs in the two-dimensional space correlates well with both of those parameters (Fig. 11, C and D). These data suggest that the functional role of the AIp area may also include a harmonic analysis of calls.

Harmonic analysis of calls within the AIp area

Many naturally produced sounds have a clear harmonic structure. Harmonics play an important role in our ability to distinguish different sounds. Harmonics give different sounds their quality and may help identify a sound source. Accordingly, spectral elements, such as harmonic frequencies, within calls can function as important information-bearing parameters for perception of a call type and in the detection of fine changes in the pitch of any one type of a call (Medvedev et al. 2002Go). Harmonic analysis of calls may also allow segregation of sounds with different fundamental frequencies when heard at the same time. Behavioral experiments show that the presence of two or three harmonics significantly improves the recognition of wriggling calls that release maternal behaviors in house mice (Ehret and Riecke 2002Go). Excitatory frequency tuning data in the mustached bat show that many neurons in the AIp area are tuned to harmonic frequencies with a low harmonic in the range of ~24 kHz and a high harmonic in the range of ~48 kHz (Peng and Kanwal 2001Go). Multipeaked tuning is also observed in the primary auditory cortex of cats, although in this case, frequencies are not as well separated and have not been shown to have a harmonic relationship (Sutter and Schreiner 1991Go). A recent study suggests that spectral harmonic templates emerge early in the auditory system (Shamma and Klein 2000Go). The model proposed by these authors demonstrates that harmonic templates can be derived from any broadband stimulus as a result of cochlear filtering followed by coincidence detection. Therefore the central auditory system is obviously exposed to sets of harmonic frequencies within complex sounds.

The harmonic structure of environmental and communication sounds is intimately related to one of the major psychophysical characteristics of auditory perception, namely, the perception of pitch. Pitch is a function of the fundamental frequency of a complex sound and can be perceived even when the fundamental frequency itself is absent in the spectrum of a sound (case of the "missing fundamental") (Schouten et al. 1962Go; Shamma and Klein 2000Go). The phenomenon of perception of the missing fundamental has been shown in bats as well (Preisler and Schmidt 1995Go). One of the most influential concepts regarding pitch perception is a theory for the central formation of pitch (Goldstein 1973Go). This theory is based on a hypothetical optimum processor performing statistical estimation of the best matching template from the periodic spectrum of a complex sound. Using an associative neural network to model perception of pitch, we have recently suggested that spectral templates in the central auditory system may be formed as a result of Hebbian-like associations between harmonically related frequency channels (Medvedev et al. 2002Go). Multi-frequency tuning of neurons in the AIp area to harmonically related frequencies may provide the neuronal substrate for such a mechanism. Our data on correlation of the LFP in the AIp area with the fundamental frequency of a social call are also suggestive that this part of the auditory cortex may be involved in the recognition of the fundamental frequency ("pitch") of calls. This is not surprising in light of the recent finding of a topographic map of periodicity pitch in the brain of the Mongolian gerbil (Schulze et al. 2002Go).

Another acoustic parameter of calls defined here as "harmonic complexity" depends on the number of harmonics within the call spectrum and the width of the individual spectral peaks. "Harmonic complexity" can be related to the "fine" spectral structure of sounds, or timbre (Hartmann 1997Go). As evident from our MDS analysis, the LFP waveform shows a high correlation with the "harmonic complexity" of a call. This finding indicates that synaptic activity within the AIp area is most likely also involved in the computation of timbre of complex sounds.

Population coding of calls?

Single neurons within the unspecialized AIp area fire scarcely and probabilistically in coincidence with specific components of the LFP. In the absence of call-specialized neurons, it seems logical to look for neural representation of calls in the synaptic activity within small populations of neurons. The ability of AIp neurons to respond to many calls is probably related not to their individual properties but rather to their collective properties within a network. In this case, a network is able to respond to many calls. The "call-specificity" of the LFP waveform, however, is based on the network's response to the unique spectral-temporal structure of a call. Thus in the AIp are, calls may be encoded in the activity of small populations of neurons and are relatively independent of the topographic location of this activity. This result corresponds well with the established sensitivity of the scalp-recorded evoked activity to spectrotemporal characteristics of speech sounds in humans (for a review, see Eggermont and Ponton 2002Go). In the visual domain, sparse population coding was demonstrated using MDS analysis of SU responses to faces in the inferotemporal cortex of monkeys (Young and Yamane 1992Go). This elegant result was obtained by summing the SU response of a population of neurons in the inferotemporal cortex. In our study, we show a similar coding of species-specific calls based on LFP data, which represents the response of a sparse population of neurons in the primary auditory cortex. This is exciting because it implicates a common model for processing complex visual and acoustic stimuli by cortical areas (Kanwal et al. 2004).

Previously proposed schemes for the representation of auditory information include rate coding and temporal coding at the level of single neurons and spatial representation (mapping) of various stimulus parameters (Cariani 1999Go; deCharms and Zador 2000Go). An important example of spatial mapping is tonotopic representation of frequencies and pitch by anatomically distinct channels within the auditory system (Lutkenhoner 2003Go; Pantev et al. 1989Go). Our data with complex sounds invoke population activity that suggests that stimulus qualities such as pitch and timbre can be represented within the same or highly overlapping local networks of neurons and therefore do not require absolute spatial segregation of the encoded parameters. Instead, these parameters are encoded within specific temporal patterns of synaptic activity that can be extracted most effectively by the LFP waveform. Thus an LFP represents a rapid succession of synaptic activity (within 1–200 ms) in response to a call and leads to less robust, stochastic spiking activity within single neurons at the recording site. A mechanism for spatiotemporal integration of complex sounds based on transient synchrony within neural clusters has been suggested recently (Hopfield and Brody 2000Go, 2001Go). Transient synchronization of neuronal firing patterns in a set of neurons is achieved through temporal convergence of synaptic currents across those neurons as determined by the acoustic structure of a call. Our results showing a "call-specific" temporal structure of the LFP are consistent with this mechanism.

In conclusion, the results of this study suggest that call processing within the AIp area involves determination of the "fine" spectral structure of a call. This information may be used to distinguish calls from noncall stimuli as well as to discriminate between different simple syllabic calls. Most importantly, these functions appear to be carried out in the AIp area of the mustached bat without invoking specialized neural mechanisms that are so important for processing echolocation information (Suga 1965Go; Suga and Taniguchi 1983Go). Further studies are needed to elucidate the different roles of the highly specialized versus unspecialized cortical mechanisms for call processing in the mammalian brain.


    GRANTS
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
This research was supported by National Institute on Deafness and Other Communication Disorders Grant DC-02054 to J. S. Kanwal.


    ACKNOWLEDGMENTS
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
We thank the Ministry of Agriculture, Land and Marine Resources in Trinidad for permitting us to export mustached bats and F. Muradali for assistance with the collection and exportation procedures.


    FOOTNOTES
 
The costs of publication of this article were defrayed in part by the payment of page charges. The article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

Address for reprint requests and other correspondence: J. S. Kanwal, Dept. of Physiology and Biophysics, Georgetown Univ. Medical Center, 3900 Reservoir Rd., NW, Washington, DC 20057-1460 (E-mail: kanwalj{at}georgetown.edu).


    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
Barth DS and Di S. Three-dimensional analysis of auditory-evoked potentials in rat neocortex. J Neurophysiol 64: 1527–1536, 1990.[Abstract/Free Full Text]

Brenowitz EA, Wilczynski W, and Zakon HH. Acoustic communication in spring peepers. J Comp Physiol [A] 155: 585–592, 1984.[CrossRef]

Busnel RG. Acoustic Behaviour of Animals. Amsterdam, The Netherlands: Elsevier, 1963.

Cariani P. Temporal coding of periodicity pitch in the auditory system: an overview. Neural Plast 6: 147–172, 1999.[Medline]

Casseday JH, Ehrlich D, and Covey E. Neural tuning for sound duration: role of inhibitory mechanisms in the inferior colliculus. Science 264: 847–850, 1994.[Abstract/Free Full Text]

deCharms RC and Zador A. Neural representation and the cortical code. Annu Rev Neurosci 23: 613–647, 2000.[CrossRef][ISI][Medline]

Eggermont JJ and Ponton CW. The neurophysiology of auditory perception: from single units to evoked potentials. Audiol Neurootol 7: 71–99, 2002.[CrossRef][Medline]

Eggermont JJ and Smith GM. Synchrony between single-unit activity and local field potentials in relation to periodicity coding in primary auditory cortex. J Neurophysiol 73: 227–245, 1995.[Abstract/Free Full Text]

Ehret G and Riecke S. Mice and humans perceive multiharmonic communication sounds in the same way. Proc Natl Acad Sci USA 99: 479–482, 2002.[Abstract/Free Full Text]

Fenton MB. Variation in the social calls of little brown bats (Myotis lucifugus). Can J Zoolog 55: 1151–1157, 1977.

Fitzpatrick DC, Kanwal JS, Butman JA, and Suga N. Combination-sensitive neurons in the primary auditory cortex of the mustached bat. J Neurosci 13: 931–940, 1993.[Abstract]

Franowicz MN and Barth DS. Comparison of evoked potentials and high-frequency (gamma-band) oscillating potentials in rat auditory cortex. J Neurophysiol 74: 96–112, 1995.[Abstract/Free Full Text]

Fuzessery ZM and Feng AS. Mating call selectivity in the thalamus of the leopard frog, (Rana pipiens): single and multiunit analyses. J Comp Physiol [B] 150: 333–344, 1983.[CrossRef]

Goldstein JL. An optimum processor theory for the central formation of the pitch of complex tones. J Acoust Soc Am 54: 1496–1516, 1973.[CrossRef][ISI][Medline]

Green S. Dialects in Japanese monkeys: vocal learning and cultural transmission of locale-specific vocal behavior? Z Tierpsychol 38: 304–314, 1974.

Grinnell AD. The neurophysiology of audition in bats: intensity and frequency parameters. J Physiol 167: 38–66, 1963.[Free Full Text]

Hartmann WM. Signals, Sound, and Sensation. Woodbury, NY: AIP Press, 1997.

Hopfield JJ and Brody CD. What is a moment? "Cortical" sensory integration over a brief interval. Proc Natl Acad Sci USA 97: 13919–13924, 2000.[Abstract/Free Full Text]

Hopfield JJ and Brody CD. What is a moment? Transient synchrony as a collective mechanism for spatiotemporal integration. Proc Natl Acad Sci USA 98: 1282–1287, 2001.[Abstract/Free Full Text]

Kanwal JS. Processing species-specific calls by combination-sensitive neurons in an echolocating bat. In: The Design of Animal Communication, edited by Hauser MD and Konishi M. Cambridge, MA: MIT Press, 1999, p. 134–157.

Kanwal JS, Fitzpatrick DC, and Suga N. Facilitatory and inhibitory frequency tuning of combination-sensitive neurons in the primary auditory cortex of mustached bats. J Neurophysiol 82: 2327–2345, 1999.[Abstract/Free Full Text]

Kanwal JS, Gordon M, Peng JP, and Heinz-Esser K. Auditory responses from the frontal cortex in the mustached bat, Pteronotus parnellii. Neuroreport 11: 367–372, 2000.[ISI][Medline]

Kanwal JS, Matsumura S, Ohlemiller K, and Suga N. Analysis of acoustic elements and syntax in communication sounds emitted by mustached bats. J Acoust Soc Am 96: 1229–1254, 1994.[CrossRef][ISI][Medline]

Kanwal JS, Medvedev AV, and Peng JP. Complex sound perception: response coherence in the activity of neural ensembles (Abstract 167). Assoc Res Otolaryngol 25: 43, 2002.

Kanwal JS, Peng JP, and Esser K-H. Auditory communication and echolocation in the mustached bat: computing for dual functions within single neurons. In: Echolocation in Bats and Dolphins, edited by Thomas JA, Moss MD, and Vater M. Chicago, IL: University of Chicago Press, 2004, p. 201–208.

Kiang NY and Moxon EC. Physiological considerations in artificial stimulation of the inner ear. Ann Otol Rhinol Laryngol 81: 714–730, 1972.[ISI][Medline]

Langner G, Bonke D, and Scheich H. Neuronal discrimination of natural and synthetic vowels in field L of trained mynah birds. Exp Brain Res 43: 11–24, 1981.[ISI][Medline]

Lee YS, Lueders H, Dinner DS, Lesser RP, Hahn J, and Klem G. Recording of auditory evoked potentials in man using chronic subdural electrodes. Brain 107:115–131, 1984.[Abstract/Free Full Text]

Leippert D. Social behaviour on the wing in the false vampire, Megaderma lyra. Ethology 98: 111–127, 1994.

Leppelsack HJ and Vogt M. Responses of auditory neurons in the forebrain of a songbird to stimulation with species-specific sounds. J Comp Physiol [B] 107: 263–274, 1976.[CrossRef]

Lutkenhoner B.