|
|
||||||||
Department of Physiology and Biophysics, Georgetown University School of Medicine, Washington, DC 20057
Submitted 16 May 2003; accepted in final form 10 May 2004
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
Multiple cortical representations of the auditory world are thought to be part of a parallel processing hierarchy, in which each area processes preferentially a certain aspect of an auditory scene. "Preferentially" means that the parameter domains represented in nonprimary auditory cortex are by no means processed exclusively in one area, but significant differences between areas are expected in the distribution of response preferences within any such domain. Similar ideas have long been proposed for the role of multiple cortical representations in vision (Felleman and van Essen 1991
; Ungerleider and Mishkin 1982
; Van Essen and Maunsell 1983
; Zeki 1978
). The proportion of neurons selective for the color or direction of motion of a visual stimulus, for instance, varies substantially between extrastriate visual areas and has led to the characterization of certain areas as "color" (V4) or "motion" (V5/MT) areas. Astonishingly little progress has been made to date in terms of characterizing in a similar fashion any functional differences between higher auditory cortical areas.
Using tone bursts, various parameter domains of auditory processing have been analyzed in the past, but most of these analyses were restricted to primary auditory cortex (A1), especially of cats. This has included the analysis of sharpness of frequency tuning, level sensitivity, and binaural interactions (Imig and Adrián 1977
; Phillips and Irvine 1981
; Phillips et al. 1994
; Schreiner and Cynader 1984
; Schreiner and Mendelson 1990
; Sutter and Schreiner 1991
, 1995
). A few of these studies have gone beyond A1, such as the anterior auditory field (AAF) (Knight 1977
; Phillips and Irvine 1982
) or the posterior auditory field (PAF) (Loftus and Sutter 2001
; Phillips and Orman 1984
; Phillips et al. 1995
). AAF is similar to A1 in many respects, although there is some indication that frequency-tuning curves are broader than those in A1 and more often display multiple peaks (Knight 1977
; Tian and Rauschecker 1994
). In ferrets, the excitatory bandwidth in AAF neurons is twice as large as that in A1 (Kowalski et al. 1995
). In addition, a higher proportion of nonmonotonic, intensity-tuned cells has been found in PAF than in A1 (Phillips et al. 1995
). Nevertheless, none of these parameters has been sufficient to differentiate the various areas functionally.
Clearly, what is needed is an analysis with more complex sounds comparing neuronal responses in the different cortical areas. Suga was one of the first to successfully differentiate auditory cortical areas on the basis of their response preferences to various complex sounds (for reviews see Suga 1988
, 1992
). In the mustached bat, the use of behaviorally relevant sounds markedly facilitated the success of this approach, and the choice of complex stimuli was narrowed down considerably by modeling them after species-specific sounds used during echolocation. In other mammals, such an approach is made more difficult by their less specialized behavioral repertoire and a bewildering number of natural complex sounds that may be expected to trigger responses in nonprimary cortical neurons. One group of species-specific sounds that has been used with some success in nonhuman primates in the past (Newman and Symmes 1974
; Symmes 1981
; Winter and Funkenstein 1973
; Wollberg and Newman 1973
) and again more recently (Eliades and Wang 2003
; Rauschecker and Tian 2000
; Rauschecker et al. 1995
; Tian et al. 2001
; Wang 2000
; Wang and Kadia 2001
; Wang et al. 1995
) is that of vocalizations. Although their spectra are known, however, they are still too complex to be used as a first- or second-order stimulus with any hope of finding stimulus-specific characteristics across different cortical areas.
It thus appears most promising to go back initially to the use of stimuli of intermediate complexity, such as amplitude-modulated (AM) or frequency-modulated (FM) sounds. Both types of stimuli have been used quite extensively for the study of A1 in cats (Eggermont 1994
; Heil et al. 1992a, b
; Mendelson and Cynader 1985
; Mendelson and Grasse 1992
; Mendelson et al. 1993
; Phillips et al. 1985
; Schreiner and Urbas 1986
; Whitfield and Evans 1965
), ferrets (Kowalski et al. 1995
; Shamma et al. 1993
), and squirrel monkeys (Bieser and Müller-Preuss 1996
). Again, however, few attempts have been made to analyze nonprimary auditory areas: AM sounds have been tested in AAF, PAF, and VPAF (ventro-posterior auditory field) of the cat (Schreiner and Urbas 1986
, 1988
), FM sounds in AAF and PAF of cats (Tian and Rauschecker 1994
, 1998
), and AAF of ferrets (Kowalski et al. 1995
).
In our previous cat studies we found that neurons in AAF prefer high FM rates (Tian and Rauschecker 1994
), whereas neurons in PAF prefer, on average, lower FM rates (Tian and Rauschecker 1998
). At the same time, a high prevalence of spatially tuned neurons in AAF and in the anterior ectosylvian sulcus (AES) was found (Korte and Rauschecker 1993
; Rauschecker and Korte 1993
). These findings led to the hypothesis that PAF may be more suitable for the analysis of communication sounds with their slower FM rates (Brown et al. 1978
; Capranica 1972
; Leppelsack 1983
; Liberman et al. 1967
; Rauschecker 1997
, 1998
), whereas AAF and AES may be more specialized for the analysis of sound location. However, the conclusions from the cat work remained incomplete and purely hypothetical because no tests of call selectivity, for instance, were conducted and no direct comparison between areas was performed in the same animals.
A similar functional dichotomoy has been proposed for the auditory cortical system of nonhuman primates, based on a more direct analysis of neurons in the lateral belt areas with species-specific vocalizations presented at varying spatial locations (Rauschecker and Tian 2000
; Tian et al. 2001
). Area AL (anterolateral) was found to have neurons with greater vocalization specificity, whereas area CL (caudolateral) had significantly more neurons with greater spatial selectivity. This suggests that AL is more closely involved in the analysis and identification of specific complex sound patterns (or auditory "objects"), including those used in acoustic communication, whereas CL is more specialized for the analysis of sound location. If the above species comparisons hold up, the functional gradient in the rostro-caudal direction would appear to be reversed in comparison with the cat. This could be explained by a disproportionate growth of the temporal lobe in monkeys during phylogeny, which would lead to a reversal of tonotopic gradients along the rostro-caudal axis (Jones 1985
).
Based on these considerations, it will be instructive to perform a detailed analysis with FM sweeps of the lateral belt areas in rhesus monkeys. A prediction can be made as to the FM rate preferences of neurons in AL versus CL: AL with its higher selectivity for communication calls should prefer lower FM rates in the range of these calls, whereas CL should prefer higher rates. As pointed out previously, the use of linear FM sweeps is interesting for a second reason: they can be seen as analogous to light stimuli moving in a particular speed and direction (Mendelson and Cynader 1985
; Tian and Rauschecker 1994
, 1998
), which are highly effective in driving neurons of extrastriate visual cortex (Hubel and Wiesel 1959
). A comparison of FM tuning and visual direction tuning could lead to a more general understanding of cortical processing algorithms regardless of sensory modality.
Preliminary accounts of this work were previously presented (Rauschecker 1997
, 1998
; Tian and Rauschecker 1995
, 1996
).
| METHODS |
|---|
|
|
|---|
The experimental procedures were generally the same as those used in our previous studies of FM selectivity in cats (Tian and Rauschecker 1994
, 1998
). Four adult rhesus monkeys (Macaca mulatta) with no signs of middle ear infections were used. A total of 216 cells were collected, 190 from the lateral belt areas [AL, ML (mediolateral), and CL], 14 cells from area CM, and 12 cells from A1. Recordings were made exclusively from the left hemisphere.
For the experiment, animals were treated with atropine sulfate (0.05 mg/kg, subcutaneous) and were initially anesthetized with ketamine (10 mg/kg, intramuscular). A venous catheter was placed in one of the hind leg veins, and a tracheal tube was inserted. The monkey was then placed in a specially designed monkey head holder and artificially respirated with a ventilator (Harvard) at an intrapulmonary pressure of 500 to 1,000 Pa. Expiratory CO2 content was monitored continuously (Beckman, Medical Gas Analyzer LB-2) and kept at about 3.8% by varying ventilation stroke volume and frequency. Anesthesia was maintained with isoflurane (12%) in a mixture of 50% nitrous oxide and 50% oxygen. Isoflurane creates a light state of anesthesia that can be easily controlled. EKG was monitored to ensure adequate anesthesia and stable physical condition of the animal. If the animal showed any acceleration of its heart rate in response to noxious stimuli, the concentration of anesthetic gases was increased, or a short-acting barbiturate (Biotal) was given through the venous catheter. The animal's core temperature was maintained at 38°C with a heating pad (Gaymar). Fluid (0.5% dextrose in 0.9% saline) was administered through the venous catheter by an infusion pump at a rate of 1230 ml/h.
A craniotomy was performed over the parietal cortex in aseptic surgery. A temporary well was built with bone cement. The dura was left intact to minimize brain pulsation and prevent the cortex from possible mechanical damage during surgery and transportation. After the monkey was transferred to the recording room, the dura was dissected and retracted to expose the superior temporal gyrus (STG). The well was filled with saline to prevent the cortex from drying.
Acoustic stimulation
STIMULUS GENERATION. Pure-tone (PT) stimuli were for the most part generated with a stimulus generator (Wavetek 148A). The tone bursts were gated by a control unit (HI-MED, HG 300G) to 50-ms duration with 5-ms rise/fall time. The amplitude was monitored on an oscilloscope.
To determine the extent of the lateral belt areas, the supratemporal plane was first mapped with band-passed noise (BPN) (Rauschecker et al. 1995
). BPN bursts were generated with the SIGNAL software (Engineering Design) on an IBM-compatible personal computer. The SIGNAL program generated a random noise, which was band-pass filtered with different cutoff frequencies. By varying upper and lower cutoff frequencies, BPN bursts could be generated with different bandwidths and center frequencies. The sampling rate was chosen as 100 kHz to avoid significant quantization steps in the signal. The stimuli had a rise/fall time of 5 ms to reduce the effect of transients. Standard stimulus duration was 50 ms, but duration could be varied if there was any indication of a neuron being tuned to this parameter. The root-mean-square (RMS) values of BPNs were normalized to the standard output for PT stimuli, which was 1 Vpeak.
The major stimulus category tested in the present study consisted of linear FM sweeps. These stimuli were also generated with SIGNAL at the same sampling rate of 100 kHz. The frequency range of the FM sweeps was chosen large enough to exceed the excitatory PT tuning range of the neuron by one octave on either side, unless this exceeded the frequency range of the sound delivery system. While keeping the frequency range constant, the duration of the FM sweep was systematically varied from 50 to 1,600 ms in logarithmic steps, to test the responses to different FM rates (FMR). The total range of FMR tested reached from 6.25 to 640 Hz/ms. Like PT stimuli, the FM sweeps also had a rise/fall time of 5 ms to reduce the effect of transients. The amplitude of the FM sweeps generated by SIGNAL was 1 Vpeak, the same as those of PT stimuli, which could also be generated with the SIGNAL program, so that a direct comparison of response strength between FM, BPN, and PT was possible. Stimulus amplitude remained constant during each trial but was varied between trials through an attenuator (HP 350D). The interstimulus interval was
1 s. Each stimulus was repeated 20 times for each neuron. All stimuli were amplified with a power amplifier (Hafler, SE 120) and played back with a high-fidelity loudspeaker (Infinity 5 Kappa) in free field. The loudspeaker was positioned 1.14 m in front of the monkey at the height of the ears.
Electrophysiological recording experiments were carried out in a large dimly lit laboratory room (4.7 x 7.6 x 2.6 m), which was kept as quiet as possible. The sound pressure level (SPL in dB, re 20 µPa) of the noise in the recording room was measured with a Brüel and Kjaer (B & K) -in. condenser microphone (#4133, free-field) and a B & K Precision Sound Level Meter (#2235; A-weighting scale). The constant background noise had its peak level (35 dB) at 0.5 kHz (i.e., outside the effective range of most neurons). The amplitude of the sound stimuli was set
1020 dB above the background level. The standard SPL for pure tones and FM sweeps was 6085 dB, as measured at the monkey's head, which was well above the background noise level but still within the linear range of our sound delivery system.
STIMULUS CALIBRATION AND SOUND FIELD.
The stimulus delivery system was calibrated with the same B & K equipment. Between 0.4 and 24 kHz, the output varied by ±6 dB. Above and below this range a roll-off existed, partially because the signal was outside the measuring range of the microphone. The fidelity of the system in producing rapid FM sweeps was also tested, and near free field conditions were ensured. Details of calibration were reported in previous papers (Tian and Rauschecker 1994
, 1998
).
Electrophysiological recording
For extracellular recording of neuronal spike activity, a lacquer-coated tungsten electrode (F. Haer, impedance about 1 M
) was advanced into the brain by a hydraulic micropositioner with remote-controlled stepping motor (Model 650, David Kopf). Each penetration position was recorded with the National Institutes of Health (NIH) Image program on a Macintosh computer and a CCD camera (Panasonic) mounted on a surgical microscope (Zeiss), so that the penetration sites could be reconstructed. The electric signals were band-pass filtered (0.320 kHz) and amplified in 2 stages (A-M Systems preamplifier, Model 1800; Tektronix, AM 502). A "slicer" module (A.B. Bonds, Vanderbilt University) was used to reliably separate spikes from more than one neuron and filter out background noise. The output of the slicer was monitored with an audio monitor (Grass Instruments, AM 8), and a window discriminator was used to convert spikes with different amplitude levels into TTL signals. In addition, the signal at each step was monitored on an oscilloscope (Tektronix, 5113 Dual Beam Storage), so that we are quite confident that only isolated single-unit activity was recorded. The TTL signals were then registered on an IBM-compatible PC with a data collection program (HIST, Spikes Systems), which produced peristimulus time histograms (PSTHs) and raster displays with a bin width of 1 ms for on- or off-line evaluation.
Standard sets of digitized complex sounds were used as search stimuli while the electrode was lowered. When a unit was isolated, we first attempted to determine the best frequency (BF) of the neuron and its lowest excitation threshold at the BF with tone bursts. In this case, threshold was defined as the amplitude of a BF tone at which an increase of activity above the spontaneous level was just noticeable. The frequency-tuning range of the neuron was then determined at a set sound pressure level 620 dB above threshold. If a neuron was not, or only slightly, responsive to pure tones, the best center frequency (BFc) of the neuron was determined by varying the center frequency of BPN bursts, generally with a bandwidth of 1/3 or 1 octave. The center frequency of a BPN at which the response was strongest was taken as the BFc of the neuron. FM sweeps were played back at the same or lower amplitudes as PT or BPN stimuli. PSTHs and raster displays were recorded during presentation of all sound stimuli.
At the end of each penetration, electrolytic microlesions (7 µA, 7 s) were made to mark the electrode tracks and specific recording sites.
Data analysis
To quantify the response to stimulation with PT, BPN bursts, and FM sweeps in the lateral belt areas, the same "peak firing rate" was determined from the PSTHs as described in previous studies (Tian and Rauschecker 1994
, 1998
). In brief, a 10-ms window was slid at 1-ms steps across the PSTH, the number of spikes in this 10-ms interval was counted at each step until the maximum was found, and the average firing rate in the peak interval was calculated after subtracting spontaneous activity. The ratefrequency curves thus obtained were smoothed with a 3-point smoothing routine. Sometimes several local maxima were detected in these ratefrequency curves. A secondary or tertiary peak was defined if the firing rate between 2 local maxima dropped by
50% compared with either maximum.
FM responses were analyzed in an analogous fashion by creating "ratefrequency" curves from instantaneous frequency (IF). Because linear FM sweeps were used, the time axis in the PSTH can be easily converted into a frequency axis, as follows. The duration of the FM stimulus was divided into 50 equal intervals (the shortest FM stimulus was 50 ms), after which the peak firing rate in each interval was determined. Thus instead of looking for the peak response within the 10-ms window for the whole PSTH, the local peak response was determined in each interval. In the case of an FM duration of 1,600 ms, for example, the duration of each interval was 32 ms, but the width of the sliding window was kept at 10 ms. The obtained values were smoothed and plotted for both upward and downward FM sweeps against the IF at the center of the peak interval. Best instantaneous frequency (BIF) was defined as the IF at the global maximum in the ratefrequency curve. The BIFs at different FMRs were averaged separately for each direction (upward and downward). These averaged values were compared with the BFs determined from PT stimulation. The appropriate transformation into the frequency domain required an estimation of response latency, which was obtained from the shortest onset latency in response to any of the stimuli tested. Because it was sometimes difficult to estimate the minimum latency due to the relatively high spontaneous activity, the calculated ratefrequency curve was compared with the peak response for the whole FM duration. If the peak was not included in the ratefrequency curve, then the latency was reassessed until this was the case. Multiple maxima in the response to FM stimuli were defined by the 50% criterion (as for PT responses) and by their occurrence at corresponding IFs for both upward and downward sweep at least for one FMR.
To assess the neuron's response to different FMR, the peak firing rates at each FMR were determined and plotted as FMR tuning curves. According to the neuron's responses to the different FMRs, FMR tuning was categorized as high-pass (HP), low-pass (LP), band-pass (BP), all-pass (AP), or band-rejection (BR) behavior. A neuron was classified as HP or LP if the response dropped below 75% of the maximum response at lower or higher FMRs, respectively. A BP neuron had a clear optimum in the middle of the FMR tuning range, whereas the response of an AP neuron was never <75% of the maximum. A BR neuron had 2 optimal FMRs with <5% difference in firing rates separated by FMRs with firing rates <75% of the maximum (Tian and Rauschecker 1994
, 1998
). The preferred FMR (PFMR) was defined as the FMR at which the response was maximal in a given FM direction. For statistical evaluation, the PFMR in the preferred direction was used if the neuron was FM direction-selective (see following text); otherwise, the PFMR at which the response was maximal was used. Because linear FMs were used in our study but the cochleotopic scale is rather logarithmic, the actual FMR slows down at higher frequencies with regard to the cochleotopic organization. In other words, a neuron with a higher BFc could have the same instantaneous PFMR as that of another neuron with a lower BFc, although their overall PFMRs measured on a linear scale (Hz/ms) were different. To compensate for this effect, one has to determine the instantaneous PFMR on a logarithmic scale (octaves/s) at the BFc of a neuron. When
t is made small enough, the instantaneous FMR at the BFc of a neuron can be approximated for the upward sweep by the equation
![]() |
![]() |
Another way to assess a neuron's sensitivity to FMR is to calculate the centroid (Kowalski et al. 1995
). Because the steps of FMR were logarithmic, the original formula had to be modified to reflect the logarithmic steps
![]() |
To assess FM direction selectivity (DS), a quantitative index was calculated using the equation
![]() |
In case a neuron was tested at more than one intensity level, the best response was used for population analysis.
Histology
At the end of an experiment, the animal was deeply anesthetized with sodium pentobarbital (Nembutal, 60 mg/kg, intravenous) and perfused transcardially with 0.9% saline followed by 4% paraformaldehyde in 0.1 M phosphate buffer. The brain was removed from the skull, blocked stereotaxically, and stored in fixative at 4°C. The part of the brain containing the auditory cortex was sunk in increasing sucrose gradients and cut in 50-µm-thick frontal sections on a freezing microtome. The sections were mounted on slides and stained with cresyl violet or thionin. Electrolytic microlesions were identified under a microscope (Wild) and the sections drawn with a camera lucida (Leitz). The lesion sites were compared with the stereotaxic measurements taken during the recording session to verify the location of the recording sites.
| RESULTS |
|---|
|
|
|---|
Recording sites
Parallel electrode penetrations were made into the superior temporal gyrus (STG) along the lateral sulcus (LS) under visual guidance. Figure 1A shows surface electrode penetration sites through the left hemisphere in one monkey (CR696). In most cases, we aimed to angle the penetrations orthogonally to the cortical surface. However, because of the curvature of the STG and the goal to keep penetrations parallel to one another, some electrode tracks ended up being slightly oblique to the cortical surface (Fig. 1B). In some cases (with more medial coordinates), penetrations into the supratemporal plane (STP) were also made through overlying parietal cortex.
|
Responses to FM sweeps: general properties
When stimulated with linear FM sweeps, 165 of 175 lateral belt neurons (94%) responded in at least one FM direction (Fig. 2). Only in 10 units (6%), largely from CL, a response was not detectable at any FM rate or direction. It cannot be excluded that the PFMR in these cases was outside the range of FMR tested.
|
Selectivity for FM direction
To characterize FM direction selectivity quantitatively, a direction selectivity (DS) index was calculated at each FMR, as described in METHODS, and was plotted against FMR (Fig. 3, bottom panels). A neuron was considered direction selective when the response in one FM direction for one or more FMR was at least twice as large as that in the other direction (Mendelson and Cynader 1985
). This corresponds to an absolute value of the DS index of 0.33, indicated by the dashed lines in Fig. 3. If there were points above the dashed line (i.e., the DS value was >0.33), then the neuron was classified as preferring upward sweeps (Fig. 3, A and B). Conversely, if there were points below the lower dashed line (i.e., the absolute value of DS was >0.33), then the neuron was classified as preferring downward sweeps (Fig. 3C). Sometimes, the direction preference depended on the FMR, i.e., preferring one direction at one FMR, but the opposite direction at another FMR (not shown in Fig. 3). These neurons were classified as preferring up- and downward sweeps. If the DS value was not greater than +0.33 or smaller than 0.33 at any FMR tested, then the neuron was classified as not direction-selective.
|
|
Tuning to FM rate
The example in Fig. 2 showed that lateral belt neurons responded differentially to different FMRs. When the spike rate was plotted against the FMR in a particular FM sweep direction, FMR tuning curves could be derived (Fig. 3). Often there was a maximum in the middle range of FMRs tested (Fig. 3A, up- and downward; C, downward); in other cases, the maximum was at the lower (Fig. 3C, upward) or higher extreme (Fig. 3B, downward). Occasionally, 2 peaks could be detected in the FMR tuning curve (Fig. 3B, upward). Additional examples of FMR tuning curves are shown in Fig. 5. The responses here at different FMRs were normalized to the maximum response. As defined in METHODS, FMR tuning was categorized as high-pass (HP) (Fig. 5A), low-pass (LP) (Fig. 5C), band-pass (BP) (Fig. 5B), all-pass (AP) (not shown here), or band-rejection (BR) (Fig. 5D).
|
|
|
The preferred FM rate (PFMR) was defined as the FMR at which the response was maximal in a given FM direction (see METHODS). BR neurons were excluded here because a defined PFMR could not be determined. When the distributions of the PFMRs for the 3 areas were plotted, there was a clear difference between the 3 areas (Fig. 8). AL neurons preferred lower FMRs. More than half of the neurons had their PFMRs below 64 Hz/ms (24/31 = 77% and 17/31 = 55% for upward and downward sweeps, respectively), with their medians at 25 and 50 Hz/ms, respectively. CL neurons, in contrast, preferred higher FMRs. About 70% of CL neurons preferred FMRs above 64 Hz/ms (23/31 = 72% and 22/31 = 69% for upward and downward sweeps, respectively), with the medians at 160 Hz/ms for both directions. ML neurons preferred FMRs in the midrange. About half of the ML neurons (52/95 = 55% and 37/97 = 38% for upward and downward sweeps, respectively) had their PFMR below 64 Hz/ms, with their medians at 60 and 125 Hz/ms, respectively. The difference between the 3 areas was significant (P < 0.05, df = 2, KruskalWallis test). No significant difference was found in PFMRs between upward and downward directions: AL (P = 0.0513, Wilcoxon signed-rank test), ML (P = 0.0731), and CL (P = 0.4552).
|
Another way to compensate for the "slow-down " effect at higher frequencies is to compute the instantaneous FMR on a logarithmic scale (octaves/s) at the neuron's BFc. Because the best instantaneous frequency (BIF) differed widely from the BFc in many neurons, it is more appropriate to use the BIF rather than the BFc to determine the instantaneous FMR in octaves/s. Data for each lateral belt area are plotted in Fig. 9 for both the upward and downward directions. Again, AL neurons were distributed at lower values with a median of 5.2 and 6.3 octaves/s for upward and downward directions, respectively; ML neurons were in the midrange with medians of 21.5 and 16.9 octaves/s; and CL neurons had higher values with medians of 31.0 and 30.6 octaves/s. The difference between areas was significant for both directions (P < 0.0385 and P = 0.0055 for upward and downward sweeps, respectively; df = 2, KruskalWallis test). When AL and CL neurons were compared directly, the difference was also significant for both directions (P < 0.0205 and P = 0.0013 for upward and downward sweeps, respectively; MannWhitney U test).
|
|
Another way to assess a neuron's FMR tuning preference is to compute the centroid, which is thought to be more stable than the mere PFMR, given that it takes into account the responses at all FMRs tested (Kowalski et al. 1995
; Shamma et al. 1993
). BR neurons were again excluded from this analysis because they would yield a centroid somewhere in the middle where the real response was low. Using this type of analysis, the same preference was seen for each area in a given FM direction (Fig. 11). AL had lower centroids with medians at 69 and 76 Hz/ms for upward and downward sweeps, respectively, ML in the midrange with medians at 86 and 101 Hz/ms, and CL had higher centroids with medians at 98 and 104 Hz/ms, respectively (P < 0.005, df = 2, KruskalWallis test). When the centroids of upward and downward sweeps were compared within each area, ML neurons had higher centroids for downward sweeps than for upward sweeps (P = 0.002, Wilcoxon signed-rank test), but there was no significant difference in the other 2 areas between upward and downward sweeps (P = 0.0897 and P = 0.6951 for AL and CL, respectively, Wilcoxon signed-rank test). As for the PFMRs, similar results were obtained for centroids in the preferred direction. AL had lower centroids, ML had centroids in the mid-range, and CL had higher centroids, with medians at 73, 97, and 101 Hz/ms for each area, respectively (P = 0.0008, df = 2, KruskalWallis test).
|
To investigate whether FMR preference is correlated with the BFc, both PFMR and centroid of a neuron were plotted against its BFc. Such an analysis is useful to exclude the possibility that the differences in FMR tuning between areas are merely the result of a sampling bias in terms of best frequencies. In a regression analysis, there was a weakly positive correlation between PFMR and BFc in ML for downward sweeps (r = 0.34, n = 93, P = 0.0040, ANOVA), but not for upward sweeps (r = 0.25, n = 91, P = 0.1065, ANOVA) (Fig. 12). However, there was no correlation in AL for upward sweeps (r = 0.02, n = 31, P = 0.9518, ANOVA) and even a negative correlation for downward sweeps (r = 0.14, n = 31, P = 0.9413, ANOVA). In CL, no significant correlations were found in the regression analysis either (r = 0.21, n = 29, P = 0.3631, ANOVA and r = 0.16, n = 29, P = 0.1097, ANOVA for upward and downward sweeps, respectively). The same regression analysis was also performed between BFc and the preferred instantaneous FMR in octaves/s, and except for a negative correlation in AL and CL neurons for downward sweeps (r = 0.41, n = 31, P = 0.024, ANOVA and r = 0.38, n = 30, P = 0.039, ANOVA for AL and CL, respectively; Fig. 13) no significant correlations were found between best frequency and PFMR, meaning that the difference in PFMR between belt areas cannot be attributed to a sampling bias.
|
|
FM versus PT responses
DIRECT COMPARISON.
It has been reported that neurons in the lateral belt areas respond poorly to PT stimuli (Merzenich and Brugge 1973
; Morel et al. 1993
; Rauschecker et al. 1995
). This was also true in the present study. Whenever possible, however, PT responses were also assessed on the basis of PSTHs. Therefore a direct comparison of PT and FM responses could be made in 54 of the lateral belt neurons (Fig. 14). Overall, and in most individual cases, the neuronal response to FM was significantly stronger than to PT (P < 0.001, paired t-test). The same was found in the small sample of neurons from area CM, in which a direct comparison was performed.
|
In a linear model, a response of a neuron to an FM sweep will be triggered when the instantaneous frequency (IF) of the FM sweep enters the excitatory tuning range of the neuron; the response reaches its maximum when the IF approaches the BF of the neuron; and the neuron will cease to fire when the IF leaves the excitatory tuning range of the neuron, just like a PT probe stimulus being used to explore the excitatory tuning range. Under linear conditions, this sequence of events should always hold regardless of the time the IF enters or leaves the excitatory tuning range (i.e., independent of FM duration or FMR). Thus in a linear system the response to FM sweeps is directly related to the PT responses; in other words, the response to an FM sweep can be derived from the PT responses. To probe whether the responses to FM sweeps in the lateral belt were indeed linear, rateIF curves were created, similar to the ratefrequency curves in response to PT stimuli. The PSTHs in response to FM were transformed into rateIF curves by substituting frequency for time, and the locations of local and global maxima in these curves were determined. Best instantaneous frequency (BIF) was defined as the IF at the global maximum in the rateIF curve. Both procedures are described in detail in METHODS.
Figure 15 shows examples of rateIF curves at 6 different FMRs from the same neuron shown in Fig. 3. The BIFs were in general around 3 and 4 kHz, but varied with the FMR. Exceptions were found in the rateIF curves at 290 Hz/ms for upward sweeps and 36.25 Hz/ms for downward sweeps, where the BIFs were considerably higher (>1 octave). The BIFs also deviated from the BFc obtained with BPN bursts, which was 2 kHz.
|
|
As mentioned above, a linear model also predicts that the BIFs of a neuron should be close to its BF (when tested with PT) or its BFc (when tested with BPN). Use of BPN bursts is customary in the lateral belt because neurons are not very responsive to PT. In 159 of the 165 lateral belt neurons tested with FM sweeps it was possible to determine the BFc, so a direct comparison between BFc and BIF could be performed. In Fig. 15, the BFc was at 2 kHz, significantly removed from the BIFs at most FMRs. There were cases when the BIFs were close to the BFc over a substantial portion of the FMR range tested (Fig. 16A, upward sweeps at middle FMRs; C, both upward and downward sweeps at lower FMRs; and D, downward at the 2 extreme FMRs), but in many instances BIFs differed from BFc by a wide margin over most of the FMR range tested (Fig. 16B).
When the difference between BIF and BFc was averaged over the 6 different FMRs for each direction, the result was greater than 1/3 octave in 75% (24/32) and 78% (25/32) of AL neurons for upward and downward sweeps, respectively. The corresponding numbers were 78% (76/97) and 77% (75/97) for ML neurons, and 77% (23/30) and 77% (23/30) for CL neurons (Fig. 17, left panels). The distribution for upward and downward sweeps was almost identical in all 3 areas. More than 90% of lateral belt neurons had a SD larger than 1/3 octave (Fig. 17, right panels), indicating that the BIF varied to a large extent in lateral belt neurons. These data suggest that FM responses in lateral belt neurons are not based on linear mechanisms.
|
| DISCUSSION |
|---|
|
|
|---|
Neurons in all 3 lateral belt areas, AL, ML, and CL, were found to respond well to FM sweeps of various rates and direction. The peak firing rate in response to FM sweeps was significantly higher in most cases than to PT bursts of any frequency. This finding alone confirms previous reports that FM sounds are more adequate stimuli than PT bursts for neurons beyond the primary core areas (Tian and Rauschecker 1994
, 1998
). FM glides occur very commonly in many natural sounds, including those used for communication by many animal species as well as humans. The finding is also consistent with the notion that neurons respond to increasingly complex stimuli as one moves away from the primary sensory areas. FM sweeps with their frequency changing over time certainly have a more complex spectrum than stationary tone bursts.
Because our electrode penetrations were made into the exposed surface of the STG under visual guidance and we used histological track reconstruction, the penetrations were guaranteed not to go into primary auditory core areas, which are situated on the STP inside the LS (Merzenich and Brugge 1973
; Morel et al. 1993
; Rauschecker et al. 1997
). At the same time, we are fairly confident that our recording locations were in the belt and not in the "parabelt " (Hackett et al. 1998
; Morel et al. 1993
) because they were placed as closely as possible to the lateral sulcus. However, this can be decided with more certainty only on the basis of combining electrophysiological recording with histochemical staining (Hackett et al. 1998
). The belt/parabelt border varies between studies (and perhaps individual monkeys) (Hackett et al. 1998
; Morel et al. 1993
). Future work will have to determine the exact borderline between these two regions by systematically exploring neuronal response properties along the mediolateral extent of the STG.
Nonlinearity of FM responses in the lateral belt
Neurons in the lateral belt areas responded to FM sweeps usually with a peak in the PSTH that was much shorter than the FM stimuli, and the onset of the peak changed clearly with FMR. This indicates that the response was triggered by a particular portion of the FM sweep. To determine which portion was responsible for the peak, rateIF tuning curves were derived from the PSTHs and the BIF values were determined for each FMR. In lateral belt neurons, BIF values varied considerably with FMR, but no systematic relationship could be established between BIF and FMR. BIF values also differed significantly from the BF or BFc. These analyses suggest that responses to FM sweeps in lateral belt neurons are based on nonlinear mechanisms. Linear responses were found in A1 and AAF (Kowalski et al. 1995
; Tian and Rauschecker 1994
), but not in PAF (Tian and Rauschecker 1998
), indicating a hierarchy of information processing in the cat's auditory cortex. Similarly, the nonlinear responses in lateral belt neurons confirm that the lateral belt represents a higher stage than A1 in the information processing hierarchy of the rhesus monkey.
FM rate and direction
Most neurons in the lateral belt areas were highly selective for the rate and direction of an FM sweep. In terms of direction selectivity, the proportions of neurons preferring upward or downward sweeps were roughly equal in ML and CL. More neurons in AL preferred downward than upward sweeps, but this might be attributable to the limited sample size. A higher proportion of neurons preferring downward sweeps was also found in cat A1 (Heil et al. 1992a
; Mendelson and Cynader 1985
), whereas more neurons in AAF and PAF preferred upward sweeps (Tian and Rauschecker 1994
, 1998
). The possible behavioral significance of these asymmetries has yet to be explained.
In terms of rate selectivity, lateral belt neurons can display low-, high-, or band-pass behavior. In addition, band-rejection (BR) behavior was found in some lateral belt neurons. Band-rejection is complementary to band-pass, but the function of BR neurons may be quite similar to BP neurons, as they filter out FM sounds at particular rates. In cats, BR was found in PAF but not in AAF neurons.
Specificity for different FM rates in AL and CL
The central finding of the present study is the clear difference in preferred FM rates between areas of the lateral belt. Neurons in AL generally responded better to slower FM sweeps (in the range of tens of Hz/ms), whereas neurons in CL responded best to very fast FM sweeps (in the range of hundreds of Hz/ms). ML neurons included all FM rates, which is consistent with the lower processing level of ML compared with AL and CL (Hackett et al. 1998
; Rauschecker and Tian 2000
). We performed various control analyses to assure ourselves that the difference in preferred FM rates between areas was not the result of a sampling bias, whereby more neurons with low and high BFc would have been sampled in AL and CL, respectively. This was clearly not the case, nor was there a consistent cor