|
|
||||||||
Kresge Hearing Research Institute, University of Michigan, Ann Arbor, Michigan
Submitted 27 June 2004; accepted in final form 2 April 2005
| ABSTRACT |
|---|
|
|
|---|
10 ms. In the present study, neurons typically responded to the leading stimulus with a brief burst of spikes, followed by suppression lasting 100200 ms. At an ISD of 20 ms, at which listeners report a distinct lagging sound, only 12% of units showed discrete lagging responses. Long-lasting suppression was found in all sampled cortical fields, for all leading and lagging locations, and at all sound levels. Recordings from awake cats confirmed this long-lasting suppression in the absence of anesthesia, although recovery from suppression was faster in the awake state. Despite the lack of discrete lagging responses at delays of 120 ms, the spike patterns of 40% of units varied systematically with ISD, suggesting that many neurons represent lagging sounds implicitly in their temporal firing patterns rather than explicitly in discrete responses. We estimated the amount of location-related information transmitted by spike patterns at delays of 116 ms under conditions in which we varied only the leading location or only the lagging location. Consistent with human psychophysical results, transmission of information about the leading location was high at all ISDs. Unlike listeners, however, transmission of information about the lagging location remained low, even at ISDs of 1216 ms. | INTRODUCTION |
|---|
|
|
|---|
The precedence effect often is studied by presenting 2 brief sounds from different directions with a variable interstimulus delay (ISD) (reviewed by Litovsky et al. 1999
). When the ISD is set below the echo threshold (typically about 5 ms), the pair of stimuli is perceived as a single fused sound. This failure to hear the lagging stimulus as a separate sound is called echo suppression. At these short ISDs, the location of the leading sound source dominates the perceived location of the soundan observation called localization dominance. As the ISD is increased just above the echo threshold, the lagging sound is heard separately, but spatial attributes of the lagging stimulus are poorly discriminated. This reduced spatial sensitivity for the lagging sound, called lag localization suppression, is strongest at ISDs <510 ms (depending on the listener, type of stimulus, and psychophysical task). At ISDs >10 ms, the lagging sound is heard separately and is easily localized.
Lesion studies and physiological recordings indicate that the auditory cortex is important for sound localization and for the precedence effect. Lesions of the auditory cortex disrupt normal sound localization behavior in humans and cats (Jenkins and Masterton 1982
; Jenkins and Merzenich 1984
; Zatorre and Penhune 2001
). Studies in humans and cats also suggest that at least one aspect of the precedence effectlocalization dominancedepends on an intact auditory cortex (Cornelisse and Kelly 1987
; Cranford and Oberholtzer 1976
). Neurons in the cats auditory cortex are known to be sensitive to the locations of single sound sources (reviewed by Middlebrooks et al. 2001
).
Recent studies have begun to examine the cortical representation of sounds that elicit the precedence effect. Recording from field A1 of awake rabbits, Fitzpatrick and colleagues found that the responses of most units to lagging sounds were strongly suppressed at ISDs of 120 ms, although a minority of units did show discrete lagging responses at delays <20 ms (Fitzpatrick et al. 1999
). In a study of field A1 of anesthetized cats, (Reale and Brugge 2000
) only rarely observed discrete lagging responses at ISDs below 20 ms, but they did find a subset of units whose spike rates were sensitive to the presence of the lagging stimulus. In a study of fields A1 and A2 of anesthetized cats that focused on ISDs below echo threshold, we found that cortical units generally responded to paired sounds with a single burst of spikes, and that the locations signaled by spike patterns generally agreed with localization dominance and summing localization reported by listeners (Mickey and Middlebrooks 2001
).
In the current study, we addressed several outstanding questions about the cortical representation of sounds that elicit the precedence effect. The long-lasting suppression of lagging responses observed in field A1 appears to conflict with observations of the precedence effect. We therefore investigated whether this suppression depends on the cortical field, the locations of leading and lagging sounds, the sound level, or the use of general anesthesia. Recordings were obtained from 3 distinct cortical fields [A1, A2, and posterior auditory field (PAF)] of anesthetized cats and from field A1 of awake cats. A range of sound levels and several combinations of leading and lagging locations were tested. We found that, regardless of these factors, discrete lagging responses were uncommon at ISDs of 1020 ms, at which listeners report a distinct lagging sound. Even in the absence of a discrete lagging response, however, we found that the spike patterns of many cortical neurons were influenced by lagging stimuli. Finally, we tested the predictionbased on psychophysical measurements of lag localization suppressionthat neurons would transmit information about the leading location regardless of the ISD, but that they would transmit information about the lagging location only as the ISD is increased
10 ms. In agreement with the prediction, transmission of information about the leading location was high across ISDs. With the exception of a few notable units, however, information transmission about the lagging location failed to rise as the ISD was increased to as great as 16 ms.
| METHODS |
|---|
|
|
|---|
This report describes data from 14 anesthetized cats and 5 unanesthetized cats. All procedures complied with guidelines of the University of Michigan Committee on Use and Care of Animals. All animals were purpose-bred young adults (Harlan, Indianapolis, IN). Cats studied with anesthesia were males with body weights ranging from 3.5 to 5.5 kg. The anesthetized preparation was previously detailed (Middlebrooks et al. 1998
) and is only summarized here. Isoflurane anesthesia was used during surgery and intravenous
-chloralose anesthesia was used during unit recording. A craniotomy about 1 cm in diameter exposed the middle ectosylvian gyrus of the right hemisphere. A plastic retainer was cemented to the ventral margin of the opening to create a recording chamber. The animal was positioned with its head in the center of the sound chamber, its body supported in a sling with a heating pad, and its head supported by a bar attached to a skull fixture. A recording probe was inserted into active layers of the cortex. The laminar locations of recorded neurons were unknown, but we presume that most neurons were pyramidal cells in the middle layers. Probes were silicon-substrate 16-channel electrodes (Anderson et al. 1989
) with inter-site spacing of 100 or 150 µm, designed for acute extracellular recording, and provided by the University of Michigan Center for Neural Communication Technology. After completely studying a cortical location, the probe was moved so that a new set of cortical units could be studied.
Cats studied without anesthesia were females with body weights ranging from 2.9 to 5.2 kg. The awake preparation was previously detailed (Mickey and Middlebrooks 2003
) and is only summarized here. Each cat was trained to discriminate a 200-Hz click train (which was associated with a food reward) from noise bursts or clicks presented from random azimuthal locations. Then, using aseptic techniques under isoflurane anesthesia, a craniotomy about 1 cm in diameter was created over the middle ectosylvian gyrus of the right hemisphere and a metallic head fixture was implanted over the craniotomy. After recovery from surgery, a recording probe was placed in the auditory cortex with an external connector housed within the head fixture. Probes were silicon-substrate 16-channel electrodes (Anderson et al. 1989
) with intersite spacing of 100 or 150 µm, designed for chronic extracellular recording, and provided by the University of Michigan Center for Neural Communication Technology. In the following days, unit activity was recorded in once- or twice-daily recording sessions while the cat performed the behavioral task or while it sat idly. Responses did not depend strongly on the animals behavior (performing vs. idle), as described previously (Mickey and Middlebrooks 2003
). The cats head was unrestrained, so the position and orientation of the head was recorded with an electromagnetic tracking system (Polhemus fastrak, Colchester, VT). All stimulus locations are reported in head-centered coordinates (Mickey and Middlebrooks 2001
). Figure 3 E shows the distribution of head orientation for data described in the current report; the central 90% of measurements of head azimuth, elevation, and roll fell within the ranges 51 to +45, 19 to +31, and 14 to +28°, respectively. After a cortical location was completely studied or responses deteriorated (typically after a few days to a few weeks), the probe was removed and a new probe was placed so that a new set of units could be studied.
|
Sounds were presented under free-field conditions with an apparatus that was described previously (Middlebrooks et al. 1998
). In brief, a sound-attenuating chamber (dimensions 2.6 x 2.6 x 2.5 m) was lined with sound-absorbing foam to suppress reflections. Fifteen 2-way coaxial loudspeakers were positioned on a horizontal circular hoop from 60 to +60° in 10° steps, and at ±80°. Loudspeakers were located 1.2 m from the cats head at various azimuths. The location directly ahead of the animal was assigned an azimuth of 0°, negative azimuths were to the left (contralateral to the recorded hemisphere), and positive azimuths were to the right. Experiments were controlled by custom MATLAB software (The MathWorks, Natick, MA) running on a Pentium-based personal computer with signal processing instruments from Tucker-Davis Technologies (Gainesville, FL). Computer-controlled 2-channel D/A converters and multiplexers allowed sounds to be presented from single loudspeakers or from pairs of loudspeakers simultaneously.
Physiological experiments used clicks, noise bursts, pure-tone bursts, and click trains. The stimulus passband was 0.530 or 0.230 kHz. Because transfer functions generally differed between loudspeakers, each loudspeaker was individually calibrated by obtaining an impulse response (Zhou et al. 1992
). Stimuli were created by convolution of the intended signal with the inverse impulse response of the intended loudspeaker. Click stimuli were 100-µs rectangular pulses. Noise bursts were samples of Gaussian noise with abrupt onsets and offsets; a different random sample was presented on each trial. Pure-tone bursts were ramped on and off with 5-ms raised-cosine functions. Stimulus waveforms were generated with 16-bit precision at a sampling rate of 100 kHz.
Sounds were delivered every 12 s in pseudorandom order such that all stimulus conditions were tested once before repeating all stimuli again in a different pseudorandom order. During experiments with anesthetized animals, we typically presented a block of 1020 repetitions of each single-stimulus condition and a block of 10 repetitions of each paired-stimulus condition, and then repeated each block at least once more. The blocks were interleaved to reduce the potential effects of any variation of neuronal responsiveness during the 2- to 4-h stimulus set. Stimuli were presented at 2 or 3 levels in steps of 10 dB, about 2040 dB above unit threshold. Reported sound levels refer to the sound field at the location of the cats head; the sound levels at the 2 tympanic membranes generally differed because of directional filtering by the pinnae. Unlike in our previous report (Mickey and Middlebrooks 2001
), in this report sound levels for paired stimuli refer to the level of each member of the pair (rather than their sum) relative to unit threshold for a single stimulus, and the levels of leading and lagging stimuli were equal.
Data acquisition and spike sorting
During recording sessions, the activity at 16 probe sites was recorded simultaneously and written to the computer hard disk as previously described (Mickey and Middlebrooks 2003
, 2001
). Spike times were expressed relative to the onset of D/A conversion, so latencies include 3.5 ms of acoustic travel time. Spike sorting was performed off-line using custom software based on principal-component analysis of spike shape, as described previously (Furukawa et al. 2000
; Mickey and Middlebrooks 2003
). The quality of unit isolation was characterized based on scatterplots of the first 2 or 3 principal-component weights and on histograms of interspike intervals (Mickey and Middlebrooks 2003
). Single units were characterized by discrete clustering in principal-component space and a lack of interspike intervals <1 ms. In the majority of cases, we recorded multiunit clusters. In those cases, spikes either could not be ascribed reliably to a single neuron or clearly originated from 2 or more neurons. In the present study, we use the term "unit" to refer to a single unit or a multiunit cluster. From anesthetized animals we recorded 65 single units and 805 multiunit clusters from 65 probe placements across 14 animals. From awake animals we recorded 6 single units and 38 multiunit clusters from 8 probe placements across 5 animals. Most of those units were included in a previous study that described spatial sensitivity for single noise burst stimuli (Mickey and Middlebrooks 2003
).
Data analysis
The assignment of each unit to a cortical field was based on pure-tone frequency-tuning curves and the location of recording sites relative to gross anatomical landmarks. Units in field A1 were distinguished by narrow frequency tuning (half-maximal bandwidth
1.33 octaves at about 40 dB above threshold) and by locations on the dorsal part of the middle ectosylvian gyrus. Units in the dorsal zone of A1 (field DZ) were dorsal and caudal to A1 proper and were distinguished by complex frequency tuning (band-pass, low-pass, highpass, or multipeaked tuning curves). Units in field A2 were located ventral to A1 and exhibited complex frequency tuning. Units in the posterior auditory field (PAF) were located within or near the anterior bank of the posterior ectosylvian sulcus, they exhibited complex frequency tuning, and they typically responded with median first-spike latencies >25 ms. Of 870 units from anesthetized animals, 271 (31%) were from A1, 13 (1.5%) were from DZ, 466 (54%) were from A2, and 103 (12%) were from PAF. For 17 units (2%), the cortical field was unassigned. Units recorded from awake animals were located in A1 (Mickey and Middlebrooks 2003
).
To quantify the suppression of unit responses to lagging sounds, we calculated recovery times for each unit. First, we determined the average number of spikes per trial obtained when an isolated click was delivered from the lagging loudspeaker (i.e., in the absence of a leading stimulus). This number was estimated by counting spikes in a 30-ms window centered on the time of peak response to that loudspeaker. Second, for each ISD tested with paired clicks, we determined the average number of spikes per trial in a 30-ms window centered at the expected time of a lagging response (i.e., the window was delayed by a time equal to the ISD). The normalized spike count arising from the lagging click at each ISD was computed by dividing by the number of spikes recorded after an isolated click. Third, we found the smallest tested ISD at which the normalized spike count was
0.10, 0.50, and 0.90. Recovery times t10, t50, and t90 were calculated by linearly interpolating between this ISD and the next smallest tested ISD and finding the ISD at which the normalized spike count crossed 0.10, 0.50, and 0.90. In some cases, recovery times were undefined because the response to the isolated click was too small (the mean spike count across trials was <2 SEs above zero).
To estimate the amount of information transmitted by unit responses about source location or ISD, we used a pattern-recognition algorithm and information-theoretic analysis. The procedure is described elsewhere in detail (Mickey and Middlebrooks 2003
) and summarized here. The pattern-recognition approach was favored because it is sensitive to stimulus-related information carried by neuronal spike times. An alternative would have been to reduce multidimensional temporal patterns to one-dimensional spike counts. We tested a spike-count-only analysis, but found that elimination of spike-timing information reduced transmitted information by about 36%; similarly, our previous work has demonstrated that spike counts transmit 3756% less information than do full spike patterns (Furukawa and Middlebrooks 2002
; Mickey and Middlebrooks 2003
; Middlebrooks et al. 1998
). Because the issue of information loss is not central to the current experimental questions, we illustrate only the results of the analysis that used full spike patterns.
The spike patterns of each unit were analyzed in 3 steps. In the first step, we randomly divided trials into 2 sets of equal size and computed average response patterns RA(s) and RB(s) for each set, where s is the set of unique combinations of stimulus parameters. Response patterns consisted of spike density functions (1-ms time bins, 1-ms Gaussian convolution) over the range of time 1060 ms after stimulus onset for each location or ISD.
In the second step, we performed pattern recognition on RA and RB using a probabilistic neural network constructed using the MATLAB Neural Network Toolbox (Demuth and Beale 2000
). Network weights and biases were calculated such that the input of each RA resulted in an output YAA that corresponded to the true source location or ISD. To characterize how consistent RB was with RA, RB was presented to the network and the output YAB was recorded. RA and RB were then interchanged and an output YBA was obtained in a similar manner. The greater the similarity of stimulus-related responses in RA and RB, the more closely YAB and YBA estimated the true stimulus values. The entire procedure was repeated a total of 50 times (i.e., 50 different randomly derived responses, RA and RB, were analyzed), yielding 100 outputs for each location or ISD.
In the third step, the amount of stimulus-related information transmitted by the pattern recognition was computed. A confusion matrix was constructed from the 100 network outputs YAB and YBA. Element i, j of the confusion matrix consisted of the number of network outputs at stimulus value i for the true stimulus value j. The accuracy of network estimates was characterized by computing the transmitted information (T), i.e., the mutual information (Rieke et al. 1997
). The more closely that network outputs estimated the true stimulus locations, the greater the value of T. In general, T was greatest when average unit responses for various stimulus locations were easily discriminable; T was near zero when responses varied little between different stimuli. Chance variation of spike patterns with location or ISD led to overestimation of T, so we ultimately used the corrected transmitted information, Tcor = T T0, where T0 was determined by a control analysis. In the control analysis, the trial-by-trial association of stimulus location with unit responses was reassigned randomly. The subsequent 3-step analysis was identical to that used to determine T. Rather than computing T0 only once as previously described (Mickey and Middlebrooks 2003
), we computed T0 520 times and took the mean value to further reduce noise originating from randomization.
To determine whether a unit transmitted a statistically significant amount of stimulus-related information, we used a distribution-free test: if T was greater than the first 20 independently calculated values of T0, then Tcor was considered significant (P < 0.05). Throughout this report we use the relative information, Trel = Tcor/Tmax, where Tmax is the maximum theoretical information (i.e., the entropy of the stimulus set,
, where pi is the probability of stimulus type i). The relative information reflects the total amount of information transmitted by a particular unit during the recording session.
| RESULTS |
|---|
|
|
|---|
Suppression of lagging responses
Although listeners report a distinct lagging sound at ISDs
5 ms, unit responses typically lacked a discrete component that corresponded to the lagging sound as the ISD was increased to >5 ms. Figure 1, AC shows a raster display for a representative unit. In response to single clicks, this unit responded strongly to contralateral clicks (50°) and weakly to ipsilateral clicks (+50°) (Fig. 1A). Although an ipsilateral click failed to reliably evoke a response, it nonetheless produced complete suppression of the response to a contralateral click presented 125 ms later (Fig. 1B). This suppression was long lasting: the unit responded to the lagging sound only when the ISD was
150 ms (Fig. 1C). Figure 1, DF shows one of the few units in our population that responded strongly and discretely to the lagging click at ISDs <25 ms (Fig. 1E). We classified units as showing a discrete lagging response if the number of spikes elicited within the expected range of latency was significantly greater when a lagging click was present than when it was absent (P < 0.01, Wilcoxon rank-sum test). Only 34 of 290 units (12%) showed a discrete lagging response at an ISD of 20 ms
a delay at which human and cat listeners report a distinct lagging sound. Thus for the vast majority of units, the suppression that followed the leading sound persisted beyond ISDs at which listeners report hearing the lagging sound.
|
|
Responses in awake animals
The findings of previous studies conducted with or without anesthesia suggest that the use of general anesthesia may augment the suppression of lagging responses (Fitzpatrick et al. 1999
; Reale and Brugge 2000
). We addressed this question by recording from field A1 using an awake cat preparation (Mickey and Middlebrooks 2003
). Forty-four click-responsive units were recorded from 5 animals. Figure 3 shows peristimulus time histograms for the ipsilateralcontralateral configuration for 3 representative units. The unit shown in Fig. 3A was unusual among our population in that it exhibited clear lagging responses even at ISDs <20 ms. Note that this unit showed little or no response to the leading click (best illustrated at an ISD of 100 ms). The example in Fig. 3B was more typical in that this unit showed discrete lagging responses at ISDs
50 ms, but not at ISDs
20 ms.
As with units in anesthetized cats, we classified units as showing a lagging response if the number of spikes elicited by paired clicks within the expected range of latencies was significantly greater than the background firing rate (P < 0.01, Wilcoxon rank-sum test). Figure 3D compares responses in the awake preparation to those in the anesthetized preparation for the ipsilateralcontralateral loudspeaker configuration. Filled circles represent data from awake animals (1141 units from 27 probe placements in 14 animals, depending on the ISD tested). Open circles represent comparable data from anesthetized animals (290446 units from 2031 probe placements in 79 animals, depending on the ISD tested). It should be noted that there was a significant amount of heterogeneity between probe placements and between animals in the anesthetized preparation (P < 0.05, chi-square test for homogeneity among dichotomous populations). Nonetheless, the overall proportion of units that showed a lagging response at a given ISD was greater in awake animals than in anesthetized animals. Thus suppression was not as long-lasting in awake animals. Because spontaneous rates also tended to be greater in the awake preparation, we sometimes observed suppressive responses. Figure 3C shows a unit that responded to leading and lagging clicks with a decrease in firing below the baseline rate. In summary, suppression was shorter lasting in awake animals, but the great majority of units nonetheless failed to respond to lagging sounds at ISDs above behavioral echo thresholds.
Modulation of spike patterns by lagging sounds
As described above, the vast majority of units failed to exhibit discrete responses to lagging sounds at ISDs <20 ms. Nonetheless, many of these units exhibited spike rates or spike timing that depended on the ISD. That is, the response to the leading sound was modulated by the presence of a lagging sound. Two examples are shown in Fig. 4. The unit in Fig. 4, AC mainly exhibited variation in spike timing (Fig. 4B), but also showed some variation in spike rate (Fig. 4C), as a function of ISD. The unit in Fig. 4, DF showed some variation in spike timing (Fig. 4E) and a strong dependency of spike rate (Fig. 4F). Units such as these indicated that, in contrast to the few units that explicitly represented the ISD with a discrete lagging response (for example, Fig. 1, DF), many units might represent lagging sounds implicitly by way of their spike rates or spike patterns. We reasoned that, if units show implicit sensitivity to lagging sounds, then they should be considered part of the cortical representation of paired sounds.
|
Figure 5 shows transmitted information for our unit population for various loudspeaker configurations. The greatest information transmission was noted when the leading sound was ipsilateral and the lagging sound was contralateral to the recorded hemisphere (ipsilateralcontralateral configuration, Fig. 5A). For this configuration, the mean relative information across the population was 0.08, with 40% of units transmitting an amount of information about ISD that was statistically significant (P < 0.05, gray bars in Fig. 5). In contrast, the contralateralipsilateral configuration (Fig. 5B) resulted in low amounts of transmitted information (mean relative information 0.01, 10% significant). Similar to the above analysis of recovery times (Suppression of lagging responses), information transmitted about the delay of the lagging stimulus was less when the leading location was more effective (compare Fig. 5, A to C, B to E, and D to F). The information transmitted by units in cortical fields A1 and A2 was not significantly different under any stimulus configuration (P > 0.05, Wilcoxon rank-sum tests). Compared with A1 and A2, information transmission by PAF units (n = 20) tended to be greater for the midlinecontralateral configuration (PAF median 0.047; A1 median 0.010, P = 0.03; A2 median 0.013, P = 0.09) and less for the ipsilateralmidline configuration (PAF median 0.008; A1 median 0.014, P = 0.03; A2 median 0.042, P = 0.0004).
|
The previous analyses showed that many units were sensitive to the presence of a lagging sound. We were specifically interested in whether the spike patterns of units could also accurately signal the location of the lagging sound. Furthermore, for units that were sensitive to the lagging location, we wanted to know how that sensitivity depended on the ISD. To address those questions, we presented paired sounds under 2 conditions: with a variable leading location and fixed lagging location; or with a fixed leading location and variable lagging location. In each case, the fixed location was always 0° azimuth and the variable location ranged from 60 to +60° azimuth, in 20° steps. In addition, the ISD of paired sounds was varied among 6 values within the range 116 ms. Psychophysical experiments in humans have shown that localization of leading sounds is accurate at all ISDs, whereas localization of lagging sounds is accurate only at ISDs >510 ms. Therefore we predicted that under conditions in which the leading location varied, location-related information transmission would be high regardless of the ISD. In contrast, under conditions in which the lagging location varied, information transmission was predicted to be low at small ISDs (18 ms) and higher at large ISDs (1216 ms).
Figure 6 shows an example of a unit whose responses agreed with those predictions. In response to single clicks, the unit responded more strongly to contralateral sounds than to ipsilateral sounds (Fig. 6A). Figure 6B shows responses when the leading location was varied and the lagging location was fixed at 0°. The unit responded with a single burst of spikes at an ISD of 1 ms (Fig. 6B, top), but a discrete lagging response emerged as the ISD was increased to 16 ms (Fig. 6B, bottom). Although the particular location-dependent temporal pattern of spikes varied with ISD, spike patterns were modulated by the leading location at every ISD tested. Notably, at ISDs of 1216 ms, when the leading location was varied and the lagging location fixed, both leading and lagging components of spike patterns showed sensitivity to the leading location (Fig. 6B, bottom 2 panels). As described above for longer ISDs (Suppression of lagging responses), this units lagging responses were suppressed to a greater extent when the leading stimulus was presented from more effective locations (contralateral). Figure 6C shows responses when the leading location was fixed at 0° and the lagging location was varied. Responses appeared to depend on the lagging location when the ISD was 816 ms but not when the ISD was 14 ms (Fig. 6C). Inspection of these spike patterns suggested that only the lagging component of the response was location sensitive at ISDs of 1216 ms (Fig. 6C, bottom 2 panels). It is notable that, under both leading-variable and lagging-variable conditions, lagging responses were diminished relative to responses to single stimuli from the same locations.
|
Units such as this one were uncommon. Only 3 of 134 units (2%) showed a dependency of transmitted information on ISD like that shown in Fig. 6, D and E (i.e., a significant amount of information at ISDs of 1216 ms but not 14 ms). More often we found that transmission of location-related information was high when the leading location varied and low when the lagging location varied, regardless of ISD. Distributions of transmitted information across our unit population are shown in Fig. 7. Among 205 units, the information transmitted about the location of single clicks ranged from about 0 to nearly 2 bits (Fig. 7A), with 129 units (63%) showing a statistically significant amount of information (P < 0.05, gray bars in Fig. 7A). [The entropy for this set of 7 stimulus locations is log2 (7), about 2.8 bits, so 2 bits is equivalent to a relative information of 0.71.] These 129 location-sensitive units were analyzed further to determine the amount of information transmitted under paired-stimulus conditions in which either the leading location (Fig. 7B) or the lagging location (Fig. 7C) was variable. When the leading location was varied, units showed relatively high transmission of location-related information: across ISDs of 116 ms, the median relative information was 0.100.18, with 5578% of units transmitting a statistically significant amount of information (Fig. 7B). In contrast, when the lagging location was varied, units transmitted little location-related information: across ISDs of 116 ms, the median relative information was 00.02, with only 223% of units transmitting a statistically significant amount of information (Fig. 7C). Figure 7, D and E shows transmitted information as a function of ISD for the population: open circles represent the mean and solid curves indicate 10th, 50th, and 90th percentiles. In summary, units transmitted a high amount of information when the leading location varied and a low amount when the lagging location varied, with only a weak dependency on the ISD.
|
| DISCUSSION |
|---|
|
|
|---|
Suppression in auditory cortex
The suppression of lagging responses that we observed is qualitatively similar to previous reports. Fitzpatrick and colleagues (1999)
presented dichotic click stimuli while recording from field A1 in awake rabbits and found that the median recovery time for their unit population was 22 ms. We found somewhat more suppression, with only 14% of units showing discrete lagging responses at ISDs of 1620 ms in A1 of awake cats. Reale and Brugge (2000)
used virtual acoustic spatial stimuli while recording from A1 of anesthetized cats and found a median recovery time of 73 ms when the lagging sound was from the neurons preferred location and the leading sound was from a nonpreferred location. By comparison, we found that only 16% of units in A1 of anesthetized cats had a t50 <73 ms with the ipsilateralcontralateral configuration. These differences in recovery times may be related to the method of stimulus presentation. It is not known how echo thresholds differ when using virtual acoustic spatial stimuli, but echo thresholds are known to be smaller for dichotic stimuli than for free-field sounds (Litovsky et al. 1999
). Aspects of the animal preparations such as species or type of anesthetic may also have contributed to the variation of results between studies. Nonetheless, all 3 studies agree in that the great majority of cortical neurons are suppressed at ISDs beyond behavioral echo thresholds, even in the absence of anesthesia. It will be of interest for future studies to determine whether different attentional states or different behavioral contexts result in differing amounts of suppression of lagging responses. Although we have not found a behavioral dependency (Mickey and Middlebrooks 2003
), the responses of auditory cortical neurons can vary with the animals behavior (e.g., sound localization; Benson et al. 1981
).
The paucity of lagging responses that we observed at short ISDs is consistent with previous cortical studies that used repetitive stimuli. In those studies, each repetition of the stimulus possesses the same spatial or binaural characteristics, so in terms of the precedence effect, such stimuli correspond to an unrealistic situation in which a direct sound and its echos arise from the same direction. Nonetheless, the comparison may be useful for the temporal aspects of the precedence effect. In general, for repetition rates
50 Hz (periods
20 ms), cortical neurons do not show entrainment (i.e., a response to every stimulus repetition), although many neurons do show phase locking at high rates (Bieser and Muller-Preuss 1996
; de Ribaupierre et al. 1972
; Lu and Wang 2000
; Lu et al. 2001
; Phillips et al. 1989
; Schreiner and Urbas 1986
, 1988
; Steinschneider et al. 1998
). Our results also agree with previous studies that found that phase locking extends to higher repetitions rates in awake animals compared with anesthetized animals (Bieser and Muller-Preuss 1996
; Lu et al. 2001
; Steinschneider et al. 1998
) and in A1 compared with A2 or PAF (Schreiner and Urbas 1988
).
What is the source of the long-lasting suppression that we and others have observed in the auditory cortex? The failure to respond to lagging sounds is likely to be inherited, to some extent, from earlier auditory centers. The characteristics of suppression in the auditory thalamus are unknown, but neurons in the inferior colliculus show suppression similar to that in the cortex, although with a shorter duration (Fitzpatrick et al. 1999
; Litovsky and Yin 1998
; Yin 1994
). The additional suppression found in the cortex seems likely to arise from inhibitory inputs to cortical neurons. Intracellular recordings suggest that the typically phasic responses of cortical neurons may result from an excitatory postsynaptic potential followed a few milliseconds later by a more prolonged inhibitory postsynaptic potential (de Ribaupierre et al. 1972
; Wehr and Zador 2003
). Whereas the suppression is likely to arise (at least in part) by inhibition, it does not appear to be spatial lateral inhibition. That conclusion is based on the finding that, as described here and in another study (Reale and Brugge 2000
), suppression of lagging clicks is generally longer lasting when the leading sound is delivered from a more effective location than when it is delivered from a less-effective location. The suppression may therefore be better characterized as adaptation. Consistent with the idea of adaptation, the suppression is long lasting regardless of whether leading and lagging sounds are presented from the same location or from different locations (Fitzpatrick et al. 1999
; Reale and Brugge 2000
). A definitive answer to this issue will require measurement of postsynaptic potentials while presenting click pairs.
Cortical representation of lagging sounds
The results of physiological studies thus indicate that cortical neurons rarely respond to lagging sounds at ISDs near the behavioral echo thresholds of humans and cats. Presumably, the responses of cortical neurons underlie spatial aspects of the precedence effect, so the question arises: How does the cortex represent lagging sounds and their spatial attributes? We consider 2 possibilities here. If we assume an explicit representation of lagging sounds, that is, a representation based on discrete lagging responses, then we are forced to place the burden of this representation on a small minority of units (Fitzpatrick et al. 1999
; Yin 1994
). In other words, psychophysical performance is attributed to a more-or-less functionally distinct subpopulation of neurons. In support of this view, we found a small number of units (a few percent of our population) that responded to lagging sounds at ISDs near the behavioral echo thresholds of humans and cats and a similar small number that showed spatial sensitivity for lagging sounds.
Alternatively, the representation of lagging sounds may be implicit. In this view, the neuron lacks a discrete lagging response, yet its spike rate or temporal pattern of spikes is sensitive to attributes of the lagging sound. In support of this view, we found that the responses of up to 40% of units showed some sensitivity to the ISD of the stimulus. This observation has a parallel in psychophysics: human listeners are able to discriminate between stimuli with different ISDs even when the ISD is below echo threshold (Blauert 1997
). We found that implicit ISD sensitivity is fairly common, so the implicit representation of sounds would not require one to posit a small minority of specialized units. Similar implicit representations have been proposed in other contexts. Work from our laboratory has shown that the temporal spike patterns of cortical neurons can accurately represent the locations of single sounds (Furukawa and Middlebrooks 2002
; Mickey and Middlebrooks 2003
; Middlebrooks et al. 1998
). An implicit rate-based code has been proposed for the representation of click trains with small interstimulus intervals in the auditory cortex (Lu et al. 2001
). Of course, implicit and explicit representations of lagging sounds are not mutually exclusive: some neurons might represent lagging sounds implicitly whereas others represent them explicitly.
We measured information transmission for the locations of leading and lagging sounds with the goal of comparing the results with human psychophysical measurements. In general, localization (or discrimination between locations) of leading sounds is relatively accurate, whereas localization of lagging sounds is accurate only at longer ISDs (Litovsky et al. 1999
). Thresholds for lag-discrimination suppression have been estimated at 510 ms (Yang and Grantham 1997
). Our results agree with psychophysical results in that substantially more information was transmitted about leading sounds than about lagging sounds, but disagree in that the vast majority of units showed no significant transmission of information about the lagging location at ISDs of 816 ms. Our method for estimating information transmission is independent of the type of coding used by the neuron (e.g., implicit or explicit). Therefore regardless of the coding strategy, it appears that a small subpopulation of specialized neurons must be invoked to account for localization of lagging sounds.
| GRANTS |
|---|
|
|
|---|
| ACKNOWLEDGMENTS |
|---|
|
|
|---|
Present address of B. J. Mickey: Department of Psychiatry, University of Michigan, Ann Arbor, MI 48109-0118 (E-mail: bmickey{at}umich.edu).
| FOOTNOTES |
|---|
Address for reprint requests and other correspondence: J. C. Middlebrooks, Kresge Hearing Research Institute, University of Michigan, 1301 East Ann Street, Ann Arbor, MI 48109-0506 (E-mail: jmidd{at}umich.edu)
| REFERENCES |
|---|
|
|
|---|
Benson DA, Hienz RD, and Goldstein MH Jr. Single-unit activity in the auditory cortex of monkeys actively localizing sound sources: spatial tuning and behavioral dependency. Brain Res 219: 249267, 1981.[CrossRef][ISI][Medline]
Bieser A and Muller-Preuss P. Auditory responsive cortex in the squirrel monkey: neural responses to amplitude-modulated sounds. Exp Brain Res 108: 273284, 1996.[ISI][Medline]
Blauert J. Spatial Hearing: The Psychophysics of Sound Localization. Cambridge, MA: MIT Press, 1997.
Cornelisse LE and Kelly JB. The effect of cerebrovascular accident on the ability to localize sounds under conditions of the precedence effect. Neuropsychologia 25: 449452, 1987.[CrossRef][ISI][Medline]
Cranford JL. Localization of paired sound sources in cats: effects of variable arrival times. J Acoust Soc Am 72: 13091311, 1982.[CrossRef][ISI][Medline]
Cranford JL and Oberholtzer M. Role of neocortex in binaural hearing in the cat. II. The "precedence effect " in sound localization. Brain Res 111: 225239, 1976.[CrossRef][ISI][Medline]
Demuth H and Beale M. Neural Network Toolbox Users Guide, Version 4. Natick, MA: The MathWorks, 2000.
de Ribaupierre F, Goldstein MH Jr, and Yeni-Komshian G. Cortical coding of repetitive acoustic pulses. Brain Res 48: 205225, 1972.[CrossRef][ISI][Medline]
Fitzpatrick DC, Kuwada S, Kim DO, Parham K, and Batra R. Responses of neurons to click-pairs as simulated echoes: auditory nerve to auditory cortex. J Acoust Soc Am 106: 34603472, 1999.[CrossRef][ISI][Medline]
Freyman RL, Balakrishnan U, and Helfer KS. Spatial release from informational masking in speech recognition. J Acoust Soc Am 109: 21122122, 2001.[CrossRef][ISI][Medline]
Furukawa S and Middlebrooks JC. Cortical representation of auditory space: information-bearing features of spike patterns. J Neurophysiol 87: 17491762, 2002.
Furukawa S, Xu L, and Middlebrooks JC. Coding of sound-source location by ensembles of cortical neurons. J Neurosci 20: 12161228, 2000.
Haas H. The influence of a single echo on the audibility of speech. J Audio Eng Soc 20: 145159, 1972.
Imig TJ, Irons WA, and Samson FR. Single-unit selectivity to azimuthal direction and sound pressure level of noise bursts in cat high-frequency primary auditory cortex. J Neurophysiol 63: 14481466, 1990.
Jenkins WM and Masterton RB. Sound localization: effects of unilateral lesions in central auditory system. J Neurophysiol 47: 9871016, 1982.
Jenkins WM and Merzenich MM. Role of cat primary auditory cortex for sound-localization behavior. J Neurophysiol 52: 819847, 1984.
Keller CH and Takahashi TT. Responses to simulated echoes by neurons in the barn owls auditory space map. J Comp Physiol A Sens Neural Behav Physiol 178: 499512, 1996.[Medline]
Kelly JB. Localization of paired sound sources in the rat: small time differences. J Acoust Soc Am 55: 12771284, 1974.[CrossRef][ISI][Medline]
Litovsky RY, Colburn HS, Yost WA, and Guzman SJ. The precedence effect. J Acoust Soc Am 106: 16331654, 1999.[CrossRef][ISI][Medline]
Litovsky RY and Yin TC. Physiological studies of the precedence effect in the inferior colliculus of the cat. I. Correlates of psychophysics. J Neurophysiol 80: 12851301, 1998.
Lu T, Liang L, and Wang X. Temporal and rate representations of time-varying signals in the auditory cortex of awake primates. Nat Neurosci 4: 11311138, 2001.[CrossRef][ISI][Medline]
Lu T and Wang X. Temporal discharge patterns evoked by rapid sequences of wide- and narrowband clicks in the primary auditory cortex of cat. J Neurophysiol 84: 236246, 2000.
May BJ and Huang AY. Sound orientation behavior in cats. I. Localization of broadband noise. J Acoust Soc Am 100: 10591069, 1996.[CrossRef][ISI][Medline]
Mickey BJ and Middlebrooks JC. Responses of auditory cortical neurons to pairs of sounds: correlates of fusion and localization. J Neurophysiol 86: 13331350, 2001.
Mickey BJ and Middlebrooks JC. Representation of auditory space by cortical neurons in awake cats. J Neurosci 23: 86498663, 2003.
Middlebrooks JC and Green DM. Sound localization by human listeners. Annu Rev Psychol 42: 135159, 1991.[CrossRef][ISI][Medline]
Middlebrooks JC and Pettigrew JD. Functional classes of neurons in primary auditory cortex of the cat distinguished by sensitivity to sound location. J Neurosci 1: 107120, 1981.[ISI]