|
|
||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1Computation and Neural Systems, California Institute of Technology, Pasadena, California; and 2Department of Neuroscience, Albert Einstein College of Medicine, Bronx, New York
Submitted 31 October 2006; accepted in final form 13 February 2007
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
It is known that the owl is capable of discriminating between nearby frequencies with ease (Quine and Konishi 1974
) and between complex noises that differ by as little as 2.5 dB in a single 1/3 octave band (Konishi and Kenuk 1975
), but the stimuli used in these studies were not characterized in terms of their instantaneous frequency structure. Nonetheless, behavior suggests that the owl is capable of subtle tasks of recognition: for example, the owl has been reported to be able to distinguish between the sounds of a mouse walking through dry leaves versus a mouse walking through chopped-up paper (Konishi and Kenuk 1975
). Combined with the psychophysical work in humans on the relevant stimulus parameters for sound recognition, it is likely that the owl is capable of detecting spectrotemporal information on a relatively short timescale.
The brain stem auditory nuclei of the barn owl have been primarily characterized in terms of their responses to the two major binaural cues for sound localization: the interaural time difference (ITD) and the interaural level difference (ILD). However, all auditory information, including that needed for sound recognition and discrimination, ascends through these nuclei. Keller and Takahashi (2000)
characterized the spectrotemporal tuningi.e., the joint distribution over instantaneous frequency and time of the stimuli preceding spikes (Klein et al. 2000
)of the neurons in the lateral shell of the central nucleus of the inferior colliculus (ICls). ICls receives inputs from both the ITD pathway [by the core of the central nucleus of the inferior colliculus (ICcc); Takahashi and Konishi 1988
] and the ILD pathway [by nucleus angularis (NA) and the nucleus dorsal lemnisci lateralis pars posterior (LLDp); Takahashi et al. 1989
].
Neurons in the initial stages of the ITD processing pathway of the barn owl, including the auditory nerve, nucleus magnocellularis (NM), and nucleus laminaris (NL), are known to phase lock to and thus to convey spectral information for frequencies as high as 8 kHz (Carr and Konishi 1990
; Köppl 1997
; Peña et al. 1996
; Sullivan and Konishi 1984
); phase locking was also shown in ITD-related neurons in mammalian systems (de Boer and de Jongh 1978
; Eggermont 1993
; Kim and Young 1994
; Lewis et al. 2002
; Louage et al. 2004
). In previous work (Christianson and Peña 2006
) we showed that the transition from NL to ICcc includes a convergence across units with similar tuning to optimize the information on ITD conveyed by the firing rates of single neurons. Additionally, the time required for an owl to estimate the ITD of a signal (Konishi 1973
) and the temporal resolution with which human subjects can follow moving auditory stimuli (Grantham and Wightman 1978
) are both on the order of tens of milliseconds. Taken together, this suggests that the computation of ITD involves sufficient pooling across time or population that the neurons of ICcc may show a temporal resolution only on the order of tens of milliseconds; consistent with this, there are no reports of phase locking in the ICcc of owls. This would suggest that the observed envelope coding in the ICls (Keller and Takahashi 2000
) would come from the ILD pathway.
In this study, we analyze the spectrotemporal receptive properties of NL, the locus of the computation of ITD, and ICcc, the terminus of the ITD pathway. We show that throughout this pathway the reliability in response to the time-varying spectrum of the stimulus is maintained despite the loss of phase information for high frequencies. Thus the reduction of noise from NL to ICcc previously described (Christianson and Peña 2006
) and the loss of phase locking do not come at the expense of the representation of spectrotemporal cues and, in fact, maintain a millisecond-resolution envelope code of the stimulus spectrum.
| METHODS |
|---|
|
|
|---|
Electrophysiology
We positioned the recording electrodes by known stereotaxic coordinates and by the units response properties, using white noise and tones of manually set ITD and frequency as search stimuli. NL is easily identified by the large field potentials (called neurophonics) that vary in amplitude with ITD; ICcc neurons were distinguished by the presence of ITD sensitivity without side-peak suppression and by an absence of tuning to interaural level differences. We isolated and maintained NL single neurons by a loose patch method (Peña et al. 1996
, 2001
) in which the electrode served as a suction electrode, allowing us to hold neurons for a long time. Neural signals were serially amplified by an Axoclamp-2A (Molecular Devices, Palo Alto, CA) and a custom-made AC amplifier (µM-200; B.E.S., California Institute of Technology, CA). ICcc neurons were recorded using tungsten electrodes (A-M Systems, Carlsborg, WA). A spike discriminator (SD1, Tucker-Davis Technologies, Gainesville, FL) converted neural impulses into TTL pulses for an event timer (ET1, Tucker-Davis Technologies), which recorded the timing of the pulses. A computer was used for stimulus synthesis and online data analysis.
It is possible that the differences in the recording techniques used for ICC and NL contributed to the changes observed in response properties. The "loose patch" recording technique was developed to overcome the traditional difficulty in obtaining stable and well-isolated recordings of coincidence detector neurons in NL and in its mammalian equivalent, the medial nucleus of the superior olive (MSO). We consider that the recorded NL neurons were in reasonably good health because we obtained stable recordings lasting for >1 h. During these periods, the neurons showed a high degree of phase locking and stable tuning to ITD. Also, we previously showed that NL neurons recorded by loose patch have tuning to ITD that is tolerant to a broad range of sound intensity (Peña et al. 1996
) and precisely matches the values expected by conduction delays of afferent fibers published in previous studies (Carr and Konishi 1990
; Peña et al. 2001
). We reasoned that if either recording technique was affecting the response properties, the most likely one would be the loose patch because the electrode is presumably attached or very close to the cell's soma. Therefore a decline in the ability of phase locking should be observed in NL neurons, rather than in ICC. On the other hand, the loose patch technique is not only significantly more time consuming than regular metal electrodes but also more invasive because glass pipettes suitable for patch clamping are used. For these reasons, we refrained from using loose patch for recording ICC units. Although we consider it unlikely, we cannot completely rule out that the use of metal electrodes biased our sample of ICC neurons toward only those that did not show phase locking. An additional bias may have been introduced by our requirement for recording stability. Our full protocol (to collect the data presented both here and elsewhere) required holding a cell for a minimum of about 1 h; this constraint contributed to our relatively low cell counts.
Acoustic stimulation
An earphone assembly consisting of a Knowles 1914 receiver, a Knowles 1743 damping device, and a Knowles 1939 microphone delivered sound stimuli. These components are encased in an aluminum cylinder that fits into the owl's ear canal. The gaps between the cylinder and the ear canal were filled with silicon impression material (Gold Velvet II, Earmold and Research Laboratory, Wichita, KS). At the beginning of each experimental session, the earphone assemblies were automatically calibrated (Arthur 2004
). The computer was programmed to equalize sound pressure level and phase for all frequencies within the frequency range relevant to the experiment for both tonal and broadband stimulation. Noise was designed by specifying the desired amplitude and phase spectrum, applying the calibration, and computing the inverse Fourier transform. Initial phase was randomized while preserving the desired interaural phase difference for each trial.
Tonal and broadband stimuli 100 ms in duration with 5-ms rise/fall times were presented once per second. We used PA4 digital attenuators (Tucker-Davis Technologies) to vary stimulus sound levels. Stimuli were presented at an intensity of 50 dB/SPL, which was found to be consistently above the saturation threshold of rate-intensity responses in both NL and ICcc.
Data collection
Data on the variability and reproducibility of neuronal responses were obtained using a frozen noise protocol, in which a single white noise stimulus (with power between 1,000 Hz and 12 kHz) was repeatedly presented at a sampling rate of 48,077 Hz. To minimize any effects of habituation, two separate and unique frozen noises with different ITDs were randomly interleaved. Rate-ITD functions were obtained by scanning in 30-µs steps from ±2,000 to ±3,000 µs.
The occurrence of a spike is presumably related to the occurrence of a stimulus feature to which the neuron is sensitive. To determine the stimulus features to which the neuron is sensitive we used reverse correlation (de Boer and de Jongh 1978
). In this method, we first compute the pre-event stimulus ensemble (PESE): a matrix in which row n contains the segment of the stimulus that preceded spike n. By examining the statistical properties of this matrix it is possible to determine the stimulus features that precede, and presumably elicit, spikes.
Data for reverse correlation were obtained by presenting 100-ms de novosynthesized broadband signals at the estimated characteristic delay (CD) of each neuron, randomly interleaved with stimuli at a different ITD; stimuli were presented with an interstimulus interval of 500 ms and we attempted to collect between 400 and 800 trials for each ITD condition. Because of the higher reliability in the response to ITD of ICcc neurons (Christianson and Peña 2006
), we used the best ITD. In NL, we determined the CD by collecting rate-ITD curves at different frequencies and finding the peak at which all of them overlapped. The window of the reverse correlation (i.e., the amount of stimulus preceding each spike that was considered in the analysis) was 15 ms; manual verification did not indicate that any neurons in our population had a response function whose temporal extent exceeded that limit. To ensure that segments of the interstimulus interval and the rise period of the stimulus were not included in the reverse correlation analysis (or, in other words, to guarantee that the reverse correlation was done on a signal with stationary statistics), spikes that occurred in the 20 ms immediately after stimulus onset were excluded; this exclusion was also sufficient to guarantee that the onset transient was excluded and that the neuron had reached a stable firing rate and we encountered no neurons that had onset-only responses. After this exclusion, a large number of spikes were still left for consideration (average number of spikes used in the reverse correlation analysis per neuron for NL: 4,158 ± 1,818; ICcc: 3,915 ± 1,788).
Analysis
The spike-triggered average (STA) is given by averaging the PESE matrix across columns (spikes), to give the average waveform preceding each spike. When computed with Gaussian white noise, the STA is the best estimate of the linear impulse response function of the neuron (de Boer and de Jongh 1978
). However, the STA incorporates the response of the neuron to phase as well as spectral power and, for neurons that do not exhibit phase-locking, the STA will not generate meaningful structure (Eggermont et al. 1983b
). The response of a neuron can also be characterized using the spectrotemporal receptive field (STRF), which gives the average spectotemporal distribution of the stimulus preceding each spike (Eggermont et al. 1983b
).
An estimate of the reproducibility of response was derived using the shuffled autocorrelogram (SAC) as presented by Joris (2003)
. We have N spike trains arising from presentations of the same broadband stimulus to a particular neuron and began by choosing the first spike train. For each spike in that train, we computed the forward time intervals (that is, the difference in spike times between that spike and all spikes occurring after it relative to stimulus onset) between that spike and all spikes in the other spike trains, and then this procedure was repeated until all spikes in all spike trains have been used as the reference. Because there are no intra-train comparisons, effects of refractory period are eliminated. The SAC is guaranteed to be symmetric about the delay of zero (each forward time interval will reoccur as a backward time interval), and hence we need only consider the forward time intervals, though reverse intervals are also plotted for visual clarity. A normalizing factor N(N 1)r2
D, where r is the mean firing rate, 
is the bin width of the correlogram, and D is stimulus duration, results in a unity baseline, where a spike train with Poisson statistics will have a flat SAC of height 1. We used a histogram of 50-µs bin width, as in Louage et al. (2004)
.
To compute the STRF, we used the same procedure as Keller and Takahashi (2000)
. Each 15-ms-long stimulus segment in the PESE was passed through a gammatone filter bank, with 80 channels spaced linearly between 1 and 10 kHz. The envelope of the stimulus in each channel was extracted using the Hilbert transform, after which the average envelope of the channels across the PESE was computed, giving a phase-insensitive estimate of the average power in each frequency channel preceding spikes. The same process (up to averaging across stimulus samples) was used to compute a spectrogram (spectrotemporal distribution).
Because STRFs are not in general square (they can have different numbers of time and frequency bins), we used the singular value decomposition (SVD) for principal component analysis. In the SVD, an m x n matrix M is represented by M = U
VT, where U is m x m, V is n x n, and
is an m x n diagonal matrix whose entries are the singular values
i of M. The fractional energy of a singular value
i, given by
i2/
j
j2, is a measure of the relative contribution of the associated singular vector pair to the reconstruction of the overall matrix. The fractional energy of
1 is equal to 1
SVD, where
SVD is the degree of inseparability defined by Depireux et al. (2001)
.
The spectral tuning was estimated in the STRF using the vector produced by summing the STRF along the time axis. The vector was normalized to have a maximum positive value of 1, with a possibly negative minimum, and was quantified using the frequency corresponding to the maximum positive value in the vector (PF), the width of the peak of the vector at a height of 0.5 (BW0.5), and the frequency on which BW0.5 was centered (CF0.5). The peak time (PT), 0.5 temporal width (TW0.5), and center time (CT0.5) were defined analogously but over the time axis instead of the frequency axis.
The STRF was used to predict the response of a neuron to a novel stimulus by convolving each frequency channel in the spectrogram of the stimulus by the corresponding channel in the STRF and then averaging across frequency channels. Comparison was done between the predicted response and the actual poststimulus time histogram (PSTH) of the neuron for frozen noise by computing the correlation coefficient over 80 ms of stimulus. Both PSTH and predicted firing rate were first convolved by a Gaussian kernel with 0.5 ms SD.
In analyzing data that are not normal in distribution, we use the median (denoted by Q2), which in boxplots is given by the center line; Q1 and Q3 are the lower and upper quartiles, respectively, and give the edges of the box. Whiskers give the extent of the data, up to a maximum of 1.5-fold the interquartile range; data points outside this range are treated as outliers and marked individually.
| RESULTS |
|---|
|
|
|---|
Phase sensitivity
We began by establishing whether phase sensitivity, which is known to be present in NL neurons in the form of phase-locking (Carr and Konishi 1990
; Peña et al. 1996
), is preserved in ICcc. The STA is computed by averaging the PESE across spikes and gives the average stimulus preceding a spike and therefore an estimate of the spectral, temporal, and phasic aspects of the tuning of the neuron. For nonfrozen broadband stimuli, the STA shows an organized structure only in the case where there is locking to the fine structure of the stimulus (Eggermont 1993
); intuitively, this is because unless the spikes are synchronized to a particular phase of the stimulus, the averaging across the PESE will result in destructive interference comparable to averaging across time. Consistent with reports of phase locking in NL, we saw coherent STAs for all NL neurons examined (Fig. 1, A and B). In contrast, a meaningful STA was never observed in our population of ICcc neurons (Fig. 1, C and D), which is consistent with an elimination of fine structure locking from NL to ICcc. The presence of a coherent white-noise STA is not a direct indication of phase locking to pure tones; in particular, this observation cannot be directly related to the most common measure of phase lockingvector strengthalthough we suggest that the fact that even our highest-frequency NL neurons (CF of 6.4 kHz) showed a coherent STA sets a ceiling on possible vector strength values in ICcc. On the other hand, because there is evidence that vector strength from tone response does not correctly estimate a neuron's response to broadband noise (Yin and Chan 1990
), the STA is more meaningful in the context of a study using broadband signals. Mammalian data reports a continued phase locking in IC, although at reduced maximum frequencies compared with the earlier nuclei (Liu et al. 2006
). The neuron in our data set with the lowest CF10 (1,170 Hz) is shown in Fig. 1D, and did not display any difference compared with higher-frequency units (Fig. 1C has a CF10 of 3,500 Hz). Although we cannot rule out phase locking in ICcc at frequencies <1 kHz, this is a dramatic reduction compared with the ability of NL neurons to phase lock at frequencies >7 kHz.
|
Phase locking in NL throughout the hearing range of the barn owl would predict that the timing of spikes would convey information about the fine structure of the signal. Consistent with this, when the same noise stimulus is repeatedly presented to an NL neuron, a clear pattern emerges in a raster plot of spike times (Fig. 2A). A similar patterned response is also visible in ICcc neurons (Fig. 2B), despite the lack of phase locking described earlier. To quantify the reproducibility of the response, we used the shuffled autocorrelogram (SAC) method (Joris 2003
; Louage et al. 2004
). The height of the SAC gives the likelihood that given a spike at time t in one presentation of a stimulus that another presentation of the same stimulus will elicit a spike at the same time; the width of the peak of the SAC gives a measure of how much jitter in spike timing is present (Fig. 2, C and D). We note that the SACs of ICcc neurons are in general smooth, indicating a relatively normal distribution of jitter (Fig. 2D). In contrast, NL neurons tended to demonstrate peaks (Fig. 1C), suggesting that jitter in NL neurons tends to occur preferentially at multiples of the period of the preferred frequency of that neuron.
|
|
As described earlier, the STA requires synchrony to the phase of the stimulus. However, there is ample evidence that the higher auditory centers of both mammals (Batra et al. 1989
; Joris 2003
; Yin et al. 1984
) and owls (Keller and Takahashi 2000
) can lock to the power transients of particular frequency bands within the stimulus regardless of phase. This synchrony can be thought of as a locking to the amplitude envelope imposed on the signal by the effective band-pass of the neuron and is thus referred to as envelope coding. To explore this possibility, we also considered the spectrotemporal receptive field (STRF) (see METHODS), which gives the power in each frequency channel as a function of time, independent from the phase of the stimulus. In all neurons recorded in both NL and in ICcc, STRFs were present, which is consistent with envelope coding being present in the neurons of both nuclei (Fig. 4). The shapes of the STRFs were qualitatively similar in all cases and for both nuclei. In all neurons examined, the STRF had an excitatory peak at a latency range predicted by previous physiological data, followed by an inhibitory trough. Because we saw no evidence for an inhibitory mechanism in the spike-triggered average for NL neurons, the trough most likely arises from refractory period effects (Eggermont et al. 1983b
). In other words, because we expect the neuron is likely to spike near the onset of an occurrence of a spike-triggering feature; this leads statistically to a greater likelihood that a spike-triggering feature is preceded by an absence of spike-triggering features. The strength of this likelihood is dependent on the refractory period, with shorter refractory periods causing a less-prominent effect.
|
|
There is reason to believe that neurons in NL possess very short membrane time constants and thus relatively short temporal integration windows (Gertsner et al. 1996
). Thus it might be that ICcc neurons have STRFs that are extended in temporal extent arising from averaging over greater time periods to effect the reduction of noise in ITD encoding (Christianson and Peña 2006
); if this were the case, it would be expected that the values of the stimulus over a greater period of time could influence response and thus that the TW0.5 would be larger in ICcc neurons. In ICcc, there was no correlation between CF0.5 and TW0.5 (Fig. 6A; P > 0.4). A weak correlation was present in NL (P < 0.01), but this is likely the result of a small sample size at higher frequencies; removing the two neurons with CF0.5 >6,500 Hz removed the correlation (P > 0.05). Pooling the observed TW0.5 across frequency, there was no statistically significant difference between the neurons of the two nuclei (P > 0.7). These results are based on a definition of TW0.5 that neglected the following inhibitory trough (that is, it considers only the temporal extent of the excitatory peak), but including the trough did not affect the conclusions presented here. Thus there is no evidence that ICcc neurons integrate their responses over a longer period of time than NL neurons and thus that noise reduction for ITD tuning is mainly the result of a mechanism that pools across inputs rather than one that pools across time.
|
Although NL and ICcc STRFs have similar spectral and temporal extent, it may be that there is some difference in shape that reflects a change in spectral processing from NL and ICcc. One way to quantify this is the degree of separability of the STRFs: that is, can the STRF be considered the product of individual time and frequency tuning functions? Alternatively, there may be nontrivial timefrequency interactions in the receptive field, which could be consistent with a tuning to frequency-modulated tone sweeps, for instance. Taking the SVD of the STRF and then observing the fractional energy of the singular value components can address this question. For a separable STRF, the fractional energy of the first singular value will dominate over that of the other values. When we perform this analysis on our data set (Fig. 6), we see that for both nuclei the fractional energy of the first component dominates (fractional energy of the first singular value for NL: 77 ± 8%; for ICcc: 65 ± 17%), whereas the others lay along a characteristic exponential line, which suggests that the spectral and temporal components of the STRFs are relatively separable: that is, that the STRF can be well approximated as the product of a spectral and a temporal tuning vector, although as Fig. 7 illustrates there was a trend toward a small downward sweep in the STRFs. There is also no strong evidence that either nucleus is more separable than the other (P > 0.01), which argues against any dramatic changes in the shape of the STRFs. The fact that the fractional energies are different at the P < 0.05 level leaves open the possibility that there is some subtle change in the STRF shape between the two nuclei, perhaps reflecting the beginning of stimulus-specific spectrotemporal tuning.
|
In principle, descriptions of the receptive fields of a neuron should be able to predict the neuron's response to novel stimuli (Eggermont et al. 1983a
; Linden et al. 2003
; Theunissen et al. 2000
). The degree to which the prediction matches the true neuronal response then provides a measure of the sufficiency of the model: in this case, to what degree the STRF is a complete description of the behavior of the neuron. Because the STRF is a linear model, if some nonlinear processing of spectral cues was introduced from NL to ICcc, it would not be evident in the preceding analysis, but we would expect that the ability of the STRF to predict responses in ICcc to be degraded compared with the performance in NL.
We were able to test this in the neurons for which we collected frozen noise data. In NL, it was possible to get a good prediction of the firing rate in response to a novel stimulus (because the frozen noise data were not included in the computation of the PESE), as seen in Fig. 8A. In our data set, a linear model was able to account for 39 ± 12% of the variance of NL neurons (Fig. 8B); this is comparable to the predictive ability of STRFs in ICls (Keller and Takahashi 2000
), auditory cortex (Linden et al. 2003
), and in the auditory midbrain of the grass frog (Eggermont et al. 1983a
). ICcc neurons were also well estimated by the STRF (Fig. 8C) and the performance across the ICcc population (31 ± 12% of variance explained) was not significantly different from the performance in NL (ANOVA, P > 0.1). Thus although we cannot rule out the possibility that there is some nonlinear difference in spectrotemporal encoding between NL and ICcc neurons, our data do not suggest such a conclusion.
|
Neurons in the cat IC were also previously identified that appear to synchronize to the cochlea-induced envelope pattern (Chase and Young 2006
; Joris 2003
). However, synchronization to amplitude modulations, both of the sound's intrinsic waveform (Joris and Yin 1992
, 1998
) and of the cochlear output waveform (Louage et al. 2004
), was described in lower auditory centers, leaving open the possibility that phase-locking and envelope-locking neurons represent two distinct populations throughout the ascending auditory pathway. Although we did not observe any NL neurons that did not phase lock, we would like to rule out the possibility that the ICcc neurons were driven by a subset of purely envelope-locking NL neurons.
A simple test for this comes from polarity tolerance. Normally, the rate-ITD function obtained with anticorrelated broadband noise presented to the two ears is the inverse of the rate-ITD function obtained with perfectly correlated noise (Yin et al. 1987
). However, some rate-ITD functions do not invert under anticorrelated stimulation (Joris 2003
). Because the amplitude envelope of a signal is invariant with respect to inversion (Hartmann 1997
), the absence of inversion with anticorrelated noise is consistent with a computation of ITD based on the cross-correlation of signals synchronized to the envelope. Therefore we collected rate-ITD functions in both correlated and anticorrelated cases for 17 ICcc neurons. In all cases, we observed a significant negative correlation between the two rate-ITD functions (P < 104), indicating that the inputs to these neurons had been locked to ongoing phase, and not the envelope, when computing the ITD.
| DISCUSSION |
|---|
|
|
|---|
Previous studies in the barn owl (Keller and Takahashi 2000
), as well as amphibian (Hermes et al. 1981
) and mammalian models (Carney and Yin 1989
), demonstrated patterned (that is, stimulus-locked) responses in the midbrain using broadband noise. However, because of the segregation of sound localization cues in the barn owl, it is not immediately clear whether these responses emerge from the ITD pathway or the ILD pathway. Taken all together, our results suggest that the population activity of the nuclei of the ITD pathway, and specifically ICcc, can be treated as a dynamic spectrum analyzer with a temporal resolution of approximately a millisecond. Although this does not rule out any role of the ILD-sensitive neurons in the generation of spectral tuning in higher auditory centers, it does argue against the possibility that the spectrotemporal tuning in ICls neurons reported by Keller and Takahashi (2000)
is attributed solely to the inputs from the ILD-sensitive neurons.
We did not examine the effect on the STRF of varying ITD because the rectification and large dynamic range of the rate-ITD functions of ICcc neurons (Christianson and Peña 2006
) made collecting a sufficient number of spikes for nonpeak ITDs prohibitive. However, raster plots from the work of Carney and Yin (1989)
indicate that the timing of spikes, if not their number, is relatively invariant to changes in the ITD in the cat IC; this is confirmed by the work demonstrating that spike timing does not appear to convey information related to the ITD (Chase and Young 2006
). Previous work in owls (Keller and Takahashi 2000
) demonstrated that STRFs of neurons in the ICls, to which ICcc projects, are also invariant under changes in ITD. Thus any dependency on ITD of STRFs for NL and ICcc neurons is likely to be explainable by a modulation in gain resulting from the dependency of mean firing rate on ITD.
In previous work, we showed that the transition from NL to ICcc acts as a noise-elimination phase in the representation of ITD (Christianson and Peña 2006
). It might be expected that this improvement in the encoding of a spatial location cue might come at the expense of the representation of the spectral content; this is especially true in the barn owl, where it is established that there is a frequency convergence to resolve ITD coding ambiguity (Mazer 1998
; Saberi et al. 1999
; Takahashi and Konishi 1986
). Here, we demonstrate that this is not the case. The conclusion is thus that the pooling that produces the reduction in ITD encoding noise must occur over a population of NL neurons with extremely similar spectrotemporal tuning. This may not be as difficult to arrange as it may appear. Theoretical work suggests that the neurons that compute the ITD should behave as cross-correlators (Licklider 1959
), which experimental work supports (Yin and Chan 1990
). One of the properties of cross-correlation is that the shape of the cross-correlation function (in this case, the rate-ITD function) should be determined by the spectral tuning of the cross-correlator, known as the WienerKhinchin relationship. Although the experimental evidence in support of this property is weak (Yin and Chan 1990
), it suggests that with an assumption of a moderate degree of homogeneity in the properties of NL neurons, the expectation would be that similar spectral tuning should accompany similar ITD tuning.
At first glance, the observation from Christianson and Peña (2006)
that NL neurons are noisy compared with ICcc neurons would seem to be in conflict with our observation that the variability in response to frozen noise stimulation was the same in both nuclei. In fact, the noise in the NL rate-ITD functions is likely to be the result of the spectrotemporal tuning itself. Because white noise has a uniform spectrum only when averaged over time, we would expect the firing rate over small time windows to be influenced to a large degree by the instantaneous power spectrum of the signal. The result that the STRFs and the SACs of both NL and ICcc neurons are similar suggests that the variability in response arising from spectral properties of the signal is the same in both nuclei, and we have observed that the overall variability in firing rate as a function of mean firing rate was similar in both NL and ICcc (Christianson and Peña 2006
). Thus it is likely that a primary source of "noise" in the rate-ITD function is a result of the variations in spike timing linked to spectrotemporal attributes of the sound, which is consistent with work done in cats indicating that ITD information is not carried by precise spike timing (Chase and Young 2006
). However, in ICcc the increase in overall dynamic range in the rate-ITD function (Christianson and Peña 2006
) allows that the variation in mean firing rate resulting from the firing patterns related to the spectral properties of different signals with the same ITD be small compared with the variation in mean firing rate resulting from changes in the ITD. As such, this constitutes an elegant implementation of simultaneous rate- and timing-based coding strategies to represent multiple stimulus parameters within the limited language of neuronal spiking.
| GRANTS |
|---|
|
|
|---|
| ACKNOWLEDGMENTS |
|---|
|
|
|---|
| FOOTNOTES |
|---|
Address for reprint requests and other correspondence: G. B. Christianson, UCL Ear Institute, 332336 Gray's Inn Road, London, UK WC1X 8EE (E-mail: g.christianson{at}ucl.ac.uk)
| REFERENCES |
|---|
|
|
|---|
Batra R, Kuwada S, Stanford TR. Temporal coding of envelopes and their interaural delays in the inferior colliculus of the unanesthetized rabbit. J Neurophysiol 61: 257268, 1989.
Brenowitz EA. The contribution of temporal song cues to species recognition in the red-winged blackbird. Anim Behav 31: 11161127, 1983.[CrossRef][ISI]
Carney LH, Yin TC. Response of low-frequency cells in the inferior colliculus to interaural time differences of clicks: excitatory and inhibitory components. J Neurophysiol 62: 144161, 1989.
Carr CE. Delay line models of sound localization in the barn owl. Am Zool 33: 7985, 1993.[ISI]
Carr CE, Konishi M. A circuit for detection of interaural time differences in the brain stem of the barn owl. J Neurosci 10: 32773246, 1990.
Chase SM, Young ED. Spike-timing codes enhance the representation of simultaneous sound-localization cues in the inferior colliculus. J Neurosci 26: 38893898, 2006.
Christianson GB, Peña JL. Noise reduction of coincidence detector outputs by the inferior colliculus of the barn owl. J Neurosci 26: 59485954, 2006.
de Boer E, de Jongh HR. On cochlear encoding: potentialities and limitations of the reverse-correlation technique. J Acoust Soc Am 63: 115135, 1978.[CrossRef][ISI][Medline]
Depireux DA, Simon JZ, Klein DJ, Shamma SA. Spectro-temporal response field characterizations with dynamic ripples in ferret primary auditory cortex. J Neurophysiol 85: 12201234, 2001.
Drullman R, Festen HM, Plomp R. Effect of temporal envelope smearing on speech recognition. J Acoust Soc Am 95: 10531064, 1994.[CrossRef][ISI][Medline]
Eggermont J. Wiener and Volterra analyses applied to the auditory system. Hear Res 10: 191202, 1993.
Eggermont JJ, Aertsen AM, Johannesma PI. Prediction of the responses of auditory neurons in the midbrain of the grass frog based on the spectro-temporal receptive field. Hear Res 10: 191202, 1983a.[CrossRef][ISI][Medline]
Eggermont JJ, Johannesma PIM, Aertsen AMHJ.Reverse-correlation methods in auditory research. Q Rev Biophys 16: 341414, 1983b.[ISI][Medline]
Gertsner W, Kempter R, van Hemmen JL, Wagner H. A neuronal learning rule for sub-millisecond temporal coding. Nature 383: 7678, 1996.[CrossRef][Medline]
Grantham DW, Wightman FL. Detectability of varying interaural temporal differences. J Acoust Soc Am 63: 511523, 1978.[CrossRef][ISI][Medline]
Hartmann WM. Signals, Sound, and Sensation. New York: Springer-Verlag, 1997, p. 416.
Hermes DJ, Aertsen AM, Johannesma PIM, Eggermont JJ. Spectrotemporal characteristics of single units in the auditory midbrain of the lightly anaesthetized grass frog (Rana temporaria L) investigated with noise stimuli. Hear Res 5: 147178, 1981.[CrossRef][ISI][Medline]
Joris PX. Interaural time sensitivity dominated by cochlea-induced envelope patterns. J Neurosci 23: 63456350, 2003.
Joris PX, Yin TCT. Responses to amplitude-modulated tones in the auditory nerve of the cat. J Acoust Soc Am 91: 215232, 1992.[CrossRef][ISI][Medline]
Joris PX, Yin TCT. Envelope coding in the lateral superior olive. III. Comparison with afferent pathways. J Neurophysiol 73: 253269, 1998.
Keller CH, Takahashi TT. Representation of temporal features of complex sounds by the discharge patterns of neurons in the owl's inferior colliculus. J Neurophysiol 84: 26382650, 2000.
Kim PJ, Young ED. Comparative analysis of spectro-temporal receptive fields, reverse correlation functions, and frequency tuning curves of auditory-nerve fibers. J Acoust Soc Am 95: 410422, 1994.[CrossRef][ISI][Medline]
Klein DJ, Depireux DA, Simon JZ, Shamma SA. Robust spectrotemporal reverse correlation for the auditory system: optimizing stimulus design. J Comput Neurosci 9: 85111, 2000.[CrossRef][ISI][Medline]
Konishi M. How the owl tracks its prey. Am Scientist 61: 414424, 1973.[ISI]
Konishi M, Kenuk AS. Discrimination of noise spectra by memory in the barn owl. J Comp Physiol 97: 5558, 1975.[CrossRef]
Köppl C. Phase locking to high frequencies in the auditory nerve and cochlear nucleus magnocellularis of the barn owl, Tyto alba. J Neurosci 17: 33123321, 1997.
Lewis ER, Henry KR, Yamada WM. Tuning and timing of excitation and inhibition in primary auditory nerve fibers. Hear Res 171: 1331, 2002.[CrossRef][ISI][Medline]
Licklider JCR. Three auditory theories. In: Psychology: A Study of a Science, edited by Koch S. New York: McGraw-Hill, 1959.
Linden JF, Liu RC, Sahani M, Schreiner CE, Merzenich MM. Spectrotemporal structure of receptive fields in areas AI and AAF of mouse auditory cortex. J Neurophysiol 90: 26602675, 2003.
Liu L, Palmer AR, Wallace MN. Phase-locked responses to pure tones in the inferior colliculus. J Neurophysiol 95: 19261935, 2006.
Louage DHG, van der Heijden M, Joris PX. Temporal properties of responses to broadband noise in the auditory nerve. J Neurophysiol 91: 20512065, 2004.
Mazer J. How the owl resolves auditory coding ambiguity. Proc Natl Acad Sci USA 95: 1093210937, 1998.
Peña JL, Viete S, Albeck Y, Konishi M. Tolerance to sound intensity of binaural coincidence detection in the nucleus laminaris of the owl. J Neurosci 16: 70467054, 1996.
Peña JL, Viete S, Funabiki K, Saberi K, Konishi M. Cochlear and neural delays for coincidence detection in owls. J Neurosci 21: 94559459, 2001.
Quine DB, Konishi M. Absolute frequency discrimination in the barn owl. J Comp Physiol 93: 347360, 1974.[CrossRef]
Ross S. A First Course in Probability (4th ed.). New York; Macmillan College Publishing, 1994.
Rust NC, Schwartz O, Movshon JA, Simoncelli E. Spike-triggered characterizations of excitatory and suppressive stimulus dimensions in monkey V1. Neurocomputing 58- 60: 793799, 2004.[CrossRef]
Saberi K, Takahashi Y, Farahbod H, Konishi M. Neural basis of an auditory illusion and its elimination in owls. Nat Neurosci 2: 656659, 1999.[CrossRef][ISI][Medline]
Shannon RV, Zeng FG, Kamath V, Wygonski J, Ekelid M. Speech recognition with primarily temporal cues. Science 270: 303304, 1995.
Sullivan WE, Konishi M. Segregation of stimulus phase and intensity coding in the cochlear nucleus of the barn owl. J Neurosci 4: 17871799, 1984.[Abstract]
Takahashi TT, Konishi M. Selectivity for interaural time difference in the owl's midbrain. J Neurosci 6: 34133422, 1986.[Abstract]
Takahashi TT, Konishi M. Projections of the cochlear nuclei and nucleus laminaris to the inferior colliculus of the barn owl. J Comp Neurol 274: 190211, 1988.[CrossRef][ISI][Medline]
Takahashi TT, Wagner H, Konishi M. Role of commissural projections in the representation of bilateral auditory space in the barn owl's inferior colliculus. J Comp Neurol 281: 545554, 1989.[CrossRef][ISI][Medline]
Theunissen FE, Sen K, Doupe AJ. Spectral-temporal receptive fields of nonlinear auditory neurons obtained using natural sounds. J Neurosci 20: 23152331, 2000.
Trussell LO. Synaptic mechanisms for coding timing in auditory neurons. Annu Rev Physiol 61: 477496, 1999.[CrossRef][ISI][Medline]
Wallace MN, Shackleton TM, Palmer AR. Phase-locked responses to pure tones in the primary auditory cortex. Hear Res 172: 160171, 2002.[CrossRef][ISI][Medline]
Winter IM, Palmer AR. Responses of single units in the anteroventral cochlear nucleus of the guinea pig. Hear Res 44: 161178, 1990.[CrossRef][ISI][Medline]
Wright BA, Lombardino LJ, King WM, Puranik CS, Leonard CM, Merzenich MM. Deficits in auditory temporal and spectral resolution in language-impaired children. Nature 387: 176178, 1997.[CrossRef][Medline]
Yamada WM, Lewis ER. Predicting the temporal responses of non-phase-locking bullfrog auditory units to complex auditory waveforms. Hear Res 130: 155170, 1999.[CrossRef][ISI][Medline]
Yin TCT, Chan JCK. Interaural time sensitivity in the medial superior olive of the cat. J Neurophysiol 64: 465488, 1990.
Yin TCT, Chan JCK, Carney LH. Effects of interaural time delays of noise stimuli on low-frequency cells in the cat's inferior colliculus. III. Evidence for cross-correlation. J Neurophysiol 58: 562583, 1987.
Yin TCT, Kuwada S, Sujaku Y. Interaural time sensitivity of high-frequency neurons in the inferior colliculus. J Acoust Soc Am 76: 14011410, 1984.[CrossRef][ISI][Medline]
Zhang XD, Heinz MG, Bruce IC, Carney LH. A phenomenological model for the responses of auditory-nerve fibers: I. Nonlinear tuning with compression and suppression. J Acoust Soc Am 109: 648670, 2001.[CrossRef][ISI][Medline]
| ||||||||||||||||||||||||||||