|
|
||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1Department of Neurobiology, University of Pittsburgh and Department of Statistics, Carnegie Mellon University, Pittsburgh, Pennsylvania; and 2Center for Hearing and Balance and Department of Biomedical Engineering, Johns Hopkins University; Baltimore, Maryland
Submitted 11 June 2007; accepted in final form 27 January 2008
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
We address the general question of information integration in the context of sound localization coding. The central nucleus of the inferior colliculus (IC) serves as a major integration point of the auditory system, receiving convergent input from nearly every brain stem auditory nucleus (for reviews, see Winer and Schreiner 2005
). Among other inputs, neurons in the IC receive information about the three main cues used for sound localization: interaural level differences (ILDs), interaural timing differences (ITDs), and monaural spectral cues (SNs, for spectral notches). IC neurons are usually sensitive to more than one localization cue (Benevento and Coleman 1970
; Caird and Klinke 1987
; Chase and Young 2005
; Delgutte et al. 1995
, 1999
). The representation of localization cues has usually been analyzed using discharge rate as the response variable (Palmer and Kuwada 2005
). However, other features of the response, like first-spike latency and the temporal patterning of spike trains, are also known to encode stimuli in the auditory system (Furukawa and Middlebrooks 2002
; Narayan et al. 2006
; Nelken et al. 2005
).
We have previously used virtual-space stimuli to investigate the simultaneous representation of two localization cues using either a rate code (Chase and Young 2005
) or a spike distance metric (SDM) code (Chase and Young 2006
; Victor and Purpura 1997
). SDM codes are sensitive to spike timing distributed throughout the neural response. However, when given an analysis window consisting of the entire response duration, we found that the SDM code is not very sensitive to changes in the timing of single spikes, such as first-spike latency differences (cf. Fig. 7 of Chase and Young 2006
). The present work expands on our previous work by investigating the coding of localization cues in first-spike latency and comparing the relative efficacy of each code in representing the different cues. Latency provides information about all three cues; as is the case for rate responses, the latency information is largest for ILDs, but there is also latency information about ITDs and SNs. Because of the differential coding of the localization cues in different aspects of spike trains, our results show how cues can be multiplexed onto the activity of single neurons.
| METHODS |
|---|
|
|
|---|
Surgical procedure
Acute recording experiments were performed on adult cats with clean external ears, obtained from Liberty Labs (Waverly, NY). Animals were anesthetized for surgery with xylazine [1 mg/kg, administered intramuscularly (im)] and ketamine (40 mg/kg, im). The cats were decerebrated by transecting the brain between the superior colliculus and the thalamus. After decerebration, anesthesia was discontinued. Throughout the experiment, the cat's temperature was maintained between 37.5 and 38.5°C with a feedback-controlled heating pad.
The IC was approached dorsally by aspirating occipital cortex and, when necessary, removing part of the bony tentorium. The ear canals were exposed and fitted with ear tubes for sound delivery, and the bullae on both sides were vented with 30 cm of PE 90 tubing. At the end of the experiment, the cat was killed with an overdose of barbiturate anesthetic. All procedures were performed in accordance with the guidelines of the Institutional Animal Care and Use Committee of the Johns Hopkins University.
Electrodes were advanced dorsoventrally through the IC to sample neurons with various best frequencies (BFs). All neurons used for the analyses had narrowly tuned nonhabituating responses recorded in a dorsoventral BF sequence (as in Ramachandran et al. 1999
); these results and the placement of the electrode suggest that the data are from the central nucleus of the IC.
Recording protocol
All recordings were made in a sound-attenuating chamber. Sounds were presented via speakers placed on hollow ear bars inserted into the ear canals. In situ speaker calibrations showed responses that are uniform (±4.6 dB SPL) between 40 Hz and 35 kHz. Platinum/iridium microelectrodes were used for single-neuron recording; neurons were isolated with a Schmitt trigger or a template-matching program (Alpha-Omega Engineering, Nazareth, Israel).
Stimulus design
Two sets of virtual space stimuli were used in this study, based on a single 330-ms token of broadband noise sampled at 100 kHz (interstimulus interval: 1 s). Each set was manipulated to vary independently in two parameters and each parameter was adjusted in five steps, for a total of 25 stimuli per set. To build up statistics sufficient for information theoretic analyses, each stimulus set was repeated multiple times (20–200, depending on how long the neuron was held; median: 90 repetitions). Stimuli were presented with 10-ms linear onset and offset ramps, interleaved in fixed order, such that all 25 stimuli were presented once, then the entire stimulus set was presented again, and so forth.
In the first stimulus set, ITD and ILD were manipulated. The frozen noise token was filtered through a spatially averaged head-related transfer function (HRTF; obtained from the cat data of Rice et al. 1992
), which imparts to the stimulus the spectral characteristics of the head and ear canal, independent of spatial location. The stimulus was then split into two streams (one for each ear) that were delayed relative to one another to impart an ITD and attenuated relative to one another to impart an ILD. ITDs and ILDs were chosen to correspond roughly to spatial locations in the horizontal plane of –60, –30, 0, 30, and 60° azimuth, where negative values refer to locations in the ipsilateral hemifield (Kuhn 1977
; Rice et al. 1992
; Roth et al. 1980
). For ITDs, the five cue values were –160, –80, 0, 80, and 160 µs; for ILDs, they were –13.8, –8.4, 0, 8.4, and 13.8 dB. The ILDs were imparted by setting the levels in the ipsilateral ear to 0, 0, 0, –8.4, and –13.8 dB (re the reference level) and the levels in the contralateral ear to –13.8, –8.4, 0, 0, and 0 dB. The ILDs were arranged in this way to help disambiguate ILD sensitivity from monaural or binaural sound-level sensitivity, as explained previously (Chase and Young 2005
).
In the second stimulus set, ILD and SN were varied. The SN cue was imparted by filtering the frozen noise token through one of five midline HRTFs containing a prominent spectral notch, representing elevations ranging from 0 to 30° in 7.5 ° steps (see Fig. 2D of Chase and Young 2005
). The stimulus was then split into two streams and an ILD cue was imparted as in the ITD/ILD stimulus set. Interaural spectral differences were not considered in this work because the same stimulus spectrum was sent to each ear. Before presentation, the stimuli in this set were resampled (resample command in MATLAB, The MathWorks, Natick, MA) such that the five SNs surrounded the BF of the neuron under study with the notch frequency of the third HRTF at BF. A by-product of this resampling is a BF-dependent change in stimulus length. For presentation, stimuli >400 ms were truncated at 400 ms, whereas stimuli <200 ms were repeated (including the 10-ms linear ON–OFF ramps) to be
200 ms long. This latter step was necessary only for neurons with BFs >18.5 kHz (n = 10).
|
Data were analyzed by computing the mutual information (MI) between stimulus (S) and response (R) under the assumption of several different response variables. The MI is defined by (Cover and Thomas 1991
)
![]() | (1) |
The stimulus sets (described earlier) consist of 25 stimuli containing variation in two stimulus parameters. To compute the MI between the neural responses and just one cue, a 5-stimulus set was defined by combining the stimuli across the other parameter. Thus the MI for individual cues quantifies the information carried about one cue in the presence of variation in the other cue. For all codes, data were analyzed over a window starting at stimulus onset and extending for 200 ms.
For spike rate, the MI was computed directly from empirical distributions of spike counts; that is, p(s, r) was the probability of getting a certain spike count r for a stimulus s. Bias was estimated (and removed) using a bootstrap procedure (Efron and Tibshirani 1998
). This method has been fully explained, including the debiasing techniques, in a previous publication (Chase and Young 2005
). The data processing here was slightly different from that in our previous papers. Here we analyzed spikes from a fixed interval of 0–200 ms, regardless of the duration of the stimulus, whereas previously we used the full stimulus duration. Also here we analyzed all neurons, regardless of response type, whereas previously we analyzed only response types V, I, and O in the scheme of Ramachandran and colleagues (1999)
. These changes result in small differences between comparable figures (Fig. 5, D–F of this paper vs. Fig. 5, A, B, and D in Chase and Young 2006
).
|
t|). Here,
t is the time difference between a spike in one train and the nearest spike in the other train and q is a variable cost parameter with units of s–1 that effectively sets the temporal resolution of the code. For a given q, there exists a minimum cost solution for the distance between any two spike trains. The distances are used to construct a confusion matrix for the responses as follows. To estimate the localization cues most likely to have elicited a particular spike train, the average distance between that train and the spike trains elicited by each of the 25 stimuli is calculated. The estimated stimulus is the stimulus eliciting spike trains with the minimum average distance to the spike train under consideration. The MISDM is computed directly from the confusion matrix by estimating the probabilities in Eq. 1 from the confusion matrix; for this calculation, s is the actual stimulus and r is the estimated stimulus. This calculation was repeated at several values of cost q: in addition to q = 0, the cost was sampled logarithmically from 10 to 15,850 s–1 at 5 costs per decade. For each neuron, the maximum value of MISDM across all costs was used (corresponding to median costs of 39.8 s–1 for ILD cues, 100 s–1 for SN cues, and 0 s–1 for ITD cues; see Chase and Young 2006
Although the SDM method extracts information in spike timing beyond that conveyed by spike rate, it is relatively insensitive to small changes in temporal patterns (see Fig. 7 of Chase and Young 2006
). Because latency variations usually occur only in the first ten or so milliseconds of a neuron's response to a stimulus (e.g., Fig. 2A), the SDM decoder can underestimate the information carried by first-spike latency.
|
![]() | (2) |
![]() | (3) |
j is the minimum Euclidean distance (here, the difference in spike times) between spike j and any other first spike, regardless of stimulus, and
*j is the minimum Euclidean distance between spike j and any other first spike elicited by the same stimulus. The number of stimuli S is 25 for the full stimulus set or 5 when considering an individual localization cue. Nk is the number of times the stimulus k elicited a spike, whereas N is the total number of first-spikes observed across all stimuli. To help control for errors due to small sample sizes, if a stimulus did not elicit a spike on at least eight repetitions, its contribution to the information was set to zero by deleting the corresponding terms in the second summation in Eq. 3. Spikes were recorded with 10-µs precision. To avoid zero distances, a random number uniformly distributed over [–5, 5] µs was added to each spike time.
In practice, we find the nearest-neighbor method of MIFSL computation of Victor (2002)
to be nearly bias free. To test this, we took neurons from which
100 repetitions of each stimulus were collected and computed the information contained in randomly selected subsets of the spike trains. Figure 1 A shows an example of this calculation for three randomly selected neurons from which 100 stimulus repetitions were collected. Here MItim is plotted as a function of Nreps, the size of the subset in repetitions. Each calculation was repeated four times, with a different random subset each time. Although there is a fair amount of scatter in the data points for the smaller subsets, for
20 repetitions the mean values are relatively stable. The variation in the results at Nreps = 100 is due to the random jitter added to the spike times. Figure 1B shows the average MIs measured using 100 repetitions of each stimulus plotted against the average MIs measured using only 20 randomly selected repetitions of each stimulus. There is good agreement between the two measures and no tendency for the 20-repetition estimates to be either smaller or larger than the 100-repetition estimates (P = 0.69, signed-rank test). For this reason, we did not attempt to debias these MI measurements.
|
![]() | (4) |
| RESULTS |
|---|
|
|
|---|
Figure 2 shows two examples of spike rasters recorded in response to the SN/ILD stimulus set. Each box represents the responses to multiple presentations of one stimulus; ILD parameters are given at the left, SN parameters at the right. With the use of frozen noise, spike times are very consistent across multiple repetitions of the same stimulus. A common feature of the responses of IC neurons to these frozen noise stimuli is the appearance of "clusters" of spikes in the rasters. These clusters correspond to the alignment of single spikes across multiple repetitions of the same stimulus with some small jitter (and not to bursts of spikes in response to one stimulus). In Fig. 2A, there is a clear ILD-dependent change in the latency of these spike clusters that disappears within 10 ms. This is typical of the responses of IC neurons in this data set; the timing of the first cluster of spikes is often more strongly dependent on the localization cues than the timing of subsequent spike clusters (Chase and Young 2007
). The strong variation in first-spike timing exhibited by this neuron makes first-spike latency an obvious candidate for a neural code. Figure 2B shows an example in which spike rate and first-spike latency show correlated changes to both ILD and SN.
We computed the first-spike latency information (MIFSL) between the localization cues and the first-spike latency variations exemplified by Fig. 2. Again, with this stimulus design, information can be computed about the entire set of 25 stimulus cues or, by combining rows of the probability distributions, the information between responses and the individual cues can be computed.
Figure 3 A shows the MI between first-spike latency and the full set of stimulus cues for the population of neurons studied, plotted as a function of BF. On average, first-spike latency carries 0.68 ± 0.35 (SD) bits of information about the stimuli, and there are no obvious trends with BF. Although it is not shown, there are also no trends with the IC neuron response classes, as defined by Ramachandran et al. (1999)
. The components of the first-spike latency information are shown in Fig. 3, B and C. A substantial number of neurons (48/118) have no information in MIrsp because each repetition of each stimulus elicited at least one spike. Of those neurons with MIrsp >0, roughly 23% of MIFSL stems from MIrsp, with the remainder coming from actual changes in spike timing. There are no obvious differences in MIrsp or MItim across the two stimulus sets, with the possible exception that for very low frequency neurons (BFs <1 kHz), MItim for the SN/ILD stimulus set is larger than for the ITD/ILD stimulus set.
|
|
|
Figure 5, A–C shows MIJOINT plotted against MIRATE for the three localization cues. Vertical deviations from the diagonal represent the FSL gain, information in first-spike latency that is independent of the information in rate. Points below the diagonal represent cases where estimation noise makes MIRATE appear greater than the other cue. For the ITD cue (Fig. 5A), the points cluster close to the diagonal (mean deviation: 0.06 ± 0.10 bits); there is little information conveyed by first-spike latency about ITD that is not conveyed by spike rate. Both ILD and SN show larger deviations from the diagonal (ILD: 0.17 ± 0.17 bits; SN: 0.16 ± 0.13 bits), indicating extra information available in first-spike latency over that available in rate. The independent information conveyed by first-spike latency is significantly smaller for ITD cues than for either ILD or SN cues (P < 10–4 in both cases, rank sum). This significant difference remains if the range of MIRATE is constrained (to be 0–0.5 or 0–1 bits) so that the comparison is done for cases with comparable MIRATE.
Figure 5, D–F shows a similar analysis for the information coded in the timing of spikes throughout the response, as computed using the SDM. The SDM method incorporates information from both spike rate and spike timing, so it provides a lower bound to the total information available in the spike train. The vertical distance between data points and the diagonal lines in Fig. 5, D–F is the additional information conveyed by spike timing over that available in discharge rate.
For ITD coding (Fig. 5D), there is little extra information conveyed by the SDM code, as was the case for latency (Fig. 5A) (mean SDM gain: 0.02 ± 0.06 bits). This result again suggests that ITD information is primarily carried by spike rate. For ILD coding there is an approximately equivalent amount of extra information available in the SDM code (0.12 ± 0.18 bits; Fig. 5E) as in the latency code (Fig. 5B). SN information is also provided by both latency (Fig. 5C) and spike timing throughout the response (0.31 ± 0.41 bits; Fig. 5F), although in this case there is more information in the SDM code than there is in the joint rate/first-spike latency code (P = 0.02, signed-rank test).
To further dissect the coding of the localization cues, Fig. 6 shows the information conveyed about each cue as a function of BF. For reference, the top row shows the information contained in spike rate. The middle row shows the FSL gain, or the extra information contained in first-spike latency that is not available in rate. This corresponds to MIJOINT – MIRATE, the vertical deviations of the points in Fig. 5, A–C. Unlike the rate information in the top row, the FSL gain does not show any obvious BF dependence for any of the cues. The SDM gain, corresponding to the vertical deviations shown in Fig. 5, D–F, is shown on the bottom row. Here there are differences among the three localization cues. The most striking feature of Fig. 6 is the difference in the BF dependence of the SN information carried by the three possible codes (right column). Whereas the rate information is highest in the mid to high BF range, the FSL gain is relatively independent of BF, and the SDM gain is highest at low BFs.
In discussing Fig. 5, we emphasized that the timing gains for ITD are smaller than for ILD and SN. Figure 6 shows this effect clearly, in that the scatter of points is more restricted near a timing gain of 0 for ITD than for ILD or SN. However, it is also true that MI values in general are smaller for ITD than for ILD or SN (Fig. 4 and the top row of Fig. 6); this begs the question of whether the timing gains as a fraction of MIRATE are smaller for ITDs. Unfortunately, the scatter in the data do not allow a firm answer to this question because most of the timing gains for ITD are near the values of the bias and scatter in the data.
It has been shown in experiments on the auditory cortex of anesthetized cats that a joint code consisting of spike count and first-spike latency (Furukawa and Middlebrooks 2002
) or mean spike latency (Nelken et al. 2005
) is sufficient to capture the majority of information transferred by auditory neurons. In Fig. 7, A–C, we compare the information contained in a joint rate/first-spike latency code to the maximum information contained in an SDM code by plotting MIJOINT against MISDM. Points that lie above the diagonal indicate instances where the SDM code underestimates the total information available in the spike trains. These are cases in which latency is particularly informative, but the SDM analysis does not capture that information. Points below the diagonal, however, indicate information available in spike timing beyond that available in the first spike.
For the ITD cues (Fig. 7A), the points lie close to the diagonal, although they have a (nonsignificant) tendency to cluster above it (22 of 35, P = 0.16, signed-rank test). ILD cues (Fig. 7B) show a significant tendency to lie above the diagonal (79 of 118, P = 0.002, signed-rank test). Thus a joint rate/first-spike latency code usually captures slightly more ILD information than the SDM code can recover. For SN cues (Fig. 7C), many points fall below the diagonal, indicating that spike timing carries extra information about SNs not captured by the joint code. Previously we showed that this information is in the form of responses to envelope fluctuations over the duration of the stimulus (Chase and Young 2006
). The difference between MISDM and MIJOINT is plotted as a function of BF for the SN cues in Fig. 7D. Low BF units are almost entirely dominated by SDM coding. In the physiological range of SN cues (6 to 20 kHz), there are neurons where MISDM dominates MIJOINT and neurons where the opposite holds true. In the highest BF neurons, MIJOINT and MISDM are roughly equivalent.
| DISCUSSION |
|---|
|
|
|---|
Temporal representation of SN cues
In a previous study (Chase and Young 2006
), responses to the frozen noise SN/ILD stimuli were compared with responses to a random noise set filtered in the same manner. Under these conditions, it was found that although FSL variations were still evident, the extra information recovered by the SDM decoder disappeared. Thus the information carried in temporal patterns has to do with the reproducible nature of the temporal envelope. This fact limits the utility of the information identified by SDM in natural conditions where the stimulus envelope is unknown and not repeatable.
Nevertheless, there are situations in which the temporal coding of SN might prove useful, as for moving sound sources, for comparing the locations of two sources, or for dealing with self-motion of the head or pinnae. As an example, take a case where the task is to judge whether a sound source is moving laterally (such that the distance of the source does not change, and therefore there is no overall frequency blue or red shift). In this case, slight movement of the source will cause subtle differences in the spectrum of the sound reaching the ear due to pinna filtering. These differences will be translated into differences in the temporal responses of some neurons (examples are shown in Fig. 8, discussed in the following text). Whether the brain can use this information is a matter of speculation at present because it seems to require a serial comparison of responses to successive stimuli. However, those changes in the temporal arrangements of spikes do not have to be processed strictly as temporal patterns. Temporal patterns can be converted to rate changes, as occurs in responses to amplitude modulation between cochlear nucleus, where there is little rate coding of modulation frequency, and IC (where there is a rate code; Joris et al. 2004
). As a test of this possibility, one can predict that the just-noticeable difference in the location of a frozen noise sound source will be smaller than that of a random noise source.
|
Multiplexing of information within a single spike train
It should be stressed that the three coding mechanisms investigated here—rate, first-spike latency, and ongoing temporal pattern changes—can occur simultaneously in the same neuron. Figure 8 shows the responses of one neuron to both the ITD/ILD and SN/ILD stimulus sets. In Fig. 8, A and B, the spike raster responses of the neuron to the two stimulus sets are shown. The most striking difference between the responses to the two stimulus sets is the variability from stimulus to stimulus of the rasters in Fig. 8A compared with the regularity of the rasters in Fig. 8B: the locations of the spike clusters in this neuron change as a function of SN, but not ITD. Using the SDM method, the extra information uncovered by spike timing in this neuron is 0.84 bits for SN, compared with 0.00 bits for ITD and 0.21 bits for ILD (averaged across the two stimulus sets).
Although ILD does not cause the global spike cluster rearrangement of SN, there is a clear ILD-dependent latency change in Fig. 8, both A and B. The first-spike latency information is 0.54 bits for ILD, compared with 0.13 bits for ITD and 0.57 bits for SN. The rate responses of this neuron are shown in the surface plots of Fig. 8, C and D. It is clear that all three cues cause changes in mean firing rates, with the greatest extent of information about ITD: 0.4 bits, compared with 0.28 bits for SN and 0.28 bits for ILD.
The fact that rate, latency, and temporal spike patterns all contain information about different cues makes it possible for neurons to multiplex multiple aspects of stimuli in their spike trains. When only discharge rate codes are considered (Chase and Young 2005
), there is substantial confounded information, meaning that the information about a full 25-stimulus set is less than the sum of the mutual information about the two stimulus cues that vary within the set, considered separately. Of course this is expected when one response variable is used to represent two independent aspects of the stimulus. When rate and temporal spike pattern (SDM) are considered simultaneously, it is possible to have full multiplexing of the two stimulus cues, with no confounded information (see Figs. 8 and 9 in Chase and Young 2006
). Here we show that latency provides additional information that should further improve the multiplexing of localization cues on spike trains in the IC.
This type of information multiplexing has also been found in the visual system. Sakai et al. (1997)
found that red and green inputs could be simultaneously encoded in ganglion cell responses. Gawne (2000)
and Gawne et al. (1996)
found that first-spike latency and spike rate differentially code for the contrast and orientation of visual line stimuli, respectively, whereas Victor and Purpura (1996)
found that visual contrast is coded at a higher temporal resolution than visual texture. Meanwhile, Oram (2005)
used a model of visual processing based on the differential coding of contrast and orientation to successfully predict reaction time distributions in visual recognition experiments. It is possible that the multiplexing of information in different codes within single neuron spike trains is a general mechanism of increasing information capacity in the brain.
One versus two stimulus variables
In this work we used stimuli that varied simultaneously in two parameters. The question arises of how this stimulus design affects the question considered in this paper: how multiple stimulus parameters can be represented by multiple aspects of a spike train. In particular, would it make a difference to our results if we had used stimuli that vary in only one parameter? Insight into this question can be gained using the idea of confounded information, introduced previously (Chase and Young 2005
). Define the mutual information between a stimulus parameter X and the response R as MI(X; R) as it was computed here by combining responses across all values of the second stimulus parameter Y. Mutual information could also be computed as MI(X; R|Y = y), by varying only one stimulus parameter (X) and keeping the second one (Y) constant at some reasonable value y. Usually there is not a clear "best" choice for y, so this calculation could be averaged across several values of y, which yields MI(X; R|Y). These two MI values are different and are related as
![]() | (5) |
Note also that if the response R is multidimensional, as discussed in the previous section, then the confounded information can be small. In this case the stimulus design matters less, i.e., the result will be the same if one stimulus parameter is changed versus two. For the results shown herein, this situation would hold to varying degrees for stimuli varying in ITD and either of the other two parameters, as long as temporal codes are considered. It might hold in cases for simultaneous variation of ILD or SN, but only if envelope locking could be used.
Implications for auditory processing
The differential coding of localization cues is useful only if downstream neurons are able to decode these cues independently. To compute the position of a sound source, information from the three cues must eventually be integrated together. However, the accuracy with which each cue can contribute to the overall location percept changes depending on sound conditions, such as the frequency content of the stimulus, temporal characteristics of the stimulus, or the reverberant qualities of the environment (Faller and Merimaa 2004
; MacPherson and Middlebrooks 2002
; Wightman and Kistler 1992
). In the cat, there is the additional problem that the spatial maps of ITD, ILD, and SN can vary differently as the ears are moved (Young et al. 1996
). Thus it is useful for the auditory system to preserve information about the cues separately so that the relative weight with which each cue contributes to the location percept can be changed depending on sound conditions.
In a recent study, Bizley et al. (2007)
reported the existence of visually responsive neurons within auditory cortical areas. They found that many neurons responded to both auditory and visual cues, often with unique temporal response components. In a subset of neurons, they studied the visual spatial receptive fields of these cells and noted a significant modulation of this visual spatial receptive field during auditory stimulation. In light of these results, it is possible not only for information from multiple auditory spatial cues to be optimally weighted for the computation of sound source location, but for information from different sensory modalities to be combined, as well. Although this type of spatial location processing is largely speculative at this time, investigations of sensory integration for localization processing may prove fruitful.
Regardless of how the final localization percept is calculated, the results reported here support the hypothesis that the analysis of sound location information above the inferior colliculus must allow for the possibility of temporal codes and simultaneous rate and temporal codes (e.g., Furukawa and Middlebrooks 2002
). The question of the means by which temporal codes such as latency could be used by neurons has been considered elsewhere (e.g., Chase and Young 2007
; Stecker and Middlebrooks 2003
), but remains a challenge for studies of neural coding.
| FOOTNOTES |
|---|
Address for reprint requests and other correspondence: S. M. Chase, McGowan Institute for Regenerative Medicine, 3025 East Carson Street, Pittsburgh, PA 15203 (E-mail: schase{at}andrew.cmu.edu)
| REFERENCES |
|---|
|
|
|---|
Bandyopadhyay S, Young ED. Discrimination of voiced stop consonants based on auditory nerve discharges. J Neurosci 24: 531–541, 2004.
Benevento LA, Coleman PD. Responses of single cells in cat inferior colliculus to binaural click stimuli: combinations of intensity levels, time differences, and intensity differences. Brain Res 17: 387–405, 1970.[CrossRef][Web of Science][Medline]
Bernstein LR. Auditory processing of interaural timing information: new insights. J Neurosci Res 66: 1035–1046, 2001.[CrossRef][Web of Science][Medline]
Bizley JK, Nodal FR, Bajo VM, Nelken I, King AJ. Physiological and anatomical evidence for multisensory interactions in auditory cortex. Cereb Cortex 17: 2172–2189, 2007.
Caird D, Klinke R. Processing of interaural time and intensity differences in the cat inferior colliculus. Exp Brain Res 68: 379–392, 1987.[Web of Science][Medline]
Chase SM, Young ED. Limited segregation of different types of sound localization information among classes of units in the inferior colliculus. J Neurosci 25: 7575–7585, 2005.
Chase SM, Young ED. Spike-timing codes enhance the representation of multiple simultaneous sound-localization cues in the inferior colliculus. J Neurosci 26: 3889–3898, 2006.
Chase SM, Young ED. First-spike latency information in single neurons increases when referenced to population onset. Proc Natl Acad Sci USA 104: 5175–5180, 2007.
Cover TM, Thomas JA. Elements of Information Theory. New York: Wiley, 1991.
Delgutte B, Joris PX, Litovski RY, Yin TCT. Relative importance of different acoustic cues to the directional sensitivity of inferior-colliculus neurons. In: Advances in Hearing Research, edited by Manley GA, Klump GM, Köppl C, Fastl H, Oeckinghaus H. River Edge, NJ: World Scientific, 1995, p. 288–299.
Delgutte B, Joris PX, Litovski RY, Yin TCT. Receptive fields and binaural interactions for virtual-space stimuli in the cat inferior colliculus. J Neurophysiol 81: 2833–2851, 1999.
Dreyer A, Delgutte B. Phase locking of auditory-nerve fibers to the envelopes of high-frequency sounds: implications for sound localization. J Neurophysiol 96: 2327–2341, 2006.
Efron B, Tibshirani RJ. Introduction to the Bootstrap. Boca Raton, FL: CRC Press, 1998.
Faller C, Merimaa J. Source localization in complex listening situations: selection of binaural cues based on interaural coherence. J Acoust Soc Am 116: 3075–3089, 2004.[CrossRef][Web of Science][Medline]
Furukawa S, Middlebrooks JC. Cortical representation of auditory space: information-bearing features of spike patterns. J Neurophysiol 87: 1749–1762, 2002.
Gawne TJ. The simultaneous coding of orientation and contrast in the responses of V1 complex cells. Exp Brain Res 133: 293–302, 2000.[CrossRef][Web of Science][Medline]
Gawne TJ, Kjaer TW, Richmond BJ. Latency: another potential code for feature binding in striate cortex. J Neurophysiol 76: 1356–1360, 1996.
Joris PX, Schreiner CE, Rees A. Neural processing of amplitude-modulated sounds. Physiol Rev 84: 541–577, 2004.
Kuhn GF. Model for the interaural time differences in the azimuthal plane. J Acoust Soc Am 62: 157–167, 1977.[CrossRef][Web of Science]
Macpherson EA, Middlebrooks JC. Listener weighting of cues for lateral angle: the duplex theory of sound localization revisited. J Acoust Soc Am 111: 2219–2236, 2002.[CrossRef][Web of Science][Medline]
Musicant AD, Chan JCK, Hind JE. Direction-dependent spectral properties of cat external ear: new data and cross-species comparisons. J Acoust Soc Am 30: 237–246, 1990.
Narayan R, Graña GD, Sen K. Distinct time scales in cortical discrimination of natural sounds in songbirds. J Neurophysiol 96: 252–258, 2006.
Nelken I, Chechik G, Mrsic-Flogel TD, King AJ, Schnupp JWH. Encoding stimulus information by spike numbers and mean response time in primary auditory cortex. J Comput Neurosci 19: 199–221, 2005.[CrossRef][Web of Science][Medline]
Oram MW. Integrating neuronal coding into cognitive models: predicting reaction time distributions. Network Comput Neural Syst 16: 377–400, 2005.[CrossRef]
Palmer AR, Kuwada S. Binaural and spatial coding in the inferior colliculus. In: The Inferior Colliculus, edited by Winer JA, Schreiner CE. New York: Springer-Verlag, 2005, p. 377–410.
Ramachandran R, Davis KA, May BJ. Single-unit responses in the inferior colliculus of decerebrate cats I. Classification based on frequency response maps. J Neurophysiol 82: 152–163, 1999.
Rayleigh Lord(StruttJW 3rd, Baron of Rayleigh). On our perception of sound direction. Philos Mag 13: 214–232, 1907.
Rice JJ, May BJ, Spirou GA, Young ED. Pinna-based spectral cues for sound localization in cat. Hear Res 58: 132–152, 1992.[CrossRef][Web of Science][Medline]
Roth GL, Kochar RK, Hind JE. Interaural time differences: implications regarding the neurophysiology of sound localization. J Acoust Soc Am 68: 1643–1651, 1980.[CrossRef][Web of Science][Medline]
Sakai HM, Machuca H, Korenberg MJ, Naka KI. Processing of color- and noncolor-coded signals in the gourami retina. III. Ganglion cells. J Neurophysiol 78: 2034–2047, 1997.
Simon HJ, Divenyi PL, Lotze A. Lateralization of narrow-band noise by blind and sighted listeners. Perception 31: 855–873, 2002.[CrossRef][Web of Science][Medline]
Stecker GC, Mickey BJ, Macpherson EA, Middlebrooks JC. Spatial sensitivity in field PAF of cat auditory cortex. J Neurophysiol 89: 2889–2903, 2003.
Sterbing SJ, Hartung K, Hoffmann KP. Spatial tuning to virtual sounds in the inferior colliculus of the guinea pig. J Neurophysiol 90: 2648–2659, 2003.
Victor JD. Binless strategies for estimation of information from neural data. Phys Rev E Stat Nonlin Soft Matter Phys 66: 051903, 2002.[Medline]
Victor JD, Purpura KP. Nature and precision of temporal coding in visual cortex: a metric-space analysis. J Neurophysiol 76: 1310–1326, 1996.
Victor JD, Purpura KP. Metric-space analysis of spike trains: theory, algorithms and application. Network Comput Neural Syst 8: 127–164, 1997.[CrossRef]
Wightman FL, Kistler DJ. The dominant role of low-frequency interaural time differences in sound localization. J Acoust Soc Am 91: 1648–1661, 1992.[CrossRef][Web of Science][Medline]
Winer JA, Schreiner CE. The Inferior Colliculus. New York: Springer-Verlag, 2005.
Young ED, Rice JJ, Tong SC. Effects of pinna position on head-related transfer functions in the cat. J Acoust Soc Am 99: 3064–3076, 1996.[CrossRef][Web of Science][Medline]
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Visit Other APS Journals Online |