|
|
||||||||
1Kresge Hearing Research Institute, University of Michigan, Ann Arbor, Michigan; and 2Human Cognitive Neurophysiology Laboratory, Veterans Affairs Northern California Health Care System, Martinez, California
Submitted 28 January 2005; accepted in final form 21 April 2005
| ABSTRACT |
|---|
|
|
|---|
-chloralose-anesthetized cats. We examined the dependence of spike counts and response latencies on stimulus location as well as the information transmission by neural spike patterns. Compared with units in A1, DZ units exhibited more complex frequency tuning, longer-latency responses, increased prevalence and degree of nonmonotonic rate-level functions, and weaker responses to noise than to tonal stimulation. DZ responses also showed sharper tuning for stimulus azimuth, stronger azimuthal modulation of first-spike latency, and enhanced spatial information transmission by spike patterns, compared with A1. Each of these findings was similar to differences observed between PAF and A1. Compared with PAF, DZ responses were of shorter overall latency, and more DZ units preferred stimulation from ipsilateral azimuths, but the majority of analyses suggest strong similarity between PAF and DZ responses. These results suggest that DZ and A1 are physiologically distinct cortical fields and that fields like PAF and DZ might constitute a "belt" region of auditory cortex exhibiting enhanced spatial sensitivity and temporal coding of stimulus features. | INTRODUCTION |
|---|
|
|
|---|
It is also not clear whether particular regions of the cortex are specialized for the processing of auditory spatial information. A recently popularized view of monkey auditory cortex proposes (by analogy with monkey visual cortex) the existence of separate processing "streams" specialized for the processing of spatial and spectral information (Rauschecker 1998
). That view is partly supported by physiological data that reveal some between-area differences in selectivity for the spatial locations or spectro-temporal content of acoustic stimuli (Recanzone et al. 2000
; Tian et al. 2001
). Studies of cortical neurophysiology in the cat, however, have revealed only minor differences in spatial sensitivity between various cortical areas. Rather, it appears that neurons throughout the auditory cortex are able to represent sound-source locations with similar accuracy. Furthermore, the responses of neurons in most areas of auditory cortex are modulated by stimulus location in similar ways, exhibiting broad spatial tuning that affects both the magnitude (spike count) and latency of neural responses (Imig et al. 1990
; Middlebrooks and Pettigrew 1981
; Rajan et al. 1990b
). The emerging view has been that the auditory cortex is equipotent in terms of spatial coding. That view is consistent with the general notion of broadly distributed spatial representations, but an alternative possibility is that specialization for the processing of auditory space exists in regions of the cortex in which spatial sensitivity has not yet been studied.
In an effort to address the potential for spatial specialization in unstudied regions of the auditory cortex, we have begun to describe the spatial sensitivity of neurons in fields beyond primary auditory cortex (A1) in the cat, most recently the posterior auditory field (PAF) (Stecker et al. 2003
). There we identified a number of important features of PAF responses suggestive of a role in spatial processing. Compared with neurons in A1, those in PAF are more sharply tuned for sound-source locations, their spatial tuning is less affected by increases in stimulus level, their preferred locations sample space more uniformly, and their spike patterns are more informative about stimulus location. Perhaps most significantly, response latencies of PAF neurons (which are longer than those of A1 neurons by tens of milliseconds) are strongly modulated by stimulus location, providing a robust temporal code for auditory space. Overall, these differences suggest that PAF is a strong candidate for a site of spatial processing. Aside from the differences in response latency, however, the effects appear as quantitative differences between neural populations that each exhibit significant variation between individual neurons. As such, one cannot conclude from physiological results alone that PAF is specialized or even necessary for the processing of auditory space, although recent behavioral evidence from cortical inactivation studies has suggested a critical role of PAF, along with A1 and the field of the anterior ectosylvian sulcus (fAES), in sound localization (Malhotra et al. 2004a
).
Another promising region of cat auditory cortex, in terms of spatial sensitivity, is the "dorsal zone" (DZ) of auditory cortex (He and Hashikawa 1998
; Middlebrooks and Zook 1983
; Sutter and Schreiner 1991
), which shares a number of physiological attributes with PAF. DZ extends dorsally from A1 into the ventral bank of the suprasylvian sulcus (SSS). Neurons in DZ can be distinguished from those in neighboring A1 on the basis of spectral tuning and response latency, and DZ is further set apart by a distinctive set of thalamocortical projections. Whereas A1 receives its strongest input from the ventral division of the medial geniculate (MGv), DZ receives projections mainly from the dorsal division (MGd), the dorsal cap of MGv, and the posterior group (PO) of thalamic nuclei (He and Hashikawa 1998
; Huang and Winer 2000
; Middlebrooks and Zook 1983
). A number of features of DZ responses are suggestive of a role in spatial processing. First, as in PAF, many DZ neurons exhibit complex frequency tuningoften extending to high frequencies (>12 kHz)that involves multiple excitatory and inhibitory domains (Sutter and Schreiner 1991
). As we have argued previously, such tuning might play a role in the processing of monaural spectral cues to sound-source location (particularly in elevation). Second, initial investigations of DZ (Middlebrooks and Zook 1983
) revealed a large number of "predominantly binaural" neurons that showed no response to monaural stimulation in contrast to the neurons more common to A1 that responded well to contralateral monaural stimulation and were either facilitated or inhibited by simultaneous ipsilateral stimulation. This pattern of binaural sensitivity suggests that DZ might contain a wider variety of spatial tuning beyond the forms commonly observed in regions of the cortex more strongly dominated by contralateral input (e.g., A1 and PAF). Third, DZ neurons have been shown to exhibit long response latencies (He et al. 1997
; Mendelson et al. 1997
). By analogy with the responses of PAF neurons, this raises the possibility that DZ units might exhibit spatial modulation of response timing as a robust form of spatial coding.
In this study, we recorded primarily from locations in the ventral bank of SSS, varying the caudorostral position of penetrations to sample posterior (dorsal to tip of the posterior ectosylvial sulcus (PES), near PAF, and the dorso-posterior field EPd), middle (dorsal to central A1), and anterior [dorsal to tip of the anterior ectosylvial sulcus (AES), near the anterior auditory field (AAF)] regions of DZ. We delineated the posterior, anterior, and dorsal borders of DZ based on sulcal pattern and its ventral border based on marked differences between DZ and A1 responses. It should be noted that the boundaries of DZ are not well characterized in the literature. Sutter and Schreiner (1991)
reported no physiological border between dorsal and ventral auditory fields, whereas He and Hashikawa (1998)
reported clear physiological differences between DZ and A1. It also is possible that our DZ recording sites strayed into neighboring fields. Posterior penetrations may have involved units that could be alternatively labeled as EPd or dorsal PAF. Similarly, anterior penetrations could have extended into AAF. In addition, DZ itself might contain discrete subregions. Without clear physiological markers for field borders (e.g., tonotopic reversal), their identification is difficult. We attempted to address these concerns in two ways. First, recordings in DZ were largely confined to the ventral bank of SSS, well away from the presumed border with A1 and/or PAF, while anterior recordings were monitored for indications (e.g., short-latency responses) of contamination by AAF units. Second, a series of recordings were made in the border region between A1 and DZ to characterize the sharpness of the transition of response properties between A1 and DZ.
| METHODS |
|---|
|
|
|---|
Animal preparation
Sixteen purpose-bred male (10) and female (6) cats, weighing between 2.8 and 7.0 kg were used in this study. Five of the female cats were previously trained to detect acoustic stimuli in a chronic behavioral study. The remaining cats participated only in the acute experiments. Data from six of the cats were included in the samples of A1 and PAF units reported by Stecker et al. (2003)
. The DZ data from those cats and all data from the remaining 10 cats are new to this report (see Table 1). Surgical anesthesia was induced and maintained with isofluorane (23%) in nitrous oxide (2 l/min) and oxygen (1 l/min). After surgery, cats were transferred to intravenous
-chloralose (1.5 mg/ml) in Ringer solution for unit recording. Dosage was
3 mg · kg1 · h1 and adjusted to maintain an areflexive state. Atropine sulfate (0.10.2 ml im) was administered at regular intervals throughout the experiment to suppress mucosal secretions. After partial removal of the scalp and right temporalis muscle, a craniotomy of 1-cm diam exposed the right middle ectosylvian gyrus, PES, and SSS. The animal was positioned in the center of a sound chamber with its head held by a bar attached to a skull fixture and its body suspended in a fabric sling. Thin wire supports maintained symmetric pinna placement throughout the experiment. A warm-water heating pad maintained body temperature at 37°C. Core temperature was monitored using an electronic esophageal or rectal thermometer. Heart and respiration rates were monitored using an electronic stethoscope placed in the esophagus or under the fore-limb. Experiments lasted from 2 to 5 days, after which the cats were killed. The right cortical hemisphere was then removed and immersed in buffered formalin for later visual confirmation of the region of cortex recorded.
|
Recordings were made in a 2.6x 2.6 x 2.5-m sound-attenuating chamber, the surfaces of which were lined with sound-absorbing foam (Illbruck) to suppress reflections. Sounds were presented one at a time from calibrated loudspeakers located 1.2 m from the cat's head and spaced 20° apart in the ear-level horizontal plane (for assessment of azimuth sensitivity) or in the vertical median plane (for assessment of elevation sensitivity). Loudspeaker locations are expressed in degrees azimuth or elevation, relative to the loudspeaker directly in front of the cat (0°). Positive azimuths correspond to the cat's right side (ipsilateral to the recording site); positive elevations increase upward and to the rear (90° is directly overhead). The loudspeaker placed directly behind the cat corresponds to 180° (azimuth or elevation). Loudspeakers were placed at all 20° multiples of azimuth including 0°, and all 20° multiples of elevation from 60° (60° below the frontal horizon) to +200° (20° below the rear horizon). Experiments were controlled by a personal computer, and acoustic stimuli were synthesized digitally using equipment from Tucker-Davis Technologies (TDT; Gainesville, FL). All stimuli were generated with 16- or 24-bit precision at a 100-kHz sampling rate. A computer-controlled multiplexer permitted any one loudspeaker to be activated at a time. Stimuli were either 80-ms Gaussian noise bursts with abrupt onsets and offsets or 80-ms pure tones with 5-ms raised-cosine onset/offset ramps.
Data acquisition and spike sorting
Extracellular unit activity was recorded using multi-site silicon-substrate microprobes. These probes, provided by the University of Michigan Center for Neural Communication Technology (Anderson et al. 1989
), permitted simultaneous recording from
16 cortical sites, and are fabricated in several formats. The data presented here were obtained using primarily single-shanked probes with linear arrays of either 16 recording sites spaced every 100 or 150 µm or (less commonly) 8 sites spaced every 200 µm. Impedances were between 1 and 4 M
on 16-site linear probes (site area: 177 µm2) and 340360 k
on 8-site probes (site area: 1,250 µm2). Seven penetrations spanned the presumed border between A1 and DZ; of these, three used 16-site single-shank probes penetrating the cortical surface tangentially and four used 4-shank probes oriented orthogonally to the cortical surface (see Fig. 12). Four-shank probes contained four 1,250-µm2 recording sites, spaced by 200 µm, on each of four parallel shanks 3.75 mm long and separated by 200µm. Impedances on four-shank probes ranged from 300 to 400 k
. In general, two probes were placed simultaneously in different cortical areas (DZ, PAF, and/or A1), and we recorded from up to a total of 32 sites. Activity at each site was amplified, digitized (TDT RA16, 25-kHz sampling rate), band-pass filtered (0.24 kHz), resampled at 12.5 kHz, and stored on a computer disk for off-line analysis. On-line monitoring of spikes allowed estimation of thresholds and frequency tuning prior to the collection of spatial data.
|
In contrast to previous studies (e.g., Furukawa and Middlebrooks 2002
), we chose to record from as many sites as possible per penetration rather than to obtain recordings from clearly isolated single neurons. The spike-sorting procedure described in the preceding text was used to obtain the best possible isolation of neural signals; in general, however, we conservatively consider the recordings to be from multi-unit clusters rather than single isolated neurons. In past studiesincluding that of Stecker et al. (2003)
, whose procedures were identical to those of the current studywe have not observed significant differences between tuning properties estimated from such recordings and those that could be reliably identified as single isolated neurons. Thus we do not distinguish between them in this report; the term "unit" is used in reference to both.
Units that responded with <1 spike per trial, on average, to their most effective stimulus were rejected from further analysis as were units the average response of which across all stimuli varied by more than a factor of two between the first and second halves of blocks of trials in a recording session. This screening procedure was carried out independently for responses to stimuli varying in azimuth and elevation (see Experimental procedure). A number of PAF and A1 units included in this analysis were included in the sample of Stecker et al. (2003)
. Table 1 indicates the number of units in each area that appeared in that sample or are new to the current study.
Experimental procedure
Recordings in this study focused on cortical areas DZ, PAF, and A1, which were identified initially by the cortical sulcal pattern and secondarily by their responsiveness to pure-tone stimulation, tonotopic organization, and response latencies. Penetrations in DZ proceeded in the lateromedial direction into the ventral bank of the SSS. Penetrations in area PAF proceeded in the dorsoventral or lateromedial direction along the caudal bank of the PES. These cortical regions were additionally confirmed by examination of response latencies (median latencies were 22 ms in DZ, 29 ms in PAF, and 17 ms in A1) and the identification of broad or complex (multi-peaked) tuning to pure-tone frequency. Penetrations in A1 passed obliquely into the middle ectosylvian gyrus, generally proceeding in a rostrocaudal direction. Search stimuli, consisting of broadband noise bursts and 0.5- to 30-kHz pure tones, were presented from loudspeakers located at 0 or 40° azimuth in the horizontal plane or +80° elevation in the median plane (10° forward of overhead). Penetration depths were adjusted to maximize the number of active recording sites, with typically 1014 sites per probe showing unit responses.
Study of the units in each penetration began with estimates of their thresholds to noise bursts, tested in 5-dB increments of SPL. The stimuli were presented from a location at which units responded reliably, most often from loudspeakers at azimuths of 0 or 40° in the horizontal plane or in the mid-sagittal plane at +80° elevation. Typically, unit thresholds varied by <10 dB across sites in a single penetration, and the modal threshold was adopted as the representative threshold for the penetration. Responses to pure-tone stimuli were tested using tone frequencies varying in 1/3- or 1/6-octave steps from 1 to 30 kHz; tone levels varied in 10-dB steps, typically from 0 to 50 dB SPL. Pure tones were always presented from 80° elevation; this overhead location was chosen because the spectrum of the cats' directional transfer function tended to be flattest there, minimizing the effects of filtering by the pinna on the units' responses (Xu and Middlebrooks 2000
). Next, we measured the units' spatial sensitivities using 80-ms noise bursts 20, 30, and 40 dB above threshold, presented from 18 locations in the horizontal plane (180 to +160° in 20° steps) and 14 locations in the mid-sagittal plane (60 to +200°). Stimuli were presented in pseudorandom order such that each combination of SPL and location was presented once before all combinations were repeated in a different random order; 40 repetitions were completed for each penetration. Neural activity was recorded from 2050 ms before to 80200 ms after the stimulus onset. Measurement of spatial sensitivity was often followed by presentations of additional stimuli related to other research questions, so that study at each penetration or pair of penetrations lasted from 2 to 14 h. Experiments yielded data from 2 to 18 (median = 5.5) penetrations per animal in A1, PAF, and DZ. In some cases, additional penetrations were made outside these areas; those recordings are not included in this report.
Data analysis
SPATIAL SENSITIVITY ASSESSED BY ANALYSIS OF SPIKE COUNT AND RESPONSE LATENCY.
After spike sorting, spike times were stored as latencies relative to the onset of sound at the loudspeaker. Arrival of sound at the cat's head followed a delay of
3.5 ms due to acoustical travel time. Spatial sensitivity was assessed by analyzing spike rates, response latencies, and the amount of stimulus-related information conveyed by spike patterns. From these, we computed statistics of response modulation, spatial tuning width, and preferred location. These are summarized in Table 2 and briefly described in the following text; for mathematical definitions, see (Stecker et al. 2003
).
|
).
The depth (or range) of response modulation characterized the degree to which response latencies or spike counts varied across space. It was computed as the range of geometric mean latency or arithmetic mean spike count (normalized to 1 at max count) across location.
L has units of milliseconds and
C is a proportion of maximum spike count, ranging from 0 (no modulation) to 1 (100% modulation).
SPATIAL TUNING WIDTH (W).
Spatial tuning width characterized the range of locations that were effective in eliciting a strong or rapid response from a given unit. Tuning width WC or WL was defined as the range of locations (not necessarily contiguous) associated with spike counts of
50% of maximum or latencies within the shortest 25% of the latency range across location. W has units of degrees.
SPATIAL CENTROID (
).
Following Middlebrooks et al. (1998)
, we characterized the preferred stimulus locations of individual units by calculating the spatial centroid, or spatial center of mass of the units' peak responses. The peak was defined as the contiguous set of locations eliciting responses within 25% of maximum spike count or within the shortest 25% of the latency range and including the location eliciting the maximum (or shortest latency) response overall. We then computed a vector sum of angles to stimulus locations included in the peak, each weighted by spike count or inverse latency; the angle of the resultant vector gave the spatial centroid
C or
L.
SPATIAL INFORMATION TRANSMITTED BY SPIKE PATTERNS (TSR).
As in previous work (Furukawa and Middlebrooks 2002
; Furukawa et al. 2000
; Mickey and Middlebrooks 2003
; Middlebrooks et al. 1998
; Stecker et al. 2003
), we estimated the spatial information transmitted by temporal patterns of neural response using a statistical pattern-recognition algorithm implemented using a customized version of the MATLAB Neural Network Toolbox (The Mathworks, Natick MA). The approach used here was described in detail by Stecker et al. (2003)
. Briefly, it involved the classification of neural spike patterns by the stimulus locations most likely to have elicited them. For this analysis, different types of spike patterns were computed in each of three separate conditions. In the first, single-unit spike patterns were compiled by computing bootstrapped spike-density functions (SDFs) for each unit. These were spike times recorded on eight randomly selected (with replacement, see Efron and Tibshirani 1991
) trials corresponding to a particular stimulus location (stimulus levels 2040 dB above threshold were included), convolved with a Gaussian impulse (
= 1 ms) and resampled to produce a histogram of spike count per 2-ms bin. The motivation for bootstrapping in this case was to obtain reliable estimates of stimulus-related spike patterns while also preserving a measure of trial-by-trial variability in patterns. Because bootstrapped SDFs pool data across trials, however, transmitted-information estimates based on them cannot be interpreted in terms of information per trial. In the other two conditions, we assessed the specific information-bearing features of neural responses in each cortical area by generating "reduced" spike patterns that contained only normalized spike count or response latency averaged across the set of eight selected trials and expressed as a scalar value.
Regardless of the type of spike pattern, 20 patterns per stimulus type were generated from one half of trials (the "training" set) and used to construct a pattern-recognition template for each stimulus location. Twenty additional spike patterns were generated from the remaining trials (the "test" set), and each of these was classified according to the most similar (smallest vector Euclidean distance) template obtained in the previous step, thus estimating the most likely stimulus location given the observed neural response. Estimates of stimulus locations were expressed as joint stimulus-response probability matrices (confusion matrices, see Fig. 1), from which we calculated total stimulus-related (TSR) transmitted information (the average of partial information across stimuli; Furukawa and Middlebrooks 2002
). Transmitted information (mutual information) reflects the reduction in uncertainty about stimulus location given the network responses, and has units of bits. One bit of transmitted information implies perfect discrimination of two regions of space (e.g., left vs. right) or more continuous discrimination with some error. Perfect identification of 18 locations corresponds to 4.17 bits. For the present study, we calculated the transmitted information from classifications based on single-unit spike patterns (TSRS) and reduced spike patterns consisting of only spike counts (TSRC) or response latencies (TSRL) obtained from single-unit responses.
|
The linear Euclidean distance metric of the pattern-recognition algorithm raises another consideration: the algorithm cannot recognize disjunctions in the input space (e.g., as would occur if a single stimulus elicited two different types of neural responses each dissimilar to their combined mean) and thus may not have detected information in spike patterns optimally. The frequency and degree of such effects could not be known without pursuing more complex information-theoretic analyses, but visual inspection of spike patterns did not reveal any obvious examples of such effects. Following Stecker et al. (2003)
, we consider the current TSR estimates to represent lower bounds on transmitted information in the case where complex or context-dependent responses might appear. Most importantly, our focus in this report is on comparing information rates between cortical fields rather than accurately estimating them in absolute terms. Because all information estimates were based on the same set of methods, and assuming that neither bias nor algorithm performance differed between the neural populations being compared, these effects should have no effect on the interpretation of the current results.
IDENTIFICATION OF FREQUENCY-TUNING PEAKS FROM FREQUENCY RESPONSE AREAS.
Pure-tone responses were analyzed by computing the frequency response area (FRA, a contour plot of spike count as a function of pure-tone frequency and level: Fig. 1, left) for each unit. Characteristic frequency (CF) for each unit was defined as the frequency of the lowest-level stimulus that elicited a response exceeding the (averaged prestimulus) spontaneous rate by
40% of maximum spike rate (measured across stimuli). The CF also defined the primary frequency-tuning "peak." Secondary frequency-tuning peaks, when present, were defined similarly, but only where the FRA indicated a reduction of
50% in response at frequencies intermediate of adjacent peaks.
TESTS OF STATISTICAL HYPOTHESES.
Following Stecker et al. (2003)
, we used nonparametric permutation tests to compare distributions of spatial statistics between cortical fields, stimulus levels, etc. Tests recomputed sampling distributions of interest (generally the difference between medians) under 5,000 different permutations of variable labels. The proportion exceeding (or falling below) the actual computed value gives the probability of type I error, or "P value." Unless otherwise noted, P values given in the text refer to this method. They are stated with one significant digit, although we adopted a fixed criterion for statistical significance of P < 0.05. Note that the sensitivity of a 5,000-permutation test is limited to 0.0002, so "P < 0.0002" indicates a difference more extreme than any obtained by random permutation. Other standard statistical tests (e.g., linear regression) used the MATLAB Statistics Toolbox (The Mathworks). Except as otherwise noted, tests between cortical areas compared the full population of units recorded in each area, i.e., 319 units in A1, 472 units in PAF, and 337 units in DZ, as given in Table 1.
| RESULTS |
|---|
|
|
|---|
General observations suggest broad similarities between DZ physiology and the responses of units in PAF, which together differed from A1 in the complexity of their frequency tuning, latency of their responses, prevalence of nonmonotonic rate-level functions, and general preference for stimulation by pure tones rather than by broadband noise. Figure 1 summarizes the physiological responses of example units recorded in DZ, PAF, and A1. Note the prevalence, among DZ units, of multi-peaked and complex (nonmonotonic and patchy) frequency tuning. Typically, A1 units exhibited sharp tuning to a single well-defined characteristic frequency at low levels that broadened with increasing stimulus intensity. In contrast, PAF units often exhibited complex multi-peaked frequency tuning. As depicted by the examples (which are typical), units in DZ resembled PAF units in this respect. Figure 2 (left) plots distributions of FRA complexity (described by the number of frequency-tuning peaks in each unit's FRA) in all three fields. The majority of A1 units exhibited a single, well-defined peak of frequency response, whereas the majority of DZ units exhibited multipeaked (
2) frequency tuning. PAF, noted for complex tuning (Loftus and Sutter 2001
), was intermediate. These results are consistent with those of Sutter and Schreiner (1991)
, who reported larger numbers of multipeaked neurons in dorsal than in ventral regions of auditory cortex.
|
Long, stimulus-sensitive response latency in PAF and DZ
A second feature of PAF responses shared by DZ units was long and stimulus-dependent response latency. Rasters of spike times recorded for noise stimuli varying in azimuth are plotted in Fig. 1, middle left. Whereas A1 units overwhelmingly responded with short latency (<20 ms) that was relatively insensitive to changes in stimulus azimuth, PAF units responded with longer latency (>20 ms) that was strongly modulated by location. DZ units generally exhibited latency modulation similar to PAF units although their latencies were intermediate between those of PAF and A1 units. Median overall latency in DZ (22.04 ms) was significantly longer than that in A1 (17.64 ms, P < 0.0002) and shorter than that in PAF (28.75 ms, P < 0.0002). This difference is quantified across neural populations in Fig. 3, which reveals significant differences in both the overall latency (left) and range of latency modulation (right) observed among PAF and A1 units. Although latencies were generally shorter in DZ than PAF, a number of units in DZ responded with latencies of
60 ms, consistent with previous reports of very long-latency responses in the area (He and Hashikawa 1998
). Like those of PAF units, the response latencies of DZ units were strongly modulated by sound-source location. The range of latency across azimuth was significantly smaller in A1 (median: 3.11 ms) than in either PAF (10.62 ms, P < 0.0002) or DZ (8.38 ms, P < 0.0002), which did not differ significantly from one another. It is interesting to note that latencies of many DZ units appeared to follow a different pattern than those of PAF units. The DZ units responded with one fixed latency across a wide range of contiguous azimuths, shifting to a different fixed latency at other locations. Latency shifts often occurred near 0 and 180°. The pattern of abrupt latency shift across azimuth is seen clearly for three DZ neurons in Fig. 1 and can be contrasted with the more gradual latency modulation of the depicted PAF units.
|
We calculated the monotonicity ratio, defined as the ratio of the response at the highest level tested to the maximum observed response (Sutter and Schreiner 1991
), for each unit. A monotonicity ratio near 1 indicates that a unit's spike count increased monotonically with stimulus level, 0 indicates a complete failure to respond at the highest tested level, and intermediate values indicate weakened responses to high-level stimuli. We adopted a criterion value of 0.5 to define units with nonmonotonic rate-level functions. The proportion of these was significantly greater in DZ (69/337 = 20%) than in A1 (36/319 = 11%, P < 0.0002) and less than in PAF (134/472 = 28%, P < 0.0002). Median monotonicity ratios were significantly lower in DZ (0.780) than in A1 (0.839, P < 0.01) and higher than in PAF (0.688, P < 0.005), consistent with past results showing stronger nonmonotonicity in dorsal than in ventral auditory cortex (Sutter and Schreiner 1995
). Figure 4 plots distributions of monotonicity ratio across cortical fields. Distributions in all three fields (left) were clearly nonunimodal. Rather modes were observed at values of 0 (completely nonmonotonic), 1 (completely monotonic), and at some intermediate values. Within the intermediate region, distributions did not differ greatly between fields except for a slight elevation in the proportion of moderately nonmonotonic DZ and PAF units with ratios <0.3. The largest differences between areas were instead found in the proportions of units with ratios near 0 and 1 (right).
|
A fourth similarity between units in DZ and PAF was their preference for tonal stimulation over noise. For units whose tone responses were recorded, we computed the ratio of best noise response (across all tested stimulus locations and levels) to best tone response (across all tested frequencies and levels). The results indicate that 258/337 (77%) of DZ units preferred tones to noises (noise/tone ratio <1), compared with 59% (188/319) of A1 units and 74% (351/472) of PAF units. Mean noise/tone ratios were 0.781 in PAF, 0.724 in DZ, and 0.986 in A1. Consistent with results in PAF (Stecker et al. 2003
), noise/tone ratios were significantly correlated with monotonicity ratios in PAF (r = 0.2073, P < 0.0001) and DZ (r = 0.2938, P < 0.0001) but not in A1 (r = 0.0272, P < 0.7). That is, on average, units in DZ and PAF that were nonmonotonic were somewhat more likely to prefer tones than were monotonic units. While similar to results obtained in PAF (Phillips et al. 1995
; Stecker et al. 2003
), DZ units' preference for tones over noise runs counter to reports that neurons exhibiting multi-peaked FRAs (which are more prevalent in DZ than A1) respond more strongly to noise than to tonal stimulation (Sutter and Schreiner 1991
).
Preferred locations of DZ units sample contralateral and ipsilateral space more completely than those of PAF or A1 units
Figure 5 plots distributions of preferred azimuths and elevations in A1, PAF, and DZ. Overall, the majority of units in all three fields preferred contralateral (negative azimuths) over ipsilateral (positive azimuths) stimulation. DZ, however, contained a higher proportion of units preferring ipsilateral azimuths than A1 or PAF. Omitting untuned units and those with centroids that fell within 10° of the interaural midline, significantly more DZ units (26%) possessed ipsilateral centroids than did PAF units (15%, P < 0.0002) or A1 units (10%, P < 0.0002). In terms of elevation, the majority of tuned units in all three areas preferred stimulus locations aligned with the acoustic axis of the pinnae, 2060° above the frontal horizon. A number of units in PAF and DZ, however, preferred low or rearward elevations. This may reflect the increased numbers of nonmonotonic units in PAF and DZ; such units may have responded most strongly when stimuli were subject to acoustic attenuation (e.g., due to interference by the cat's body) that occurred when stimuli were presented from the rear.
|
More units in PAF and DZ were tuned to sound-source location than in A1, as evidenced by the relatively lower proportions of units (indicated by "NC" in Fig. 5) for which no centroid could be computed. That pattern is reiterated in the spatial tuning widths plotted in Fig. 6. Azimuth tuning widths were consistently and significantly narrower in DZ (median WC,az: 205.2°) than in A1 (259.0°, P < 0.0002) or PAF (238.6°, P < 0.03) when stimuli were presented 20 dB above unit threshold. At higher stimulus levels (40 dB above unit threshold), a large number of units in each cortical area responded throughout 360° of azimuth, although the level-dependent increase in tuning width was less in PAF (21.0°) than in A1 (36.0°, P < 0.008). The tuning of DZ units broadened by an intermediate amount that did not differ significantly from PAF or A1 (28.6°, P < 0.1). As a result of this broadening, median tuning widths measured at the higher level (340.6° in A1, 301.4° in PAF, and 306.3° in DZ) remained larger in A1 than in DZ (P < 0.0004) or PAF (P < 0.0002) but became similar between DZ and PAF (P < 0.4), suggesting that DZ units were not as level-independent as PAF units (Stecker et al. 2003
). Figure 6, right, plots distributions of elevation tuning width WC,el in each cortical area. Elevation tuning was very broad overall, and did not differ significantly between areas (P > 0.05).
|
C in the three areas. Spike-count modulation by azimuth was similar in the three areas for low-level sounds (median
Caz = 0.73, 0.73, 0.75 in A1, PAF, and DZ, respectively), but was significantly weaker in A1 (median
Caz = 0.55) than in PAF (0.63, P < 0.0002) or DZ (0.61, P < 0.0002) for stimuli 40 dB above unit threshold. Similarly, modulation by elevation
Cel was significantly stronger in PAF and DZ (median
Cel = 0.50 in both) than in A1 (0.43, P < 0.0002) at 40 dB, but similar across areas at 20 dB above threshold (median
Cel = 0.68, 0.62, 0.64 in A1, PAF, and DZ).
|
We used statistical pattern recognition to assess the ability of changes in neural response patterns to signal changes in stimulus location. Summarized as bits of stimulus-related information TSRS, distributions of algorithm performance are plotted in Fig. 8. Spike patterns of DZ and PAF units transmitted a median TSRS of 0.68 and 0.70 bits, respectively, and did not differ significantly (P < 0.3). Both transmitted significantly greater azimuth-related information than did spike patterns of A1 units (median TSRS: 0.62 bits vs. DZ, P < 0.004; vs. PAF, P < 0.002). The proportion of units transmitting >1 bit, however, was greater in DZ (20.0%) than in either A1 (10.5%, P < 0.0002) or PAF (14.8%, P < 0.0002), suggesting a sizeable population of more-informative units there. This proportion also differed significantly between PAF and A1 (P < 0.04). Elevation-related information rates were more similar between the areas, slightly higher in DZ and PAF (with medians of 0.44 and 0.46 bits, respectively) than in A1 (0.42 bits). Note, however, that elevation sensitivity was tested on the vertical median plane, where interaural differences are minimized. Testing at each unit's best azimuth might have revealed different (better) elevation sensitivity for many units (e.g., those with circumscribed spatial receptive fields).
|
Because area DZ, like PAF, encoded sound-source locations more accurately than primary auditory cortex and contained a large number of neurons whose response latencies were strongly modulated by changes in stimulus location, we hypothesized thatlike PAF units (Stecker et al. 2003
)DZ units would be more effective at encoding space by response latency than spike counts. Here, we examined the relative contribution of response latency and spike count by assessing classification performance based on "reduced" spike patterns containing only latency or count information (see METHODS). Distributions of the resulting count and latency transmitted-information estimates, TSRCand TSRL, are plotted in Fig. 9. As expected, spatial information transmitted by latency was greater in DZ (median TSRL: 0.36 bits) and PAF (0.38 bits) than A1 (0.33 bits, P < 0.02), but did not significantly differ between DZ and PAF (P < 0.1). Information carried by spike count, however, was greater in DZ (median TSRC: 0.29 bits) and A1 (0.28 bits)which did not differ significantly (P < 0.3)than in PAF (0.24 bits, P < 0.03).
|
|
Arrangement of spatial tuning across the cortex
As in previous studies (e.g., Furukawa and Middlebrooks 2002
; Stecker et al. 2003
), we commonly observed that units recorded from nearby sites on a single recording probe exhibited similar response properties. When probes were oriented lateral-to-medial within the ventral bank of SSS (ventral-to-dorsal along the cortical surface), we observed groups of similarly tuned units (i.e., units that preferred contralateral or ipsilateral azimuths) that were demarcated by one or more units with an opposite lateral preference. Five such sequences are illustrated in Fig. 11. Such groups covered between four and nine adjacent recording sites, corresponding to patches 4501,200 µm in width. We hypothesize that these patches correspond to a dorsal extension of the system of "binaural bands" (23 rostrocaudally elongated regions of units with similar binaural sensitivities, interdigitating with regions of different sensitivity) described in A1 (Imig and Adrian 1977
; Middlebrooks et al. 1980
; Nakamoto et al. 2004
). In A1, binaural bands appear to correlate with regions of commissural input from contralateral auditory cortex (Imig and Brugge 1978
). Similar patches of contralateral input occur within the ventral bank of SSS, consistent with the present observation of patchy spatial tuning in DZ.
|
The preceding results indicate a clear distinction between the physiology of A1 and DZ neurons, in terms of complexity of frequency-tuning, latency of response, and pattern of spatial sensitivity. Nevertheless, DZ has been considered a subfield of A1 in some previous studies (Middlebrooks and Zook 1983
), raising the question of whether a definitive boundary can be detected between the fields. In the current study, most DZ recordings were confined to dorsal regions of DZ (within the ventral bank of SSS), presumably well dorsal of any such border. To describe the physiology of units surrounding the border, we made penetrations using four-shank probes (see METHODS) in the expected vicinity of the A1/DZ border. These revealed separate groups of units with DZ-like and A1-like responses in close proximity. Figure 12 illustrates one such recording, made with two four-shank electrodes. Rasters and frequency response areas show temporally compact and sharply tuned responses, respectively, of units recorded on the ventral shanks of both probes. Such responses are consistent with A1 physiology, whereas the responses of units recorded on the dorsal shanks demonstrated DZ-like features including complex frequency tuning and late patterned temporal responses. The transition between response types across the cortical surface was abrupt (narrower than the spacing between shanks, which was 200 µm) rather than gradual.
| DISCUSSION |
|---|
|
|
|---|
In summary, the current results reveal a distinct pattern of physiological response in DZ than in A1. The differences include, in DZ, more complex frequency tuning, longer-latency responses, increased prevalence and degree of nonmonotonic rate-level functions, and a weaker response to broadband relative to tonal stimulation. Each of these factors is consistent with a larger role for inhibition in shaping DZ responses than A1 responses (Sutter and Loftus 2003
). With respect to spatial sensitivity (and perhaps partly reflecting such inhibitory processes), DZ units are more sharply tuned to free-field azimuth and their response latencies are more strongly modulated by stimulus location than are units in A1. As a result, the spike patterns of DZ units are generally more informative of sound-source locations than those of A1 units. Furthermore, the population of DZ units samples space more uniformly than A1 in that it contains significant numbers of units preferring ipsilateral stimulation. Although several of these differences seem relatively clear (e.g., latency modulation and complex frequency tuning), many are more subtle despite their statistical significance (e.g., differences in spatial tuning width and transmitted information). While such minor differences can be useful in distinguishing cortical fields on the basis of physiological characteristics, their functional relevance may be questionable at best. Indeed, the paucity of clear (qualitative) physiological differences among cortical fields seems to argue against a strong view of functional specialization in the auditory cortex. Nevertheless, the overall pattern of results suggests that DZ is physiologically distinct from A1, and moreover that it might play an important role in sound-localization behavior.
Middlebrooks and Zook (1983)
treated DZ, conservatively, as a region of primary auditory cortex (A1), albeit a region receiving a unique pattern of thalamocortical input. He and Hashikawa (1998)
, in contrast, found a distinct pattern of neuronal physiology in DZ inconsistent with the primary-like responses observed in A1. The results of the present study corroborate that observation and lend support to the view that A1 and DZ represent physiologically distinct cortical fields. Additional support comes from the observation that the ventral-to-dorsal transition of response properties is not gradual (as expected if these differences reflect continuous variation within a field), but abrupt (consistent with an inter-field border). The existence of a border between A1 and DZ is further supported by anatomical studies using SMI-32 antibody staining as a marker of areal divisions in cortex (Mellott et al. 2005
). The border-like transition observed in our physiological results, however, might also coincide with transitions between binaural bands (Middlebrooks et al. 1980
) in A1 and/or DZ. It is not currently understood whether these bands represent subfields within one or more functionally homogeneous cortical fields, interdigitating extensions of two or more distinct fields or individually distinct cortical fields. This question confuses the relationship of A1 with its neighboring fields, and must be addressed in the future by high-resolution physiological mapping of auditory cortex.
The argument that DZ is distinct from A1 is based on large differences between the behavior of neurons in the two fields. While the characteristics of DZ neurons set them apart from A1 neurons, they arein nearly every respectshared with PAF neurons. In both fields, as compared with A1, we observe sharper spatial tuning, enhanced coding of sound-source locations, complex frequency tuning, nonmonotonic rate-level functions, stronger responses to tones than to broadband noise, and elongated temporal responses with spatially modulated first-spike latency. The differences between DZ and PAF are fewer than the similarities but include overall shorter first-spike latencies in DZ (values are intermediate between PAF and A1), better coding of sound-source locations by spike counts in absence of temporal information in DZ, and larger numbers of ipsilaterally tuned neurons in DZ (possibly related to the appearance of "binaural bands"). One might argue, based on the results, that PAF and DZ correspond to a system of "belt" fieldscharacterized by complex, nonlinear, and long-latency responsesthat surrounds the primary "core" fields of A1 and AAF (Harrington et al. 2005
), which are characterized by simpler, linear, short-latency responses. We note, however, that there are prominent anatomical differences between DZ and PAF; anatomical tracer injections in DZ produce retrograde label in compact clusters of cells in the dorsal cap of the lateral part of the lateral division of the MGB (Middlebrooks and Zook 1983
) whereas that label has not been demonstrated following PAF injections.
Are fields like DZ and PAF specialized for spatial processing?
As discussed in our report on spatial sensitivity in PAF (Stecker et al. 2003
), the physiological characteristics of neurons in that field are better suited for encoding information about sound-source location than are neurons in A1. To the extent that such characteristics are shared by DZ neurons, a similar argument holds for DZ. The differences between fields, however, are quantitative (e.g., sharper spatial tuning, stronger modulation of spike count or latency); there are no clear qualitative differences in the manner of spatial coding between fields. Furthermore, our pattern-recognition analyses suggest that localization based on the responses of A1 neurons, although inferior to that based on PAF or DZ responses, should be reasonably accurate. We have argued that in the absence of qualitative differences between spatial sensitivity in various cortical fields, identification of each field's functional role in sound localization requires behavioral evidence (e.g., from lesion studies). Such evidence, however, has not provided a simple answer to this question. Chronic lesion studies in monkeys, for example, have demonstrated severe contralesional sound-localization deficits following extensive lesions of auditory cortex, but only minimal effects following restricted lesions of various regions within auditory cortex (Harrington and Heffner 2002
; Heffner 2005
). Studies employing temporary "inactivation" of auditory cortex (e.g., by cooling cortical tissue), however, have revealed profound deficits following inactivation of particular restricted regionsnotably cat A1 (including DZ), PAF, and fAESbut not others (e.g., cat AAF and A2). Inactivation of either DZ or ventral A1 alone produces only partial deficits, suggesting that the two fields make up a single functional unit for sound localization (Malhotra et al. 2004b
).
How are we to make sense of these cortical-inactivation results? Although PAF and DZ appear (in physiological terms) about equally specialized for spatial processing, inactivation of one (PAF) results in profound localization deficits, whereas inactivation of the other (DZ) results in only partial deficits. Further, with the exception of A1, surgical removal of other auditory fields in the cat results in only partial localization deficits, if any, regardless of the fields removed (Stroming