|
|
||||||||
1Bioengineering Graduate Group, University of California, San Francisco, University of California, Berkeley; 2W. M. Keck Foundation Center for Integrative Neuroscience; 3Coleman Memorial Laboratory, Department of Otolaryngology–Head and Neck Surgery, University of California, San Francisco, California; and 4Center for Synapses and Cognitive Neuroscience, Medical College of Georgia, Augusta, Georgia
Submitted 5 April 2007; accepted in final form 11 August 2007
|
|
ABSTRACT |
|---|
|
|
|
INTRODUCTION |
|---|
|
Recently, we examined the topographic organization of responses in the primary auditory cortex (AI) of anesthetized rats and squirrel monkeys to FM sweeps (Godey et al. 2005
; Zhang et al. 2003
). Both studies used high-density topographic mapping across AI and showed the functional organization of FM sweep responses. Characteristic frequency (CF) could be predicted from FM sweep responses. Direction selectivity was correlated with CF, indicating that sweep direction was topographically represented in AI, with upward FM sweep selective sites present in the low-frequency portion of AI and downward selective sites present in the high-frequency portion.
In this study we extend our experimental technique and examine the responses of neurons in the awake owl monkey to FM sweep stimuli. A main question is whether the coding of dynamic stimuli, such as FM sweeps, is fundamentally different in the awake animal because there has been evidence of potential differences in the response pattern to stationary stimuli in anesthetized and awake preparations (Lu et al. 2001
). The owl monkeys used in this study were chronically implanted with a multielectrode recording array using previously established methods (Blake and Merzenich 2002
; deCharms et al. 1998
, 1999
). The use of the implanted awake owl monkey allows auditory function to be studied under more natural, and behaviorally controlled, conditions. Because nonhuman primates are an attractive model for human perception and neural processing of complex sounds, this preparation is advantageous when studying neural coding in AI.
In addition to FM sweep responses, we computed spectrotemporal receptive fields (STRFs) using random tone-pip stimuli (Blake and Merzenich 2002
; deCharms et al. 1998
; Rutkowski et al. 2002
). Theoretically, for a neuron that processes stimuli in a quasi-linear fashion, the STRF provides a sufficient description of the spectrotemporal preferences of the neuron (Blake and Merzenich 2002
; Theunissen et al. 2000
). We used the STRFs to predict the preferences of AI neurons for several FM sweep parameters. By predicting direction selectivity and best speed from the STRFs and comparing them to observed FM responses, we determined the adequacy of simple linear prediction in describing the dynamic behavior of cortical neurons (Blake and Merzenich 2002
). We conclude this report by discussing our findings in view of FM sweep studies in other species and their significance for the neural processing of vocalizations.
|
|
METHODS |
|---|
|
Animal welfare was monitored by the IACUC at University of California, San Francisco, and conformed to National Institutes of Health guidelines. All data were obtained using an approved IACUC protocol. We obtained the data from two chronically implanted owl monkeys, Aotus nancymaae. All methods were reported previously and are only briefly described here (deCharms et al. 1998
, 1999
). The AI was targeted 2–3 mm anterior and just lateral to the temporal–frontal fissure in the lateral bank of the lateral sulcus. Transcranial recording through burr holes confirmed AI response characteristics and expected tonotopy (Imig et al. 1977
; Recanzone et al. 1999
). Recording array implants were then placed over AI. Surgery was performed under areflexic barbiturate anesthesia and histological confirmation of AI was performed in one animal. Recordings were made from chronically implanted arrays of
49 individually placed ultrafine parylene-coated iridium microelectrodes (Micro Probe, Potomac, MD). Electrodes had exposed tip lengths of 5 to 7 µm, yielding impedance values of about 2 M
and good isolation of signals from individual neurons. These signals were filtered from 0.3 to 8 kHz, sampled at 20 kHz, digitized, stored to disk, and spike-sorted to yield well-resolved single neurons. Stimuli were presented free field and frontally, approximately 24 in. from the center of the head of each monkey, in a double-walled acoustic isolation chamber while the animal sat in a primate chair without head restraint. Recordings were obtained before behavioral sessions. Animals had been trained to wait throughout these data collection periods and were monitored either live or by a closed-circuit camera. Animals were never observed to be drowsy throughout these periods, and this protocol was surprisingly effective at having the animals remain calm and alert. In practice, gaze was within a 15–20° radius and motion during recordings was minimal. Recordings during substantial head movements were discarded. Spike trains were simultaneously collected from multiple sites in the cortex during the presentation of the stimulus sets. Implant best frequencies spanned the range of frequencies found on the superior temporal gyrus, ranging from 0.1 to 7 kHz (Imig et al. 1977
).
Sound presentation
Sound levels at the position of the external ears were calibrated with a Brüel & Kjær sound level meter. Frequency modulated sweeps were created using Matlab (The MathWorks, Natick, MA), written to compact disk using standard software, and delivered using a McIntosh audio amplifier. The sweeps traversed the frequency range in the upward (50 to 21,000 Hz) or downward (21,000 to 50 Hz) direction. This range covers the frequency response area normally observed for AI neurons (Recanzone et al. 1999
). FM sweeps in New World monkey vocalizations often have bandwidths (BWs) between 5 and 10 kHz, and thus they effectively cover the frequency response of AI neurons in a manner similar to the synthetic sweeps (Atencio et al. 2005
; Bieser 1998
; Godey et al. 2005
). The sweep speeds were logarithmically spaced: 10, 17, 30, 52, and 90 octaves per second for a total of 10 different stimulus conditions. The sweep speeds were chosen so that they covered the normal range of frequency transitions present in owl monkey vocalizations. All sweeps had 100-ms rise/fall cosine-squared ramps, where either 50 or 21,000 Hz was ramped up or down to the starting amplitude at which point the FM began. The durations of the individual FM sweep stimuli were: 1,071, 713, 490, 368, and 297 ms for the 10, 17, 30, 52, and 90 oct/s sweeps, respectively. The interstimulus time between sweep stimuli was 2 s. Each sweep was nonconsecutively presented 16 times in random order.
Trains of tone pips were used to construct spectrotemporal receptive fields (STRFs) as described in previous reports (Blake and Merzenich 2002
; deCharms et al. 1998
). The tone-pip trains were synthesized by randomly selecting frequencies from 84 possible values spanning 7 octaves from 110 to 14,080 Hz in 1/12th-octave steps. For all stimuli, every 1/12th-octave frequency band contained an independent Poisson train of tone pips with the same mean rate of tone-pip presentation. Each individual tone pip was 20 ms in duration, with 5-ms cosine-squared onset/offset ramps. The random tone-pip stimulus was presented continuously for 5 min, with 1 s of silence between stimuli. Tone-pip and FM sweep stimuli were presented at the same sound level, between 50 and 70 dB SPL.
Analysis
FM sweep stimuli responses were analyzed in a manner similar to that in previously published work (Godey et al. 2005
). Responses to FM sweep stimuli were quantified by first computing the poststimulus time histogram (PSTH) for each stimulus condition using 10-ms bins. For each of the 10 PSTHs a Gaussian was fit to the response in the PSTH. Each Gaussian had the form
![]() |
2 test, P < 0.05).
Next, we related the parameters from the Gaussian fits to the corresponding pure-tone responses. We pursued this because as an upward FM sweep travels from 50 to 21,000 Hz it will encounter the low-frequency boundary of a neuron's frequency tuning curve. The same process occurs for downward sweeps, although the high-frequency boundary of the frequency tuning curve is encountered initially in this case. The frequencies at which the sweep encounters the tuning curve are termed trigger frequencies, and may be calculated using the latencies of the PSTH peaks. From these trigger frequencies we can estimate the BW and CF of a particular neuron. The procedure that was used to compute the trigger frequencies, illustrated in Fig. 1, was similar to that in previous studies (Heil 1997
; Heil and Irvine 1998
; Heil et al. 1992b
; Nelken and Versnel 2000
). First, latencies from the Gaussian fits were plotted against the inverse of the sweep speeds. We then computed a best-fit line to these data points. We considered only best-fit lines that showed a significant nonzero slope (P < 0.05, two-sided t-test). For every neuron the correlation between the values was >0.975, and thus this measure of latency is robust to changes in sweep speed because all data points, including those for the two slowest speeds, fall on the diagonal line.
|
We quantified the direction and speed properties of the FM sweep responses by calculating the area under the Gaussian fit to each PSTH. We used this area as the measure of responsiveness for each stimulus. After we calculated the areas under the Gaussian fits, we then computed a direction selectivity index (DSI). The DSI for each site is the sum of all areas for upward direction sweep responses, minus the sum of all the areas calculated for downward sweep responses, divided by the sum of all areas in both
![]() |
![]() |
![]() |
|
|
RESULTS |
|---|
|
FM sweep poststimulus time histograms
The responses of a representative owl monkey AI neuron to FM sweep stimuli are shown in Fig. 2, where column A shows the responses to upward sweeps and column B the responses to downward sweeps. Each poststimulus time histogram (PSTH) represents the response of the cell to 16 presentations of the same FM sweep stimulus. The bar under each PSTH represents the duration of the FM sweep. As can be seen from the PSTHs, this cell responded most vigorously with a prominent activity peak to upward low-speed sweeps, showing a maximal response to the lowest sweep speed presented, 10 oct/s.
|
To quantify the nature of the response profiles, Gaussian curves were fit to each of the 10 PSTHs obtained for each FM sweep stimulus. In Fig. 2 these are seen as the black outlines superimposed on each PSTH. Two main measures were used to quantify the FM response. The first was the latency of the Gaussian curve, which provided an estimate of the time to the peak response, as shown in each PSTH. The second was the area under each Gaussian fit, a metric for response strength. The results of these computations for the responses in Fig. 2, A and B are shown in Fig. 2, C and D. In Fig. 2C plots of latency versus the inverse of FM sweep speed are shown for upward and downward directions. These data contain a significant nonzero correlation (Upward: r = 0.99, P < 0.001; Downward: r = 0.99, P < 0.001, t-test). Ordinary least-squares regression on this data gave the slope of the lines and allowed, using a simple formula (see METHODS), calculation of the frequency at which the neuron began to respond to the FM sweep (Bain and Engelhardt 1992
; Draper and Smith 1981
). For the cell in Fig. 2 the frequency calculated from the upward sweeps is 1,016 Hz and from the downward sweeps this value is 1,201 Hz. Because these two frequencies represent the trigger frequencies of the cell to sweep stimuli, each may be taken as an estimate of the effective low- and high-frequency boundary edges of the cell's frequency–response area. From these values CF is estimated as the average of the low and high frequencies, or 1,109 Hz, which is in excellent agreement with the pure-tone CF estimate of 1,100 Hz. The bandwidth of the cell is estimated as the difference between the two trigger frequencies, which in this case is 185 Hz, whereas the response bandwidth calculated from pure tones was 1,453 Hz.
Plots of normalized area versus sweep speed are shown in Fig. 2D. These plots show response strength versus FM sweep speed. The normalized areas are calculated by dividing all responses, in both sweep directions, by the largest area value, determined from both directions. These curves show a cell's selectivity for a particular sweep speed in either the upward or the downward direction. For a highly selective cell, preferring only upward sweeps at 10 oct/s, it would be expected that the upward area–speed curves would contain a strong peak at 10 oct/s and have values near zero elsewhere, as well as a flat curve near zero for the downward direction response. The responses in Fig. 2D show that this neuron preferred FM sweeps at low sweep speeds in both directions. These curves show that for this neuron slower speeds elicit the strongest response and that in both directions a low-pass area–speed characteristic is present. Also, in comparing the normalized area–speed curves in both directions it is clear that the response in the upward direction is stronger, although the shapes of the two curves are similar (r = 0.98, P < 0.01, t-test). This was a general trend because the correlation between the upward and downward area–speed curves was normally quite high (see following text). The direction selectivity index (DSI) for this unit was calculated to be 0.14, in accord with the greater responses this neuron showed to upward sweeps. Best speed was 27 oct/s and the speed tuning was 0.57, in accord with this site's higher selectivity for sweeps at slower sweep speeds.
Another representative example of responses to FM sweep stimuli is shown in Fig. 3. The greatest response occurs at 30 oct/s for sweeps in the upward direction. In the downward direction the neuron, although not selective for one speed, did respond to the whole range of sweep speeds. Thus this example denotes a neuron that is most selective to upward sweeps and one sweep speed in particular while being responsive, though nonselective, to sweeps in the downward direction.
|
The responses of a downward sweep selective neuron are shown in Fig. 4. The greatest response occurs at 10 oct/s in the downward direction, although responses in the upward direction are comparable. Thus this example shows a neuron that is most selective to slower sweep speeds, with responses that show a similar selectivity for FM sweep speed in the upward and downward directions.
|
Comparison of pure-tone and FM-derived response properties
CF estimates from the FM sweep data and from the STRF are in close register (Fig. 5A). The FM estimate of CF was calculated as the average of the upward and downward trigger frequencies and compared with the CF estimated from the STRF (Linden et al. 2003
; Nelken and Versnel 2000
; Sen et al. 2001
). The data shown in Fig. 5A are from animal OM1 because sufficient STRFs were available only for that subject. Tone-pip CFs plotted against the CF estimate from FM sweep stimuli reveal a significant correlation (OM1: r = 0.86, P < 0.001, n = 68, t-test). Thus FM sweep data accurately predict the CF from pure-tone responses (Heil and Irvine 1998
; Heil et al. 1992b
; Tian and Rauschecker 1994
).
|
In many species the direction selectivity index is significantly correlated with CF. In this study we did not find a significant correlation (r = –0.04, P = 0.38, t-test). The lack of a significant negative correlation between DSI and CF is not in accord with results from other studies (Godey et al. 2005
; Mendelson and Ricketts 2001
; Zhang et al. 2003
). Because CF is mapped topographically in owl monkey AI (Imig et al. 1977
), this leaves an open question as to whether the DSI–CF correlation is present in the owl monkey, or whether our negative finding is a consequence of incomplete sampling of the full frequency range.
Population distributions of FM sweep preferences
Response distributions for the four FM sweep response parameters for both cases (OM1 and OM2) are shown in Fig. 6. The two columns represent the response distributions for each monkey. Direction selectivity: The median DSI across all neurons is 0.04, and that for each individual subject is 0.01 and 0.10, reflecting that, as a population, the neurons are equally responsive to both upward and downward sweeps. Best speed: The overall median is 36 oct/s, with 35 and 37 oct/s the median values for the individual populations. This corresponds to the middle of the range of FM sweeps found in New World monkey twitter vocalizations (Bieser 1998
; Nagarajan et al. 2002
). Sweep speed tuning: The two monkeys show similar distributions, with the median of the distributions at 0.57 and 0.45, leading to an overall value 0.51 that reflects moderate selectivity for specific sweep speeds. Thus whereas most neurons prefer sweeps that are between 25 and 45 oct/s, they are responsive to a broader range of sweep speeds.
|
Spectrotemporal receptive fields
For many recording sites (n = 68) randomly presented tone pips were presented and STRFs were constructed. An STRF can be interpreted as the average stimulus that precedes an action potential. Although additional characterization is required to determine the specificity of the response to the stimulus (Escabí et al. 2005
), the STRF is, in principle, suited to assess the time-dependent nature of a neuron's responses to complex stimulation patterns, including FM components. Examples of representative STRFs are shown in Fig. 7. Figure 7A shows a neuron's STRF with a CF of about 500 Hz. This STRF shows clear excitatory and suppressive regions. Of note is the slanted shape of the suppressive region, which is consistent with a neuron that prefers upward direction FM sweeps because a stimulus that begins at low frequencies and transitions to high frequencies would interact minimally with the suppressive region while interacting maximally with the excitatory portion of the receptive field (Shamma et al. 1993
; Suga 1965a
,b
, 1968
). These STRF features are consistent with the neuron's responses to FM sweep stimuli. The experimentally determined DSI for this neuron was +0.17, indicating upward sweep preferences.
|
To quantitatively assess how well STRFs can capture FM sweep responses, we predicted the responses to the sweep stimuli using approaches that were similar to those in previous reports (Andoni et al. 2007
; Brimijoin and O'Neill 2005
). The predicted response can be obtained by following the procedure illustrated in Fig. 8A. First, the convolution between the stimulus with the STRF, in the time domain, was calculated. This result was then summed across frequency. Last, the response was half-wave rectified to approximate the nonlinearity present in extracellular recordings (Andoni et al. 2007
). The resulting predicted FM response was then analyzed in the same manner as the observed FM PSTHs (shown in Figs. 2–4). This process was performed for each FM sweep stimulus (n = 10), and then used to calculate DSI and best speed values.
|
FM sweep responses derived best speed preferences and the corresponding STRF predictions are shown in Fig. 8C. Again, a significant correlation is seen between observed and predicted results (r = 0.67, P < 0.0001, t-test). However, the predicted speed range differs from the observed range of best speeds present in the FM data. The observed best speeds range from 10 to 60 oct/s, whereas the range of predictions is limited to 35 to 50 oct/s. This bias toward higher speeds in the STRF predictions may reflect systematic influences of the stimulus ensemble used to derive STRFs. For example, the duration of the tone pips was fairly short compared with the duration of the slow FM stimuli. Also, longer-duration adaptation and/or suppression effects might be underestimated by the STRF.
We conclude this study by noting that our result, which showed that spectral bandwidth cannot be adequately estimated from the responses to FM, is similar to that found in the anesthetized squirrel monkey (Atencio et al. 2005
; Godey et al. 2005
). A recognizable difference between awake and anesthetized monkeys, as illustrated in Figs. 2–4, is that awake monkeys have PSTHs that are robust and qualitatively more tonic (Atencio et al. 2005
; Godey et al. 2005
). To examine these response differences in further detail we compared the response durations, across all neurons, in the awake owl monkey in the present study, to those in the anesthetized squirrel monkey, in the study by Godey and colleagues (2005)
. Figure 9A shows the proportion of neuronal sites in the awake owl monkey and anesthetized squirrel monkey as a function of the response duration to 10 oct/s upward and downward sweeps. Upward and downward sweeps were combined because they were not statistically different in either preparation (P > 0.10, rank-sum test). The slowest experimental FM speeds were chosen because their responses are the most likely to reveal tonic components compared with very fast FM sweeps. The figure shows that a higher proportion of sites in the anesthetized squirrel monkey have shorter response durations (Owl Monkey median: 16.9 ms; Squirrel Monkey median: 10.0 ms, P < 0.0001, rank-sum test). The majority of squirrel monkey sites respond with a width of <15 ms, indicating a relatively phasic response pattern. The responses to 17 oct/s sweeps also follow these population profiles (Fig. 9B). Again, the responses in the awake preparation are longer than those in the anesthetized case (Owl Monkey median: 12.0 ms; Squirrel Monkey median: 7.6 ms, P < 0.0001, rank-sum test), although a considerable range of responses is evident. Figure 9, A and B shows the complete population responses. In Fig. 9C the mean response duration of the population data is shown for both preparations. The mean response duration in this plot is taken as the SD of the Gaussian fits to the PSTHs. The data show that at every speed the awake owl monkey responses are longer in duration than those of the anesthetized squirrel monkey (P < 0.001, t-test for each sweep speed). Thus the earlier description of qualitative differences between the two preparations is indeed statistically significant. In the awake owl monkey the higher proportion of longer-duration responses, which vary with changing FM sweep duration, may indicate a higher proportion of tonic, compared with phasic, responses in this preparation.
|
|
|
DISCUSSION |
|---|
|
Comparison to other species and studies
Three key findings have been consistently reported in population studies of FM sweep responses in AI. These are the population distributions of direction selectivity and best speed and the correlation between DSI and CF. The findings are often confounded by the different methodologies used, such as different FM sweep stimuli, different species, and different anesthetics. Despite these variations in published accounts, comparisons may still be made across studies, giving a general picture of FM processing in AI.
In anesthetized squirrel monkey, the global distributions of DSI and best speed in AI are similar to the current values in the awake owl monkey, suggesting that the impact of anesthesia on the coding of dynamic attributes may be much less compared with coding for static stimuli (Godey et al. 2005
; Lu et al. 2001
; Wang et al. 2005
). Distributions of DSI in ferrets are also similar to those in the awake owl monkey, although speed preferences are higher. The discrepancy between these best speed values may be due to species-specific differences in environment or processing, or because some FM sweeps in the ferret study were very fast (>150 oct/s) and thus resembled broadband transients (Kowalski et al. 1995
; Shamma et al. 1993
).
Rat and squirrel monkey AI, unlike that of the owl monkey, show a significant correlation between CF and DSI (Godey et al. 2005
; Mendelson and Ricketts 2001
; Zhang et al. 2003
). FM sweep responses from AI in the cat using logarithmic (Mendelson et al. 1993
) and linear (Heil et al. 1992b
) sweeps revealed a DSI population distribution that was centered about 0, as in the present study. A weak correlation between direction selectivity and CF was also found (Mendelson et al. 1993
). The lack of a CF and DSI correlation in the current study is puzzling but may be a consequence of limited CF sampling with the fixed recording array. Also, in contrast to a previous report in the awake cat, we did not find any case in which a neuron responded exclusively to one sweep direction, or evidence of two temporally separated phasic responses during the presentation of sweep stimuli (Whitfield and Evans 1965
).
In the bat auditory cortex, direction selectivity is more pronounced (Razak and Fuzessery 2006
). The majority of DSI values are >0.2, which implies significantly more selectivity, most likely as a consequence of the increased ethological significance of FM in bat echolocation. Although the selectivity is higher, the underlying mechanisms in auditory cortex, and in other stations, may be similar (Razak and Fuzessery 2006
). The timing between excitatory and inhibitory inputs may explain intra- and extracellular FM selectivity results (Gordon and O'Neill 1998
; Zhang et al. 2003
). Duration tuning may also contribute (Razak and Fuzessery 2006
), although this aspect requires a more thorough evaluation in New World monkeys than is presently available.
The FM sweep processing results just described appear to have behavioral consequences. In a study of the European starling, linear FM sweeps covered either high or low frequencies, but not the entire frequency range tested (Klump 1991
; Phillips et al. 1985
). Upward FM sweeps were most detectable when low frequencies were traversed and downward FM sweeps were most easily discriminated when the sweep traversed high frequencies, consistent with the correlation between FM direction selectivity and characteristic frequency. In the primate, similarly structured studies have yet to be initiated, although from preliminary work similar results would be expected because frequency transitions appear to have behavioral significance for the processing of Japanese macaque coo vocalizations (May et al. 1988
, 1989
). Any future primate work that incorporates these behavioral aspects with physiological recording techniques would be a significant advance.
The methodologies commonly used in FM studies were compared in AI of the barbiturate-anesthetized ferret (Nelken and Versnel 2000
). In the paradigm that most resembles the one used in our report there was a correlation of approximately +1 between the DSI values obtained from logarithmic and linear sweeps. Best speed preferences were also similar, in that neurons that preferred fast/slow linear sweeps also preferred fast/slow logarithmic sweeps. Thus despite different experimental paradigms and anesthetic differences meaningful conclusions may be drawn.
These remarks are qualified because our recordings were predominantly from laminae IIIb and IV. These laminae are the main targets of thalamic input to cortex and therefore display strong responses with latencies shorter than those found in other layers. Although these layers have been used almost exclusively in mapping studies in the auditory cortex very little work has shown how responses in different layers might lead to different functional maps. Few studies have attempted to relate FM sweep responses across different cortical depths (Mendelson and Cynader 1985
; Mendelson et al. 1993
; Shamma et al. 1993
). These studies concluded that best FM speed and direction selectivity are fairly constant in orthogonal cortical penetrations, although they were limited in the number of neurons sampled and in the consistency of penetration depths. FM response properties may vary as a function of cortical depth due to lamina-specific cortico-cortical connections and general receptive field differences (Code and Winer 1985
, 1986
; Winguth and Winer 1986
), although further clarification is needed.
Last, our analysis procedure implicitly uses spike count as the metric for response strength. Other response metrics, such as spike timing, may also convey information. An examination of this issue is beyond the scope of this work, although it points to an additional means that AI neurons may use to transmit FM information.
Similarities and differences of FM sweep response distributions across animals
The results displayed in Fig. 6 show that similar population response patterns are present across individual animals. Our analysis revealed that selectivity for upward or downward sweeps—the speed of these sweeps that gives the best response—and the correlation between upward and downward area–speed curves are similar across animals. For example, it is perhaps surprising that the DSI distributions are centered near zero in each animal. One might hypothesize a bimodal distribution for directional selectivity because owl monkeys must process either upward or downward sweeps that are present in their social calls, although we did not find this result. DSI values were approximately Gaussian distributed and centered at zero. For best speed the distributions showed consistent trends, with maximal responses at speeds between 20 and 60 oct/s, the normal range of FM sweep speeds found in New World monkey twitter vocalizations. These consistent response distributions imply that there is a general range of FM sweep parameters represented in AI. These response distributions do not permit inference to how these sweep parameters are organized spatially across animals in AI, yet they do point to constraints on the characteristics of populations of neurons that are needed for New World monkeys to process complex communication sounds.
STRF implications and linearity of responses
For a significant fraction of neurons in one awake owl monkey we computed STRFs. Although the applicability of the STRF metric to awake recordings has been debated (Barbour and Wang 2003b
), our results are consistent with those from other sensory systems, where this approach has been used successfully in primary cortical areas of awake monkeys (David et al. 2004
; deCharms et al. 1998
; DiCarlo et al. 1998
; Reich et al. 2000
; Rust et al. 2005
). A significant finding of our study was that the STRF accurately predicted FM sweep direction selectivity and best speed in the majority of cases. An STRF is a linear filter for a neuron optimized for the stimulus conditions used to create it (Blake and Merzenich 2002
). It predicts responses less well to stimuli that are less similar to the STRF-generating stimulus (Blake and Merzenich 2002
). We found that the directional selectivity index and speed tuning of neurons were significantly related to those predicted by the STRF in the awake monkey, indicating that spectrotemporal processing, in addition to first-order spectral and temporal processing (Linden et al. 2003
; Miller et al. 2002
; Sen et al. 2001
), may be estimated with some success from tone-pip–derived STRFs in AI.
STRF predictions were less successful for high speed preferences and high direction selectivity. This degradation of the predictions may result from the nature of the stimulus used. By their nature FM sweeps are spectrotemporally correlated stimuli and are fundamentally different from uncorrelated noiselike stimuli. If auditory neurons are activated in nonlinear ways by correlated stimuli then they may respond more robustly to correlated than to uncorrelated sounds (Escabí and Schreiner 2002
). This is expected from cortical neurons because natural sounds contain significant correlations (Attias and Schreiner 1997
; Brillinger and Irizarry 1998
; Voss and Clarke 1975
). Thus a possible conclusion is that highly direction selective neurons perform more nonlinear processing on acoustic sounds. Simply using linear prediction, then, may not be adequate, and further extensions of the model may need to be considered in the future. In fact, this realization has been a motivating factor behind recent calculations of higher-order filters in sensory systems (Sharpee et al. 2004
; Touryan et al. 2002
, 2005
; Yamada and Lewis 1999
).
In summary, the STRF procedure can be useful in estimating many receptive field properties, although limitations to the method must be kept in mind. For instance, even though the STRF is a useful descriptor of cortical processing and is well defined mathematically, only linear predictions may be deduced, although the inclusion of nonlinearities may improve its accuracy (Eggermont 1993
; Marmarelis and Marmarelis 1978
). Also, the construction of the STRF is dependent on the responsiveness of the neuron and the stimulus, so care must be taken when choosing a stimulus set. Given these caveats, the STRF remains a robust, multidimensional metric of cortical processing that should be useful in future awake animal preparations (Elhilali et al. 2004
; Fritz et al. 2005
).
Comparison between and awake and anesthetized New World monkey responses
The same analysis approach used in our previous anesthetized New World monkey study worked without modification in examining the results from the current study (Atencio et al. 2005
; Godey et al. 2005
). For neurons that responded to the FM sweep stimuli the Gaussian fits were well matched to the PSTHs. Additionally, the range of distributions for direction selectivity, best speed, speed tuning, and the correlation coefficient were similar to those found in the anesthetized squirrel monkey (see Table 1). It appears that the general response properties of New World monkeys to these dynamic sounds are broadly similar in spite of anesthetic state, although this does not exclude the possibility that differences may be present when behavioral tasks are used.
|
Best speed and speed tuning parameters also showed subtle differences from our anesthetized study. Median best speed in the present study was about 35.9 oct/s, a slightly lower value than that found in the anesthetized squirrel monkey (41.4 oct/s), although it is closer to the speeds of upward FM sweeps found in natural New World monkey twitter calls, which are typically between 30 and 35 oct/s. Likewise, speed tuning values were higher in the present study than those in the anesthetized squirrel monkey (0.52 vs. 0.43), indicating more selective processing of FM sweep speeds in the awake owl monkey. These differences suggest that further studies need to be completed to understand the factors surrounding the parameter differences that may be due to actual anesthetic effects, species differences, to single- versus multiunit responses, or to inherent variability in the response. Preliminary analysis does show that FM response parameters are different in the squirrel monkey under different anesthetic states (MA Heiser, personal communication).
Comparison of the onset duration in awake owl monkey and anesthetized squirrel monkeys did reveal a tendency toward longer response durations in the awake animals. This is in line with the notion that more sustained response patterns are a common feature in the awake preparation (Lu et al. 2001
). It is intriguing, however, that the global differences in FM processing between the anesthetized squirrel monkey and the awake owl monkey remain quite small, suggesting that dynamic stimulus aspects, as captured by the phasic response components, are less affected by anesthesia than tonic components.
In conclusion, most studies implicitly assume that the responses to FM sweeps are related to spectral receptive fields (sRFs). The rationale is that as an FM sweep traverses different frequencies it will reach the boundary of the receptive field and produce a response. Thus the sRF boundaries and FM sweep trigger frequencies should be related (Heil 1997
). The use of trigger frequencies allowed us to obtain accurate estimates of the CFs compared with those obtained from STRFs. However, bandwidth estimates were accurate in only one quarter of the sites. They were quite poor in the remaining sites, most likely because of inhibitory surround effects (Shamma et al. 1993
). A recent intracellular study suggests that other frequency-dependent asymmetries in sRFs, such as response magnitude and the timing of excitation and inhibition, may also be responsible for the expression of FM preferences (Zhang et al. 2003
).
Because the most significant difference between simple tones and FM sweeps is the stationary nature of tones, this suggests that the inaccuracy in our bandwidth estimates is due to the nonstationary nature of FM sweeps, as well as the partial picture of acoustic processing given by STRFs. Indeed, even though FM sweeps are parameterized sounds, and are less complex than natural vocalizations, they still contain feature correlations. These correlations, as shown in a previous study in the midbrain, may cause auditory neurons to process sounds in ways that are not predictable from the processing of simple, stationary sounds (Escabí and Schreiner 2002
). It follows then that most auditory neurons process spectrally dynamic sounds and spectrally simple sounds in different ways (Schreiner 1998
).
Our results show systematic relationships between tone-evoked receptive field response parameters and FM sweep response parameters. In many cases STRFs accurately predicted direction selectivity and best speed preferences of the neurons under study, although significant deviations were present. Therefore responses in AI to different acoustic properties are not completely described by the STRF prediction procedure, implying that underlying higher-order nonlinear interactions are responsible for their increased direction selectivity. Studies that use a stimulus that dynamically challenges cortical neurons, and can be used to calculate higher-order response properties, are clearly needed for a more complete picture of auditory processing in AI (Barbour and Wang 2003a
; deCharms et al. 1998
; Escabí and Schreiner 2002
; Miller et al. 2002
; Sharpee et al. 2004
).
|
|
GRANTS |
|---|
|
|
|
FOOTNOTES |
|---|
Address for reprint requests and other correspondence: C. A. Atencio, 513 Parnassus Ave., HSE 834, Box 0732, San Francisco, CA 94143-0732 (E-mail: craig{at}phy.ucsf.edu)
|
|
REFERENCES |
|---|
|
Atencio CA, Strata F, Blake DT, Bonham BH, Godey B, Merzenich MM, Schreiner CE, Cheung SW. Representation of frequency modulation in the primary auditory cortex of New World monkeys. In: Auditory Signal Processing: Physiology, Psychoacoustics, and Models, edited by Pressnitzer D, de Cheveigné A, McAdams S, Collet L. New York: Springer, 2005, p. 169–175.
Attias H, Schreiner CE. Temporal low-order statistics of natural sounds. In: Advances in Neural Information Processing Systems, edited by Mozer MC, Jordan MI, Petsche T. Cambridge, MA: MIT Press, 1997, p. 27–33.
Bain LJ, Engelhardt M. Introduction to Probability and Mathematical Statistics (2nd ed.). Boston, MA: PWS-Kent, 1992.
Barbour DL, Wang X. Contrast tuning in auditory cortex. Science 299: 1073–1075, 2003a.
Barbour DL, Wang X. Auditory cortical responses elicited in awake primates by random spectrum stimuli. J Neurosci 23: 7194–7206, 2003b.
Bieser A. Processing of twitter-call fundamental frequencies in insula and auditory cortex of squirrel monkeys. Exp Brain Res 122: 139–148, 1998.[CrossRef][Web of Science][Medline]
Blake DT, Merzenich MM. Changes of AI receptive fields with sound density. J Neurophysiol 88: 3409–3420, 2002.
Brillinger DR, Irizarry RA. An investigation of the second- and higher-order spectra of music. Signal Process 65: 161–179, 1998.[CrossRef]
Brimijoin WO, O'Neill WE. On the prediction of sweep rate and directional selectivity for FM sounds from two-tone interactions in the inferior colliculus. Hear Res 210: 63–79, 2005.[CrossRef][Web of Science][Medline]
Cheung SW, Bedenbaugh PH, Nagarajan SS, Schreiner CE. Functional organization of squirrel monkey primary auditory cortex: responses to pure tones. J Neurophysiol 85: 1732–1749, 2001.
Code RA, Winer JA. Commissural neurons in layer III of cat primary auditory cortex (AI): pyramidal and non-pyramidal cell input. J Comp Neurol 242: 485–510, 1985.[CrossRef][Web of Science][Medline]
Code RA, Winer JA. Columnar organization and reciprocity of commissural connections in cat primary auditory cortex (AI). Hear Res 23: 205–222, 1986.[CrossRef][Web of Science][Medline]
David SV, Vinje WE, Gallant JL. Natural stimulus statistics alter the receptive field structure of v1 neurons. J Neurosci 24: 6991–7006, 2004.
deCharms RC, Blake DT, Merzenich MM. Optimizing sound features for cortical neurons. Science 280: 1439–1443, 1998.
deCharms RC, Blake DT, Merzenich MM. A multielectrode implant device for the cerebral cortex. J Neurosci Methods 93: 27–35, 1999.[CrossRef][Web of Science][Medline]
DiCarlo JJ, Johnson KO, Hsiao SS. Structure of receptive fields in area 3b of primary somatosensory cortex in the alert monkey. J Neurosci 18: 2626–2645, 1998.
Draper NR, Smith H. Applied Regression Analysis (2nd ed.). New York: Wiley, 1981.
Eggermont JJ. Wiener and Volterra analyses applied to the auditory system. Hear Res 66: 177–201, 1993.[CrossRef][Web of Science][Medline]
Elhilali M, Fritz JB, Klein DJ, Simon JZ, Shamma SA. Dynamics of precise spike timing in primary auditory cortex. J Neurosci 24: 1159–1172, 2004.
Escabí MA, Nassiri R, Miller LM, Schreiner CE, Read HL. The contribution of spike threshold to acoustic feature selectivity, spike information content, and information throughput. J Neurosci 25: 9524–9534, 2005.
Escabí MA, Schreiner CE. Nonlinear spectrotemporal sound analysis by neurons in the auditory midbrain. J Neurosci 22: 4114–4131, 2002.
Fitzpatrick KA, Imig TJ. Auditory cortico-cortical connections in the owl monkey. J Comp Neurol 192: 589–610, 1980.[CrossRef][Web of Science][Medline]
Fritz JB, Elhilali M, Shamma SA. Differential dynamic plasticity of A1 receptive fields during multiple spectral tasks. J Neurosci 25: 7623–7635, 2005.
Godey B, Atencio CA, Bonham BH, Schreiner CE, Cheung SW. Functional organization of squirrel monkey primary auditory cortex: responses to frequency-modulation sweeps. J Neurophysiol 94: 1299–1311, 2005.
Gold B, Morgan N. Speech and Audio Signal Processing: Processing and Perception of Speech and Music. New York: Wiley, 2000.
Gordon M, O'Neill WE. Temporal processing across frequency channels by FM selective auditory neurons can account for FM rate selectivity. Hear Res 122: 97–108, 1998.[CrossRef][Web of Science][Medline]
Heil P. Aspects of temporal processing of FM stimuli in primary auditory cortex. Acta Otolaryngol Suppl 532: 99–102, 1997.[Medline]
Heil P, Irvine DR. Functional specialization in auditory cortex: responses to frequency-modulated stimuli in the cat's posterior auditory field. J Neurophysiol 79: 3041–3059, 1998.
Heil P, Langner G, Scheich H. Processing of frequency-modulated stimuli in the chick auditory cortex analogue: evidence for topographic representations and possible mechanisms of rate and directional sensitivity. J Comp Physiol A Sens Neural Behav Physiol 171: 583–600, 1992a.[Medline]
Heil P, Rajan R, Irvine DR. Sensitivity of neurons in cat primary auditory cortex to tones and frequency-modulated stimuli. I. Effects of variation of stimulus parameters. Hear Res 63: 108–134, 1992b.[CrossRef][Web of Science][Medline]
Heil P, Scheich H. Spatial representation of frequency-modulated signals in the tonotopically organized auditory cortex analogue of the chick. J Comp Neurol 322: 548–565, 1992.[CrossRef][Web of Science][Medline]
Imig TJ, Ruggero MA, Kitzes LM, Javel E, Brugge JF. Organization of auditory cortex in the owl monkey (Aotus trivirgatus). J Comp Neurol 171: 111–128, 1977.[CrossRef][Web of Science][Medline]
Jürgens U. The squirrel monkey as an experimental model in the study of cerebral organization of emotional vocal utterances. Eur Arch Psychiatry Neurol Sci 236: 40–43, 1986.[CrossRef][Medline]
Kewley-Port D. Measurement of formant transitions in naturally produced stop consonant-vowel syllables. J Acoust Soc Am 72: 379–389, 1982.[CrossRef][Web of Science][Medline]
Klump GM. Detection of upward and downward frequency sweeps in the European starling (Sturnus vulgaris). Naturwissenschaften 78: 469–471, 1991.[CrossRef][Web of Science]
Kowalski N, Versnel H, Shamma SA. Comparison of responses in the anterior and primary auditory fields of the ferret cortex. J Neurophysiol 73: 1513–1523, 1995.
Lindblom BE, Studdert-Kennedy M. On the role of formant transitions in vowel recognition. J Acoust Soc Am 42: 830–843, 1967.[CrossRef][Web of Science][Medline]
Linden JF, Liu RC, Sahani M, Schreiner CE, Merzenich MM. Spectrotemporal structure of receptive fields in areas AI and AAF of mouse auditory cortex. J Neurophysiol 90: 2660–2675, 2003.
Lu T, Liang L, Wang X. Temporal and rate representations of time-varying signals in the auditory cortex of awake primates. Nat Neurosci 4: 1131–1138, 2001.[CrossRef][Web of Science][Medline]
Marmarelis PZ, Marmarelis VZ. Analysis of Physiological Systems: The White Noise Approach. New York: Plenum, 1978.
May B, Moody DB, Stebbins WC. The significant features of Japanese macaque coo sounds: a psychophysical study. Anim Behav 36: 1432–1444, 1988.[CrossRef][Web of Science]
May B, Moody DB, Stebbins WC. Categorical perception of conspecific communication sounds by Japanese macaques, Macaca fuscata. J Acoust Soc Am 85: 837–847, 1989.
Mendelson JR, Cynader MS. Sensitivity of cat primary auditory cortex (AI) neurons to the direction and rate of frequency modulation. Brain Res 327: 331–335, 1985.[CrossRef][Web of Science][Medline]
Mendelson JR, Grasse KL. A comparison of monaural and binaural responses to frequency modulated (FM) sweeps in cat primary auditory cortex. Exp Brain Res 91: 435–454, 1992.[Web of Science][Medline]
Mendelson JR, Ricketts C. Age-related temporal processing speed deterioration in auditory cortex. Hear Res 158: 84–94, 2001.[CrossRef][Web of Science][Medline]
Mendelson JR, Schreiner CE, Sutter ML, Grasse KL. Functional topography of cat primary auditory cortex: responses to frequency-modulated sweeps. Exp Brain Res 94: 65–87, 1993.[Web of Science][Medline]
Miller LM, Escabí MA, Read HL, Schreiner CE. Spectrotemporal receptive fields in the lemniscal auditory thalamus and cortex. J Neurophysiol 87: 516–527, 2002.
Nagarajan SS, Cheung SW, Bedenbaugh P, Beitel RE, Schreiner CE, Merzenich MM. Representation of spectral and temporal envelope of twitter vocalizations in common marmoset primary auditory cortex. J Neurophysiol 87: 1723–1737, 2002.
Nelken I, Versnel H. Responses to linear and logarithmic frequency-modulated sweeps in ferret primary auditory cortex. Eur J Neurosci 12: 549–562, 2000.[CrossRef][Web of Science][Medline]
O'Shaughnessy D. Speech Communications: Human and Machine (2nd ed.). New York: IEEE Press, 2000.
Phillips DP, Mendelson JR, Cynader MS, Douglas RM. Responses of single neurones in cat auditory cortex to time-varying stimuli: frequency-modulated tones of narrow excursion. Exp Brain Res 58: 443–454, 1985.[Web of Science][Medline]
Pickett JM. The Sounds of Speech Communication: A Primer of Acoustic Phonetics and Speech Perception. Baltimore, MD: University Park Press, 1980.
Poon PW, Yu PP. Spectro-temporal receptive fields of midbrain auditory neurons in the rat obtained with frequency modulated stimulation. Neurosci Lett 289: 9–12, 2000.[CrossRef][Web of Science][Medline]
Razak KA, Fuzessery ZM. Neural mechanisms underlying selectivity for the rate and direction of frequency-modulated sweeps in the auditory cortex of the pallid bat. J Neurophysiol 96: 1303–1319, 2006.
Recanzone GH, Schreiner CE, Sutter ML, Beitel RE, Merzenich MM. Functional organization of spectral receptive fields in the primary auditory cortex of the owl monkey. J Comp Neurol 415: 460–481, 1999.[CrossRef][Web of Science][Medline]
Reich DS, Mechler F, Purpura KP, Victor JD. Interspike intervals, receptive fields, and information encoding in primary visual cortex. J Neurosci 20: 1964–1974, 2000.
Rust NC, Schwartz O, Movshon JA, Simoncelli EP. Spatiotemporal elements of macaque v1 receptive fields. Neuron 46: 945–956, 2005.[CrossRef][Web of Science][Medline]
Rutkowski RG, Shackleton TM, Schnupp JW, Wallace MN, Palmer AR. Spectrotemporal receptive field properties of single units in the primary, dorsocaudal and ventrorostral auditory cortex of the guinea pig. Audiol Neurootol 7: 214–227, 2002.[CrossRef][Medline]
Schreiner CE. Spatial distribution of responses to simple and complex sounds in the primary auditory cortex. Audiol Neurootol 3: 104–122, 1998.[CrossRef][Medline]
Sen K, Theunissen FE, Doupe AJ. Feature analysis of natural sounds in the songbird auditory forebrain. J Neurophysiol 86: 1445–1458, 2001.
Shamma SA, Fleshman JW, Wiser PR, Versnel H. Organization of response areas in ferret primary auditory cortex. J Neurophysiol 69: 367–383, 1993.
Sharpee T, Rust NC, Bialek W. Analyzing neural responses to natural signals: maximally informative dimensions. Neural Comput 16: 223–250, 2004.[CrossRef][Web of Science][Medline]
Stevens KN, Klatt DH. Role of formant transitions in voiced-voiceless distinction for stops. J Acoust Soc Am 55: 653–659, 1974.[CrossRef][Web of Science][Medline]
Suga N. Analysis of frequency-modulated sounds by auditory neurones of echo-locating bats. J Physiol 179: 26–53, 1965a.
Suga N. Responses of cortical auditory neurones to frequency modulated sounds in echo-locating bats. Nature 206: 890–891, 1965b.[CrossRef][Medline]
Suga N. Analysis of frequency-modulated and complex sounds by single auditory neurones of bats. J Physiol 198: 51–80, 1968.
Theunissen FE, Sen K, Doupe AJ. Spectral-temporal receptive fields of nonlinear auditory neurons obtained using natural sounds. J Neurosci 20: 2315–2331, 2000.
Tian B, Rauschecker JP. Processing of frequency-modulated sounds in the cat's anterior auditory field. J Neurophysiol 71: 1959–1975, 1994.
Tian B, Rauschecker JP. Processing of frequency-modulated sounds in the cat's posterior auditory field. J Neurophysiol 79: 2629–2642, 1998.
Touryan J, Felsen G, Dan Y. Spatial structure of complex cell receptive fields measured with natural images. Neuron 45: 781–791, 2005.[CrossRef][Web of Science][Medline]
Touryan J, Lau B, Dan Y. Isolation of relevant visual features from random stimuli for cortical complex cells. J Neurosci 22: 10811–10818, 2002.
Voss RF, Clarke J. 1/f noise in music and speech. Nature 258: 317–318, 1975.[CrossRef][Web of Science]
Wang X. On cortical coding of vocal communication sounds in primates. Proc Natl Acad Sci USA 97: 11843–11849, 2000.
Wang X, Lu T, Snider RK, Liang L. Sustained firing in auditory cortex evoked by preferred stimuli. Nature 435: 341–346, 2005.[CrossRef][Medline]
Wang X, Merzenich MM, Beitel R, Schreiner CE. Representation of a species-specific vocalization in the primary auditory cortex of the common marmoset: temporal and spectral characteristics. J Neurophysiol 74: 2685–2706, 1995.
Whitfield IC, Evans EF. Responses of auditory cortical neurons to stimuli of changing frequency. J Neurophysiol 28: 655–672, 1965.
Winguth SD, Winer JA. Corticocortical connections of cat primary auditory cortex (AI): laminar organization and identification of supragranular neurons projecting to area AII. J Comp Neurol 248: 36–56, 1986.[CrossRef][Web of Science][Medline]
Winter P, Ploog D, Latta J. Vocal repertoire of the squirrel monkey (Saimiri sciureus): its analysis and significance. Exp Brain Res 1: 359–384, 1966.[Web of Science][Medline]
Wright PC. The behavior and ecology of the owl monkey. In: Aotus: The Owl Monkey, edited by Baer JF, Weller RE, Kakoma I. San Diego, CA: Academic Press, 1994, p. 97–112.
Yamada WM, Lewis ER. Predicting the temporal responses of non-phase-locking bullfrog auditory units to complex acoustic waveforms. Hear Res 130: 155–170, 1999.[CrossRef][Web of Science][Medline]
Zhang LI, Tan AY, Schreiner CE, Merzenich MM. Topography and synaptic shaping of direction selectivity in primary auditory cortex. Nature 424: 201–205, 2003.[CrossRef][Medline]
This article has been cited by other articles:
![]() |
I-H. Hsieh and K. Saberi Detection of spatial cues in linear and logarithmic frequency-modulated sweeps Atten Percept Psychophys, November 1, 2009; 71(8): 1876 - 1889. [Abstract] [PDF] |
||||
![]() |
M. N. Insanally, H. Kover, H. Kim, and S. Bao Feature-Dependent Sensitive Periods in the Development of Complex Sound Representation J. Neurosci., April 29, 2009; 29(17): 5456 - 5462. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. A. Brown and R. V. Harrison Responses of Neurons in Chinchilla Auditory Cortex to Frequency-Modulated Tones J Neurophysiol, April 1, 2009; 101(4): 2017 - 2029. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. D. Washington and J. S. Kanwal DSCF Neurons Within the Primary Auditory Cortex of the Mustached Bat Process Frequency Modulations Present Within Social Calls J Neurophysiol, December 1, 2008; 100(6): 3285 - 3304. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Qin, J. Wang, and Y. Sato Heterogeneous Neuronal Responses to Frequency-Modulated Tones in the Primary Auditory Cortex of Awake Cats J Neurophysiol, September 1, 2008; 100(3): 1622 - 1634. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Visit Other APS Journals Online |