|
|
||||||||
Department of Biology, Humboldt University, Berlin, Germany
Submitted 24 November 2006; accepted in final form 13 February 2007
|
|
ABSTRACT |
|---|
|
|
|
INTRODUCTION |
|---|
|
An established method of investigating the processing of amplitude modulations (AMs) by auditory systems is based on modulation transfer functions [rate (rMTFs) or temporal (tMTFs)]. These assays reveal the range of AM frequencies within which auditory neurons provide specific information about the stimulus envelope (e.g., Joris and Yin 1992
; Joris et al. 2004
; Krishna and Semple 2000
). However, MTFs do not take into account the reproducibility of responses, which is an important factor that constrains neural encoding schemes. Moreover, these traditional procedures do not always provide an answer to the key question of how well different AM frequencies can indeed be discriminated on the basis of neural signalsthe only information available to the CNS.
Discrimination may be poor, for example, if neurons exhibit low-pass or broad band-pass MTFs. To assess the fidelity of discrimination based on spike train comparison alone, here we apply a method to determine metric distances between spike trains, as introduced by van Rossum (2001)
. The spike train metric method has the advantage of providing information about the temporal precision with which spike trains should be evaluated, to obtain optimal discrimination (Machens et al. 2003
). Thus it can help to infer the type of coding prevalent in neurons. We then compare these results with the description by the traditional tMTF and rMTF functions.
As an experimental system we chose the metathoracic auditory pathway of grasshoppers, a well-established model system for the processing of acoustic patterns (e.g., Gollisch et al. 2002
; Machens et al. 2001a
; Rokem et al. 2006
; Stumpner and Ronacher 1991
, 1994
; Stumpner et al. 1991
). This auditory system is conserved in evolution across species and is able to process a large variety of stimuli with a focus on amplitude modulations (Stumpner et al. 1991
; von Helversen and von Helversen 1997
). It is characterized by a hierarchical organization, consisting of three consecutive levels of processing: receptor neurons, local interneurons, and interneurons ascending to the brain, which are connected to receptors by only two or three steps of synaptic transmission. The classification of acoustic stimuli based on spike train similarities will be compared for these processing levels. This comparison can reveal how the representation of AM stimuli changes and what coding strategies may be used at different stages of the auditory pathway (Hennig et al. 2004
; Joris et al. 2004
; Narayan et al. 2006
; Vogel et al. 2005
).
|
|
METHODS |
|---|
|
All experiments were performed on adult locusts (Locusta migratoria), obtained from a commercial supplier. Legs, wings, head, and gut were removed and the animals were fixed with wax, dorsal side up onto a Peltier element, attached to a holder. The thorax was opened dorsally to expose the auditory nerve and the metathoracic ganglion, which houses the auditory pathway in question, was stabilized by a small NiCr platform. The whole torso was filled with locust Ringer solution (Pearson and Robertson 1981
). During the experiments the preparation was kept at a constant temperature of 30 ± 2°C.
Auditory receptors and auditory interneurons were recorded intracellularly in the frontal auditory neuropil of the metathoracic ganglion. Neural responses were amplified (SEC-05LX; npi electronic, Tamm, Germany) and recorded by a data-acquisition board (PCI-MIO-16E-1; National Instruments, Munich, Germany) with a sampling rate of 20 kHz.
The tips of the glass microelectrodes (borosilicate, GC100F-10; Harvard Apparatus, Edenbridge, UK) were filled with a 35% solution of Lucifer yellow (SigmaAldrich, Taufkirchen, Germany) in 0.5 M LiCl. After completion of the recordings the dye was injected into the recorded cell by applying hyperpolarizing current. After an experiment the thoracic ganglia were removed, fixed in 4% paraformaldehyde, dehydrated, and cleared in methylsalicylate. The stained cells were identified under a fluorescent microscope according to their characteristic morphology (Römer and Marquart 1984
; Stumpner and Ronacher 1991
). The experimental protocol complied with German law governing animal care.
Acoustic stimulation
All experiments were performed in a Faraday cage lined with foam prisms to attenuate echoes. Acoustic stimuli were broadcast by one of two speakers (D-28/2, Dynaudio, Skanderborg, Denmark), situated laterally at a distance of 30 cm from the preparation. The signals were amplified (Mercury 2000; Jensen, Pulheim, Germany) and attenuated (PA5; Tucker-Davis Technologies, Gainesville, FL). All acoustic stimuli were stored digitally and delivered by custom-made software (LabVIEW, National Instruments) using a 100-kHz D/A-conversion (PCI-MIO-16E-1, National Instruments). Sound intensities were calibrated with a Brüel & Kjær (Nærum, Denmark) microphone (1/2-in.), positioned at the site of the preparation, a Brüel & Kjær measuring amplifier (type 2209), and are given in decibels re 2 x 105 Nm2 (dB SPL).
First, acoustic search stimuli were presented to observe changes in firing rate, indicating that a cell could be considered as an auditory neuron. Second, a brief intensityresponse paradigm was run to determine the neuron's response threshold (100-ms pulses filled with white noise, bandwidth 0.530 kHz, 3070 dB in 10-dB steps, each intensity repeated four times). The sound intensity for further test stimuli was then adjusted to about 20 dB above threshold.
The acoustic stimuli used in this study (compare Fig. 1A) were sinusoidal amplitude modulations (10, 20, 40, 83, 125, 167, 250, 333, and 500 Hz; modulation depth 100%) of a broad-band noise carrier (0.530 kHz). Our intention was to sample over a broad range of modulation frequencies. A large proportion of high modulation frequencies was included for two reasons: 1) because there is evidence that modulation frequencies between 167 and 250 Hz are behaviorally relevant (von Helversen and von Helversen 1998
) and 2) 333 and 500 Hz were chosen to ensure that the corner frequencies of the tMTF curves are captured. Stimuli were generated in the LabVIEW programming environment (National Instruments). To ensure that each neuron was in a well-defined adaptation state, each stimulus was preceded by a 200-ms segment of unmodulated noise, followed by a 1-s segment of constant modulation frequency and depth, and another segment of 200 ms of unmodulated noise. The stimuli were repeated four times with 300-ms stimulus intervals. For a few cells another set of stimuli was tested that had a 4-s segment of constant modulation frequency, whereas all other stimulus parameters were identical. These stimuli were repeated only once. No systematic differences were observed in data analysis between the 1- and the 4-s stimuli.
|
Data analysis
From the digitized recordings, the spike times were extracted by means of a voltage threshold criterion. Lists of spike times were derived starting at the onset of the modulated part of the stimuli.
Analysis of spike train distances and classification success
To measure metric distances the spike trains were fragmented into 500-ms segments, such that for each stimulus type eight spike trains were analyzed. To compute the distance between two spike trains, spikes were first convolved with a filter function (Fig. 1B; Machens et al. 2003
; van Rossum 2001
)
![]() |
![]() | (1) |
(Fig. 1B)
![]() | (2) |
can be varied to study the influence of the temporal resolution with which spike trains are evaluated: if
is large it is mainly the difference in average firing rates that contributes to the distance, whereas for small
values the distance depends on differences in spike timing.
For each
value the difference of the convolved spike train traces of interest is computed, squared, and integrated. This integral gives the distance (i.e., the dissimilarity) between the two spike trains (for details see Machens et al. 2003
; van Rossum 2001
). This method yields results equivalent to those of the spike train metric introduced earlier by Victor and Pupura (1997)
(compare also Machens et al. 2001b
, 2003
; Narayan et al. 2005
; Victor 2005
).
Once the distances between all spike trains of a certain cell were computed (see Fig. 2, middle column) a cluster algorithm was applied to quantify the classification success and thus the discrimination performance of the stimuli based on the spike trains. For reliable discrimination, spike trains elicited by the same stimulus should have smaller distances to each other than spike trains elicited by a different stimulus, resulting in a tight clustering of spike trains. Clustering was examined by randomly picking a spike train as a "template" for each stimulus. The remaining spike trains were classified by assigning them to that template to which they exhibited the smallest distance. This procedure was repeated for all possible template permutations, resulting in an average probability of classification, shown in the classification matrices (see Fig. 2, right column). The diagonal elements correspond to correctly classified spike trains, the off-diagonal elements to misclassified spike trains.
|
The classification success over the whole range of stimuli (Fig. 3) was calculated as the mean percentage of correct classifications over all spike trains. Because the classification success depends on the resolution parameter
(see Fig. 6, AC), for each cell its optimal
value was used. These values ranged between 1 and 30 ms.
|
|
|
within an evaluated time window of 500 ms (Fig. 6, AC). From the maxima of these curves the optimal
values were determined (Fig. 6D). If there was a broad constant maximum (e.g., at 100% classification; see Fig. S1A1 ; blue curves for REC and BSN1) the midpoint
value of this maximum was taken. In addition, a range of
for near-optimal classification was determined representing the range of
values yielding a performance 10% below the maximum of the curves (compare Fig. 6, D and E). Modulation transfer functions
The responses to all AM stimuli as a function of modulation frequency were also characterized by rate modulation transfer functions (rMTFs) and temporal modulation transfer functions (tMTFs), based on vector strength, that describe the phase-locking of spikes to the stimulus envelope. Details of these procedures were given previously (for rMTF: Krishna and Semple 2000
; for tMTF: Joris and Yin 1992
; Krishna and Semple 2000
). In short, for the rMTF (compare Fig. 1C) mean spike rates of the modulated part of the stimuli were computed (averaged over a time window of 500 ms). If the spike rate changed in the tested range of MF by
33% (compare also Fig. 8C) a cutoff frequency was derived from the curves. This was the frequency at which the response fell to the minimum spike rate plus 10% of the difference to the maximum spike rate (Fig. 1C, left). If there were two distinct maxima of spike rate, the cutoff frequency according to the lower-frequency maximum was taken [following Krishna and Semple (2000)
; see point W1 in their Fig. 1E].
|
![]() | (3) |
i is the timing of spike i defined as phase of the modulation waveform and n is the number of spikes. The vector strength can vary from a minimum of zero to a maximum of 1, the latter indicating a "perfect" phase locking, i.e., all spikes falling in the same bin. From the resulting tMTF curves corner frequencies were obtained (see Fig. 1C; compare also with the rMTF analysis). The corner frequency was determined as the frequency where the curve dropped to the 90% point from the maximal vector strength (Fig. 1C, right). In all cases, linear interpolation was used if required. Only the cutoff frequencies derived from the rMTF curves and corner frequencies of the tMTF curves are shown here (Fig. 8). |
|
RESULTS |
|---|
|
The central question of this study is how well different AM stimuli can be discriminated by a nervous system, based on the sole information provided by the spike trains from auditory neurons. Discrimination will be impeded by the fact that spike trains exhibit some variability and usually differ even if an identical stimulus is presented several times. Thus a basic requirement for correct classification is that any spike trains elicited by repeated presentations of the same stimulus should be more similar compared with other spike trains elicited by a different stimulus. Our analysis therefore relies on an evaluation of spike train (dis)similarities, as introduced by van Rossum (2001)
(see also Machens et al. 2003
). Spike train distances within a stimulus reflect the response variability, whereas the distances across stimuli yield cues for discrimination, provided that across-stimulus distances are larger than within-stimulus distances.
The procedure shall be exemplified with the data of four neurons that are representative for the first three processing stages in the auditory pathway of the locust: receptor neurons, local interneurons, and ascending interneurons (Fig. 2). On the left, spike trains are shown as raster plots for the nine stimuli, each with eight stimulus presentations, whereas in the middle column the distances of each spike train to all other spike trains are shown in a color-coded two-dimensional matrix. Comparing a spike train to itself yields a distance of zero (as visible in the purple points on the diagonal). The distance matrix of the receptor cell shows distinct squares of dark blue (i.e., small distance values) aligned along the diagonal, up to a modulation frequency of 125 Hz (Fig. 2A). These 8 x 8 squares reveal a property of the spike trains that is crucial for the reliable classification of AM stimulithat is, that the spike trains evoked by repeated presentations of a certain AM frequency should exhibit high similarity. For the receptor neuron the distances to all other spike trains appear to be rather homogeneously distributed over the whole matrix. The matrices of the three other neurons, however, are radically different (Fig. 2, BD). The local bisegmental neuron (BSN1) exhibits a large region of uniformly small metric distances and therefore poor discrimination, for all spike trains produced by frequencies >83 Hz (zone of dark blue on the bottom right). The spike trains produced by 10, 20, and 40 Hz are very different from those produced by all other frequencies. The matrix of the ascending neuron AN3 (Fig. 2C) resembled that of BSN1, although the maximum distances were somewhat smaller. Again, a large homogeneous region of small distance values existed at >40 Hz. The pattern of spike train distances of the ascending neuron AN4 was different from that of the other three cells, exhibiting two symmetry axes. As a result of the almost complete cessation of spiking activity at modulation frequencies between 83 and 167 Hz (see raster plot on the left), all distances in the center of the matrix became very small. The responses to 10- and 20-Hz stimuli exhibited highest distances to these midfrequencies that elicited virtually no spikes, whereas the spike trains exhibited increased similarity to still higher frequencies as a result of the reappearing spike responses (compare Fig. 2D).
Stimulus classification based on spike train distances
As a next step the spike train distances were converted to a classification matrix, expressed as percentage of correctly classified stimuli, to obtain a measure for the discrimination performance (Fig. 2, right column). For each stimulus type one spike train was (randomly) selected, which served as a template for this stimulus. The distance data were then subjected to a cluster algorithm that assigned each spike train to that template to which it exhibited the smallest distance value. This was repeated for different template spike trains, resulting in percentages of correctly classified spike trains (values on the diagonal) and of misclassified spike trains (off-diagonal values in Fig. 2, right column; see METHODS). The receptor responses yielded a perfect assignment of spike trains to the respective stimulus class
125 Hz. As expected from the distance data of the middle column, for the local and ascending neurons, BSN1 and AN3, the range of modulation frequencies allowing for a correct assignment of spike trains was reduced, leading to many misclassifications for frequencies >40 Hz. The classification pattern of the AN4 neuron was different again. Only the two lowest modulation frequencies could be clearly discriminated from all others on the basis of the spike trains, whereas in the midfrequency range no reasonable discrimination was possible because of the almost complete cessation of spiking. The spike trains produced by high modulation frequencies were again similar to those of low frequencies, resulting in an X-like pattern of misclassifications characteristic for this neuron type.
Note that local neurons are restricted to the thoracic ganglia and transmit information from receptors to ascending neurons that send their axons to the brain. Thus in this system receptor neurons and ascending neurons are separated by only two (or perhaps three) steps of synaptic transmission (cf. Boyan 1992
, 1999
). Among ascending neurons one finds a general increase in spike train distances and remarkably complex spike train distance patterns. In addition, the discrimination performance as measured by the probability of correct classification diminishes at higher processing levels.
Spike rates and the speed of discrimination
A basic requirement for on-line processing of acoustic signals is speed. In many situations animals must respond quickly to external stimuli and thus the minimal time needed for correct classification may be crucial. It is therefore a central question how the fidelity of discrimination does depend on the length of the spike train segment evaluated and thus on the number of spikes available. To answer these questions we computed the percentage of correctly classified spike trains as a function of the duration of the evaluation time window (Fig. 3A), or of the number of spikes available (Fig. 3B), for the three processing levels. Two trends were evident in the data: first, most curves showed a rather steep slope during the first 100 to 150 ms and then saturated. Second, the maximum classification performance decreased considerably from receptors to ascending neurons. Most receptors attained a level of correct classifications between 70 and 85% and similarly high values were found for two local neurons with primary-like responses, TN1 and SN1. Note that 80% correct classification indicates a quite remarkable discrimination ability, considering that our stimulus regime contained a large proportion of high-modulation frequencies (four of nine: 167, 250, 333, and 500 Hz). Interestingly, most specimens of another local neuron with phasictonic response pattern, BSN1, attained a much smaller classification success rate and, in this respect, appear more similar to ascending neurons, for which the classification level rarely exceeded 50 to 60%. Among the ascending neurons, AN3 and AN11 types performed best, whereas the highly directional neurons, AN1 and AN2, performed worst.
There was a clear trend that classification performance decreased among ascending neurons. However, this could simply result from their lower average spike rates: Vogel et al. (2005)
reported that ascending neurons exhibit distinctly smaller average spike rates than local neurons or receptor neurons. We controlled for such an influence by converting the x-axis of Fig. 3A into number of spikes evaluated. This led to a somewhat tighter clustering of the curves, in particular for the ascending neurons (Fig. 3B). Again there was a steep increase in classification performance up to five to ten spikes. In other words, a spike train of 150- to 200-ms length, corresponding to a number of five to ten spikes, suffices for a rather reliable classification of AM stimuli, at least for receptors and local neurons. To further extend the evaluation time window >200 ms would yield only little improvement. Remarkably, the difference in classification performance between processing levels cannot be explained by differing spike rates alone: even at the same spike count the probability of correct classification was distinctly worse for the ascending neurons than for receptors and local neurons (Fig. 4, A and B). As a consequence, there was a clear tendency that larger time windows were necessary for ascending neurons to achieve a moderate 40% probability of correct classification (Fig. 4C). We then looked for possible correlations between spike rate and classification success separately for cells of the different processing levels (Fig. 4D). For receptor and local neurons classification performance did not significantly depend on spike rate, whereas for the ascending neurons a weakly significant positive correlation was observed (r = 0.59, P < 0.05).
|
In the previous section (Figs. 3 and 4) we examined the general discrimination performance of neurons of the three processing levels. We now take a more detailed look at the influence that different modulation frequencies exert on the classification success, to enable a direct comparison with the results of the modulation transfer function paradigm. From the classification matrices of all recorded neurons (see Fig. 2, right column) we computed the percentages of correct classifications for each modulation frequency separately (Fig. 5). The spike trains of receptor neurons and the two primary-like local neurons TN1 and SN1 allowed for a perfect or near-perfect classification up to modulation frequencies of 125 Hz, or even 167 Hz (Fig. 5, A and B). The high overall classification performance (Fig. 3) reflects this behavior. For these neurons, the intratype variability was also small. This was different with another local neuron, BSN1. Some specimens of this neuron enabled the identification of stimuli
125 Hz, whereas others approached chance level (11%) already at 83 Hz (see large SDs in Fig. 5B). This trend was even stronger with the ascending neurons. None of these allowed for a perfect classification at 125 Hz and most neurons already showed a steep decline of correct classifications between 20 and 83 Hz (Fig. 5C), which is the main reason for the drop in their overall classification success visible in Figs. 3 and 4. From the individual curves corner frequencies were determined, which are characterized by a value 10% below the maximum of the curves (similar as for the tMTF; see Fig. 1 and METHODS). The data indicate that for the ascending neurons and BSN1 there was considerable variability in the corner frequencies within one cell type (Fig. 5D). The corner frequencies of receptors TN1 and SN1 were tightly clustered at >125 Hz, indicating good discrimination up to nearly 150 Hz. Their distribution of corner frequencies (diamonds and open circles in Fig. 5D) had no overlap to that of ascending neurons, which were all <100 Hz, whereas BSN1 (crosses) occupied an intermediate position.
As a general picture, the responses of ascending neurons enabled a near-perfect discrimination of modulation frequencies only in a stimulus range that was much more restricted to low frequencies than that of receptors, TN1, and SN1. The two neurons that best encode sound direction (AN1, AN2; see Stumpner and Ronacher 1994
) performed worst in the classification task and also for these neurons a very high spike count variability was previously found by Vogel et al. (2005)
. The specimens of the local BSN1 neuron were intermediate between receptor neurons and ascending neurons. The relatively large range of corner frequencies found for BSN1 probably can be explained by the fact that this neuron type exists in two (or perhaps even three) copies on each side (twin neurons; see Römer and Marquart 1984
) that exhibit somewhat different response properties (Stumpner 1989
). Ascending neurons seem to perform a relatively coarse classification, allowing discrimination only between broader subsets of AM frequencies.
Discrimination performance and temporal resolution
The discrimination results mentioned earlier were obtained using for each neuron its specific optimal
-value. This parameter determines the width of the filter function by which each spike was replaced to compute the metric distances (see Fig. 1B and METHODS). Thus
describes the temporal precision with which the spikes are evaluated for the distance metric. A very large
(>200 ms) has the consequence that the timing of spikes is virtually neglected, for which reason the metric distance between spike trains reflects mainly spike count differences (van Rossum 2001
; see also Machens et al. 2003
). Very small
values imply that the vast majority of spikes increase the distance, except for spikes of the two spike trains that are truly coincident. Changing the value of
thus influences the distances between spike trains and, consequently, also the classification of stimuli. How the classification success varied with the resolution parameter
is shown in Fig. 6, AC for the different neuron types recorded in this study. From the curves of individual specimens both the optimal
value (i.e., at the peak of the curves) and the range of
values enabling a near-optimal classification (90%; see inset) were assessed (Fig. 6, D and E). For receptors, the optimal
was <5 ms and the same was true for all local neurons, except one BSN1 (Fig. 6D). In sharp contrast, the ascending neurons (except one AN3) exhibited optimal
5 ms; for AN12, AN4, AN2, and most AN1 the optimal
was >10 ms. As in other respects, the receptors and TN1, SN1 exhibited very similar curves. Although the optimal
values did not differ much between receptors and BSN1, the curves of the former extended even into the submillisecond range (Fig. 6, A and B). This can also be seen in Fig. 6E in which the ranges of
are shown that allowed a near-optimal classification, to avoid local biases in measuring the optima (see inset in Fig. 6).
An obvious objection to our approach is that the optimal resolution, and thus discrimination, may be strongly influenced by the stimulus ensemble used. Because our stimuli consisted of a large proportion of high-modulation frequencies, this might have biased the optima of the resolution parameter
. In the data analysis we checked for such influences of the stimulus space by restricting our stimulus ensemble in three ways: exclusion of the three highest-modulation frequencies; restricting the stimuli to 83, 125, and 167 Hz; and exclusion of the six highest frequencies. The first two restrictions had only minor effects on the optimal
(see Fig. S1 in supplementary information). If the stimulus ensemble was reduced to 1040 Hz, the range of optimal
became broader, extending up to about 50 ms, but still encompassed the
of the nine-stimulus ensemble. However, this extension toward larger
values does not necessarily imply a shift of the optimal
(or a change in the coding principle). Rather it indicates that it was simply not detrimental to increase the
as long as the spikes were distributed in large, well-separated clusters (see raster plots in Fig. 2).
Contribution of spike count and spike timing to discrimination
The metric distance measure offers a significant advantage: the width
of the filter function allows separation of the respective contributions of spike timing and of spike count differences to the discrimination of stimuli. This characteristic is visible in the curves of, say, AN3 or AN4 (Fig. 6). With very small
values (<0.5 ms) the stimulus classification dropped to chance level (11%), whereas for large
(>200 ms) performance was lowroughly 30 or 40%but consistently above chance level. The proportion of correct classification at
= 1,000 ms can be attributed exclusively to differences in spike count, whereas all information about spike times is ignored. Consequently, the difference between the percentage of correct classification at 1,000 ms and the peak value of the curves reflects the increment in classification performance that can be achieved by additionally taking into account temporal information about spike times (see double arrow in inset diagram in Fig. 6). These increment values based on spike times are plotted in Fig. 7A (gray bars), together with the discrimination success that can be achieved on the basis of spike count differences alone (black bars). The highest benefit that can be attributed to temporal information was attained by the primary-like TN1 and SN1 neurons, around 65%, followed by the receptors (roughly 60%). For these neurons the classification success depended nearly exclusively on spike timing, whereas spike count differences contributed only marginally (roughly 5% above chance level, Fig. 7A). The situation was clearly different with ascending neurons. For these the contribution of spike timing to discrimination dropped to nearly 20% and roughly equaled the contribution of spike count differences. The other local neuron, BSN1, resembled ascending neurons, with an increased spike count contribution and a reduced contribution of spike timing (only roughly 30%). The curves of Fig. 6 suggest a twofold trend across processing levels: toward an increased importance of spike count and toward reduced temporal resolution, i.e., larger optimal
values. There was indeed a strong negative correlation between the increment in discrimination attributable to spike times and the value of the optimal
(Fig. 7B; r = 0.85), which suggests a general trend within the auditory pathway. The negative correlation remained significant if receptors and the primary-like TN1, SN1 were excluded from the analysis (r = 0.63, P < 0.01, n = 23).
|
A further goal of the present study was a quantitative comparison of the spike train metric paradigm with the traditional MTF methods. To directly relate results obtained with the metric method and the standard MTF paradigm, we derived corner frequencies from the curves of Fig. 5, which indicate the upper limit of high discrimination performance (see Fig. 5D). These values were then compared with the corner frequencies and cutoff frequencies obtained in the MTF paradigm (see Fig. 1C). The corner frequency derived from the tMTF curves describes the frequency limit up to which a neuron is able to strongly phase lock to the stimulus envelope and may carry information in the timing of spikes. With respect to the rMTF curves, however, the situation is different (compare Fig. 1C, left): the dynamic range of the neuron's spike count extends between corner and cutoff frequency, whereas below the rMTF corner frequency the neuron responds with spike rates similar to those of all frequenciesat least for low-pass filters to which the majority of the neurons in this study belong. Thus there are no spike rate differences available that could be used for discrimination of different frequencies. Therefore in the rMTF paradigm it is the cutoff frequency that describes the most interesting frequency range to which a neuron responds. The comparison between the cutoff frequencies obtained in the rMTF data evaluation and the corner frequencies in the spike train metric paradigm revealed a discrepancy, the latter yielding much smaller values (Fig. 8A; data of receptors, TN1, and SN1 are missing in this figure because no meaningful cutoff frequencies could be determined from their all-pass rMTF curves). This discrepancy was not caused by our choice of cutoff frequencies for the rMTF: no significant correlation was found either if we compared the corner frequencies derived from the rMTF and metric corner frequencies (r = 0.21; P > 0.10; data not shown). In contrast, the comparison between metric and tMTF-based corner frequencies (Fig. 8B) yielded a rather close correspondence between the two measures for most cells (with the possible exception of BSN1). In particular, the data of receptors and TN1, SN1 resulted in very similar corner frequencies in both data evaluations.
Figure 8C compares the temporal precision parameter
(see Fig. 6) with the difference between maximal and minimal spike rates obtained in the rMTF functions as an indicator of the dynamic range (in this graph receptors and local neurons are included with their small rate differences). Taken over all neuron types there existed a strong positive correlation between the size of the optimal
and the maximal spike rate difference in the rMTF curves (compare Fig. 1C; r = 0.73, P < 0.0001). However, excluding the data from receptors and TN1, SN1, thus reducing the data set to ascending neurons and BSN1, the significance disappeared (r = 0.39, P
0.06). A comparison of the optimal
with the corner frequencies derived from the tMTF curves yielded a strong inverse correlation (Fig. 8D; r = 0.77, P < 0.0001). Thus in general the corner frequency obtained from the tMTF is a good predictor for the temporal resolution with which an optimal classification of AM stimuli can be achieved.
What could be the cause of these strong correlations between
and the MTF data? A hint comes from Figs. 7 and 8C. The cluster of small
values at low spike rate differences (Fig. 8C) confirms that any information about the stimulus envelope must be encoded in the timing of spikes if different stimuli do not elicit reliable differences in spike count (see also Fig. 7A). The reverse, however, is not true: a neuron could encode information in the timing of spikes and in spike count differences. However, the shift toward larger
values in ascending neurons (Fig. 8C) and the data of Fig. 7A show that among ascending neurons the spike timing information contributed less. Thus it must be asked why among ascending neurons is the temporal information about spike times obviously not very useful. A likely reason for this is a less-precise locking of spikes to the stimulus envelope (Fig. 2). Again the local BSN1 neuron, with rather small optimal
values in spite of larger spike count differences, seems to occupy an intermediate position between primary-like and ascending neurons.
|
|
DISCUSSION |
|---|
|
Representation of amplitude modulations at consecutive stages of the auditory pathway
In the auditory system of grasshoppers the ascending neurons are connected to receptor neurons by local neurons. Thus we have only few, in some cases only two to three, steps of synaptic processing that convert receptor spike trains to the responses of ascending neurons (Boyan 1992
, 1999
; Marquart 1985
; Stumpner et al. 1991
).
The responses of receptor neurons and the primary-like local neurons (TN1, SN1) allowed a remarkable discrimination, based on the classification of single spike trains, up to stimulus frequencies of about 150 Hz. Very short spike train segments and small numbers of spikes sufficed to attain this high classification success. Importantly, an extension of the evaluation time window >200 ms (corresponding to roughly 10 spikes) would yield only small improvements, thus implying a severe law of diminishing returns (Fig. 3). A short evaluation time window is in accord with behavioral observations in another grasshopper species, which recognizes conspecific signals even if these are shortened to 160- to 250-ms duration, in a one-shot manner (Ronacher and Krahe 1998
). In contrast to receptors and TN1, ascending neurons fail to resolve AMs at much lower frequenciesyielding near-chance performance for frequencies
83 Hz (Fig. 5). Interestingly, another local neuron, BSN1, that shares several features with ascending neurons, was previously shown to be an important distributor of information onto ascending neurons (Boyan 1992
).
Contributions of spike timing and spike count to discrimination
The classification procedure applied here searched for the optimal discrimination by varying the width
of the filter function used to determine spike train (dis)similarities (Figs. 1 and 6; see also Machens et al. 2003
). A specific power of the metric method resides in the possibility of separating the respective contributions of spike count, as determined in a large "evaluation window," and spike timing to the discrimination performance (Fig. 7). The comparison between the classification performance at optimal
and at
= 1,000 ms revealed that the increment in classification success that can be attributed to the timing of spikes became small for most ascending neuronsin sharp contrast to the receptor and primary-like neurons for which the classification success depended essentially on spike timing, whereas spike count differences contributed very little (Fig. 7A). It should be emphasized, however, that the separation of spike count and spike time contributions makes sense only if the optimal
is distinctly <1,000 ms, as was indeed the case in our system.
The optimal
values show with what temporal resolution such an evaluation of spike timing should take place to extract the maximum information about the stimulus envelopes. The very small optimal
values found for receptors and TN1 (Fig. 6D), mostly <3 ms, do not only demonstrate a high temporal precision of their spike trains (see also Rokem et al. 2006
), combined with a high trial-to-trial reliability, but also that the spike trains of these neurons should be processed with outstanding temporal resolution to extract all information. If we interpret the filter function of Fig. 1B as the shape of postsynaptic potentials in downstream neurons, half-widths of 13 ms incur questions about the physiological plausibility of such a hypothetical evaluation procedure. However, this resolution problem is appreciably relieved if we consider the ranges of
yielding 90% performance (Fig. 6E): without substantial loss of classification performance, these
values now extend into the range of 5 to 10 ms and thus are consistent with observed durations of postsynaptic potentials (Boyan 1999
; Franz and Ronacher 2002
; Marquart 1985
). For many ascending neurons the optimal
values were shifted to 1020 ms (see Fig. 6, D and E).
The classification procedure based on the spike train metric may be particularly suited to determine the characteristic timescale of a coding process. As Theunissen and Miller (1995)
pointed out, the duration of the "encoding time window" associated with a neural code must be in accord with the rate of dynamic stimulus variations, and thereby probably reflects an (evolutionary or current) adaptation to the timescales of amplitude fluctuations in naturally relevant sound stimuli. Our resultsin particular the controls with reduced stimulus ensembles shown in the supplementary Fig. S1indicate that the optimum of the resolution parameter
does not depend directly on the stimulus ensemble, but rather seems to reflect a specific property of neurons.
Obviously two factors may influence the observed values of the optimal
in Fig. 6: 1) spike count differences between AM frequencies and 2) the temporal precision of spike trains, which constrains the upper limits of useful temporal resolution. If no spike count differences are available, any correct classification must rely on the timing of spikes, as was found for receptors and primary-like local neurons with their all-pass rate-MTFs. Their
curves extend into the submillisecond range, indicating an outstanding temporal precision of spiking responses (Fig. 6; see also Rokem et al. 2006
). If distinct spike count differences exist, as is true for many ascending neurons, the
curves may extend to larger values to match this aspect (see Fig. 6). However, the shift of the left-hand flank to reduced temporal resolution (and the corresponding shift in optimal
) obviously does reflect a decrease in spike timing precision. The spike raster plots of Fig. 2 do indeed suggest an increase of spike jitter between auditory receptors, BSN1, and the ascending neurons. This impression is supported by quantitative data by Vogel et al. (2005)
. In the locust auditory pathway these authors report a substantial increase of spike train variability between receptors and ascending neurons. Thus the optimal temporal resolution visible in the curves of Fig. 6 appears to reflect a compromise between averaging out spike time jitter on one hand and the loss of information about the stimulus' temporal structure on the other (see also Narayan et al. 2006
).
There are several general trends common to our observations in the auditory pathway of locusts and the processing of amplitude-modulated sound in vertebrates. Peripheral neurons of vertebrates encode envelope information by synchronizing their activity to the modulation waveforms, whereas neurons at higher levels become progressively tuned in terms of average firing rate, rather than synchronizing to the envelope (for a comprehensive review see Joris et al. 2004
)compare with Figs. 6 and 7. Such a transformation from temporal-to-rate coding may be a general feature of auditory systems (Joris et al. 2004
; Lu and Wang 2004
; Narayan et al. 2005
, 2006
; the term "temporal coding" is used here in the sense proposed by Theunissen and Miller 1995
). Also at higher processing stages in vertebrates one observes a decrease in the highest AM frequencies that can influence the neural responses (Joris et al. 2004
; Krishna and Semple 2000
)compare Fig. 5, C and D. However, an important difference has to be mentioned. In mammals filter banks of neurons with band-pass properties were found that are sharply tuned to different modulation frequencies (see Fig. 8 in Langner 1992
). In the locust the neurons' stimulus classification properties exhibit mainly low-pass characteristics and suggest a binary type of discrimination. A particular AM frequency therefore could not be unequivocally inferred from the response of a single cellas is in principle possible in a filter bankbut should be deduced by a comparison of ascending neurons with different corner frequencies. Whether such discriminations are important for these animals and are implemented is not clear. A relatively coarse classification of AM patterns may suffice for the limited behavioral tasks faced by these insects because a main function of their auditory pathway may be to detect similarities or dissimilarities between the AM patterns of incoming acoustic signals and some stored templates.
Comparison with the MTF paradigm
Modulation transfer functions are a classical method to describe the response of a neuron or a sensory system to the temporal pattern of amplitude modulations. However, the MTF curves are based on period histograms that summarize the responses to many stimulus periods. As a consequence, MTF procedures tend to disregard the trial-to-trial variability of neuronal responses, which is, however, crucial for recognition tasks. The spike train metric, in contrast, is susceptible to response variability and thus enables a more realistic approach to sensory processing. Indeed, the results presented in Fig. 8A indicate that the rate-MTF method may miss important aspects relevant for discrimination of different modulation frequencies. In contrast, the corner frequencies derived from tMTF and spike train metrics correlate rather well, but still the tMTF describes the locking of spikes to the envelope and not primarily the discriminability of different modulation frequencies. Here we show for the first time that the discrimination performance of interneurons correlates well with the corner frequency obtained from the tMTF curves (Fig. 8).
Taken together, the classification method based on spike train metrics applied here offers three important advantages, as compared with the tMTF method: 1) it is sensitive to the trial-to-trial variations of spike trains; 2) it yields additional information about the optimal resolution parameter
, as a potential clue to the encoding strategies present at various stages in the nervous system; and 3) it allows one to separate the contributions of spike count differences and spike timing to the principal discriminability of amplitude-modulated acoustic stimuli.
|
|
GRANTS |
|---|
|
|
|
ACKNOWLEDGMENTS |
|---|
|
|
|
FOOTNOTES |
|---|
1 The online version of this article contains supplemental data. ![]()
Address for reprint requests and other correspondence: S. Wohlgemuth, Institut f. Biologie der Humboldt-Universität zu Berlin, Abt. Verhaltensphysiologie, Invalidenstr. 43, D 10099 Berlin (E-mail: Sandra.Wohlgemuth{at}rz.hu-berlin.de)
|
|
REFERENCES |
|---|
|
Boyan GS. Common synaptic drive to segmentally homologous interneurons in the locust. J Comp Neurol 321: 544554, 1992.[CrossRef][Web of Science][Medline]
Boyan GS. Presynaptic contributions to response shape in an auditory neuron of the grasshopper. J Comp Physiol A Sens Neural Behav Physiol 184: 279294, 1999.[CrossRef][Web of Science]
Franz A, Ronacher B. Temperature dependence of time resolution in an insect nervous system. J Comp Physiol A Sens Neural Behav Physiol 188: 261271, 2002.[CrossRef][Web of Science][Medline]
Gerhardt HC, Huber F. Acoustic Communication in Insects and Anurans. Chicago, IL: Chicago Univ. Press, 2002.
Gollisch T, Schütze H, Benda J, Herz AVM. Energy integration describes sound-intensity coding in an insect auditory system. J Neurosci 22: 1043410448, 2002.
Hennig RM, Franz A, Stumpner A. Processing of auditory information in insects. Microscopy Res Tech 63: 351374, 2004.[CrossRef]
Joris PX, Schreiner CE, Rees A. Neural processing of amplitude-modulated sounds. Physiol Rev 84: 541577, 2004.
Joris PX, Yin TCT. Responses to amplitude-modulated tones in the auditory nerve of the cat. J Acoust Soc Am 91: 215232, 1992.[CrossRef][Web of Science][Medline]
Krishna BS, Semple MN. Auditory temporal processing: responses to sinusoidally amplitude-modulated tones in the inferior colliculus. J Neurophysiol 84: 255273, 2000.
Kroodsma DE, Miller EH. Ecology and Evolution of Acoustic Communication in Birds. Ithaca, NY: Cornell Univ. Press, 1996.
Langner G. Periodicity coding in the auditory system. Hear Res 60: 115142, 1992.[CrossRef][Web of Science][Medline]
Lu T, Wang X. Information content of auditory cortical responses to time-varying acoustic stimuli. J Neurophysiol 91: 301313, 2004.
Machens CK, Prinz P, Stemmler MB, Ronacher B, Herz AVM. Discrimination of behaviorally relevant signals by auditory receptor neurons. Neurocomputing 3840: 263268, 2001b.[CrossRef][Web of Science]
Machens CK, Schütze H, Franz A, Stemmler MB, Ronacher B, Herz AVM. Auditory receptor neurons preserve characteristic differences between conspecific communication signals. Nat Neurosci 6: 341342, 2003.[CrossRef][Web of Science][Medline]
Machens CK, Stemmler MB, Prinz P, Krahe R, Ronacher B, Herz AVM. Representation of acoustic communication signals by insect auditory receptor neurons. J Neurosci 21: 32153227, 2001a.
Marquart V. Local interneurons mediating excitation and inhibition onto ascending neurons in the auditory pathway of grasshoppers. Naturwissenschaften 72: 4244, 1985.[CrossRef][Web of Science]
Narayan R, Ergün A, Sen K. Delayed inhibition in cortical receptive fields and the discrimination of complex stimuli. J Neurophysiol 94: 29702975, 2005.
Narayan R, Grana G, Sen K. Distinct timescales in cortical discrimination of natural sounds in songbirds. J Neurophysiol 96: 252258, 2006.
Pearson KG, Robertson RM. Interneurons co-activating hindleg flexor and extensor moto-neurons in the locust. J Comp Physiol 144: 391400, 1981.[CrossRef]
Pollack GS. Neural processing of acoustic signals. In: Comparative Hearing: Insects, edited by Hoy RR, Popper AN, Fay RR. New York: Springer-Verlag, 1998, p. 139196.
Pollack GS. Who, what, where? Recognition and localization of acoustic signals by insects. Curr Opin Neurobiol 10: 763767, 2000.[CrossRef][Web of Science][Medline]
Rokem A, Watzl S, Gollisch T, Stemmler M, Herz AVM, Samengo I. Spike-timing precision underlies the coding efficiency of auditory receptor neurons. J Neurophysiol 95: 25412552, 2006.
Römer H, Marquart V. Morphology and physiology of auditory interneurons in the metathoracic ganglion of the locust. J Comp Physiol A Sens Neural Behav Physiol 155: 249262, 1984.[CrossRef]
Ronacher B, Krahe R. Song recognition in the grasshopper Chorthippus biguttulus is not impaired by shortening song signals: implications for neuronal encoding. J Comp Physiol A Sens Neural Behav Physiol 183: 729735, 1998.[CrossRef]
Rosen S. Temporal information in speech: acoustic, auditory and linguistic aspects. Philos Trans R Soc Lond B Biol Sci 336: 367373, 1992.[Web of Science][Medline]
Shannon RV, Zeng F-G, Kamath V, Wygonski J, Ekelid M. Speech recognition with primarily temporal cues. Science 270: 303304, 1995.
Smith ZM, Delgutte B, Oxenham AJ. Chimaeric sounds reveal dichotomies in auditory perception. Nature 416: 8790, 2002.[CrossRef][Medline]
Stumpner A. Physiological variability of auditory neurons in a grasshopper. Naturwissenschaften 76: 427429, 1989.[CrossRef][Web of Science]
Stumpner A, Ronacher B. Auditory interneurones in the metathoracic ganglion of the grasshopper Chorthippus biguttulus 1. Morphological and physiological characterization. J Exp Biol 158: 391410, 1991.
Stumpner A, Ronacher B. Neurophysiological aspects of song pattern recognition and sound localization in grasshoppers. Am Zool 34: 696705, 1994.[Web of Science]
Stumpner A, Ronacher B, von Helversen O.Auditory interneurones in the metathoracic ganglion of the grasshopper Chorthippus biguttulus 2. Processing of temporal patterns of the song of the male. J Exp Biol 158: 411430, 1991.
Theunissen F, Miller JP. Temporal encoding in nervous systems: a rigorous definition. J Comput Neurosci 2: 149162, 1995.[CrossRef][Web of Science][Medline]
van Rossum M. A novel spike distance. Neural Comput 13: 751763, 2001.[CrossRef][Web of Science][Medline]
Victor JD. Spike train metrics. Curr Opin Neurobiol 15: 585592, 2005.[CrossRef][Web of Science][Medline]
Victor JD, Purpura KP. Metric-space analysis of spike trains: theory, algorithms and application. Netw Comput Neural Syst 8: 127164, 1997.[CrossRef]
Vogel A, Hennig RM, Ronacher B. Increase of neuronal response variability at higher processing levels as revealed by simultaneous recordings. J Neurophysiol 93: 35483559, 2005.
von Helversen D, von Helversen O. Recognition of sex in the acoustic communication of the grasshopper Chorthippus biguttulus (Orthoptera, Acrididae). J Comp Physiol A Sens Neural Behav Physiol 180: 373386, 1997.[CrossRef]
von Helversen D, von Helversen O. Acoustic pattern recognition in a grasshopper: processing in the time or frequency domain. Biol Cybern 79: 467476, 1998.[CrossRef][Web of Science]
Woolley SMN, Fremouw TE, Hsu A, Theunissen FE. Tuning for spectro-temporal modulations as a mechanism for auditory discrimination of natural sounds. Nat Neurosci 8: 13711379, 2005.[CrossRef][Web of Science][Medline]
Woolley SMN, Gill PR, Theunissen FE. Stimulus-dependent auditory tuning results in synchronous population coding of vocalizations in the songbird midbrain. J Neurosci 26: 24992512, 2006.
This article has been cited by other articles:
![]() |
F. Creutzig, S. Wohlgemuth, A. Stumpner, J. Benda, B. Ronacher, and A. V. M. Herz Timescale-Invariant Representation of Acoustic Communication Signals by a Bursting Neuron J. Neurosci., February 25, 2009; 29(8): 2575 - 2580. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. J. Hildebrandt, J. Benda, and R. M. Hennig The Origin of Adaptation in the Auditory Pathway of Locusts Is Specific to Cell Type and Function J. Neurosci., February 25, 2009; 29(8): 2626 - 2636. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Neuhofer, S. Wohlgemuth, A. Stumpner, and B. Ronacher Evolutionarily conserved coding properties of auditory neurons across grasshopper species Proc R Soc B, September 7, 2008; 275(1646): 1965 - 1974. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. J. Malone, B. H. Scott, and M. N. Semple Dynamic Amplitude Coding in the Auditory Cortex of Awake Rhesus Macaques J Neurophysiol, September 1, 2007; 98(3): 1451 - 1474. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Visit Other APS Journals Online |