|
|
||||||||
Laboratory of Auditory Neurophysiology, Department of Biomedical Engineering, Johns Hopkins University School of Medicine, Baltimore, Maryland 21205
Submitted 10 January 2003; accepted in final form 26 September 2003
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
How much cortical representation of stimuli is based on spike count and how much on spike timing is an ongoing debate in the literature. Temporal codes can range from the timing of individual spikes such as first spike latency (e.g., Heil 1997a
; Phillips and Hall 1990
) to patterns of neural activity, such as stimulus-synchronized discharges (de Ribaupierre et al. 1972
; Goldstein et al. 1959
), and to even more complex temporal patterns. It is important to distinguish between temporal coding of "when" and "what" events occurred (Borst and Theunissen 1999
). The former, such as stimulus-synchronized discharges, provides information that directly correlates to the time a stimulus feature occurred. The latter cortical representation maps a specific stimulus feature to a temporal discharge pattern, such as odors in the olfactory system (Laurent et al. 1996
) or two-dimensional visual patterns in primate inferior temporal cortex (Optican and Richmond 1987
).
Traditionally, temporal discharge patterns have been analyzed using methods such as autocorrelation or synchronization index (based on vector strength). More recently, information theory has been applied to address the issue of rate codes versus temporal codes. In auditory cortex of cat, Furukawa and Middlebrooks (2002
) showed that full-spike patterns contained more information on stimulus locations than did spike counts and that transmitted information was sensitive to disruption of spike timing on a scale of more than
4 ms. In the rat somatosensory cortex, it was shown that spike timing could increase the information content by 44% compared with spike count alone (Panzeri et al. 2001
). The timing of the first spike was the largest contributor to the information content based on spike timing. In the LGN of the visual pathway, it was demonstrated that temporal patterns can contain redundant information, but more often they coded additional visual information (up to 24.9% more of the total information content) (Reinagel and Reid 2000
). Buracas et al. (1998
) showed that high temporal precision rather than low spike count variability accounted for high information rates.
In our recent studies in awake marmosets, we have shown that a population of auditory cortical neurons could code rapid temporal modulations of sounds by the magnitude of their average discharge rates and that slow modulations of sounds were reflected in another population by their stimulus-synchronized temporal discharge patterns (Liang et al. 2002
; Lu et al. 2001a
,b
; Wang et al. 2003
). These auditory cortical responses to time-varying sounds demonstrated the capacity of cortical coding based on average discharge rates. Although there was little evidence of stimulus-synchronized activity in the responses to the rapid temporal modulations in many cortical neurons, the question of whether such neurons had additional information in their non-stimulus-synchronized discharge patterns was not addressed. Vector strength measures used in our previous studies were unable to reveal intrinsic temporal patterns that were not synchronized to the stimulus waveform. Thus it remained unclear if cortical neurons lacking stimulus-synchronized discharges were temporally encoding the stimuli in other ways. In this report, we use statistical and information-theoretic methods (Shannon 1948
) to quantify the responses of auditory cortical neurons to time-varying stimuli. The present study demonstrates that, regarding how auditory cortical neurons encode time-varying signals, it is not simply an issue of whether spike timing is important, but when it is important (i.e., under what stimulus conditions and in which population of cortical neurons).
| METHODS |
|---|
|
|
|---|
Animal preparation and recording procedures were detailed in a previous report (Lu et al. 2001a
) and are only briefly described here. Marmoset monkeys (Callithrix jacchus), a highly vocal New World primate (Wang 2000
), were adapted to sit quietly in a semi-restraint device through an acclimation period of several weeks. The primary auditory cortex (A1) of the marmoset lies largely on the surface of the superior temporal gyrus (Aitkin et al. 1986
), and our procedure approaches A1 laterally through small holes (
1.0 mm) (Lu et al. 2001a
). All necessary precautions and steps were taken to ensure both animal comfort and sterility during all procedures. All experimental procedures were approved by the Institutional Animal Care and Use Committee of the Johns Hopkins University following NIH guidelines.
Single-unit activity was recorded using tungsten microelectrodes of impedance ranging from 2 to 5 M
(A-M Systems). For each cortical site, the electrode was inserted nearly perpendicular, under visual guidance with an operating microscope, to the cortical surface by a micromanipulator (Narishige) and advanced by a hydraulic microdrive (David Kopf Instruments). Action potentials were detected by a template-based spike sorter (MSD, Alpha Omega Engineering) and continuously monitored by the experimenter while data recordings progressed. Typically, one to three well-isolated single neurons were studied in each session. The data presented were mainly obtained from A1 and may include a small number of neurons from immediately adjacent areas. Single neurons were encountered at all cortical layers, but the majority of the recorded data were from upper layers, and to a lesser extent, from middle layers, judging by the depth of recording and response characteristics. The location of A1 was determined by its tonotopic organization, its relationship to the lateral belt area (which was more responsive to noises than tones), and by its response properties (short latencies, and highly responsive to tones).
Acoustic stimuli
All recording sessions were conducted within a double-walled, sound-proof chamber (Industrial Acoustics). Acoustic signals were generated digitally and delivered in free-field through a speaker located
1 m in front of the animal. The characteristic frequencies (CFs) and thresholds were precisely determined by a computer-controlled stimulus protocol. Once a neuron was isolated and its basic response properties determined (e.g., CF, latency and rate-level characteristics), other stimulus protocols were executed in randomized blocks for 5 or 10 repetitions. Spontaneous activity was recorded for 500 ms before and after each trial. Intertrial intervals were
1.5 s long. For all stimuli, sound intensity was typically set at the peak in the neuron's rate-level function if it was nonmonotonic or 10-30 dB above threshold if it was monotonic.
Acoustic stimuli used were click-trains with regular or irregular (Poisson-distributed) interclick intervals (ICIs) (Lu and Wang 2000
; Lu et al. 2001b
), ramped and damped sinusoids (Lu et al. 2001a
; Patterson 1994
), and sinusoidal amplitude-modulated (sAM) tones (Liang et al. 2002
). The click trains with regular ICIs (regular click trains) had fixed intervals ranging from 3 to 100 ms and consisted of either broad-band (rectangular) or narrow-band (Gaussian) clicks. The click-trains with irregular ICIs (irregular click trains) had random intervals based on a modified Poisson distribution (
= 70 ms) with a dead time of 3 ms. Ramped and damped sinusoids were tone carriers that were modulated by an exponentially growing or decaying envelope. The sAM stimuli had modulation frequencies ranging from 2 to 512 Hz. All narrow-band stimuli were centered at the CF of the neuron being studied.
Data analysis
MEAN LATENCY AND COEFFICIENT OF VARIATION. The coefficient of variation, CV =
/µ, of spike time latencies (relative to the click that evoked the response) was used to measure the precision of spike timing. The mean latency, µ, and SD,
, were calculated from a "postclick" histogram of response. This is similar to the period histogram in that the responses to each click were accumulated into a histogram, but the window-sizes are not necessarily equal to the stimulus period. For synchronized neurons, the window sizes were adjusted manually to include all of the onset response (generally, the 1st cluster of spikes responding to the 1st click) for each neuron. At long ICIs (such as 100 ms), the analysis window was shorter than the ICI. This was done because it provides a possible means to compare the onset response with any potential subsequent responses to the clicks, particularly for responses to the midrange and shorter ICIs. The window size was meant to include the cluster of onset responses and was generally larger than the period at these shorter intervals. The CV of the onset response was calculated from the spikes occurring within the first window (which had been adjusted to include the initial cluster of spikes) and did not include later spike times. Window sizes for nonsynchronized neurons were fixed at 50 ms, as many did not show a clear burst of activity in response to stimulus onset, and there was no obvious selection of a window size. Spike count CV was also analyzed, using the same window sizes as with the spike timing analysis.
INFORMATION MEASURES. We measured the information content of the stimuli based on spike counts, inter-spike intervals (ISI), and period histograms. The entropy of the response is given by the formula
![]() |
Stimulus-specific mutual information was computed using the following formula
![]() |
4.25 bits. The response measure for the ISI was based on the time between successive action potentials in a given stimulus presentation trial. The distribution of these ISIs was used as an estimate of their probability of occurrence. Bin widths were 1 ms, and a total of 1,000 bins were used. The total entropy of the response was calculated from the ISI distribution over all stimulus conditions.
Spike count distributions were generated using a sliding window for each trial. The length of the window was chosen to be 100 ms, which was similar to the window size used in another study (Buracas et al. 1998
). The distribution of the spike counts, with a bin width of 1 spike and a total of 100 bins, was used in calculating the entropy.
Given the choices of bin widths, the analysis was not sensitive to the total number of bins. Although the maximum theoretical entropy increases with the total number of bins, unoccupied bins add nothing to the entropy calculation (i.e., zero probability of occurrence, and thus no information). For example, two distributions of 100 and 1,000 bins, with 4 occupied bins each (e.g., with 1 count in each bin) and the others empty, will have the same calculated entropy (2 bits).
Because small sample sizes can bias the mutual information (Panzeri and Treves 1996
), we implemented a bias correction
![]() |
SPIKE-JITTERING MANIPULATIONS. To measure the amount of temporal information in the responses, we compared the entropy of the response with a temporally modified version of the response. These temporal manipulations were accomplished by two methods. The first was to jitter spike times by adding or subtracting a random amount to or from each time. Spike times across all stimulus conditions were jittered independently according to a uniform distribution with a maximum jitter time of (±1/2)
t. The values of
t were systematically varied from 3 to 100 ms. The entropy calculations in this case were based on period histograms using a bin size of 1 ms.
The second method was to randomize spike times according to a uniform distribution to remove any temporal structure in the response and therefore maximize entropy. Because spike times were randomized rather than added or removed, the discharge rates at the trial-by-trial level were unchanged. The entropy of the ISIs of the control was compared with that of the randomized version. This provided a basis to determine if any difference in the temporal structure of the response was due to the stimuli, and a ratio near 1 indicates that the spike time distribution in the control was as nearly uniform as the randomized version. The ratios of the ISI entropy were averaged over the population for each ICI condition. The temporal structure of the spontaneous activity was analyzed similarly.
| RESULTS |
|---|
|
|
|---|
Stimulus-synchronized and nonsynchronized temporal discharge patterns of responses to time-varying signals
Figures 1 and 2 illustrate the two types of temporal discharge patterns that we observed in auditory cortex of awake marmosets: stimulus-synchronized and nonsynchronized responses, using a variety of time-varying sounds. For each example, the dot raster (left), ISI histogram (middle), and conditional ISI plot (right) are shown to highlight temporal structures of the recorded spike trains. These examples illustrate several important properties of cortical responses to time-varying signals as analyzed in the following text.
|
|
Because it is possible that the duration of an ISI can be dependent on that of the immediately preceding ISI and thus temporally structuring a unit's discharge pattern, each ISI in the response is plotted against each subsequent ISI (Fig. 1A, right). The presence of any dependencies in the temporal structure of the response would appear as clusters of data points in the conditional ISI plot as demonstrated by the representative example of a synchronized neuron (Fig. 1A, right). The regularity of the discharges in response at each click train ICI value resulted in a diagonal band of clusters ranging from 30-100 ms that indicated equal values of preceding and subsequent ISIs. Its range reflects the ICI values to which the unit showed synchronized responses (Fig. 1A, left). The off diagonal bands result from synchronized responses that skipped a click because they were not 100% entrained, resulting in ISI values that were multiples of the ICI. In addition, there were also a number of clusters in the conditional ISI plot near 2-3 ms values (Fig. 1A, right). These clusters indicated short bursts of action potentials in response to each click stimulus. Bursting was present in almost all synchronized units we encountered. Both the diagonal bands and the bursting-associated clusters demonstrate the variety of temporal patterning of the neural spike trains in the synchronized response.
In contrast to the synchronized responses, a population of neurons was found to respond in a non-stimulus-synchronized manner such as the example in Fig. 1B. Units with responses of this type will be referred to as nonsynchronized units. The timing of the spikes appeared to be nearly random and unrelated to the timing of individual clicks in the click-train stimulus, and the ISI distribution appeared to be Poisson-like with a dead time (<5 ms) due to the unit's refractory period (Fig. 1B, middle). The conditional ISI plot for a nonsynchronized unit showed no clear clustering (Fig. 1B, right). Most of the ISIs were loosely distributed between 10 and 1,000 ms. In this case, ISIs of any length could be followed by ISIs of any other length, demonstrating the lack of structured temporal relationships between subsequent spikes to the preceding ones. Although not shown in this example, bursting can occur in nonsynchronized neurons as well.
The two types of temporal discharge patterns can also be demonstrated using irregular click trains (Fig. 1, C and D). For the synchronized unit example (Fig. 1C), discharges to multiple repetitions of the stimulus clearly show a repeatable temporal discharge synchronized to the timing structure of the click train (Fig. 1C, left), and clusters of points are visible in the conditional ISI plot (Fig. 1C, right). The nonsynchronized unit by comparison did not show any stimulus-synchronized structures in its responses to the same stimulus (Fig. 1D, left), and no strong ISI dependencies in the form of clustered points were observed (Fig. 1D, right).
Different neural populations, regardless of whether they were synchronized or nonsynchronized, encoded different temporal aspects of the stimuli within a short time window (Fig. 2, A vs. B). An example of a synchronized unit responding to ramped and damped sinusoidal stimuli is shown in Fig. 2A. The clustering in response to 25-ms stimulus intervals is clearly seen in both the ISI histogram (Fig. 2A, middle) and the conditional ISI plot (Fig. 2A, right). This unit responded preferentially to damped rather than ramped sinusoids at several half-lives (Fig. 2A, left). Preference to the ramped sinusoids was also observed in synchronized units (Lu et al. 2001a
). The nonsynchronized unit in Fig. 2B responded to the same set of ramped and damped sinusoids with discharge patterns different from those in Fig. 2A as revealed by the ISI histogram (Fig. 2B, middle) and the conditional ISI plot (Fig. 2B, right). Clusters associated with temporal patterns at the 25-ms stimulus repetition period were absent (Fig. 2B, right). This unit responded preferentially to ramped than damped sinusoids (Fig. 2B, left). Preference to the damped sinusoids was also observed in nonsynchronized units (Lu et al. 2001a
).
Figure 2, C and D, is a further example of the two response properties illustrated in Figs. 1 (A-D) and 2 (A and B), respectively, using sAM stimuli. Responses to each cycle of the sAM stimuli were generally more dispersed than responses to click stimuli (e.g., Fig. 1A). The stimulus-synchronized temporal patterns are nevertheless reflected as clusters in the conditional ISI plot (Fig. 2C, right). As with the other examples, the nonsynchronized responses to the sAM stimuli (Fig. 2D, left) did not show stimulus-related temporal patterns in either the ISI histogram (Fig. 2D, middle) or the conditional ISI plot (Fig. 2D, right). Both of these units, however, showed similar modulation frequency selectivity near 32 Hz based on their mean firing rates.
None of the examples of nonsynchronized units (Figs. 1, B and D, and 2, B and D) appeared to contain obvious temporal structures observable by dot-raster plots, ISI histograms, or conditional ISI plot. However, these analyses can overlook temporal structures quantifiable by more rigorous methods.
Information content based on the period histogram
The importance of spike timing for the representation of the click-train periodicity was analyzed in both synchronized and nonsynchronized units. Spike times were either systematically jittered or completely randomized, and their effects on the entropy, as defined by information theory (Shannon 1948
), of the period histogram of the response was measured. This method quantifies the importance of coding by spike timing within a single stimulus period, e.g., ICI. Entropies that increased due to the manipulation would indicate that a nonrandom temporal pattern was present in a unit's response because a random spike train achieves maximum entropy.
We jittered the spike times of the responses systematically, from 3 to 100 ms, in the same step sizes as with the ICI, where the values given indicate the total width of the jitter. Figure 3A demonstrates the effect on the responses of a synchronized unit. The discharge rate was only slightly affected as some spikes were jittered out of the calculation window (Fig. 3B, left). More importantly, the Rayleigh statistic, a measure of how well the response was stimulus-synchronized, decreased with increasing jitter (Fig. 3B, right).
|
0.8 to the click train of 100-ms ICI. As the jitter was increased, the entropy of the jittered responses increased monotonically from
0.8 (no jitter) to
1.0 (maximum jitter of 100 ms). Similar trends were observed for responses at other ICIs. In general, the entropy of responses increased with increasing ICI at a given jitter value and for responses with no jitter. These results highlight the fact that the spike times of the responses of synchronized units are crucial in representing timing information in the stimulus such as ICIs. The same analysis was applied to the nonsynchronized population (Fig. 3C, gray curves). Unlike the synchronized population, the calculated entropy of the responses to any click train ICI was insensitive to spike-time jittering. It remained consistently near the value of 1. In contrast to the synchronized-population, there were little differences in the entropy values across the ICIs for both "no jitter" and jittered conditions. These results indicate that within stimulus periods, the spike timing of nonsynchronized neurons does not convey the information of click train ICI.
Information content based on ISIs
To further explore the role of spike timing in two populations of cortical neurons, spike times over the duration of the stimulus were uniformly randomized, and the entropies of their ISI distributions were computed. The entropies of the unaltered spike times were then compared with those from the randomized version. The randomness, and therefore the entropy, of a spike train would increase if temporal structures were originally present in the spike train. The example in Fig. 4A shows the effect of this manipulation on a synchronized unit. The discharge rate remained the same (Fig. 4A, middle) but any synchronized responses were eliminated (Fig. 4A, right). The result of the randomization of spike times on the population of synchronized neurons was that they no longer had stimulus-synchronized spike trains (Fig. 4B). Average discharge rates, on the other hand, remained unchanged as compared with those based on unaltered spike trains (Fig. 4B, inset).
|
-, - + -). For comparison, the entropy ratio was also calculated for spontaneous discharges (Fig. 4C, - - -). Note that the entropy ratios for spontaneous discharges had values <1, indicating that spontaneous firing was not distributed completely randomly. In the case of the synchronized population (Fig. 4C, -
-), as the ICI increased, the entropy ratio began to drop near 30 ms (the mean ability of neurons to synchronize reported in Lu et al. 2001 was
21 ms ICI) and became significantly different from the entropy ratio of spontaneous discharges at 50 ms. It continued to be significantly different for all larger values of ICI, having a value nearly 0.85 at 100 ms ICI. This difference in entropy ratios demonstrated that the randomness of the spike trains for the synchronized population was less than that of their randomized versions at ICIs longer than
30 ms. The curve for the average entropy ratio of the nonsynchronized population was fairly flat over the range of ICIs tested (Fig. 4C, - + -). In addition, none of the ratios were significantly different from that of the spontaneous discharges. The fact that the entropy ratio of the nonsynchronized population was consistently higher than that of spontaneous discharges across ICIs indicated that stimulus driven discharges in these neurons were at least as random as those of spontaneous firing.
Spike timing precision in two populations of neurons
We quantified the precision of spike timing of the responses to click train stimuli for each of the two populations of neurons and plotted the histogram of the CV for each population (Fig. 5). Figure 5A contains CVs from synchronized units for both regular and irregular click trains including all ICI conditions (median = 0.40, [25%, 75%] = [0.26, 0.54]). The subset of CVs corresponding to long ICIs (black bars,
100 ms for regular click trains and >150 ms for irregular click trains) were clustered at small values of CV (median = 0.34, [25%, 75%] = [0.20, 0.45]) indicating a smaller dispersion in the spike time when units responded to clicks separated by long ICIs.
|
0.2-0.3 at ICIs greater than
30-40 ms and increased rapidly with decreasing ICIs (<30 ms). By 3 ms, the mean CV was near 0.6. It is noted that there was an inflection point in the curve near 25 ms in the mean CV of the population (Fig. 5C), which was also near the median synchronization boundary (21.3 ms) of the stimulus-synchronized population (Lu et al. 2001b
0.35 at long ICIs and reached near 0.7 at short ICIs. The results shown in Fig. 5, B and C, indicate that, for synchronized units, spike time precision was higher in the onset response and in responses to more sparsely occurring stimulus events. Similar results were observed using both regular and irregular click trains. The same analyses were applied to nonsynchronized units (Fig. 5, right). In contrast to synchronized units (Fig. 5A), there were proportionally fewer values of CVs <0.5 (median = 0.52, [25%, 75%] = [0.41, 0.58]; Fig. 5D), and the subset of CVs resulting from the longer ICIs (black bars) appears to be as widely distributed as the rest. Because many nonsynchronized neurons did not have a precise onset response, the data points for onset CV (median = 0.26, [25%, 75%] = [0.07, 0.50]) were widely scattered while the data points for the total CV (median = 0.53, [25%, 75%] = [0.44, 0.58]) tended to gather near 0.5 (Fig. 5E). There was a small drop in mean CV for the responses to the regular click trains with increasing ICIs (Fig. 5F, black curve), a reflection of these units' nonsynchronized responses. The responses to the irregular click trains showed a wide scatter but no systematic trend (Fig. 5F, gray curve). These results show that, compared with synchronized units, nonsynchronized units had greater spike timing dispersion in both onset (1st 50 ms window) and sustained responses.
Variation in spike count in two populations of neurons
The trial-to-trial variation in spike count was analyzed for both synchronized and nonsynchronized neurons. As with the latency, the histogram for the spike-count-based CVs is shown in Fig. 6A (median = 0.57, [25%, 75%] = [0.39, 0.86]). The subset of CVs calculated from the longer intervals (black bars) was not significantly lower. We also compared the CVs of the onset responses (median = 0.66, [25%, 75%] = [0.47, 0.94]) versus the CVs calculated from the total responses (median = 0.47, [25%, 75%] = [0.35, 0.70]; Fig. 6B). The median for onset CVs was significantly higher than that of the total CVs (P « 0.001, Wilcoxon rank sum), in contrast to the latency analysis (Fig. 5B), in which the opposite was observed. The values of the CVs for spike count were also considerably greater than those for spike latency. Whereas CVs for spike latency were always <1, there was a significant fraction of points for spike count CVs that were >1, indicating there was much more relative dispersion for the spike counts. The CV averaged over the population did not show the dependency on the inter-click interval for both regular click and irregular click trains (Fig. 6C).
|
Information measures based on spike count and ISI
To quantify the capacity of these neurons to transmit information, the entropy and mutual information were calculated based on either trial-by-trial spike count or by ISIs. Figure 7 shows the distribution of entropy and mutual information based on both ISI and spike count measures, pooled from both populations. The ISI-based entropy, HISI, had a median of 6.52 [25%, 75%] = [5.87, 6.90] bits (Fig. 7A), whereas that for the spike-count-based measurement HSpkC was 1.80 [25%, 75%] = [1.22, 2.57] bits (Fig. 7B). The distributions of mutual information had medians of 0.21 [25%, 75%] = [0.08, 0.50] bits for ISI-based calculations, IISI (Fig. 7C), and 0.07 [25%, 75%] = [0.04, 0.12] bits for spike-count-based measures, ISpkC (Fig. 7D). These results indicate that ISI-based codes may potentially encode more information than spike-count-based codes.
|
|
| DISCUSSION |
|---|
|
|
|---|
The current study investigates several issues not addressed in our previous reports (Liang et al. 2002
; Lu et al. 2001a
,b
). The main issue concerns the possibility of temporal structures existing in the responses of the nonsynchronized neurons to time-varying stimuli such as click trains. Although our previous studies have shown that spike rate, but not stimulus synchrony-based measures, contained stimulus-specific information, we did not fully address whether additional information existed in these nonsynchronized neurons in the form of temporal coding not synchronized to stimulus period.
Based on the quantitative analyses reported here, there was not much indication of temporal coding beyond what is achieved with stimulus-synchronized responses in auditory cortical responses to repetitive stimulus events. We found little evidence of cortical coding based on ISIs that was not directly stimulus-synchronized. Unlike the responses for the synchronized neurons, we observed that nonsynchronized rate-response neurons did not show a significant increase in entropy when the spike times were jittered or randomized. In fact, the spike timing in these neurons appeared to be nearly as random as that of spontaneous discharges (Fig. 4C). Although temporal discharge patterns based on ISI could potentially code more information than spike-count-based measures in nonsynchronized neurons, it remains unclear whether and how such information is used by the auditory cortex. The possibility cannot be ruled out that temporal coding, not detectable by our present analyses and data set, may be used by auditory cortical neurons to encode finer temporal or spectral changes in a complex sound or sound sequences than those tested in our studies.
In summary, the experimental observations from our previous studies and theoretical analyses of the present study lead to the following conclusions. First, temporal and rate coding may operate at separate stimulus domains, encode the same stimulus space in parallel by different neuronal populations, or encode different stimulus aspects. Second, spike timing, but not mean firing rate, is crucial to encode stimulus periodicity in "synchronized" neurons. Third, "nonsynchronized" neurons contain no clear spike timing information for encoding stimulus periodicity. These neurons exhibit a transformed representation of the stimulus in the auditory cortex and deserve greater attention in future studies. Fourth, spike timing on the occurrence of stimulus events is more precise at the first event (onset) than at successive ones, and at sparsely occurring stimulus events than at densely occurring stimulus events.
Statistical variations of spike timing and count in onset and sustained responses
Because the reproducibility of responses is an important factor in considering potential neural codes, we studied the relative dispersion of both spike time precision and spike count variation in responses to click trains (Figs. 5 and 6). The results of the CV analysis have implications for both the spike time latency reported in the literature and the inferences that can be drawn from them to extrapolate to whole stimulus encoding. Although it has been shown that the precision of the first spike latency can be well preserved from the auditory nerve to A1 (Heil and Irvine 1997
), the timing precision of subsequent responses to an ongoing stimulus has not been addressed. This may be due, in large part, to the phasic responses that dominate anesthetized auditory cortex, the preparation with which many previous studies of onset responses were conducted (e.g., Heil 1997a
,b
; Phillips and Hall 1990
). Because of this, and the more limited synchronizing capability of neurons in anesthetized than awake auditory cortex (Goldstein et al. 1959
), the issue of temporal precision of auditory cortical neurons during an ongoing stimulus was difficult to address.
For the synchronized neurons, we found that spike time dispersion was generally smaller in onset responses and in the responses to sparsely occurring acoustic events (separated by >30-40 ms) than in the responses to densely occurring acoustic events (Fig. 5, B and C). These findings held for click trains with either static (regular) or dynamic (irregular) ICIs. The latter was used to reduce potential effects of adaptation. As shown in Fig. 5, spike time precision is much reduced in the responses of nonsynchronized neurons. This is consistent with the suggestion that these neurons use their firing rates instead to encode stimulus parameters (Lu et al. 2001b
).
The CV analysis of spike count showed a much larger relative scatter than with the spike latency (Fig. 6). We found that there was no clear correlation between spike count CV and click-train ICI (Fig. 6C) for synchronized neurons. In the case of nonsynchronized neurons, there was indeed a correlation (Fig. 6F), where CV values were smaller with shorter ICIs. These results mirror those for the spike timing (Fig. 5, C and F), where the respective operating domains (long vs. short ICIs) for each type of neuron (synchronized vs. nonsynchronized) have smaller CV values but only for the proper coding method (spike time vs. spike count).
Effects of neuronal adaptation on information measures
Neuronal adaptation (a change in response strength as a function of time) may potentially affect the calculation of the entropy and mutual information that was based on the entire stimulus duration. Adaptation typically occurs prominently within the first 100 ms of the response, usually as part of an onset burst. In Fig. 9, we compared the entropy and mutual information for both spike count and ISI measures between the first and second half of the response, both with (Fig. 9, A and B) and without (Fig. 9, C and D) the discharges within the first 100 ms (onset spikes). The entropies were closely correlated between the two halves of the response (Fig. 9, A and C). The slopes of the linear fits of the data points in Fig. 9, A and C, were 0.84 (all spikes) and 0.95 (without onset spikes) for ISI measures and 0.96 (all spikes) and 1.01 (without onset spikes) for spike count measures. Although the slight shift in Fig. 9A for the spike-count-based entropy (as compared with Fig. 9C) suggests that the onset response can cause the entropy to be overestimated in some cases, there is little evidence of adaptation such that the entropy measures of the second half of the response are much smaller than those of the first half. Figure 9C shows that sustained responses (after onset spikes) had consistent values of entropy throughout stimulus duration, suggesting that adaptation is not a significant factor for the majority of sustained responses that we recorded in awake marmoset auditory cortex. Many examples of highly sustained discharges for various time-varying stimuli have been shown in our prior studies (e.g., Liang et al. 2002
; Lu et al. 2001a
,b
). The comparison of the mutual information measures between the first and second half of responses (Fig. 9, B and D) also showed similar trends as the entropies. The slopes of the linear fits of the data points in Fig. 9, B and D, were 0.68 (all spikes) and 0.74 (without onset spikes) for ISI measures and 0.96 (all spikes) and 0.96 (without onset spikes) for spike count measures, respectively.
|
The values that we found for both the ISI and spike-count entropies (0-8 bits) and mutual information (0-1.6 bits) were comparable to published data from other studies of cortical neurons. The range of information rates reported in the literature for the cortex is large due in part to the different stimulus ensembles used. Neurons in the inferior temporal area in alert macaques were reported to have an information rate of 1 bits/s (Optican and Richmond 1987
). It was shown that V1 neurons can transmit 5-30 bits/s with 25% efficiency in response to pseudorandom stimuli and that spikes with short ISIs carried a disproportionate amount of information (Reich et al. 2000
). The rate of information transmission was reported in MT of alert macaque monkeys to be
1 bit/s for constant stimuli to 12 bits/s and as high as 29 bits/s for variable stimuli (Buracas et al. 1998
). In the orbitofrontal cortex of primates, it was 0.09 bits for olfactory neurons (Rolls et al. 1996
). Primate temporal cortex was shown to have a range of
0.4-1.2 bits (Tovee et al. 1993
). It is possible that variations in the information theoretic analyses are attributed to the cortical area recorded from as well as the stimuli used.
Some previous methods computed the first few principal components and used those values as a basis for the information measure (Optican and Richmond 1987
; Rolls et al. 1996
; Tovee et al. 1993
). Currently, one of the more predominant methods involves representing the number and spacing of spikes with binary strings (de Ruyter van Steveninck et al. 1997
; Reinagel and Reid 2000
; Rieke et al. 1998
). The information content is then calculated from the frequency distribution of the resulting strings. Although calculating the information content based on binary strings could, in theory, account for all possible temporal patterns, it can be difficult to do in practice because of the enormous amount of data required to estimate the frequency of occurrence of particular words. The number of stimulus presentations used in our experimental protocols was too few to adequately and robustly calculate information based on binary strings (de Ruyter van Steveninck et al. 1997
; Rieke et al. 1998
; Warzecha and Egelhaaf 1999
). The approach taken here was better suited to our existing data set.
We observed little evidence indicating any reliable temporal structure (based on ISIs) in the responses to rapidly modulated stimuli, but the information calculations of ISI in the present study did not take into account absolute spike time relative to stimulus onset (i.e., latency). As shown in a somatosensory cortex study, the location of whisker stimulation can be encoded by the latency of the response (Panzeri et al. 2001
). The authors in that study concluded that the temporal information in their data appeared to be based mainly on the response latency. However, because the responses they observed consisted of largely onset discharges recorded in deeply anesthetized animals, their conclusion regarding the role of spike timing cannot be generalized to situations where there is an abundance of sustained discharges, such as the nonsynchronized responses we have observed in the auditory cortex of awake primates.
The contribution of our present study is to demonstrate that coding by cortical neurons is not simply an issue of rate code versus temporal code, but when each of the coding methods is put into use and by which population of cortical neurons.
| ACKNOWLEDGMENTS |
|---|
|
|
|---|
GRANTS
This work was supported by National Institute on Deafness and Other Communication Disorders Grant DC-03180, a Presidential Early Career Award for Scientists and Engineers (X. Wang), and a grant from the National Organization for Hearing Research Foundation.
| FOOTNOTES |
|---|
Address for reprint requests and other correspondence: X. Wang, Dept. of Biomedical Engineering, Johns Hopkins University School of Medicine, 720 Rutland Ave., Ross 424, Baltimore, MD 21205 (E-mail: xwang{at}bme.jhu.edu).
| REFERENCES |
|---|
|
|
|---|
Aitkin LM, Merzenich MM, Irvine DR, Clarey JC, and Nelson JE. Frequency representation in auditory cortex of the common marmoset (Callithrix jacchus jacchus). J Comp Neurol 252: 175-185, 1986.[CrossRef][ISI][Medline]
Barbour DB and Wang X. Temporal coherence sensitivity in auditory cortex. J Neurophysiol 88: 2684-2699, 2002.
Bieser A and Müller-Preuss P. Auditory responsive cortex in the squirrel monkey: neural responses to amplitude-modulated sounds. Exp Brain Res 108: 273-284, 1996.[ISI][Medline]
Borst A and Theunissen FE. Information theory and neural coding. Nat Neurosci 2: 947-957, 1999.[CrossRef][ISI][Medline]
Buracas GT, Zador AM, DeWeese MR, and Albright TD. Efficient discrimination of temporal patterns by motion-sensitive neurons in primate visual cortex. Neuron 20: 959-969, 1998.[CrossRef][ISI][Medline]
Creutzfeldt O, Hellweg F-C, and Schreiner C. Thalamocortical transformation of responses to complex auditory stimuli. Exp Brain Res 39: 87-104, 1980.[ISI][Medline]
deCharms RC, Blake DT, and Merzenich MM. Optimizing sound features for cortical neurons. Science 280: 1439-1443, 1998.
de Ribaupierre F, Goldstein MH Jr, and Yeni-Komshian G. Cortical coding of repetitive acoustic pulses. Brain Res 48: 205-225, 1972.[CrossRef][ISI][Medline]
de Ruyter van Steveninck RR, Lewen GD, Strong SP, Koberle R, and Bialek W. Reproducibility and variability in neural spike trains. Science 275: 1805-1808, 1997.
Eggermont JJ. Rate and synchronization measures of periodicity coding in cat primary auditory cortex. Hear Res 56: 153-167, 1991.[CrossRef][ISI][Medline]
Furukawa S and Middlebrooks JC. Cortical representation of auditory space: information-bearing features of spike patterns. J Neurophysiol 87: 1749-1762, 2002.
Goldstein MH Jr, Kiang NY-S, and Brown RM. Response of the auditory cortex to repetitive acoustic stimuli. J Acoust Soc Am 31: 356-364, 1959.[CrossRef]
Heil P. Auditory cortical onset responses revisited. I. First-spike timing. J Neurophysiol 77: 2616-2641, 1997a.
Heil P. Auditory cortical onset responses revisited. II. Response strength. J Neurophysiol 77: 2642-2660, 1997b.
Heil P and Irvine DR. First-spike timing of auditory-nerve fibers and comparison with auditory cortex. J Neurophysiol 78: 2438-2454, 1997.
Johnson DH. The relationship between spike rate and synchrony in responses of auditory-nerve fibers to single tones. J Acoust Soc Am 68: 1115-1122, 1980.[CrossRef][ISI][Medline]
Joris PX and Yin TCT. Responses to amplitude-modulated tones in the auditory nerve of the cat. J Acoust Soc Am 91: 215-232, 1992.[CrossRef][ISI][Medline]
Langner G. Periodicity coding in the auditory system. Hear Res 60: 115-142, 1992.[CrossRef][ISI][Medline]
Laurent G, Wehr M, and Davidowitz H. Temporal representations of odors in an olfactory network. J Neurosci 16: 3837-3847, 1996.
Liang L, Lu T, and Wang X. Neural representations of sinusoidal amplitude and frequency modulations in the primary auditory cortex of awake primates. J Neurophysiol 87: 2237-2261, 2002.
Lu T, Liang L, and Wang X. Neural representations of temporally asymmetric stimuli in the auditory cortex of awake primates. J Neurophysiol 85: 2364-2380, 2001a.
Lu T, Liang L, and Wang X. Temporal and rate representations of time-varying signals in the auditory cortex of awake primates. Nat Neurosci 4: 1131-1138, 2001b.[CrossRef][ISI][Medline]
Lu T and Wang X. Temporal discharge patterns evoked by rapid sequences of wide- and narrowband clicks in the primary auditory cortex of cat. J Neurophysiol 84: 236-246, 2000.<