|
|
||||||||
The Journal of Neurophysiology Vol. 86 No. 6 December 2001, pp. 2789-2806
Copyright ©2001 by the American Physiological Society
1Keck Center for Integrative Neuroscience and Department of Physiology, 2Sloan-Swartz Center for Theoretical Neurobiology, and 3Department of Otolaryngology, University of California, San Francisco, California 94143-0444
| |
ABSTRACT |
|---|
|
|
|---|
Liu, Robert C., Svilen Tzonev, Sergei Rebrik, and Kenneth D. Miller. Variability and Information in a Neural Code of the Cat Lateral Geniculate Nucleus. J. Neurophysiol. 86: 2789-2806, 2001. A central theme in neural coding concerns the role of response variability and noise in determining the information transmission of neurons. This issue was investigated in single cells of the lateral geniculate nucleus of barbiturate-anesthetized cats by quantifying the degree of precision in and the information transmission properties of individual spike train responses to full field, binary (bright or dark), flashing stimuli. We found that neuronal responses could be highly reproducible in their spike timing (~1-2 ms standard deviation) and spike count (~0.3 ratio of variance/mean, compared with 1.0 expected for a Poisson process). This degree of precision only became apparent when an adequate length of the stimulus sequence was specified to determine the neural response, emphasizing that the variables relevant to a cell's response must be controlled to observe the cell's intrinsic response precision. Responses could carry as much as 3.5 bits/spike of information about the stimulus, a rate that was within a factor of two of the limit the spike train could transmit. Moreover, there appeared to be little sign of redundancy in coding: on average, longer response sequences carried at least as much information about the stimulus as would be obtained by adding together the information carried by shorter response sequences considered independently. There also was no direct evidence found for synergy between response sequences. These results could largely, but not entirely, be explained by a simple model of the response in which one filters the stimulus by the cell's impulse response kernel, thresholds the result at a fairly high level, and incorporates a postspike refractory period.
| |
INTRODUCTION |
|---|
|
|
|---|
To understand the coding of information by neurons, it is important to quantify the variability in their responses. When this variability is driven by changes in the stimulus, the neurons can use this to distinguish between stimuli. On the other hand, when this variability occurs in repeated responses to the same stimulus, it acts as noise that reduces the neurons' potential capacity to code information.
The study of neuronal variability has recently seen a rebirth of
interest in association with the renewed use of information-theoretic techniques for analyzing neural coding (Bair 1999
;
Borst and Theunissen 1999
; Buracas and Albright
1999
; de Ruyter van Steveninck et al. 1997
;
Meister and Berry 1999
; Rieke et al.
1997
; Victor 1999
). In the visual system, the
precision of spike times and counts has been investigated in several
neural areas, although only a few have looked at the lateral geniculate
nucleus (LGN) (Guido and Sherman 1998
; Hartveit
and Heggelund 1994
; Kara et al. 2000
; Keat et al. 2001
; Reich et al. 1997
;
Reinagel and Reid 2000
; Sestokas and Lehmkuhle
1988
). In this paper, we further explore the degree of
precision found in LGN neurons of barbiturate-anesthetized cat by
examining both spike count and timing measures. We go on to quantify
the amount of information transmitted by neurons about the stimulus and
to determine the degree to which models of response based on linear
integration of inputs can account for the observed precision.
A unique feature of the present approach is that we closely examined
the dependence of neuronal variability on the degree of specification
of the stimulus. To do this, we employed a pseudorandom binary stimulus
known as an M-sequence (Sutter 1992
). We focused only on
characterizing the neurons' response to temporally varying stimuli by
showing full-field bright and dark frames, ignoring the center-surround
spatial structure of LGN neurons. M-sequences provide a statistically
efficient and convenient method for analyzing responses because they
have the nice property that every sequence of bright and dark frames of
a given length (up to some limit) is repeated the same number of times
somewhere throughout the sequence (see METHODS). This
allowed us to simultaneously examine the responses
both the mean
response and the variability in the response
to every
sequence of a given length, giving us a detailed characterization of
the neural code for such sequences. By varying this length, we examined
how much of the stimulus had to be specified to maximize the precision
of a neuron's response: e.g., if the neuron's response was influenced
by the last 10 frames and only 5 frames were specified, then the
response would be averaged over the unspecified frames, causing the
neuron's responses to appear more variable than they would be if the
stimulus were fully specified. The variability remaining when the
stimulus was fully specified reflected the neuron's intrinsic response variability.
It is common to characterize a cell's response by its linear temporal
kernel, which
as computed from an M-sequence stimulus and neglecting
normalization (see METHODS)
is the difference between its
mean response to a single bright frame and its mean response to a
single dark frame. We found that average responses to a single bright
or dark frame within a sequence showed Poisson-like spike count
variability and temporal dispersion over tens of milliseconds, and the
kernel was correspondingly temporally broad. But by specifying more of
the stimulus
e.g., specifying eight consecutive frames
the response
could become far more precise, with sub-Poisson spike count variability
and temporal precision of 1-2 ms. The information conveyed by the
neuron correspondingly increased, containing as much as 3.5 bits/spike
about longer stimulus sequences. We found that this information
depended on the specification of spike times down to 1-ms resolution
and that the information in consecutive spikes showed little redundancy
or synergy. Finally, we determined that the precision obtained when
multiple frames were specified could be largely, but not entirely,
explained if the spike rate arose from a filtering of the stimulus by
the cell's temporal kernel followed by thresholding, along with
imposition of a postspike refractory period.
Some of this work was previously presented in abstract form (Liu
et al. 2000
; Tzonev et al. 1997
).
| |
METHODS |
|---|
|
|
|---|
Experiments
We performed experiments on adult cats under a protocol approved
by the University of California, San Francisco Committee on Animal
Research. Cats were initially anesthetized with isoflurane (1-5%),
and placed on a feedback-controlled heating pad to maintain body
temperature at 37.5-38°C. We established an intravenous line and
thereafter maintained anesthesia via thiopental sodium or pentobarbital
sodium (the latter was given once anesthesia was stable). The heart
rate, respiratory rate, core temperature, O2 saturation, expiratory CO2, and lung pressure
were all continually monitored. After performing a tracheotomy, the
animal was respirated with nitrous oxide in a 1:1 ratio with oxygen. We
performed a craniotomy, and then paralyzed the animal by infusing
gallamine (10 mg · kg
1 · h
1 in lactated dextrose Ringers). The
electroencephalogram (EEG) was subsequently monitored continuously. We
reflected the optic disk onto a white background using a fiber optic
light source, and inserted contact lenses to focus the eyes at a
distance of 35-40 cm.
We recorded extracellularly using tetrodes (Gray et al.
1995
) advanced through a guide tube inserted to within a few
millimeters of the LGN. The LGN was recognized by the small (relative
to surrounding structures) and monocular visual receptive fields, and
by the match of topography across repeated penetrations to published accounts (Sanderson 1971
). The electrodes were
constructed from 13-µm-diam nickel chromium insulated wire (~20
µm including the insulation). The tips were beveled and gold-plated,
and the typical impedance was in the range of 0.8-1.5 M
. Tetrode
signals were amplified and then digitized at 20 or 30 kHz with 12-bit
resolution. The digitized data were continuously streamed to the disk.
To separate signals from different neurons, we sorted based on the spike amplitudes measured at the four tetrode wires. Clustering was
done manually using different two-dimensional projections of the
four-dimensional space.
Stimulus
For visual stimulation, sequences of full-field bright and dark
frames were presented on a computer monitor at the rate of 120 Hz,
yielding a frame duration of tf
8.3 ms. Each frame varied randomly between bright or dark, with a
photopic mean luminance; contrast [measured as (L
D)/(L + D) where L and D
were the luminances of bright and dark frames, respectively] for each
full sequence was chosen from 6, 14, 20, 40, or 80%.
We generated random frames using a binary M-sequence, which is essentially a stream of pseudorandom bits having some special properties (see following text). A bit value of 1 corresponded to a bright frame, and 0 corresponded to a dark frame.
An M-sequence of order n consists of
2n
1 bits. The full sequence can be viewed as
a collage of overlapping k-bit sequences, k
n, drawn from the list of all possible binary
combinations of k bits. For example, for k = 2, the possible binary combinations are: (0) 00, (1) 01, (2) 10, and
(3) 11. Thus a portion of the full sequence consisting of the bits
0110100 can be decomposed as the overlapping combination of the
sequences (1), (3), (2), (1), (2), (0). The same decomposition
procedure can be applied for any k. The M-sequence has the
convenient property that all subsequences of length k
n randomly appear within the full sequence the same
number of times, namely 2n
k occurrences
(except that the all-zero sequence of length k appears 2n
k
1 times). Because of this statistical
regularity of the M- sequence, it is an excellent tool for the
investigation of a cell's neural code.
Analysis
Cells were selected for analysis based on the following
criteria. To ensure single cell isolation, we chose only cells with clearly isolated clusters in the various two-dimensional projections of
the four-electrode amplitude space; clusters with clipped responses due
to amplifier saturation were avoided. To achieve reasonable estimates
of the information rates,
1,000 spikes were required during the whole
stimulus. Finally, only cells with ON or OFF linear temporal kernels (see following text) were studied, since this
formed the basis for the definition of response events. In total, 12 cells (4 ON, 8 OFF) in one cat were studied at
five contrast levels
80% (9 cells), 40% (6 cells), 20% (3 cells),
14% (1 cell), and 6% (2 cells)
yielding a total of 21 trials.
Response events and precision analysis
To study the precision of spikes, we attempted to classify each
individual spike as part of a spike event evoked in response to a
specific sequence of k frames. This was done by applying the
following algorithm, described here for an OFF cell. We
determined the average stimulus before a spike, and defined the cell's
mean conditional latency (conditioned on a spike) as the time to the zero-crossing between peak and trough in the spike-triggered-average stimulus (illustrated in Fig. 1). Then,
as shown in Fig. 2, for each spike in
the train, we looked back in time from the spike by the mean
conditional latency and found the closest OFF transition (bright frame followed by dark frame) within a window of ±1.5 frames;
the spike was assigned to that transition. If there was no such
transition, the spike was unclassified. We characterized sequences by
their length k and the location t of the
transition within the sequence (e.g., k = 8, t = 3 labeled an 8-frame sequence with a transition at
the onset of the 3rd frame
that is, between the 4th and 3rd frames,
where the 1st frame was the latest in time). For a given choice of
k and t, a given transition was uniquely associated with a surrounding sequence, and the spike was assigned to
that sequence. All spikes associated with the same sequence were
labeled as part of the same event. The percentage of total spikes that
were unclassified served as a measure of the level of "spontaneous"
activity that was not driven by transitions.
|
|
Once the events were identified for a given choice of k and
t, the probability that a specific sequence produced an
event was computed by dividing the number of times some spike response (
1 spike) was obtained for that sequence, by the total number of
presentations of that sequence (i.e., 2 × 214
k times). This quantity was called the
event probability.
We assessed the timing precision of the first spike in an event for
each sequence consisting of a specified number of frames, k, with transition location t. A
distribution for the times to the first spike in an event (of 1 or more
spikes) was obtained from the numerous presentations of a particular
k-frame sequence. A jackknife estimate of the standard
deviation of this first-spike time was used as the index of the timing
precision (Thomson and Chave 1991
), and the error was
taken as the square root of its variance. We approximated the overall
first-spike timing jitter for a given k and t by
the median standard deviation across all k-frame sequences
with transition location t. The timing jitter was then
studied as a function of k and t.
To determine whether the timing jitter was correlated with the event
probability, we computed the Spearman rank-order correlation (Press et al. 1992
, p. 639-642) for eight-frame
sequences that had t = 3, the transition position that
generally resulted in the smallest timing jitter. In several cases,
there were sequences with very small event probabilities and hence very
few event responses from which to estimate the timing jitter. This
could result in particularly large or particularly small jitters. To
test whether this may have biased our estimate of the correlation, we
calculated the Spearman rank-order correlation under two conditions:
using all sequences and using only those sequences with event
probabilities above a minimum probability. This minimum probability was
arbitrarily taken to be 1/
We also assessed the spike count precision of the events for each sequence of a specified k and t. In this case, we generated a histogram of the number of spikes in the event responses for each sequence, allowing for the possibility of no spikes. A jacknife estimate of the variance of that distribution was used as the index of that sequence's count precision. The error was again taken as the square root of the variance of this estimate. To summarize the results across all sequences of length k with a given t, the Fano factor (variance divided by the mean) for each sequence was also estimated by jacknife. The median spike count Fano factor was then used to show the dependence of spike count precision on k for a given t.
Information analysis
The information in the spike train about the stimulus was
quantified using the "direct" method (de Ruyter van
Steveninck et al. 1997
; Strong et al. 1998a
,b
).
This method estimates the mutual information between stimulus and
response "directly" from the spike trains without regard to the
details of the stimulus/response relationship and with very few
assumptions about the coding strategy. This method relies on the fact
that the mutual information between the stimulus and response can be
written as the difference of two spike train entropies. First, the
maximum amount of information that a spike train response
can
provide about the stimulus is just given by the entropy of the spike
train itself, H(
). This is estimated from the probability
distribution of spike responses over the course of the whole experiment
without specific knowledge of the stimulus. Second, the information the
spike train carries about the stimulus is reduced from this maximum by
the degree to which there is variability or noise
in the repeated
responses to an identical stimulus, as measured by the spike train
noise entropy, H(
). This is estimated from the
probability distribution of spike responses to multiple, identical
presentations of the same stimulus, averaged over stimuli.
With the M-sequence, responses to the repeated presentations of each
k-frame stimulus sequence were easily obtained. For each occurrence of a specific k-frame sequence, the response
beginning at a delay
(ranging from 0 to 130 ms) relative to the
onset of the initial frame of the sequence was divided into bins of size 
(usually 1 ms) containing the number of spikes in each bin.
These bins were combined to form spike "words" of length T = M
, where M was an integer number
of bins. For example, for M = 3, the joining of three
bins containing 2, 0, and 1 spikes, respectively, would yield the word
201 (note that the absence of spikes in a bin can be informative, and
its contribution was included).
We then computed the entropies for each choice of k, T, and

by building the probability distribution of these words
across the whole experiment for
Hk,
,T(
) and across the
multiple repeats of the ith k-frame stimulus
sequence (i = 1, ... , 2k)
at time-shift
for
Hi,
,k,
,T(
). Note that the location of a transition, t, within the
k-frame sequence was now irrelevant and not specified;
instead all k-frame sequences contributed equally to this
analysis. Both T and 
were varied to obtain estimates
of the entropy on different time scales. For a given T and

, the average information about the k-frame sequence
that began at time
before a response word was then given by
Hk,
,T(
)
Hi,
,k,
,T(
)
i, where
H(
)
i was the average
noise entropy across all k-frame stimulus sequences (i.e.,
average over i). We assigned the information about
k-frame sequences, for the given T and 
, as
the maximum information across
(see following text).
First though, for each combination of T, 
,
k, and
, we corrected for finite-data errors. This was
done by computing the mutual information for different partitions of
the data: the whole data set, and the average over each half of the
set, over each third, and each fourth. This average information was
then plotted as a function of the number of partitions N,
and fit to the functional form, I = I0 + I1/N + I2/N2
(Strong et al. 1998b
).
I0 therefore represented the true
information rate extracted from the limit of infinite data for a given
T, 
, k, and
. Note, however, that when
the amount of the data were too small, even this correction failed.
Empirically, this occurred when the ratio of
I2 to
I0 became large. We used a ratio of
2 × 10
3 as the border between sufficient
and insufficient data and show results only for cases in which data
were sufficient by this criterion. In practice, the corrections for
finite data were typically tiny, and the point of this procedure was
primarily to screen out cases (e.g., too-large k or
too-large T) for which data were insufficient.
Given the corrected information, we assigned the information about
k-frame sequences as follows. For the given k, T,
and 
, we determined the
that maximized the information. The
information, I, was then assigned to be the average
information over the bins within ±4 ms around this maximum. (We chose
this to correspond to about a frame width, so that averaging smoothed
out any frame-related artifacts.) The information rate of the spike
train, in units of bits/time, was I/(M
). We
converted this to units of bits/spike Isp by dividing by the neuron's
average spike rate, r, assessed over the entire
two-M-sequence stimulus: Isp = I/(rM
).
This method worked well only for relatively short response words. Long
response words required long stimulus sequences to minimize the
randomizing effect of different stimulus contexts on early or late
portions of the response word. However, since each sequence repeated
2 × 214
k times, as k
increased, our estimate of the entropies degraded due to sampling
problems. Thus to consider very long response words, we employed a
different strategy: we estimated a lower bound on the
information carried by the spike train about the stimulus by applying
the direct method to the two repeats of the full M-sequence. Assuming
that the only thing in common between the two presentations of the
M-sequence was the stimulus itself and that therefore the noise in the
two cases were uncorrelated, the information that one response
1 carried about the second response
2,
I
,T(
1,
2)
should be a lower bound to the information between either response
and the stimulus
,
I
,T(
,
)
(Strong et al. 1998b
). We took each response to be the
spike train generated by each full M-sequence, minus the first and last
200 ms. We then computed each spike train's entropy,
H
,T(
i),
i = 1, 2, for words of length T, and the
joint entropy,
H
,T(
1,
2),
for the co-occurrence of words in the two spike trains. These were
computed from the probability distributions for words by using
overlapping intervals (incremented by 
, to increase the effective
number of samples). To correct for finite-data errors, data size
scaling was applied in this case directly to the entropy estimations
(rather than to the mutual information as in the data size scaling
described above); an example is shown in Fig.
3A. The mutual information between the two responses was then
|
(1) |

was small. Hence, to summarize
the dependence for a particular bin size, the infinite-word-length limit was taken by obtaining a linear fit to the plots of the (infinite
data limit) entropies versus 1/T, and using the y
intercept as the (infinite word limit) entropy rates in the calculation of the information rate. The fit was performed only over the range of
1/T where sufficient data were available to accurately
estimate the entropy rates, as illustrated in Fig. 3B. In
practice, T's ranged from 8 to 48 ms. Finally, the
information per second from words of spikes was converted into the
information per spike by dividing by the mean spike rate across the
whole experiment.
|
Models
We constructed quasi-linear threshold models of driven LGN
spiking activity to investigate whether the observed precision could be
explained by simple mechanisms. All models convolved the full
M-sequence stimulus, binned at one-sixth the frame period, with the
cell's temporal kernel to generate a firing function, f(t) (linear part). These responses were
thresholded and perhaps squared (nonlinear part) to generate firing
rates r(t), as follows. We defined
r(t) = 
([f(t)
]+)p, where
[x]+ = x, x > 0; = 0, otherwise; p = 1 for a linear function and
p = 2 for a quadratic function; and

was chosen to make the mean of
r(t) equal to the observed mean firing rate. The
value of the threshold
was fit as described in the following text. Finally, spikes were generated as a Poisson process from these rates,
perhaps along with a refractory period, as will be described in the
following text.
The temporal kernel was determined as the spike-triggered-average
stimulus, divided by the autocorrelation (or in Fourier space, the
power spectrum) of the M-sequence stimulus (the power in the M-sequence
at frequency f is proportional to [sin
(f/rf)/f]2,
where rf = 120 Hz is the frame rate). This
division yields the linear filter that, applied to the stimulus, gives
the best estimate of the response in the sense of least mean-square
error (Rieke et al. 1997
). The spike-triggered average
and temporal kernel for one cell can be seen in Fig. 1. The division is
done in Fourier space, where it simplifies to a frequency-by-frequency
division; otherwise it would involve multiplying one matrix by the
inverse of another matrix. However, one does not want to continue
dividing up to arbitrarily high frequencies where the power in the
stimulus approaches zero, as this will just amplify high-frequency
noise. We chose to do the division up to some cutoff frequency, and to set all power above that cutoff frequency to zero. To choose a cutoff
frequency, we tried cutoffs from 75 to 100 Hz in 5-Hz steps. For each
cutoff, we applied the corresponding filter to the M sequence to obtain
the output f(t), converted this to a rate
function r(t) as described in the preceding text
using p = 1, and chose the threshold
as that which
minimized the mean-square error difference between the predicted
Poisson rate function and the eight-frame PSTH for the actual data. We
then chose the cutoff frequency that gave the least mean-square error;
this best cutoff was 90 Hz. This kernel was used subsequently in all
models to draw actual spikes for PSTH comparison (see following text).
The conversion from r(t) to spikes was as
follows. We interpolated r(t) to achieve a
temporal resolution of 1/60 of a frame (the spike-triggered average and
temporal kernel had been computed in bins of 1/6 of a frame or ~1.39
ms). For the simple Poisson case, spikes were then generated in each
time bin
t with probability r(t)
t, using
t = 139 µs. For the case of a Poisson process with a refractory period, a
free firing rate, q (Berry and Meister 1998
),
was generated assuming a specific refractory period, µ, by taking
q(t) = r(t)/[1
r(t)µ]. Spikes were then drawn as in the
Poisson case but using q(t) rather than
r(t). In the case of only an absolute refractory
period, the probability of a spike was set to zero for µ ms after
each spike. We also tried adding an exponential recovery after the
absolute refractory period, setting µ = µabs + µrel, where
µabs was the absolute refractory period and
µrel was the exponential recovery of the
probability from zero up to q(t). This
implementation for a relative refractory period is reasonable when
µrel is smaller than the characteristic time
over which the firing rate remains relatively constant.
For each of the models, an optimal threshold and refractory period(s) (if applicable) were selected simultaneously to minimize the mean-square error between the real data and the model of the segment of the eight-frame PSTHs defined by the 18 ~1.39 ms bins before and the 7 bins after the end of the eight-frame sequence. This was done by trying every threshold from 1 to 5 in steps of 0.2, (if applicable) absolute refractory periods from 1 to 4 ms and relative refractory periods from 0.5 to 4 ms in steps of 0.5 ms for which q(t) remained positive, and then selecting the combination of threshold and refractory periods that gave the least mean-square error. These ranges seem reasonable because in no case was the optimum parameter at an extreme of the range explored for that parameter. The mean firing rate over the whole stimulus in the model was typically matched to within a few percent of the data's mean.
| |
RESULTS |
|---|
|
|
|---|
Full-frame, binary, 14-bit M-sequence stimuli were presented at different contrast levels. In general, this stimulus drove cells in the LGN well. Average spike rates across all cells and stimulus conditions ranged from 4.6 to 25.3 Hz. Neural responses were usually triggered by transitions from either bright to dark frames (OFF cell), or vice versa (ON cell); we referred to two-frame sequences of bright/dark or dark/bright as an OFF or ON transition, respectively. Each cell's polarity was determined by reverse correlating the spike train with the M-sequence stimulus. Figure 1 presents the spike-triggered-average stimulus for one of our good OFF cells (cell 4, 80% contrast) that had a strongly driven response producing nearly 7,000 spikes. We use this cell to illustrate the main results of our analysis. A spike at time 0 for this cell was generally preceded by a transition from bright to dark ~32 ms earlier. This time delay was referred to as the cell's mean conditional latency. Figure 1 also illustrates the cell's temporal kernel (see METHODS), which represents the cell's temporal receptive field and has the same 32 ms mean conditional latency; we will return to this later.
An initial 1,200 frames (10 s) from the beginning of the M- sequence
were presented to adapt the cells to the stimulus ensemble before
showing the M-sequences used in data analysis. After the conditioning,
two repeats of the full M-sequence were displayed without delay. A
total of 2 × 214
k repetitions of
each k-frame sequence (k
14) occurred,
e.g., 128 repeats of each eight-frame sequence. Because of this
convenient property, it was natural to focus on responses to the set of
k-frame sequences for different k.
Mean response: the PSTH matrix
The M-sequence stimulus presented frames of random stimuli in series rather than in isolation. To obtain an average response to a specific stimulus sequence, we extracted the individual spike responses to the multiple presentations of that sequence in the full M sequence. Consider first the case of one-frame stimuli. The average response to single bright or dark frames of stimuli was generated in the form of a matrix of PSTHs (Fig. 4). The shading in each 1-ms bin corresponds to the total number of spikes from all presentations of this sequence at that time relative to the frame onset. Note that there was a nonzero spike rate even at the time origin that was nearly the same for both bright and dark frames. This reflects the fact that at early times, the spikes were responses to earlier frames over which we had averaged. The response to the particular bright or dark frame was most clear at ~32 ms as expected from the cell's mean conditional latency.
|
One advantage of visualizing a PSTH matrix is in the ability to display the neuron's average responses to stimuli more complex than just a single frame, as shown in Fig. 5 for two-frame sequences. This clearly shows that spikes tended to be generated near the mean conditional latency in response to an OFF transition (stimulus 2), whereas spiking was clearly suppressed near the mean conditional latency by an ON transition (stimulus 1). Note that the response to a dark frame (stimulus 0 in Fig. 4) was now broken down according to whether the preceding frame was dark or bright (stimuli 0 and 2, respectively, in Fig. 5).
|
Figure 6 displays the PSTH matrix (with 1-ms time bins) for the response to seven-frame sequences, sorted according to the rightmost two frames, f1 and f2 (we usually numbered frames in a k-frame sequence consecutively as fn, n = 1, ... , k, with f1 the latest in time and fk the earliest). This grouped together all responses to sequences with an OFF transition in the most recent two frames. As expected, a large vertical band of spikes centered at ~32 ms appeared in response to the OFF transition. One striking feature was the slight slant in time of the OFF response band near 32 ms. Qualitatively, for this cell, the time to the first spike was correlated with the amount of time the stimulus had been bright prior to the final transition to dark: the longer this time, the earlier the occurrence of the first spike in the response.
|
Moreover, the spikes in this band were noticeably isolated in time on both sides by regions of virtually no spikes, suggesting that there was a high degree of temporal precision in the response when seven frames of the stimulus were specified. To examine this, each spike should ideally be classified as part of a response to a particular sequence. In the PSTH matrix though, each spike occurred multiple times, each time associated with a different time frame and sequence. Hence, echoes of the main OFF response appeared in the other quadrants of the PSTH matrix where an OFF transition occurred earlier in the sequence.
Event classification
To classify a spike to a unique sequence, a search was performed to find the OFF transition that was most likely to be responsible for a given spike. All spikes classified to the same transition were then grouped together as the spike "event" in response to the sequence containing that transition (see METHODS). In practice, this algorithm reproduced the event structure quite well, as can be seen from the comparison of Figs. 7 and 8. These show the PSTH matrix and the extracted unique spike events, respectively, for the 1/4 of eight-frame sequences having an OFF transition in their final two frames. The band of spikes near 32 ms was clearly reproduced in the spike events. Virtually all spikes in the train were accounted for by this technique; only 1.8% of the spikes were unclassified. (Note that spikes placed at random would show 5/16, or 31%, unclassified.)
|
|
In general, for the group data across all cells, 10 of 21 trials had unclassified percentages <5%, while for the remaining 11 trials this was larger than 5%. Qualitatively, the unclassified percentage was correlated with the degree to which spikes were locked to the stimulus as evidenced by visual isolation of spikes around the mean conditional latency in the PSTH matrix. When the spikes around the mean conditional latency could be visibly isolated (10 of 21 trials), the algorithm appeared to yield fairly low unclassified percentages (9 of those 10 trials). The one exception was a 40% contrast trial for an ON cell in which the events in response to an ON transition were fairly well isolated yet the unclassified percentage was nevertheless high (26%), probably because spikes were also produced without a transition when the stimulus had been bright for several frames. In cases when locking was evident but poor (5 of 21 trials had bands of increased spiking, but these were not well isolated) or when spiking was more indiscriminate (6 of 21 trials had poorly distinguishable bands), the unclassified percentage tended to be larger (10 of these 11 trials had unclassified percentages above 5%). The one exception was a 6% contrast trial for an OFF cell with a weak linear kernel-its events were not well isolated, but its unclassified percentage was nevertheless low (3.5%).
For each sequence, we defined its event probability to be the percentage of its occurrences that evoked an event of one or more spikes.
Response variability
SPIKE TIMING PRECISION. Using the binary k-frame sequences to characterize the stimulus, and the spike events to characterize the response, we turn to the next issue of this paper: a study of the reliability and precision of responses and their dependence on the stimulus. The timing precision of these events was examined by determining the jitter in the time of the first spike in the events associated with a particular sequence. This is shown in Fig. 9A for the only possible two-frame sequence with an OFF transition. This sequence generated a spike response 49% of the time, and the time of the first spike had a standard deviation of 3.25 ± 0.04 ms. Because the responses to all possible combinations of stimulus frames before and after the two frames of the transition were averaged together, this standard deviation represented the precision achieved by the two frames of the OFF transition alone, when the other frames were unspecified. Its value was already less than the standard deviation expected (7.2 ms) if the first spike times were distributed uniformly over the three-frame search window that defined events.
|
|
mean of 1.56 ± 0.39 (SD) ms (n = 9, excluding the outlier). Two trials (1 cell at 40% contrast, another at
80% contrast) had high unclassified percentages (26 and 22%,
respectively) but nevertheless had small median standard deviations
(1.97 ± 0.06 and 2.28 ± 0.13 ms, respectively). The remaining nine trials that had >5% unclassified spikes clustered at
~4.01 ± 0.58 ms. Many of these trials were less well driven, as
evidenced by their generally lower firing rate, as shown in Fig.
11B. Because this group of trials often responded more
diffusely in time, making classification of spikes difficult, their
poorer precision was not surprising. However, given that their
precision was well below the 7.2 ms expected from random placement of
spikes, it seems likely that this reflected a true property of the
cells rather than an artifact of the classification method.
|
SPIKE COUNT PRECISION.
The timing precision analysis focused on how the stimulus affected the
jitter of a single spike (namely the 1st spike in an event). To study
the precision of the remaining spikes in an event, we analyzed the
precision of the number of spikes in the events evoked by a stimulus.
This spike count precision was characterized by examining the variance
in the number of spikes per event versus the mean number of spikes in
an event. In the case of a Poisson process, the variance is equal to
the mean. At the other extreme, the minimum possible variance for a
discrete counting process with a given mean m is obtained if
the number of spikes in every event is either ceil(m) (the
smallest integer
m) or floor(m) (the
largest integer
m). This minimum variance varies
periodically with the mean, dropping to zero at each integer and
forming a scalloped curve between integers.
|
|
|
Information transmission
The spike timing and count variability measures discussed above gave some indication of the precision of LGN neurons. How much information did this level of precision allow the cells to transmit?
To address this, we changed our analysis method. The preceding analyses of variability depended on defining events that associated each spike with a unique sequence that evoked it. This required specifying both sequence length and the location within the sequence of the transition (because spikes were associated with transitions and these 2 facts uniquely linked transitions to sequences). For the information analysis, we instead considered all sequences of a given length, without regard for the presence of a transition, and simply examined the response at some fixed time interval after the initiation of the sequence.
We computed information using the direct method (see
METHODS). We binned time into discrete units of size

, typically 1 ms, and defined the "letters" of the response
"alphabet" as the number of spikes in a bin (0 or 1 for 1-ms bins).
A string of M such letters formed a response "word"
for
M = 1, the word was simply the number of spikes in a
single bin. Ideally, the choice of bin size should reflect the degree
of temporal resolution in the code, while the word size should reflect
the longest time scale of temporal correlations in the code. The timing
precision analysis suggested that a reasonable bin size was ~1 ms.
Initially ignoring correlations between bins, we calculated the
information about k-frame stimuli by considering only
single-bin words at this resolution (Fig.
15). The information grew with time
from the onset of the stimulus sequence, provided that further stimulus frames continued to be specified, up to at least nine frames. At this
point, the maximum information was ~3.5 bits/spike and appeared to be
nearing a plateau. The existence of a plateau was reasonable since a
given response time bin should give little or no information about
stimulus frames that occurred far in the past. For longer sequences
(k
4), the information began to drop from its peak
at ~24-26 ms after the onset of the last frame in the sequence, or
~16-18 ms after the onset of the first unspecified frame. This
suggests that 16-18 ms was the minimum delay for a frame to
significantly influence the response. This was in rough agreement with
our previous results that one and perhaps two frames after the
transition frames can influence the spike count by vetoing or allowing
spikes induced by the transition; if the response occurs 32 ms after
the transition, then these frames would have onsets ~15 and 24 ms
before the response that they influence.
|
We also compared the maximal observed information rate of 3.5 bits/spike to the cell's maximum possible information rate, as measured by the entropy of its spike train. Achieving this maximum would imply that all of the cell's response variability (as measured in single 1-ms bins) was used to encode the stimulus. In fact, the coding efficiency, the ratio of the actual information coded to that which could possibly be encoded, was ~51% (for k = 9), so that the cell transmitted information in individual 1-ms bins at a level that was within a factor of two of its limit.
We next examined the role of time resolution in information encoding by
varying the binwidth. We considered 8-ms words of the spike train, and
binned these words using either 1-, 2-, 4-, or 8-ms resolution. If the
precise timing of the spikes at these resolutions within the word were
important for transmitting information, then we expected more
information at smaller bins than larger bins. Finer resolution
increases the possible information the spike train can code; if the
actual information coded also grows, then the coding efficiency would
not significantly change with increasing resolution. On the other hand,
a fall-off of the coding efficiency would indicate that the increased
resolution is not being used to code information. We computed maximum
information rates for eight-frame sequences to ensure that there were
sufficient repeats of each sequence to allow us to estimate the
information for multiple-bin response words. The information rate
increased from 2.4 bits per spike at 8-ms bins to 3.1 bits per spike at 2-ms bins, a 29% increase (Fig.
16A), while the spike train
entropy increased by 36% over the same range. That is, 0.29/0.36 = 81% of the increase in entropy associated with this increase in
resolution was used to encode information. As a result, the coding
efficiency stayed relatively flat, decreasing only ~5% from a bin
size of 8 to 2 ms. Thus the position of spikes at
2-ms resolution was significant for coding information. Improving the resolution by a
factor of 2 from 2 to 1 ms yielded an additional 3% increase in
information to 3.2 bits per spike, compared with an increase in entropy
of 15%, suggesting that only 20% of the entropy change encoded
information. Thus while more information was encoded at this finer
resolution, there was a diminishing return as the noise became a
proportionately larger contributor to the cell's increased variability.
|
Redundancy or synergy in coding
Given that a temporal resolution down to 1 ms was useful, another
important question to address is the manner in which patterns of spikes
in these bins contributed to information transmission. Three
possibilities exist: different 1-ms bins may code information independently; they may encode information redundantly, so that M-bin words code less information than M times
the one-bin-word information; or they may interact synergetically so
that M-bin words code more than M times the
one-bin-word information. Note that the degree of redundancy or synergy
may change with M
for some word sizes, the responses may be
more redundant, whereas for other word sizes, they may become synergetic.
We investigated the degree of synergy and redundancy in the LGN responses in three ways. First, we compared the information in 8-ms words with 1-ms bins to that found in 1-ms words. For our example cell, the 3.2 bits per spike for 8-ms words with 1-ms bins was close to the 3.3 bits per spike for 1- ms words found for eight-frame sequences in Fig. 15, indicating only a little redundancy and no synergy between the responses of adjacent 1-ms bins. This near independence of spikes in 8-ms words was not simply due to Poisson firing since the distribution of the number of spikes within the 8-ms window showed a much larger probability for two spikes (0.21) than would be expected from the square of the one spike probability (0.008). This result suggests that the cell was bursting, although the bursts apparently did not lead to a large level of redundancy. This could happen because redundant patterns (such as bursts) might be used synergetically to code for the stimulus. It is important to point out that our measure looks at the average level of redundancy or synergy so that the combination of different groups of redundant and synergetic spikes could appear independent at this time scale.
Second, to determine whether this lack of significant redundancy or synergy survives at longer time scales, we examined the information in much longer words. Unfortunately, the direct method as applied to the repetitions of the k-frame sequences could not be used to study response word lengths longer than ~8 ms (for 8-frame stimuli) due to data insufficiency. Instead, we estimated a lower bound on the information in the entire response to the full M-sequence by using the two repeats of the full M-sequence (see METHODS). Assuming that the stimulus was the only common drive for the two responses, then the information between the responses to the two repeats bounded from below the information either could carry about the stimulus. We compared this lower bound extracted in the limit of infinitely long response words to the exact information rates computed from eight-frame sequences and 8-ms responses (Fig. 16). The lower bound came reasonably close to the information rates computed from 8-ms words across all bin sizes considered. This implies that there is little redundancy over times longer than 8 ms but leaves open the possibility of synergy (if the true infinite-word information were much higher than our lower bound).
As a final test of the redundancy or synergy between spikes, we
compared the exact information transmitted by individual 1-ms bins to
the lower bound on the information transmitted by infinitely long words
of 1-ms bins. Figure 17 plots these two
measures for all cells and trials in the data in terms of both bits per
spike and per second. Nearly all of the trials fell close to the
diagonal line where the information from single bins equaled the lower bound to the information from infinitely many bins. To the right of
this diagonal line, coding is synergetic: more information is conveyed
on average by combinations of spikes in 1-ms bins than by single 1-ms
bins. To the left of this diagonal line, coding is redundant: less
information is transmitted on average by the words of bins than by the
single 1-ms bin. Hence, the fact that the trials aligned close to, but
to the left of, the diagonal suggested that there was at most a slight
amount of redundancy for real cells. Since the infinite word
information is a lower bound, we can only be certain that the true
information lay to the right of the plotted data
that is, there was
little or no redundancy and possibly some synergy.
|
Comparison to quasi-linear threshold models
The results reported here suggest that, while there was variation among the population, visual thalamic cells could exhibit very precise responses that conveyed considerable amounts of information per spike on average. Responses of these cells are often modeled as resulting from the convolution of the cell's temporal kernel (shown in Fig. 1) with the stimulus, followed by a nonlinear thresholding to generate a firing rate (see METHODS). Is the degree of precision consistent with this picture?
We first chose the threshold that gave a best match of the model PSTH to the data and assumed that spikes were generated randomly according to an inhomogenous Poisson process with the model PSTH (appropriately scaled to yield the same mean rate as the data). The best-matched model gave a broader and more symmetric PSTH than was observed in the data, suggesting that the precision of model spikes was significantly worse. Extracting spike events as described above allowed us to directly compare the precision of the model (Fig. 18B) to the data (Fig. 18A), in response to eight-frame sequences that had a bright to dark transition in the rightmost two frames. The model spike events were clearly more diffuse in time and did not capture the details of the dependence of response onset times on stimulus sequence. We next considered models in which spikes were generated from a Poisson process with an absolute refractory period. We considered this for the case in which the firing rate was a linear function (Fig. 18C) or a quadratic function (not shown) of the thresholded filter output. In each case, refractory period and threshold were chosen together to optimally match the data (least mean-square error in PSTH). The PSTH of the linear refractory model was slightly narrower than that derived without a refractory period but continued to be wider than the data and to not show the temporal irregularity of the data. The model using a quadratic function gave results similar to, but slightly poorer than, those of the linear refractory model, so we do not consider it further. A relative refractory period in addition to an absolute refractory period also yielded quantitatively similar results as the case of an absolute refractory period alone.
|
The model's failure to capture the detailed structure of response onset times is specifically due to an underestimation of longer onset times, while shorter onset times were well reproduced by the model (Fig. 19A). This discrepancy can be understood from an examination of the PSTH matrices (Fig. 18): it appears that when two or more consecutive dark frames preceded the bright to dark transition, this lengthened first-spike times in the data; but this effect was not picked up by any of the models. The models also reasonably reproduced the mean spike counts observed in the data, but showed a tendency to underestimate smaller mean counts and overestimate higher ones (Fig. 19B).
|
The Poisson model did a poor job of reproducing the observed
variability in spike timing or spike count (Fig.
20,
). The inaccuracy in spike count
precision is not surprising, because a Poisson model will always have a
Fano factor of 1. However, the model incorporating a refractory period
came much closer to reproducing the precision of the data (Fig. 20,
). This model tended to slightly overestimate smaller
first-spike-time standard deviations and Fano factors and to
underestimate larger ones, showing less overall diversity of
first-spike-time standard deviations and Fano factors than the data.
|
| |
DISCUSSION |
|---|
|
|
|---|
We have found that LGN neurons can show great precision in their
responses to M-sequence stimuli. For at least a subset of cells, spikes
occur in discrete events triggered by an ON or
OFF transition with spike rates close to zero at other
times. The time of the first spike in an event can be precise to 1-2
ms, and this precision can be maintained even for unreliable events (events that occur with low probability). The four frames before the
transition frames influence the event timing, so these frames must be
specified to discern the cell's spike timing precision. The number of
spikes in an event can also show great precision, with Fano factor
(ratio of variance to mean) approaching 0.3 (vs. a value of 1 expected
for a Poisson process). The frames after a transition can "veto" or
allow an event so that two frames after the transition frames as well
as the four before must be specified to discern the cell's spike count
precision. This precision of response allows cells to carry up to 3.5 bits per spike of information about the stimulus. The coding efficiency
of information transmitted in 1-ms bins can be within a factor of two
of the limit set by the spike train's entropy
a limit that is
achieved when all of the cell's variability is used to code
information. The coding efficiency remains relatively constant as the
temporal resolution for specifying spike times increases to at least 2 ms, and still more information is gained by increasing resolution to 1 ms, indicating that the timing of spikes at these resolutions carries
information about the stimulus. By comparing the information carried by
1-ms response words to that in 8-ms words and to the lower bound on the
information transmitted by infinitely long response words, we find that
there is at most only a modest amount of redundancy in the coding by
successive spikes, and we find no evidence for synergy. Finally, this
precision can be largely, but not entirely, accounted for by a model in
which firing rate is generated by filtering the stimulus with the
cell's temporal kernel and applying a threshold, followed by spike
generation as a Poisson process with an absolute refractory period.
Previous work on spike timing and count precision
Our work adds to a growing body of work finding high response
precision and high information rates in the LGN in response to
full-field noise stimuli. Keat et al. (2001)
, in work
contemporary with the present work, found 1-2 ms SD for the time to
the first spike in an event in response to full-field Gaussian white
noise in close agreement with the present results for binary white
noise. Reinagel and Reid (2000)
reported a particularly
low width (SD) of 0.6 ms for one PSTH peak in one cell's response to a
full-field "naturalistic" noise stimulus but did not more generally
report on timing precision. Both of these papers and Kara et al.
(2000)
demonstrated sub-Poissonian Fano factors in LGN
responses to full-field Gaussian noise in agreement with the present
findings. Comparable precision of spike timing and count in response to
full-field noise stimuli has been reported in the retina (Berry
and Meister 1998
; Berry et al. 1997
; Kara
et al. 2000
; Keat et al. 2001
).
Measures of response to other stimuli often do not show similar
precision. Thus Guido and Sherman (1998)
measured the
jitter in the time to first spike in responses to spots flashed in the center of the LGN cell receptive field and reported standard deviations ranging from ~3 to 35 ms, depending on the mode of firing (burst vs.
tonic). The greater variability seen in this case of a single flashed
spot is akin to the spread of the PSTH seen when only a single frame is
specified (Fig. 4) and may reflect the lack of specification of the
cell's initial state. That is, when stimulated only by a blank screen
(before stimulus onset), spontaneous activities may lead a cell to
wander through a state space of comparable diversity to that created by
the set of binary stimulus sequences that could precede a single frame
in our experiments. Similar reasoning might also explain why statically
flashed, spatially nonuniform stimuli have produced Fano factors larger
than 1 in several LGN studies (Hartveit and Heggelund
1994
; Levine et al. 1996
; Sestokas and
Lehmkuhle 1988
). Reich et al. (1997)
reported a
PSTH standard deviation of 5 ms for one LGN cell in response to a
slowly drifting sine grating, but at least some of this jitter was due
to a slow drift in response phase across many trials, which may have
represented a slow change in cell state; responses over a small set of
adjacent trials showed considerably greater precision.
Specifying neuronal state
The idea that responses to temporally modulated stimuli can show
great precision, even while responses to more static stimuli may show
greater variability, has already a long history (e.g., Buracas
et al. 1998
; de Ruyter van Steveninck et al.
1997
; Mainen and Sejnowski 1995
) and has stirred
controversy (e.g., Egelhaaf and Warzecha 1999
). Our
findings add a focus on stimulus history, showing that sufficient
specification of a temporally varying stimulus is key to revealing
neural precision. By extension, this emphasizes the importance of
control of neuronal state: noise may not be intrinsic to a neuron or a
piece of neural tissue but may instead simply represent variables that
are not under the experimenter's control. While a dynamic stimulus may
control neural firing and thus control a given cell's state, lack of a
stimulus (a blank screen) yields spontaneous activities that are
stochastic, being triggered at least in part by spontaneous quantal
events in photoreceptors (Mastronarde 1989
), and these
in turn may lead a cell's state to wander in an uncontrolled way,
presenting an uncontrolled initial condition at the moment of a flashed
stimulus. A related argument was made by Buracas et al.
(1998)
, who showed that whether or not a given stimulus evoked
a spike in a cell of area MT was strongly correlated to the local field
potential at the given time and place.
It is interesting that, at least for our binary stimuli, specification of 8 frames (67 ms) seems adequate to specify the LGN state to sufficient precision to saturate spike timing and count precision, while 9-10 frames (75-83 ms) saturate the information coded by spikes. These numbers are in rough agreement with the width of the cell's temporal kernel (Fig. 1), which differs from zero over a span of ~65-70 ms.
Previous work on neuronal information transmission in the LGN
The information rates we have found
2-3.5 bits/spike, 20-90
bits/s
are similar to those found by others in LGN who, like us, used
"direct" methods (Eckhorn and Pöpel 1975
;
Reinagel and Reid 2000
). These methods directly estimate
the information carried by the spike train about the stimulus, without
a requirement for explicit decoding, by assaying certain stimulus and
response probability distributions (Eckhorn and Pöpel
1974
; Strong et al. 1998a
,b
). Indirect methods,
such as the stimulus reconstruction method (Rieke et al.
1997
), rely on being able to "decode" the response. These methods provide only a lower bound to the information rates: any information that is successfully decoded was present, but there is no
guarantee that all information that was present was successfully decoded. Rates found using indirect methods in LGN have generally been
quite low
only ~2 bits/s (Dan et al. 1998
;
McClurkin et al. 1991
; Reinagel et al.
1999
)
suggesting that much information present in the LGN
spike trains was missed by those methods.
Stimuli better matched to the receptive field may yield more
information. Eckhorn and Pöpel (1975)
found that
spatially uniform stimuli yield lower LGN transmission rates (25-40
bits/s at the best flash rate) than spots isolated at the receptive
field center (60-80 bits/s) (Eckhorn and Pöpel
1975
). Their spot and full-field stimuli were only briefly
flashed at slow, periodic intervals (
30 Hz). Full-field stimuli
modulated randomly at higher rates drive relatively high LGN
information rates, as shown both by our work and that of
Reinagel and Reid (2000)
. The latter sees a range of
information rates similar to what we have found even though their
naturalistic stimulus distribution contained much more entropy than our
binary distribution (924 bits/s in their distribution vs. 120 bits/s in
ours). That suggests that we may be seeing the limits of what an LGN
cell can code, at least to full-field stimuli. On the other hand, the
results of Eckhorn and Pöpel (1975)
suggest that
both we and Reinagel and Reid (2000)
might have seen
even higher information rates if we had restricted flashes to the
cells' receptive field centers.
High information rates like those reported here have also been observed
in a variety of other systems, including retina, visual cortex, and
insect motion-detecting neurons (e.g., Berry et al. 1997
; Buracas et al. 1998
; de Ruyter van
Steveninck et al. 1997
; Reich et al. 2000
;
Strong et al. 1998a
), suggesting that the precision found here may not be a special property of LGN or thalamic neurons.
Minimal redundancy
The issue of redundancy or synergy in the neural code has been
addressed in numerous papers, but we are aware of only a few (Brenner et al. 2000
; Reinagel and Reid
2000
) that have looked at the issue in terms of temporal coding
in a single neuron rather than population coding across multiple
neurons. Reinagel and Reid (2000)
found that LGN neurons
can sometimes code more information on average in patterns of spikes
than if those spikes were considered independently. The synergy they
reported, however, was at most only ~20%, and many neurons were
slightly redundant (
10%) or only very weakly synergetic. Our results
are consistent with this in the sense that we also observe at most only
mild redundancies in the coding by individual neurons. We cannot rule
out synergies, but to the degree that our lower bound closely
approximates the true information, the fact that none of our neurons
lay to the right of the independence line in Fig. 17 suggests that
there are also no large synergies in the coding by individual neurons.
Models of response generation
We have found that a simple model of response generation, based on thresholding the output of the cell's temporal kernel applied to the stimulus and imposing a refractory period, can match much but not all of the precision of response that we observed. To achieve this result, it was critical that the cell's temporal kernel be used and not simply the spike-triggered average; use of the latter gave noticeably less precision in both timing and spike count (not shown).
The discrepancies between the precision of the model and the observed
data most likely arise from the linear filter model rather than the
specifics of the spike generation mechanism. The PSTH matrix generated
by the model is somewhat wider and considerably more regular than that
of the data. Most strikingly, the model fails to show the lengthening
of first-spike times observed in the data when two or more consecutive
dark frames preceded the bright/dark transition. This yields a less
"jagged" left edge for the model PSTH compared with the data PSTH.
This jagged edge is dominated by a response's first spikes, which are
unaffected by refractoriness. Accordingly, the error is unlikely to be
in our model of spike generation and refractoriness but rather in the
model of PSTH generation by linear filtering. This is also suggested by
the fact that the temporal kernel and the spike-triggered average both
give similarly smooth leading edges (data not shown), so that it seems
unlikely that a better filter would alter this result. It is further
suggested by the results of Kara et al. (2000)
, who
found that they could successfully model the spike count variability of
LGN cells by beginning with the observed PSTH (rather than deriving the
PSTH from a filter as we are doing) and adding both absolute (~1 ms)
and relative (~20 ms) refractory periods extracted from the cell's
interspike interval distribution.
Accounting for the observed PSTH presumably requires a more complex
nonlinearity in our model of firing-rate generation than the
thresholding used here; it would be interesting to determine whether
contrast-gain-control mechanisms (Shapley and Victor
1978
; Victor 1987
) might be sufficient to
reproduce the response onsets and improve the agreement between the
model and data precision measures. Nonetheless, it should be noted that
the model as it stands is significantly nonlinear. The optimal
threshold value (optimal in the sense of least meansquare
error in matching the data PSTH) was 80% of the root-mean-square of
the output of the filtering of the stimulus by the temporal kernel (see
legend to Fig. 18); that is, it was necessary to set a significant
fraction of positive filter outputs to zero. The optimal absolute
refractory period was 3 ms, long compared with probable biophysical
absolute refractory periods of
1 ms. (When both an absolute and a
relative refractory period were used, the optimum was similar, 2.5 ms
absolute plus 0.5 ms relative refractory period.)
An alternative approach to modeling the neural responses observed here
is to dispense with a firing rate model altogether and instead directly
model the spike generation process. Berry et al. (1997)
found that responses of retinal neurons to full-field noise stimuli
consisted of brief response events surrounded by substantial periods of
zero spike rate, similar to the cases in our experiment in which most
spikes could be accounted for by response events locked to
ON or OFF stimulus transitions. This has led
the same group more recently (Keat et al. 2001
) to
suggest that a rate description of such responses, in which spike
probability is zero for extended periods interrupted by brief events,
may be inadequate. Instead they proposed predicting the spikes
themselves rather than a spike rate by regarding the output of a
cell's linear filter applied to the stimulus as a voltage-like
variable rather than a rate and counting upward-going threshold
crossings of this voltage as spike times. Parameterizing the filter,
adding a spike-induced "hyperpolarization" to represent
refractoriness, and adding appropriate noise yielded a 20-parameter
model (15 parameters describing the filter and 5 additional
parameters). They showed that such a model, fit individually to each
cell by optimizing a cost function incorporating precision measures,
could do a good job of replicating the cell's spiking events and their
statistics for both retinal and LGN cells in response to Gaussian noise
stimuli. We have no reason to doubt that the same models would well
describe the responses to binary noise stimuli studied here.
Conclusion
LGN cells can show remarkable precision in their responses and code information at high rates and with high coding efficiency. Revealing this precision requires sufficient specification of the stimulus history. This points to the possibility that measurements of neuronal precision may be limited as much by the degree to which the experimenter controls the variables relevant to a cell's response as by the intrinsic precision of neural processing.
| |
ACKNOWLEDGMENTS |
|---|
We thank W. Bialek and the referees for useful comments.
This work was supported by a grant from the Sloan Foundation (S. Tzonev and R. C. Liu), the University of California President's Postdoctoral Fellowship (R. C. Liu), National Institutes of Health Grants R01-NS-33787 and R01-EY-13595, and gifts from the Swartz Foundation.
| |
FOOTNOTES |
|---|
Address for reprint requests: R. C. Liu, UCSF Sloan-Swartz Center, Physiology Box 0444, 513 Parnassus Ave., HSE806, San Francisco, CA 94143-0444 (E-mail: liu{at}phy.ucsf.edu).
Received 7 March 2001; accepted in final form 14 September 2001.
| |
REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
T. Masquelier, E. Hugues, G. Deco, and S. J. Thorpe Oscillations, Phase-of-Firing Coding, and Spike Timing-Dependent Plasticity: An Efficient Learning Scheme J. Neurosci., October 28, 2009; 29(43): 13484 - 13493. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. D. Kumbhani, M. J. Nolt, and L. A. Palmer Precision, Reliability, and Information-Theoretic Analysis of Visual Thalamocortical Neurons J Neurophysiol, November 1, 2007; 98(5): 2647 - 2663. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. S. Gaudry and P. Reinagel Benefits of Contrast Normalization Demonstrated in Neurons and Model Cells J. Neurosci., July 25, 2007; 27(30): 8071 - 8079. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. P. Billimoria, R. A. DiCaprio, J. T. Birmingham, L. F. Abbott, and E. Marder Neuromodulation of spike-timing precision in sensory neurons. J. Neurosci., May 31, 2006; 26(22): 5910 - 5919. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. W. Pillow, L. Paninski, V. J. Uzzell, E. P. Simoncelli, and E. J. Chichilnisky Prediction and Decoding of Retinal Ganglion Cell Responses with a Probabilistic Spiking Model J. Neurosci., November 23, 2005; 25(47): 11003 - 11013. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. N. Aldworth, J. P. Miller, T. Gedeon, G. I. Cummins, and A. G. Dimitrov Dejittered Spike-Conditioned Stimulus Waveforms Yield Improved Estimates of Neuronal Feature Selectivity and Spike-Timing Precision of Sensory Interneurons J. Neurosci., June 1, 2005; 25(22): 5323 - 5332. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. E. Latham and S. Nirenberg Synergy, Redundancy, and Independence in Population Codes, Revisited J. Neurosci., May 25, 2005; 25(21): 5195 - 5206. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Bair and J. A. Movshon Adaptive Temporal Integration of Motion in Direction-Selective Neurons in Macaque Visual Cortex J. Neurosci., August 18, 2004; 24(33): 7305 - 7323. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. J. Uzzell and E. J. Chichilnisky Precision of Spike Trains in Primate Retinal Ganglion Cells J Neurophysiol, August 1, 2004; 92(2): 780 - 789. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. L. Passaglia and J. B. Troy Impact of Noise on Retinal Coding of Visual Signals J Neurophysiol, August 1, 2004; 92(2): 1023 - 1033. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. L. Passaglia and J. B. Troy Information Transmission Rates of Cat Retinal Ganglion Cells J Neurophysiol, March 1, 2004; 91(3): 1217 - 1229. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. W. Bisley, B. S. Krishna, and M. E. Goldberg A Rapid and Precise On-Response in Posterior Parietal Cortex J. Neurosci., February 25, 2004; 24(8): 1833 - 1838. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. M. Blitz and W. G. Regehr Retinogeniculate Synaptic Properties Controlling Spike Number and Timing in Relay Neurons J Neurophysiol, October 1, 2003; 90(4): 2438 - 2450. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. J. Chichilnisky and R. S. Kalmar Temporal Resolution of Ensemble Visual Motion Signals in Primate Retina J. Neurosci., July 30, 2003; 23(17): 6681 - 6689. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Nirenberg and P. E. Latham Decoding neuronal spike trains: How important are correlations? PNAS, June 10, 2003; 100(12): 7348 - 7353. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Visit Other APS Journals Online |