|
|
||||||||
The Journal of Neurophysiology Vol. 79 No. 2 February 1998, pp. 1017-1044
Copyright ©1998 by the American Physiological Society
1 Computational Neurobiology Laboratory, Howard Hughes Medical Institute, The Salk Institute for Biological Studies, La Jolla, 92037; 2 Division of Neural Systems, Memory, and Aging and Department of Psychology, Arizona Research Laboratories, University of Arizona, Tucson, Arizona 85724; and 3 Department of Biology, University of California, San Diego, La Jolla, California 92093
| |
ABSTRACT |
|---|
|
|
|---|
Zhang, Kechen, Iris Ginzburg, Bruce L. McNaughton, and Terrence J. Sejnowski. Interpreting neuronal population activity by reconstruction: unified framework with application to hippocampal place cells. J. Neurophysiol. 79: 1017-1044, 1998. Physical variables such as the orientation of a line in the visual field or the location of the body in space are coded as activity levels in populations of neurons. Reconstruction or decoding is an inverse problem in which the physical variables are estimated from observed neural activity. Reconstruction is useful first in quantifying how much information about the physical variables is present in the population and, second, in providing insight into how the brain might use distributed representations in solving related computational problems such as visual object recognition and spatial navigation. Two classes of reconstruction methods, namely, probabilistic or Bayesian methods and basis function methods, are discussed. They include important existing methods as special cases, such as population vector coding, optimal linear estimation, and template matching. As a representative example for the reconstruction problem, different methods were applied to multi-electrode spike train data from hippocampal place cells in freely moving rats. The reconstruction accuracy of the trajectories of the rats was compared for the different methods. Bayesian methods were especially accurate when a continuity constraint was enforced, and the best errors were within a factor of two of the information-theoretic limit on how accurate any reconstruction can be and were comparable with the intrinsic experimental errors in position tracking. In addition, the reconstruction analysis uncovered some interesting aspects of place cell activity, such as the tendency for erratic jumps of the reconstructed trajectory when the animal stopped running. In general, the theoretical values of the minimal achievable reconstruction errors quantify how accurately a physical variable is encoded in the neuronal population in the sense of mean square error, regardless of the method used for reading out the information. One related result is that the theoretical accuracy is independent of the width of the Gaussian tuning function only in two dimensions. Finally, all the reconstruction methods considered in this paper can be implemented by a unified neural network architecture, which the brain feasibly could use to solve related problems.
Reconstruction is a useful strategy for analyzing data recorded from populations of neurons, in which external physical variables such as the orientation of a light bar on a screen, the direction of hand movement in space, or the position of a freely moving animal in space are estimated from brain activity. Reconstruction is sometimes called decoding, where "coding" describes the tuning of neuronal responses to the variables of interest. A wide range of neural coding and decoding problems have been studied previously (Abbott 1994 Representative example of reconstruction
Position reconstruction based on hippocampal place cell activity provides an excellent example that captures all major aspects of reconstruction (Wilson and McNaughton 1993
Place cell data
The experimental data analyzed in this paper were obtained by methods that have been described by Wilson and McNaughton (1993) Basic description
The goal of this section is to quantify the basic tuning property of a place cell. Reconstruction only needs two distribution functions derived from the data. The first one is the probability P(x) for the animal to visit each spatial position x = (x, y), a vector that denotes the position of the animal's head on the horizontal plane. In our dataset, x was discretized on 256 × 256 grid and sampled at 20 Hz. Let N(x) be the number of times the animal was found at position x during a time period, say, of several minutes; then the probability of finding the animal at x can be estimated as
Firing rate modulation
The description in the preceding section can be broadened to incorporate the modulation of firing rate by additional variables other than the position x, such as running speed, the heading direction, and the theta rhythm. For our dataset, including firing rate modulation only slightly improved reconstruction accuracy.
RUNNING VELOCITY.
The firing rate of a place cell typically increases with running speed and also depends on the heading direction when movement is confined to a narrow track (McNaughton et al. 1983
THETA RHYTHM.
The timing of spike firing relative to the phase of the theta waves in the hippocampal local field potential encodes additional information about the location of the rat (Brown et al. 1996
Direct basis and reciprocal basis
A basis function method uses a linear combination of fixed templates, or basis functions, with the coefficients in the linear combination proportional to the activity of the neurons. The template functions can be chosen arbitrarily. Define ni as the number of spikes fired by cell i within the time window, and
Simple example of reciprocal basis
The reciprocal basis method is illustrated by a hypothetical example of coding in the motor system (Fig. 4). Imagine two motoneurons each of which, when activated alone, will drive movement in the direction f1 or f2, respectively. We call fi the driving direction of neuron i. Assume that the two neurons are activated randomly and independently and that the actual movement is the vector sum
Special cases of basis functions
TEMPLATE MATCHING.
The template matching method for reconstruction has been used for modeling stereo hyperacuity (Lehky and Sejnowski 1990 VECTOR METHODS.
Vector reconstruction is widely applied to data from neural populations. Vector methods such as the population vector and the optimal linear estimator compute
POPULATION VECTOR WITH SCALING.
For place cells, the population vector needs to be scaled by the total activity so that the reconstructed position
Background
The Bayesian approach is a natural one for reconstruction. It is optimal within a probabilistic framework and, as shown here, yields the best reconstruction results for our dataset. Once the position dependence of the firing rates of a group of place cells is known, the Bayes formula directly addresses the inverse problem: given the firing rates of these cells, how can we infer the most probable position? Bayesian reconstruction has been applied by Földiák (1993) Basic method
Assume that using the methods in the preceding section we have measured the firing rate maps f1(x), f2(x), . . . , fN(x) of a population of N place cells as well as the animal's position distribution P(x) during a period of time. At an arbitrary later time t, given the numbers of spikes fired by these place cells within the time interval from t Testing the assumptions
The Bayesian reconstruction as applied in Eq. 36 relies on two assumptions, namely, Poisson spike distribution and independent firing of different cells. These assumptions are reasonable and convenient approximations, although neither is precisely true.
POISSON SPIKE STATISTICS.
The Poisson firing model commonly is used for neuronal spike statistics (Tuckwell 1988
![]()
INTRODUCTION
Abstract
Introduction
References
; Bialek et al. 1991
; Optican and Richmond 1987
; Salinas and Abbott 1994
; Seung and Sompolinsky 1993
; Snippe 1996; Zemel et al. 1997
; Zohary et al. 1994
).
, 1989
; Schwartz 1994
) and the template matching method applied to disparity selective cells in the visual cortex (Lehky and Sejnowski 1990
) and hippocampal place cells during rapid learning of place fields in a novel environment (Wilson and McNaughton 1993
). In these examples, reconstruction extracts information from noisy neuronal population activity and transforms it to a more explicit and convenient representation of movement and position. In this paper, various reconstruction methods that are theoretically optimal under different frameworks are considered; the ultimate theoretical limits on the best achievable accuracy for all possible methods also are derived.
![]()
DESCRIPTION OF PLACE CELL FIRING
). The task is to infer the position of the rat's head based on the patterns of spiking activity from simultaneously recorded place cells. As illustrated in Fig. 1, a place cell is often quiet and fires maximally only when the animal's head is within certain restricted regions in the environment, called the place fields (McNaughton et al. 1983
; Muller et al. 1987
; O'Keefe and Dostrovsky 1971
; Olton et al. 1978
).

View larger version (44K):
[in a new window]
FIG. 1.
Firing rate maps of 25 hippocampal place cells simultaneously recorded in a rat (animal 1) running on elevated track of a figure-8 maze, with clockwise movement in upper loop and counterclockwise movement in lower loop, and slowing down at corners for food. Restricted regions with high firing rates are called place fields. Another 32 simultaneously recorded cells in same animal were active during sleep but virtually silent on this maze and therefore were not shown. Maps were computed from 7 min of continuous data. Reciprocal fields were obtained from firing rate maps by using a pseudoinverse. In each plot, scaling is linear with zero value corresponding to 0 in color map and maximum positive value corresponding to 1. Here reciprocal fields roughly resemble place fields, but have negative values in surrounding regions.

View larger version (74K):
[in a new window]
FIG. 9.
Sudden jumps of reconstruction error (here showing Bayesian 1-step method for animal 1) often occurred when instantaneous firing rate averaged over all cells was low, which in turn was correlated with low running speed. In firing rate diagram, dotted curve shows inverses of average firing rates; this serves only as a visual guide to emphasize moments of low firing rates. Running speed and reconstruction error also had correlates with hippocampal local field potentials, because sharp waves and theta rhythm are known to be more prominent during immobility and motion, respectively. Amplitude diagram shows absolute value of local field potential signal of channel 1, which was slightly blurred and shown in arbitrary units. Continuous Fourier power spectra were obtained by sliding a Hanning window with a length 10 times the period of each frequency. Channel 1 was sampled at 1 kHz to capture sharp waves, which can be seen in Fourier power spectra as thin vertical strips ~150-200 Hz. Channel 2 was sampled at 200 Hz to capture theta rhythm, which can be seen in Fourier power spectra as horizontal dark bands ~8 Hz with 2nd harmonics around 16 Hz.
, Skaggs, McNaughton, Wilson, and Barnes (1996), and Barnes, Suster, Shen, and McNaughton (1997). The accuracy of spike timing data of neurons from areas CA1 or CA3 of the right hippocampus was 0.1 ms. The head position was tracked by infrared diodes on the head stage that were sampled at 20 Hz and at a resolution of 256 × 256 pixels (~111 × 111 cm). Several channels of hippocampal local field potentials were recorded at 200 or 1,000 Hz.
This probability is sometimes called spatial occupancy.
(1)
where
(2)
t is the time interval of position tracking so that N(x)
t is the total time spent at position x. The derived firing rate map f(x) was smoothed by convolving it with a Gaussian kernel (Parzen window) with a standard deviation of 1 cm. In theory, the time average of the firing rate should equal to the spatial average fmean =
f(x)P(x). This is no longer exactly true after the aforementioned smoothing, and therefore can be used as a cross-check.
This is equivalent to Eq. 2, as is verified readily by using Eq. 1 and the definition of two additional quantities, namely, the conditional probability
(3)
for an arbitrarily chosen spike to fall into the spatial position x, and the overall average firing rate
(4)
with
(5)
S(x) being the total number of spikes and
N(x)
t being the total time of recording. The firing rate distribution f(x) describes the intrinsic tendency for the cell to fire at each position x and is independent of how often the animal visits this position.
; Muller et al. 1994
) may be formulated by running velocity modulation of firing rate as in the next section.
). Both the effects of speed and direction can be modeled as multiplicative modulation of firing rate by the running velocity. For our dataset, the animals run mostly in one direction, and we need only to consider the firing rate modulation by speed.
where functions F(x) and G(v) depend on x and v only, respectively. The simplest case is linear speed modulation
(6)
That is, G(v) = a
(7)
+ b is a linear function of the speed
= |v|, with a and b being constant.
) (see also Fig. 8). A similar modulation by speed occurs for cells in the motor cortex of monkeys. Schwartz (1993
, 1994)
has shown that adding up the population vector head to tail approximately reproduces the hand trajectory. This implies that the length of the population vector is roughly proportional to reaching speed. Thus to a first approximation, the reaching speed modulates the average firing rate linearly also in the motor cortex.

View larger version (14K):
[in a new window]
FIG. 8.
Modulation of place cell firing rate by running speed was nearly linear. A: histogram of speed distribution. Speed was computed after convolving position trajectory with a Gaussian kernel. B: average firing rate across all cells as a function of binned running speed. For animal 1, mean speed was 19.6 cm/s, mean firing rate was 0.92 Hz, and linear fit was 0.028
+ 0.39 with correlation coefficient r = 0.991. For animal 2, corresponding parameters were 9.3 cm/s, 1.1 Hz, and 0.037
+ 0.73, with r = 0.992. Only 1st half of data are shown (7 min for animal 1 and 10 min for animal 2). For 2nd half of data, parameters for animal 1 were 18.0 cm/s, 0.91 Hz, and 0.033
+ 0.31 with r = 0.991; for animal 2, they were 7.0 cm/s, 1.38 Hz, and 0.023
+ 1.25, with r = 0.956.
; Jensen and Lisman 1996
; O'Keefe and Recce 1993
; Samsonovich and McNaughton 1996
; Skaggs et al. 1996
; Tsodyks et al. 1996
). Using phase information would be equivalent to splitting a single place field into several smaller regions while preserving the total number of spikes. How this might affect reconstruction accuracy is described quantitatively by Eq. 53. However, information from field potential recordings was not used here because with 20-30 simultaneously recorded cells, the optimal time window for reconstructing position was ~1 s (Fig. 6), which spanned around 8 theta cycles and averaged out the phase information from a single cycle.

View larger version (18K):
[in a new window]
FIG. 6.
Mean error of Bayesian 2-step reconstruction method depends on time window and shift of its center. A: length of time window was altered systematically, whereas all other parameters were identical to those in Fig. 5 with all cells. B: center of time window (length fixed at 1 s) was shifted relative to current instant of time. Only data from first half session were used for sampling. Reconstruction was performed on data from both first half session (light gray curves and symbols) and second half session (black curves and symbols, with two outlier data points removed). For animal 1, each half session lasted 7 min and for animal 2 each lasted 10 min. For both animals, reconstruction for second half session was most accurate when center of time window was slightly shifted towards past.
![]()
LINEAR APPROACH: BASIS FUNCTIONS
i(x) as an arbitrary basis function or template function associated with this cell. The basic computation is the linear sum
which is a distribution function over two-dimensional (2-D) physical space indexed by x. The peak position of this distribution can be taken as the reconstructed position
(8)
of the animal; that is
where argmax means the value of x that maximizes the function.
(9)

View larger version (41K):
[in a new window]
FIG. 2.
Snapshots of reconstructed distribution density for position of a rat (animal 1) as compared with its true position. True position occupied a single pixel on 64 × 64 grid, corresponding to 111 × 111 cm in real space. Probabilistic method (Bayesian 1-step) often yielded a sharp distribution with a single peak, whereas direct basis method typically led to a broader distribution with multiple peaks. Total numbers of spikes collected from all 25 cells for three snapshots were 5, 22, and 31, respectively. Time window for each snapshot was 0.5 s.
), the template matching method (Lehky and Sejnowski 1990
; Wilson and McNaughton 1993
), and the optimal linear estimator (Salinas and Abbott 1994
) as special cases. A more detailed discussion of these methods follows in a later section.
where the distribution function
(10)
(x) describes the position of the animal and is 1 at the animal's current position and is zero elsewhere. Equation 10 remains valid even when
(x) is an arbitrary probability distribution density.
(x) by combining some fixed template function
i(x) associated with each cell i. For an ideal reconstruction, we should have
for an arbitrary position distribution
(11)
(x). In practice, we seek an approximate reconstruction by minimizing the error.
where n = (n1, n2, . . . , nN)T is a column vector, F = [f1, f2, . . . , fN] is a matrix with each column vector fi obtained by concatenating all the pixels in the firing rate map fi(x). The exact method of the concatenation does not affect the final result, as long as it is used consistently. Ik is the kth column of the identity matrix I. The sole nonzero element of Ik represents the pixel of the animal's position after the concatenation. The requirement Eq. 11 for perfect reconstruction now becomes
(12)
where, in the last step, Eq. 12 has been used, and matrixG = [g1, g2, . . . , gN] is constructed in the same way as F, and each column vector gi corresponds to the basis function gi(x) to be determined. If Eq. 13 holds true for all k, that is, for all positions of the animal, then we must have I = GFTI. Thus in general we should try to find G that minimizes
(13)
where
(14)
X
2 means the sum of the squares of all the elements of matrix X. The solution to this least-mean-square problem is well known
where
(15)
denotes Moore-Penrose pseudoinverse and T denotes transpose. F and G are reciprocal in the sense that we also have
In other words, applying the pseudoinverse and transpose twice returns to the original basis. An example of reciprocal basis functions for place fields is shown in Fig. 1.
(16)
). Starting from the expansion F = UDVT, where U and V are orthogonal matrices and D is a rectangular diagonal matrix, we immediately obtain G = UD
VT, where D
is the diagonal matrix obtained from DT by replacing each nonzero element
i with its inverse 1/
i, hence the reciprocal property. The familiar method of computing pseudoinverse from the normal equation
by inverting the correlation matrix FTF yields identical results, provided that the matrix is not singular or near singular.
(17)
or equivalently, the tuning functions f1(x), . . . , fN(x) are uncorrelated
(18)
where C is a proportional constant and
(19)
ij is Kronecker delta that equals 1 when i = j and 0 otherwise. The conditions for orthogonality Eq. 18 or Eq. 19 are approximately satisfied for the recordings in Fig. 1, which explains why the reciprocal fields look similar to the place fields (direct basis) and why the corresponding performances are also similar (Figs. 3 and 5).

View larger version (62K):
[in a new window]
FIG. 3.
True X and Y positions of animal 1 running on figure-8 maze as compared with positions reconstructed by different methods with 25 place cells. Same 60-s segment is shown in all plots. Time window for reconstruction was 0.5 s, which was moved forward with a time step of 0.25 s. For a fair comparison of different methods, if none of 25 cells fired within time window, reconstructed position at preceding time step was used. Probabilistic or Bayesian methods were especially accurate and erratic jumps in reconstructed trajectory were reduced by a continuity constraint by using information from two consecutive time steps.

View larger version (28K):
[in a new window]
FIG. 5.
Mean errors of different reconstruction methods applied to data from two different rats. Errors all decreased as more cells were used, and lowest errors were comparable with intrinsic error of tracking system, which was about 5 cm (horizontal lines). Shaded region: reconstruction errors prohibited by lower bound derived by Fisher information and Cramér-Rao inequality (Eq. 53). Each data point and error bar represent average and standard deviation of mean errors in 40 repetitive trials in which a subset of cells were drawn randomly from whole sample. Animal 1 ran on figure-8 maze; 7 min of data were used for sampling and subsequent 7 min of data were used for reconstruction. Animal 2 ran on rectangular maze (upper loop of figure-8 maze) and both sampling and reconstructing time intervals were 10 min. Time window was 1 s.
i(x) = fi(x), and the optimal template function that minimizes the mean square error is the reciprocal basis
i(x) = gi(x) given by Eq. 15. For nonuniform P(x), instead of using the raw firing rate maps fi(x) as the basis functions, choose
This weights the basis functions by the prior probability for the animal to visit each position. As expected, simulations confirm that this modification reduces reconstruction error. Similarly, we can use
(20)
as the reciprocal basis functions. Other modifications of the reciprocal basis are considered in APPENDIX A and compared in Table 1. In our comparison of different methods in Figs. 3 and 5, the simpler Eqs. 20 and 21 were used.
(21)
View this table:
TABLE 1.
Relative mean errors of different reconstruction methods
i(x) linearly as
ni
i(x), and then take the peak position x ^as the reconstructed position of the rat as described by Eq. 9. The whole trajectory is constructed by sliding the time window forward (Fig. 3).
where n1 and n2 are the activity of the two neurons relative to their resting levels.
(22)

View larger version (16K):
[in a new window]
FIG. 4.
A hypothetical two-neuron motor system showing distinction between driving directions and preferred directions, which form a pair of reciprocal vector bases. Apparent preferred directions differ from true driving directions unless system is orthogonal. Here reciprocity means that if vectors g1 and g2 are taken as driving directions, then vectors f1 and f2 will become apparent preferred directions.
(23)
(24)
is the transposed inverse. This is a special case of the reciprocal basis in Eq. 15 because the pseudoinverse is identical to the ordinary inverse for an invertible square matrix. Expressing G = [g1, g2] in terms of two column vectors g1 and g2, the solution to Eq. 24 becomes
(25)
Therefore, the activity ni of each neuron is the inner product between the actual movement direction f and a fixed direction gi, which is the preferred direction. The driving directions and the preferred directions are reciprocal in the same sense as in the preceding section. They are identical only when the two driving directions are orthogonal. In general, Eq. 25 means FTG = I (unit matrix), or equivalently, f1·g1 = f2·g2 = 1 and f1·g2 = f2·g1 = 0. That is, in general, the preferred direction g1 of neuron 1 is always perpendicular to the driving direction f2 of neuron 2, and vice versa. The distinction between a pair of reciprocal vector bases is equivalent to that between contravariant and covariant tensors (Pellionisz 1984
(26)
; Pellionisz and Llinas 1985).
) and place cells (Fenton and Muller 1997
; Wilson and McNaughton 1993
). This method is equivalent to the direct basis method. At each fixed position x, the average firing rates of all the cells can be described by a vector f = (f1(x), f2(x), . . . , fN(x)). A vector template is stored for each position x and is later used to match the actual number of spikes fired by these cells within a time window: n = (n1, n2, . . . , nN). During reconstruction, the matching method searches for the position in which the dot product n·f is maximized. This is equivalent to maximizing n·f =
nifi(x) using fi(x) as basis functions. In other words, the reconstructed position is
Uniform scaling by the numbers of spikes does not affect the peak position or the reconstruction result. Position-dependent scaling such s(x) = 1/
(27)
fi(x)2 can be accommodated by redefining each template function fi(x) as fi(x)s(x).
where xi is a fixed vector for each neuron i, and ni is its activity level, such as the total number of spikes during a time interval or the firing rate.
(28)
; Salinas and Abbott 1994
; Sanger 1994
).
vector in Eq. 28 is identical to the peak position of the following scalar function
where vector x is a free variable in the same space as
(29)
vector but has unit length, and the template function
is a cosine function of the angle between the two vectors. In other words, the direction of the reconstructed vector defined by Eq. 28 also can be computed by
(30)
where |
(31)
vector| is the length of the vector
vector. To see this, take the dot product of the both sides of Eq. 28 with the unit vector x and then use the definitions in Eqs. 29 and 30 to obtain
(x) = x·
vector, which is maximized only when the unit vector x is aligned with
vector. Therefore basis functions with cosine tuning can precisely implement a vector method (see also Salinas and Abbott 1994
).
where xi is the center of the place field of cell i. This method has been considered by several authors (Abbott and Blum 1996
(32)
; Blum and Abbott 1996
; Burgess et al. 1994
; Muller et al. 1987
). It will be shown later that the scaling in Eq. 32 can be justified by the Bayesian method.
xi), where xi is the center of the place field of cell i. To find the peak position of
nifi(x), set its derivative with respect to x to zero. This leads to
which can be compared with Eq. 32. Because eachG(
(33)
basis
xi) is small unless the estimated position
basis is sufficiently close to the place field center xi, the overall effect may be interpreted as a population vector with an additional weighting scheme in favor of those place fields centered close to the estimated position. Consequently, an occasionally active cell with a place field far away from the estimated position does not affect the final result of the basis function method as much as in the case of a population vector.
![]()
PROBABILISTIC APPROACH: BAYESIAN METHODS
to visual orientation tuning, by Sanger (1996)
to motor directional tuning, and by Ginzburg (Gerrard et al. 1995
) and Brown (Brown et al. 1996
) to place cells.
. Alternatively, without using an explicit spike generation model, the stimulus-response relation can be obtained empirically as a look-up table (Földiák 1993
).
/2to t +
/2, where
is the length of time window, the goal is to compute the probability distribution of the animal's position at time t. Notice that what is to be computed here is a distribution of positions instead of a single position; we always can take the most probable position, which corresponds to the peak of the probability distribution, as the most likely reconstructed position of the animal (Fig. 2).
The goal is to compute P(x|n), the probability for the animal to be at position x given the numbers of spikes n. P(x) is the probability for the animal to be at position x, which can be measured experimentally. The probability P(n) for the numbers of spikes n to occur can be determined by normalizing the conditional probability P(x|n) over x and therefore does not have to be estimated directly.
(34)
where fi(x) is the average firing rate of cell i at position x, and
(35)
is the length of the time window.
(36)
By sliding the time window forward, the entire trajectory of the animal's movement can be reconstructed from the time varying activity in the neural population.
(37)
). It is equivalent to assuming that the firing is as random as possible. If points are distributed randomly over a long time interval, with equal chance for each point to fall at any location independently of each other, then the number of points contained in a small time window has a Poisson distribution. A direct test for a Poisson distribution is not easy for place cells because their mean firing rates are low and depend on the location, which an animal may visit only transiently. Nonetheless, the interspike intervals can be compared directly with the exponential distribution expected for a Poisson process. Consider cell i with firing rate map fi(x). According to the inhomogeneous Poisson model with mean firing rate depending on the position x, the overall probability density for interspike interval s should be
where P(x) is the probability for the animal to visit position x. The above formula implies that dpi(s)/ds < 0 always holds true; that is, the interspike interval histogram must be monotonically decreasing.
(38)
). Second, instead of decreasing monotonically, the actual interspike interval histograms tended to have a small bump within the theta frequency range (4-12 Hz), reflecting the periodic drive from the septal pacemakers (Stewart and Fox 1990
). Third, the Poisson spiking model does not have a refractory period to rule out extremely short intervals. A related finding by Fenton and Muller (1997)
is that place cell firing for similar movement trajectories in an open environment was more variable that expected from a Poisson distribution. Despite these limitations, the inhomogeneous Poisson model proved to be adequate for the purpose of reconstruction.
INDEPENDENCE OF DIFFERENT CELLS. The activities of two cells are statistically independent only if the joint probability of firing is a product of the individual probabilities. For two place cells, 1 and 2, their independence means that at each position x
|
(39) |
Continuity constraint: using two time steps
The trajectories reconstructed by all methods suffer from occasional erratic jumps, which often are caused by low instantaneous firing rates of the recorded place cells, especially when the animal stops running (Fig. 3). Clearly, if all recorded cells stop firing, there is not enough information for accurate reconstruction.
1 at the preceding time step is used as an additional piece of information, we can compute the conditional probability P(xt|nt, xt
1) from the general Bayes formula P(xt
1|xt, nt)P(xt, nt) = P(xt|nt, xt
1)P(nt, xt
1) and the assumption that
In the sense of probability, this assumption means that the activity nt at the current time step cannot directly affect the position xt
(40)
1 at the preceding time step. With the help of simple equalities like P(xt, nt) = P(xt|nt) P(nt) and P(nt, xt
1) = P(xt
1|nt)P(nt), we immediately obtain the final equation
where k = 1/P(xt
(41)
1|nt) is a scaling factor that does not depend on xt and can be determined by normalizing P(xt|nt, xt
1) over xt.
Compared with the single step Eq. 36, the only new factor in Eq. 41 is the conditional probability P(xt
(42)
1|xt). Intuitively, the continuity constraint should require that given the current location xt, the preceding location xt
1 cannot be too far away. For a precise formulation, we assume a 2-D Gaussian distribution
where the standard deviation
(43)
t is related to the speed
t at time t by the formula
(44)
t of the Gaussian prior is too large, the continuity constraint has little effect. On the other hand, if
t is too small, the constraint may become too restrictive and the reconstructed position might get stuck in the same position if the real position has moved away by a distance much larger than
t. In our analysis, we simply used d = 1 for linear movements and confined
t to between 20 and 60 cm (good empirical values). The scaling in Eq. 44 was chosen so that
t would just reach the allowed maximum of 60 cm at top running speed. For generality, we did not include any bias for the direction of movement. As shown in Figs. 3 and 5, the continuity constraint is effective in reducing reconstruction errors caused by erratic jumps.
Incorporating firing rate modulation
Under the assumption of multiplicative modulation of firing rate such as in Eq. 6, the Bayesian Eq. 36 should be modified by replacing the fixed tuning function fi(x) of each cell i by
|
(45) |
|
(46) |
|
(47) |
Relationship to basis functions
GENERAL EQUIVALENCE.
In general, the performance of all basis function methods can be mimicked by the Bayesian method if appropriate template functions are used. To implement the sum
ni
(x) with the basis functions
i(x), choose the tuning function
|
(48) |
|
(49) |
|
(50) |
RELATIONSHIP WITH POPULATION VECTOR WITH SCALING.
Under special assumptions, the Bayesian method can lead to the population vector method with scaling by total activity. This scaling is needed for reconstruction of a positional vector. Suppose the tuning function for each cell i is a Gaussian fi(x) = Ai exp(
x
xi
2/2
2i), where xi is the center of the place field of cell i and
i characterizes its width. Suppose the centers xi and the peak firing rates Ai are uniformly distributed in space, then the sum
fi(x) is approximately constant in space. In this case, the optimal position
in Bayesian reconstruction is given by
|
(51) |
i =
. This justifies scaling the positional population vector by the total activity
ni in Eq. 32.
| |
RESULTS OF PLACE CELL RECONSTRUCTION AND DISCUSSION |
|---|
Comparison of different methods
RECONSTRUCTION ACCURACY. The performance of different methods can be compared qualitatively in Fig. 3 and quantitatively in Table 1 and Fig. 5. The effects of various parameters are discussed later.
. This result makes sense because of the essential equivalence between the matching method and the direct basis method as discussed before.
; Wilson and McNaughton 1993
).
). However, data exclusion requires additional criteria. For all methods in this paper, we reconstructed the whole trajectory without introducing additional parameters.
PROBLEM WITH VECTOR METHODS. The population vector method was the least accurate in reconstructing position in space for our dataset. The major problem is that it sometimes yielded implausible results. To see why, imagine that only two cells were active within the time window; then the weighted average of their positions would lie somewhere on the line connecting the two centers of the place fields. The estimated position may be in the center of the maze, which was never visited by the rat. This is a general problem for all vector methods including the reciprocal vector or optimal linear estimator. As long as the shape of the environment is not convex, a vector method may suffer from this problem. The existence of systematic bias in vector methods has been pointed out by Muller, Kubie, and Ranck (1987). In contrast, both the basis function method and the probabilistic method take into account the entire 2-D distribution functions, and the final peak always occurs in a plausible position. See also the discussion after Eq. 33.
REMARK ON SILENT TIME STEPS. For a fair comparison of different methods, in all of our simulations we have used the heuristic that if all the recorded cells were silent within a time window, the current reconstructed position was set to be the reconstructed position at the preceding time step. We needed this rule because when all recorded cells are silent within the time window, all basis function methods fail, whereas the Bayesian methods are robust. With a time window of 0.5 s, ~15% of the time steps contained no spike in animal 1, and the percentage was only ~0.2% in animal 2. For a time window of 1 s, the percentages dropped to 4.3 and 0% for animals 1 and 2, respectively. There were fewer silent time steps in animal 2 because its maze was smaller, but there were more cells so that the overall average firing rates were higher than that of animal 1.
Effects of parameters
NUMBER OF CELLS. When more cells were used in reconstruction, the errors tended to decrease (Fig. 5). Different methods maintained their relative accuracy for different numbers of cells. In theory, the error of various reconstruction methods should be inversely proportional to the square root of the number of cells (Eq. 56). However, the inverse square root law is valid only when a sufficiently large number of cells is used, as shown by simulation (Fig. 12). Here the number of cells was too small to apply the law.1
|
TIME WINDOW. A time window with a duration of ~1 s was a good choice for the Bayesian two-step method for these data (Fig. 6A). The results of other methods were similar (not shown), although the errors for large time windows sometimes leveled off or only increased slightly. In theory, the error is expected to be inversely proportional to the square root of the time window (Eq. 56). This can account for the general tendency for the reconstruction to improve with a longer time window. However, this theoretical consideration no longer holds when the time window exceeds the optimal size.
TIME SHIFT.
In the Bayesian two-step method, the center of the time window was shifted relative to the current time to see what shift produced the best accuracy (cf. Blair and Sharp 1995
; Blair et al. 1997
; Muller and Kubie 1989
; Taube and Muller 1995
). Alignment of the time window to within 100 ms of the current time gave the best results. Figure 6B compares the reconstruction errors from the first half of the session to that from the second half. As expected, the errors for reconstructing the second half were larger than that of self-reconstruction because all parameters for the reconstruction algorithm were determined using data from only the first half session.
TIME BLOCKS: SLOW TRENDS.
As shown in Fig. 7, when a single 3-min time block of data was used to reconstruct the movement trajectory of other time blocks, the average reconstruction error tended to drift systematically. That is, reconstruction tended to get worse when going further into either the past or the future. Because there was no overlap between the time blocks, the tendency for gradual drift of error must be caused by gradual changes of either behavioral or neural origin.
FIRING RATE MODULATION.
Including firing rate modulation in reconstruction had only a small effect on the accuracy, even though the average firing rate was modulated clearly by running speed (Fig. 8), as reported by McNaughton, Barnes, and O'Keefe (1983).
OTHER PARAMETERS.
The reconstruction results were not sensitive to parameters like the exact width of the spatial grid and the exact radius of the blurring kernel as long as they were kept below the intrinsic error of the tracking system. In all reconstruction data presented in this paper, we used64 × 64 grid, corresponding to 111 × 111 cm in real space. Adding a small constant background value to the firing rate maps sometimes slightly improved the Bayesian reconstruction.
Discontinuity in reconstructed trajectory
As shown in Figs. 3 and 9, reconstruction errors were relatively small most of the time but were punctuated by large jumps. Most of these jumps occurred when the animal stopped running (Fig. 9). This also can be seen in Fig. 3, where the jumps tend to occur between the corners, the places where the animal stopped for food reward. The results from animal 2 were qualitatively similar, but the jumps in reconstruction errors were less frequent because more cells were used in reconstruction and their average firing rates were higher.
Fisher information
The accuracy of different reconstruction methods varied greatly. What is the maximum accuracy that can possibly be achieved? The problem is to estimate the maximum amount of information inherent in the neuronal spikes that can be extracted for reconstruction.
Minimal error under Gaussian tuning in 2-D space
To estimate Fisher information in place cell data, several properties of the place fields must be provided: the average shapes of place fields, the spiking statistics of the cells, the density of the place fields per unit area, and the statistics of the tuning parameters. Instead of using noisy real data for direct numerical evaluation, we use analytical models based on simplifying assumptions. Although these assumptions are not rigorously true, they are reasonably good approximations that allow useful analytical estimates to be derived. First, assume that the spatial tuning function is a 2-D Gaussian
66 ms (
34
32 ms) for animal 1 and
68 ms [
93
(
25) ms] for animal 2. Conversely, when the second half of data was used to reconstruct the trajectory of the first half, the minimum appeared to drift towards the future with respect to the minimum of self-reconstruction (not shown). Using the same smoothing method, the value of the drift was 118 ms [107
(
11) ms] for animal 1 and 93 ms [17
(
76) ms] for animal 2.

View larger version (26K):
[in a new window]
FIG. 7.
Mean reconstruction errors of Bayesian 2-step method in consecutive and nonoverlapping time blocks, each lasting 3 min. Only a single block (shaded) was used as data block for sampling and reconstruction was performed on all blocks. Graphs for two animals are aligned along data blocks. Average reconstruction errors tended to change gradually even though there was no overlap between contiguous time blocks.
t of allowable jumps in Eq. 44 varied between 20 and 60 cm, which gave good performance for the data of animal 1. Smaller values slightly improved the reconstruction for animal 2, which did not have many erratic jumps in the first place. Changing these values affects the reconstruction error only slightly, as long as they are not too small or too large.
) (see also Figure 8); 3) occurrence of large amplitude irregular activity and more frequent sharp waves in local field potentials (Buzsáki 1986
; Vanderwolf et al. 1975
) (in Fig. 9, the sharp waves can be seen as the thin vertical dark bands ~150-200 Hz in the Fourier power spectra of channel 1 of the local field potentials); and 4) the disappearance of the theta rhythm (Vanderwolf et al. 1975
) (in Fig. 9, theta modulation can be seen as the fuzzy horizontal dark bands ~8 Hz in the Fourier power spectra of channel 2 of the local field potentials).
). This is consistent with the report that place cell firing represented an animal's position somewhat more faithfully during the theta rhythm (Kubie et al. 1984
).
, 1989
). Sharp waves are more frequent during immobility. It is possible that the activity during sharp waves might be involved with other functions such as memory consolidation and reactivation of learned correlations rather than merely reflect the current spatial position of the animal. Reconstruction based on sharp wave activity may yield an incorrect position. However, because the jumps did not always coincide with the occurrence of sharp waves, we can only conclude that sharp waves are one possible contributing factor for discontinuity in reconstructed path.
; Rolls et al. 1995
). One cannot completely rule out the possibility that the activity of some cells in rat hippocampus might be similarly modulated by the animal's view rather than by the current position alone. If this is true, the reconstructed position may jump when a rat looks around or shifts attention.
![]()
RECONSTRUCTION ACCURACY: THEORETICAL LIMITS AND BIOLOGICAL IMPLICATIONS
; Seung and Sompolinsky 1993
; Snippe 1996). It differs from Shannon information, which has been used more commonly in estimating information contained in spikes (for a recent review, see Rieke et al. 1997
). Although these two information measures are related, their relationship is not straightforward (Cover and Thomas 1991
). Fisher information depends on the shape and the slope of the tuning function (see APPENDIX B). In the place cell example, if the spatial bins are scrambled, the amount of Fisher information would be different. By contrast, Shannon information would be the same regardless of the ordering because it is not directly related to the slope of a distribution function. Shannon information offers a useful measure on the distribution of place fields (Skaggs et al. 1993
) but no existing theory has linked it to reconstruction accuracy. See also Treves, Barnes, and Rolls (1996) for related discussion.
; Kay 1993
; Rao 1965
; Scharf 1991
; Zacks 1971
). This directly yields an estimate of the minimal reconstruction error that can possibly be achieved.
(52)
where
(53)
2 =
/2 is a constant correction factor, 
2
is the average of the square of the tuning width for different place fields, and Nspikes is total number of spikes collected from all the cells within the time window, namely,
Here N is the total number of cells recorded, fmean is their mean firing rate, and
(54)
is the length of the time window. The mean firing rate is related to the parameters of the Gaussian place fields by the formula
where A is the area of interest, in which the N place fields are uniformly distributed, and
(55)
fmax
is the average peak firing rate of all place cells. Here the assumption of the independence of peak firing rate and the tuning width has been used.
The equivalence of the two Eqs. 56 and 53 follows immediately from Eqs. 54 and 55.
(56)
Minimal error under Gaussian tuning in arbitrary dimensional space
To generalize the results in the preceding section, we can derive error formulas for an arbitrary spatial dimension D by assuming Gaussian tuning function, Poisson spike statistics, uniform distribution of the center of the Gaussians in space, and statistical independence of the firing of different cells as well as their peak rate fmax and tuning width
. As the counterpart of Eq. 53, the minimal reconstruction error based on Fisher information is
|
(57) |
means average over all the cells. The counterpart of Eq. 56 is
|
(58) |
|
(59) |
D is the correction factor for dimension D (see Table B1 and Eq. B26);
is the width and fmax is the peak firing rate for the Gaussian tuning function (52) in D-dimensional space; and Nspikes is the total number of spikes collected from N cells within the time window
|
(60) |
|
(61) |
is the cell density. For example, suppose N cells are distributed randomly within a D-dimensional cube with edge length L, we have
|
(62) |
= N/L is the number of cells per unit length; in the 2-D case,
= N/L2 is the number of cells per unit area; and in the 3-D case,
= N/L3 is the number of cells per unit volume. Parameter
is a characteristic length so that
D is the average length, area, or volume per cell when D = 1, 2, or 3, respectively (Fig. 10A). When D = 2; the new Eqs. 57 and 58 reduce to the old Eqs. 53 and 56 for place fields.
|
|
was not too small compared with the characteristic length
= 1. In this example, it follows from Eq. 58 that
In other words, as the tuning width
(63)
increases, the error increases with
in the 1-D case, keeps constant in the2-D case, and decreases with 1/
in the 3-D case.
Minimal error under cosine tuning and comparison with population vector
Cosine tuning is widely used to model motor cortical cells (Georgopoulos et al. 1988
). We will show that under cosine tuning, the Bayesian method can achieve the best possible accuracy by reaching the theoretical lower bound, whereas the performance of the population vector is well above the bound.
where
(64)
is the angle for the test direction and
i is the preferred direction, which is drawn randomly from a uniform distribution around the circle, and
are constants. Suppose the spikes have Poisson statistics and different cells are independent. The average reconstruction error
(65)
Bayes for the Bayesian method Eq. 36 is expected to approach the minimal achievable error defined by Fisher information and Cramér-Rao bound, provided that a large number of cells is used (APPENDIX C). The minimal angular error (arc) derived by Fisher information is
where
(66)
1 is the 1-D correction factor (Table B1), N is the total number of cells, and
(67)
where
(68)
with A and B given by Eq. 65. In the 2-D case, corresponding to 3-D work space for reaching, Eq. 68 needs to be modified by replacing
(69)
1 by
2 and Q1 by
Because the error for the population vector in Eq. 68 and the error for the Bayesian method in Eq. 66 are both inversely proportional to the square root of the number of cells (see also footnote 1), the ratio
(70)
is independent of the number of cells. It can be verified analytically that J1Q1 < 1 regardless of the values of fmin, fmax, and the time window
(71)
. This means that here the Bayesian method is always more accurate than the population vector regardless of the choice of parameters.
Place cell reconstruction error compared with theoretical limits
COMPARISON WITH LOWER BOUND.
As shown in Table 1 and Fig. 5, our best reconstruction error by the Bayesian two-step method using all the cells was larger than the theoretical limit in Eq. 53 by a factor of about 2. To compute the theoretical lower bound, the width of the place fields was estimated using the first half of data by the method described in APPENDIX B. For animal 1, the average width of place fields was 

= 10.1 cm and
= 11.2 cm. For animal 2, 

= 8.6 cm and
= 9.6 cm. The mean firing rates of the 25 cells from animal 1 was fmean = 0.92 Hz; and the average of the 30 cells from animal 2 wasfmean = 1.09 Hz.
CORRECTION FACTOR AND SPATIAL DIMENSION.
The correction factor
D is the ratio of the mean error over the square root of the mean square error. Its theoretical value depends only on the dimension D of the work space, assuming Gaussian error distribution (APPENDIX B). As a consequence, its empirical value could be used to estimate the dimension D. For reconstruction errors obtained for all time steps by the Bayesian two-step method, the empirical value was 0.8705 for animal 1 and 0.8855 for animal 2, surprisingly close to the value of the 2-D correction factor
2 = 0.8862 but quite different from
1 and
3 (Table B1). However, the actual error distribution differed from the theoretical probability density
2(r) (Table B1 and Fig. 16) because it peaked at a smaller value than the standard deviation (r = s) and had a longer tail for large errors. This might be related to the fact that the rats run in 2-D space but within the confinement of a track so that the work space was neither purely two dimensional nor purely one dimensional.
COUNTERINTUITIVE EFFECT OF PLACE FIELD SIZE.
Imagine we have a fixed number of place cells all having place fields within a given region. Suppose we can change the size of the place fields without altering the positions of the place field centers and the peak firing rates. How would this change affect the accuracy of spatial information encoded in the population? The same question in a different form is how the minimal achievable reconstruction error depends on the place field size. This question is related to how the accuracy of spatial representation is affected by the broader place fields found in ventral hippocampus of rats (Jung et al. 1994
) and in the hippocampus of the genetically engineered mouse (see further). The size of place fields also tends to increase during repetitive running on a track (Mehta and McNaughton 1997).
of the place fields. Here we have assumed that the place fields are of Gaussian shape, the spikes are Poisson, and different cells are independent.
in Eq. 53 so that the width
is absent in the equivalent Eq. 56.
; Rotenberg et al. 1996
). A decease of the peak firing rate of place cells was reported explicitly by Rotenberg, Mayford, Hawkins, Kandel, and Muller (1996). According to Eq. 55 such a decrease also was implied in the observation that the mean firing rate of the genetically altered mice were the same as the normal (McHugh et al. 1996
). The increase of reconstruction error for these mice (McHugh et al. 1996
) may be attributed to a decrease in peak firing rate rather than to an increase in place field size per se because Eq. 56 suggests that reconstruction error should be independent of place field size. Equivalently, if the mean firing rate is fixed, it is equally valid to apply Eq. 53, which suggests that in this case the error should be larger for larger place fields. Instability of place fields is another possible source of reconstruction error (cf. Rotenberg et al. 1996
). A more detailed analysis of the data would be required for quantitative comparison.
General biological discussion
The minimal achievable reconstruction considered in the preceding sections is actually a general measure of how accurately a physical variable is encoded by a population of neurons, regardless of the decoding method used. As a consequence, the theoretical limit is valid not only for the computational problem of reconstruction but also for all neurophysiological and behavioral measures of acuity or accuracy in the sense of mean square error, as long as the information is derived solely from the same neuronal population.
Assuming a mean peak firing rate fmax ~ 15 Hz and a biologically plausible time window
(72)
~ 200 ms, we get N ~ 103. That is, a minimum of ~1,000 cells would be required. As another example, using up all the place cells in a rat, say, N ~ 105 (Amaral and Witter 1995
), what is the maximum area that can be covered under the acuity of 1 cm? Using the same parameters considered earlier, we find the total area is A ~ 106 cm2, the same as a 10 × 10 m square.
). In contrast, in3-D space or space of higher dimensions, a sharper tuning width implies worse information encoding. So the 2-D example of place cell is a critical case where the tuning width does not matter. To increase representation accuracy, one should use neurons with broader tuning functions to represent a variable higher than two dimensions, assuming that the peak firing rate is fixed. However, broader tuning width implies higher energy consumption because more neurons are activated at any given time. See Hinton, McClelland, and Rumelhart (1986), Baldi and Heiligenberg (1988)
, Zohary (1992)
, Gerstner, Kempter, van Hemmen, and Wagner (1996) for related discussions on the consequences of broad tuning.
| |
BIOLOGICAL PLAUSIBILITY OF RECONSTRUCTION METHODS |
|---|
Unified formulation
The main result in this section is that all the reconstruction methods discussed in this paper can be implemented as linear feedforward neural networks. This result is in sharp contrast with the general belief that reconstruction methods such as the Bayesian method or the template matching method are merely mathematical techniques without any biological relevance.
Biological implications
The unified formulation of reconstruction methods in the preceding section can be implemented readily by a biologically plausible neural network structure. The first computational step in Eq. 74 is linear combination of fixed basis functions. Here the coefficients are the numbers of spikes, which are proportional to the instantaneous firing rates within the given time window, which is plausibly a fraction of a second. The linear combination only requires a feedforward network in which the basis functions are implemented by the connection weights (Fig. 14). More precisely, the sum
The goal of this paper has been in part to compare different reconstruction methods, to improve the accuracy of these methods, and to assess their performance against the theoretical lower bound on reconstruction accuracy for all possible methods. For trajectory reconstruction based on spike trains from simultaneously recorded hippocampal place cells, probabilistic methods based on Bayesian formula were exceptionally accurate, especially when information about the reconstructed position from the previous time step was used to discourage discontinuities in the trajectory. In our best reconstruction with an average of up to ~30 spikes within the time window, the mean errors were in the range of the 5 cm intrinsic error of the position tracking system.
We thank H. Kudrimoti and W. E. Skaggs for valuable help in data processing and for many discussions, J. Wang for cluster cutting, M. S. Lewicki for discussions on Bayesian methods, A. Pouget for discussions on population coding and Fisher information, A. B. Schwartz and R. U. Muller for helpful suggestions on the manuscript, and two anonymous reviewers for useful comments. Data acquisition by J. L. Gerrard and C. A. Barnes.
This work was supported by Howard Hughes Medical Institute and National Institute of Neurological Disorders and Stroke Grant NS-20331.
In this section we modify the reciprocal basis by taking into account a nonuniform probability distribution for visiting different positions as well as the randomness in firing rate. As with the original reciprocal basis, the modified basis functions are optimal with respect to mean square error. However the reciprocal relationship is lost. For our dataset, the performances of these modified methods were sometimes better than the original reciprocal basis, as shown in Table 1.
In general, consider a basis function method
The reciprocal basis functions obtained by Eq. A2 are appropriate only if the animal visits each spatial position equally frequently. When the visiting probability is nonuniform, there are several ways to modify the matrix G or the basis functions. First the simple Eq. 21 is the same as the prescription
in all reconstruction methods considered above can be expressed as
and function
(73)
(x) can be written in the form
where as before ni is the number of spikes that cell i fired in a certain time window,
(74)
i(x) is a basis function associated with cell i. The function A(x) is an additive bias, which is independent of the activity of the cells and is reminiscent of a regularization term (Poggio et al. 1985
; Tikhonov and Arsenin 1977
). See also Zemel, Dayan, and Pouget (1997) for related discussion. The bias is needed only for implementing Bayesian methods. For all basis function methods, there is no bias term; that is, A(x)
0. See Table 2 for a summary.
View this table:
TABLE 2.
Unified formulation of reconstruction methods
fi(x)ni is equivalent to maximizing the sum
ni
i(x), where
To avoid taking the logarithm of zero, an arbitrary small positive number may be added to the tuning function fi. The other factors in the Bayesian Eq. 36 becomes the additive bias
(75)
which does not depend on activity ni. An intuitive interpretation of the bias is that it favors positions that animal visits frequently, that is, places with high P(x), and at the same time disfavors places that are over-represented by a large number of cells or cells with excessively high firing rates. The bias term becomes a constant and can be ignored if the animal is equally likely to visit all positions and the average firing rates of all cells are uniform across space.
(76)
In Bayesian method, the crucial part of the main Eq. 36 is the product
(77)
The product is like a logical "and" operation and the final peaks correspond to those positions where all contributing template functions in the product are not close to zero. By contrast, the sum behaves like a logical "or" operation and all peaks from contributing template functions are preserved. This difference explains why output distributions of basis function methods are usually much broader and more likely to contain multiple peaks than that of the Bayesian methods (Fig. 2).
(78)

View larger version (12K):
[in a new window]
FIG. 13.
An example of addition and multiplication of 2 basis functions. Addition yields a distribution with 2 peaks, yet multiplication yields only 1 peak. Key operation of Bayesian reconstruction method is multiplication and its output distribution is often sharper and less likely to contain multiple peaks than output distribution of basis function method, whose key operation is addition. See Fig. 2 for an actual example of comparison.
ni
i(x) is computed as
(79)

View larger version (10K):
[in a new window]
FIG. 14.
A unified network implementation for various reconstruction methods. First layer contains N cells tuned arbitrarily to an variable x. Cells in 2nd layer represent value of x by their locations in layer. Activity distribution in 2nd layer represents how likely it is for each possible value of x to be true value.
; Ullman and Basri 1991
), parietal spatial representation (Pouget and Segnowski 1994, 1997; Zipser and Andersen 1988
), modeling place cells (O'Keefe and Burgess 1996
; Redish and Touretzky 1996
; Zipser 1985
), and/or as a general strategy for neural representation (Anderson and Van Essen 1994
; Poggio 1990
; Zemel et al. 1997
).
).
). This approach has some biological plausibility because attractor dynamics might actually be used by the brain, for example, in the hippocampal system (McNaughton et al. 1996
; Samsonovich and McNaughton 1997
; Tsodyks and Sejnowski 1995a
; Zhang 1996
), the primary visual cortex (Ben-Yishai et al. 1995
; Somers et al. 1995
; Tsodyks and Sejnowski 1995b
), and motor cortex (Lukashin et al. 1996
).
![]()
CONCLUSIONS
![]()
ACKNOWLEDGEMENTS
![]()
APPENDIX A: RECIPROCAL BASIS WITH MODIFICATIONS
where N is the total number of cells, n = (n1, . . . , nN) are the numbers of spikes fired by these cells within the time window
(A1)
and (g1, . . . , gN) are the basic functions to be determined. Here
is included in the formula for the convenience of scaling; it does not change the result. The discrete version of the basis function gi(x) is the column vector gi, which is defined by first digitizing the position x and then concatenating the pixels. The vectors are collected into a single matrix G = [g1, . . . , gN]. As shown in Eq. 15, the simple reciprocal basis is related to the firing rate matrix F by
where the second equality is valid only when the matrix inverse is nonsingular.
(A2)
This is the formula used in our systematic comparison of different reconstruction methods in Fig. 5. Alternatively, we may choose
(A3)
A third possible formulation is
(A4)
Here three diagonal matrices have been used, which are defined as
(A5)
(A6)
and
(A7)
where K is the total number of position pixels, Pk is the probability for visiting position k, and
(A8)
i is the average firing rate of cell i.
These formulas are related. In Eq. A5, the D term is related to firing rate variability (see below). For large time window
, the D term vanishes, and Eq. A5 is reduced to Eq. A4. When the visiting probability Pk is the same for all positions, Eq. A4 is reduced to the original reciprocal basis Eq. A2.
Equation A5 has been derived by minimizing the error function
|
(A9) |
|
(A10) |
When firing rate variability is ignored, the equation
|
(A11) |
|
(A12) |
| |
APPENDIX B: MINIMAL RECONSTRUCTION ERROR BASED ON FISHER INFORMATION |
|---|
Basic approach
We provide an intuitive argument for using Fisher information and then sketch our method for estimating a lower bound for reconstruction error using the Cramér-Rao inequality.
Derivation of error formula in arbitrary dimensions
Following the same approach considered in the preceding section, we list the keys steps in deriving the general formula for minimal reconstruction error in arbitrary spatial dimension D. The mean firing rate of each cell is assumed to obey a Gaussian tuning function in a D-dimensional space
. Because of random variation in the exact number of spikes, the formula
is true only on average but not for each individual trial, When we use this single cell to estimate the position x from the actual firing rate n/
(B1)
and the shape of the tuning curve f(x), the fluctuation of the number of spikes would result in an error
x that obeys
Thus the mean squared error is
(B2)
In the last step, Poisson spiking statistics was assumed; that is, the variance
(B3)

n2
is equal to the mean
n
=
f(x). Here the reconstruction error is expressed in terms of the tuning function f(x) and its slope f
(x). It makes sense that a larger slope implies smaller error because a small change of position would imply a large and noticeable change in the firing rate (Fig. B1).

View larger version (12K):
[in a new window]
FIG. B1.
Intuitive argument for Fisher information in estimating position x from instantaneous firing rate and a known tuning curve f(x). Fluctuation in firing rate leads to an error
x in estimated position, which decreases with slope of tuning curve.
; Kay 1993
; Rao 1965
; Scharf 1991
; Seung and Sompolinsky 1993
; Zacks 1971
):
where the last equality follows from straightforward calculation under the assumption that the probability P(n|x) of firing n spikes at position x has Poisson distribution with the mean firing rate
(B4)
f(x).
whose validity originates from the universal Cauchy-Schwartz inequality. This result agrees with the intuitive Eq. B3.
(B5)
This leads to an effective composition rule of minimal errors:
(B6)
where
(B7)
2 is the minimum achievable value of 
x2
by using all the cells from 1 to N, and
2i is defined as the same error using only cell i.
where J is the Fisher information for variable x alone. The minimal reconstruction error is estimated as
(B8)
2
. The final result is given by Eq. 56 or Eq. 53.
(B9)
The Fisher information with respect to a single dimension, x1 for example, is
(B10)
where the first equality is a definition, the second equality follows from the Poisson statistics Eq. B10 and the last equality follows from the Gaussian tuning Eq. B9.
(B11)
where ci is the center of the Gaussian tuning function for cell i, and
(B12)
is the number of cells per unit volume in D-dimensional space (see Eq. 62). The sum is replaced by the integral under the assumption that the number of cells N is large and the centers c1, . . . , cN are uniformly distributed in space. Because of the symmetry with respect to different dimensions, we can use the same argument in Eq. B8 to obtain the Fisher information for all N cells with all dimensions included
If different cells have different peak firing rate fmax and tuning width
(B13)
, one can easily see that Eq. B13 should be modified to
where the average
(B14)
is with respect to the population of cells with different peak firing rate and tuning width. Assuming independence of the tuning parameters, we have
Finally
(B15)
where the correction factor FD is given by Eq. B26. The result is Eq. 58. In addition, it is easy to derive Eq. 61, which immediately leads to the equivalent Eq. 57.
(B16)
Estimating tuning width
To evaluate the Fisher information, the width
in the Gaussian tuning model needs to be estimated. Given the spatial distribution of the average firing rate f(x, y) measured experimentally for a single cell, we want to estimate the tuning width
assuming a Gaussian model
|
(B17) |
. The first method uses entropy or Shannon information
|
(B18) |
In the above, all averages
(B19)
should be interpreted as follows. Suppose each unit square in the grid has the area h2 and f(i, j) is the value of average firing rate at grid point (i, j), then
f
= h2
f(i, j) and so on. Equations B18 and B19 are closely related to the information content measure and sparsity measure considered by Skaggs et al. 1996
. One difference is that here the probability for spatial occupancy should not be included.
). This linearity is consistent with the Gaussian model in Eq. B17, which gives
Because A depends linearly on ln fcutoff, both
(B20)
and fmax can be obtained by linear regression.
Distribution of reconstruction error and correction factor
A correction factor is needed because the Cramér-Rao bound only gives an estimate of the variance of unbiased reconstruction, that is, the average square of the reconstruction error
As shown in Figs. 10 and 12, Bayesian reconstruction in Eq. 36 can achieve the best possible accuracy in the sense of mean square error defined by Fisher information and the Cramér-Rao bound, provided that the conditions of Poisson spike statistics and independence of different cells are satisfied. In such a situation, the optimality of Bayesian reconstruction method can be verified directly as follows. This analysis can provide an explicit estimate of the reconstruction error in each individual trial. In general, for a uniform prior probability distribution, Bayesian reconstrution is equivalent to maximum likelihood estimation, which can reach the Cramér-Rao bound in the limit of large sample (Cramér 1946 Consider the Fisher information for a 1-D reconstruction under a uniform prior probability:
Consider how fluctuation in the instantaneous firing rate ni/
of the uncut Gaussian by a modified variance method. First, concatenate the pixels in the 2-D distribution f(i, j) to obtain a 1-D distribution f(i) and then assume a 1-D Gaussian distribution to obtain the 1-D tuning width
Here
(B21)
f
= h
f(i) and so on, with h being the interval in the1-D grid. The 2-D tuning width
should equal approximately
1 divided by the number of grid points spanned by the width of the track. This is because when a 2-D Gaussian is cut by an arbitrary straight line, the resultant 1-D distribution is a Gaussian of the same tuning width. The above procedure is valid if different 1-D slices cut in parallel with the track have approximately the same peak rate.
View this table:
TABLE B1.
Correlation factor for estimating minimal reconstruction error
r2
. Its square root
differs slightly from the direct average of reconstruction error
r
. The task of this section is to determine the ratio
D =
r
/
analytically.
(B22)
(B23)
D(r) for reconstruction error can thus be obtained from Eqs. B22 and B23:
(B24)
For example, when m = 2 the formula has a particularly simple form
(B25)
r2
= Ds2, and (r/s)2 obeys the chi square distribution with D degrees of freedom.
r
/
, is thus
Note that
(B26)
D depends only on the spatial dimensionD. The width parameter s for the error distribution has disappeared. Although
D < 1 for all dimension D, it approaches 1 monotonically as D
.
).

View larger version (17K):
[in a new window]
FIG. B2.
Distributions of reconstruction errors in 1-, 2-, and 3-D spaces. Continuous curves are theoretical distributions in Table B1, which agree nicely with histograms obtained by simulations of Bayesian reconstructions repeated for 5,000-10,000 trials, assuming Gaussian tuning function and Poisson spike statistics. Tuning width was
= 3. All other parameters were identical to those in Fig. 10.
![]()
APPENDIX C: OPTIMALITY OF BAYESIAN RECONSTRUCTION
).
where fi(x) is the tuning curve for cell i and
(C1)
is the time window (see APPENDIX B). Let
x be the reconstruction error of the Bayesian method, which seeks the position x that maximizes
(C2)
(C3)
when a large number of cells is used. That is, when the Cramér-Rao lower bound is achieved.
(C4)
affects the estimated x because without the fluctuation the reconstruction would always be perfect. By definition, the average of ni/
is fi(x). Calling the difference
i, we write
Replace x by x +
(C5)
x in Eq. C3 and then make Taylor expansion, keeping only the first-order term of
x . After straight forward algebra we find
This shows how the estimated position depends on the firing rate fluctuation in an individual trial. The above approximation procedure is valid if a large number of cells is used so that the error is small. Because by definition
(C6)

i
= 0, it follows from Eq. C6 that the estimated position must be unbiased, i.e., 
x
= 0. Squaring Eq. C6 and averaging yield
where all cross terms
(C7)

i
j
with i
j have vanished because of the independence of different cells. By using the definition Eq. C1 and Poisson spike statistics 
2i
= fi(x)/
, we obtain Eq. C4 from Eq. C7 as desired.
| |
APPENDIX D: ERROR OF POPULATION VECTOR UNDER COSINE TUNING |
|---|
2-D work space
Consider N cells with cosine tuning functions and preferred rections scattered randomly around the circle with a uniform distribution. Let the spikes of different cells occur independently of each other and obey a Poisson distribution. The task is to estimate the average angular error of the population vector.
T, we mean that the preferred directions of all cells are fixed and the average is performed over repetitive trials where the only randomness is the Poisson statistics of the spikes; by average over configurations, denoted by
C, we mean averaging over random choices of the preferred directions. The error of population vector comes from both the randomness of the spikes and the randomness of the preferred directions.
where A and B are constants (cf. Eq. 64) and vector ui = (uix, uiy) is of unit length and represents the preferred direction of cell i. The population vector is defined as
(D1)
with the coefficients given by
(D2)
where ni is the number of spikes for cell i within the time window
(D3)
and the background firing rate B is subtracted to improve reconstruction accuracy (Georgopoulus et al. 1988). The x component of the population vector is estimated as follows
in which the two approximation steps are valid if the number of cells N is large. In the above derivation, we have evaluated several averages
(D4)
follows directly from the definition Eq. D3 and the fact that
(D5)
ni
T =
fi; the equations
are a consequence of the uniform distribution of preferred direction around the circle.
(D6)
Ux and the angular error of the population vector can be approximately as
where Ux is a constant given by Eq. D4 and Uy =
(D7)
ciuiy with ci given by Eq. D3. It follows immediately that
After expanding
(D8)
U2y
T, evaluate each term using
which follows from Poisson spike statistics and the independence of different cells. Next, substitute
(D9)
ni
T =
fi =
(Auix + B) into the expansion and note that all terms containing odd powers such as uixu2iy and uixujxuiyujy will vanish after averaging over configurations. Thus we need only keep terms with even powers and obtain
which can be evaluated by applying Eq. D6 again. The final result of the average angular error is
(D10)
1
=
1
with Q1 given by Eq. 69.
3-D work space We need to consider only a single test direction along the x-axis: u = (1, 0, 0). The method is very similar to that in the case of 2-D work space, except that now
|
(D11) |
|
(D12) |
2
=
2
with Q2 given by Eq. 70.
| |
FOOTNOTES |
|---|
1
For comparison, we tested square root law on data taken from Fig. 3 in paper by Georgopoulos, Kettner and Schwartz (1988), where error of population vector was computed by using up to 475 motor cortical cells. Errors were given in terms of 95% confidence cone, which was expected to be proportional to mean angular error. Square root law held quite well: on log-log scale, a linear fit yielded a slope of
0.519 with a correlation coefficient of 0.9984, as compared with theoretical slope of
1/2.
Address reprint requests to K. Zhang.
Received 27 May 1997; accepted in final form 25 August 1997.
| |
REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
G. Santhanam, B. M. Yu, V. Gilja, S. I. Ryu, A. Afshar, M. Sahani, and K. V. Shenoy Factor-Analysis Methods for Higher-Performance Neural Prostheses J Neurophysiol, August 1, 2009; 102(2): 1315 - 1330. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. van Duuren, G. van der Plasse, J. Lankelma, R. N. J. M. A. Joosten, M. G. P. Feenstra, and C. M. A. Pennartz Single-Cell and Population Coding of Expected Reward Probability in the Orbitofrontal Cortex of the Rat J. Neurosci., July 15, 2009; 29(28): 8965 - 8976. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Lisman and A.D. Redish Prediction, sequences and the hippocampus Phil Trans R Soc B, May 12, 2009; 364(1521): 1193 - 1201. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Diba and G. Buzsaki Hippocampal Network Dynamics Constrain the Time Lag between Pyramidal Cells across Modified Environments J. Neurosci., December 10, 2008; 28(50): 13448 - 13456. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. P. Cunningham, B. M. Yu, V. Gilja, S. I. Ryu, and K. V. Shenoy Toward Optimal Target Placement for Neural Prosthetic Devices J Neurophysiol, December 1, 2008; 100(6): 3445 - 3457. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. van Duuren, J. Lankelma, and C. M. A. Pennartz Population Coding of Reward Magnitude in the Orbitofrontal Cortex of the Rat J. Neurosci., August 20, 2008; 28(34): 8590 - 8603. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. V. Anderson and A. J. Fuglevand Probability-Based Prediction of Activity in Multiple Arm Muscles: Implications for Functional Electrical Stimulation J Neurophysiol, July 1, 2008; 100(1): 482 - 494. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Thiel, M. Greschner, C. W. Eurich, J. Ammermuller, and J. Kretzberg Contribution of Individual Retinal Ganglion Cell Responses to Velocity and Acceleration Encoding J Neurophysiol, October 1, 2007; 98(4): 2285 - 2296. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Q. Quiroga, L. Reddy, C. Koch, and I. Fried Decoding Visual Inputs From Multiple Neurons in the Human Temporal Lobe J Neurophysiol, October 1, 2007; 98(4): 1997 - 2007. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. M. Yu, C. Kemere, G. Santhanam, A. Afshar, S. I. Ryu, T. H. Meng, M. Sahani, and K. V. Shenoy Mixture of Trajectory Models for Neural Decoding of Goal-Directed Movements J Neurophysiol, May 1, 2007; 97(5): 3763 - 3780. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Karmeier, J. H. van Hateren, R. Kern, and M. Egelhaaf Encoding of Naturalistic Optic Flow by a Population of Blowfly Motion-Sensitive Neurons J Neurophysiol, September 1, 2006; 96(3): 1602 - 1614. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. J. Sejnowski and O. Paulsen Network Oscillations: Emerging Computational Principles J. Neurosci., February 8, 2006; 26(6): 1673 - 1676. [Full Text] [PDF] |
||||
![]() |
A. Amarasingham, T.-L. Chen, S. Geman, M. T. Harrison, and D. L. Sheinberg Spike Count Reliability and the Poisson Hypothesis J. Neurosci., January 18, 2006; 26(3): 801 - 809. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. C. Fuhs, S. R. VanRhoads, A. E. Casale, B. McNaughton, and D. S. Touretzky Influence of Path Integration Versus Environmental Orientation on Place Cell Remapping Between Visually Identical Environments J Neurophysiol, October 1, 2005; 94(4): 2603 - 2616. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Karmeier, H. G. Krapp, and M. Egelhaaf Population Coding of Self-Motion: Applying Bayesian Analysis to a Population of Visual Interneurons in the Fly J Neurophysiol, September 1, 2005; 94(3): 2182 - 2194. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. E. Kass, V. Ventura, and E. N. Brown Statistical Issues in the Analysis of Neuronal Data J Neurophysiol, July 1, 2005; 94(1): 8 - 25. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. B. Averbeck, M. V. Chafee, D. A. Crowe, and A. P. Georgopoulos Parietal Representation of Hand Velocity in a Copy Task J Neurophysiol, January 1, 2005; 93(1): 508 - 518. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Grunewald and E. K. Skoumbourdis The Integration of Multiple Stimulus Features by V1 Neurons J. Neurosci., October 13, 2004; 24(41): 9185 - 9194. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Jarosiewicz and W. E. Skaggs Hippocampal Place Cells Are Not Controlled by Visual Input during the Small Irregular Activity State in the Rat J. Neurosci., May 26, 2004; 24(21): 5070 - 5077. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. E. Brockwell, A. L. Rojas, and R. E. Kass Recursive Bayesian Decoding of Motor Cortical Signals by Particle Filtering J Neurophysiol, April 1, 2004; 91(4): 1899 - 1907. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Schneidman, W. Bialek, and M. J. Berry II Synergy, Redundancy, and Independence in Population Codes J. Neurosci., December 17, 2003; 23(37): 11539 - 11553. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. B. Averbeck and D. Lee Neural Noise and Movement-Related Codes in the Macaque Supplementary Motor Area J. Neurosci., August 20, 2003; 23(20): 7630 - 7641. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. M. Seifert and A. J. Fuglevand Restoration of Movement Using Functional Electrical Stimulation and Bayes' Theorem J. Neurosci., November 1, 2002; 22(21): 9465 - 9474. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. Jensen and J. E. Lisman Position Reconstruction From an Ensemble of Hippocampal Place Cells: Contribution of Theta Phase Coding J Neurophysiol, May 1, 2000; 83(5): 2602 - 2609. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. C. Tresch and O. Kiehn Population Reconstruction of the Locomotor Cycle From Interneuron Activity in the Mammalian Spinal Cord J Neurophysiol, April 1, 2000; 83(4): 1972 - 1978. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Nadasdy, H. Hirase, A. Czurko, J. Csicsvari, and G. Buzsaki Replay and Time Compression of Recurring Spike Sequences in the Hippocampus J. Neurosci., November 1, 1999; 19(21): 9497 - 9507. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Laurent A Systems Perspective on Early Olfactory Coding Science, October 22, 1999; 286(5440): 723 - 728. [Abstract] [Full Text] |
||||
![]() |
D. Jancke, W. Erlhagen, H. R. Dinse, A. C. Akhavan, M. Giese, A. Steinhage, and G. Schoner Parametric Population Representation of Retinal Location: Neuronal Interaction Dynamics in Cat Primary Visual Cortex J. Neurosci., October 15, 1999; 19(20): 9016 - 9028. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. B. Stanley, F. F. Li, and Y. Dan Reconstruction of Natural Scenes from Ensemble Responses in the Lateral Geniculate Nucleus J. Neurosci., September 15, 1999; 19(18): 8036 - 8042. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. W. Goodwin and H. E. Wheat Effects of Nonuniform Fiber Sensitivity, Innervation Geometry, and Noise on Information Relayed by a Population of Slowly Adapting Type I Primary Afferents from the Fingerpad J. Neurosci., September 15, 1999; 19(18): 8057 - 8070. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. M. Maynard, N. G. Hatsopoulos, C. L. Ojakangas, B. D. Acuna, J. N. Sanes, R. A. Normann, and J. P. Donoghue Neuronal Interactions Improve Cortical Population Coding of Movement Direction J. Neurosci., September 15, 1999; 19(18): 8083 - 8093. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Zhang and T. J. Sejnowski A Theory of Geometric Constraints on Neural Activity for Natural Three-Dimensional Movement J. Neurosci., April 15, 1999; 19(8): 3122 - 3145. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. E. Skaggs and B. L. McNaughton Spatial Firing Properties of Hippocampal CA1 Populations in an Environment Containing Two Visually Identical Regions J. Neurosci., October 15, 1998; 18(20): 8455 - 8466. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. N. Brown, L. M. Frank, D. Tang, M. C. Quirk, and M. A. Wilson A Statistical Paradigm for Neural Spike Train Decoding Applied to Position Prediction from Ensemble Firing Patterns of Rat Hippocampal Place Cells J. Neurosci., September 15, 1998; 18(18): 7411 - 7425. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Jancke Orientation Formed by a Spot's Trajectory: A Two-Dimensional Population Approach in Primary Visual Cortex J. Neurosci., July 15, 2000; 20(14): RC86 - RC86. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Visit Other APS Journals Online |