|
|
||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1Department of Statistics, University of Connecticut, Storrs, Connecticut; 2Department of Statistics, Florida State University, Tallahassee, Florida; and 3Department of Organismal Biology and 4Anatomy, University of Chicago, Chicago, Illinois
Submitted 28 April 2006; accepted in final form 15 November 2006
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
Template matching has been the basis of many pattern-identification procedures (Abeles et al. 1988
, 1993
; Chi et al. 2003
; Dave and Margoliash 2000
; Louie and Wilson 2001
; Nádasdy et al. 1999
). In these procedures, a template is constructed based on exemplar spike trains that exhibit a certain pattern of interest. During data analysis, each segment of the data is scored in terms of its similarity to the template. Segments with high similarity scores are interpreted as exhibiting the pattern of interest and are thus extracted for further analysis.
In most of the current procedures, the template is treated as nearly time locked to stimuli or behavioral events. Except for some local variability, such as random perturbation in spike timing, global variability is expressed as uniform timescaling, which multiplies all the interspike intervals (ISIs) by a single factor (Chi et al. 2003
; Louie and Wilson 2001
; Nádasdy et al. 1999
). However, in some systems spiking activity can exhibit "time warping," i.e., nonuniform time compression or expansion on a relatively large timescale (Dave and Margoliash 2000
; Nádasdy et al. 1999
). As far as we know, the procedure introduced by Victor and Purpura (1996)
is the only one that explicitly addresses time warping. The procedure was developed to analyze the precision of temporal coding during specific short time intervals of sensory stimulation. In the procedure, the ISIs are assumed to vary independently and similarity scores are computed with dynamic time warping (DTW). Such scores are known as "text edit distances" in the field of computer science (Wagner and Fischer 1974
). Similar procedures based on text edit distances are extremely useful in bioinformatics (Aach and Church 2001
; Bishop and Thompson 1986
; Needleman and Wunsch 1970
; Waterman et al. 1987
). Sequences of discrete behaviors such as syllables in birdsongs are also occasionally analyzed using the text edit distance (Margoliash et al. 1991
; Sankoff and Kruskal 1983
). DTW is also commonly used in speech recognition (Huang and Gray 1988
; OShaughnessy 1999
; Rabiner and Juang 1993
), birdsong recognition (Anderson et al. 1996
; Ito and Mori 1999
; Kogan and Margoliash 1998
), and other applications.
The variability of spiking activity at local versus relatively large timescales can be roughly described as follows (Fig. 1A). Think of the occurrences of a pattern of interest as "texts." The "codewords" of the texts are short intervals with spikes. The definition of codeword should be problem dependent. For different systems, a codeword may contain a single spike, multiple spikes, or a high-frequency burst. Within the codewords, the detectable variability can be reasonably characterized as random local perturbation in spike timing and spike count. Between the codewords, on the other hand, the gaps often have quite visible nonuniform variability, resulting in time warping. The gaps need not be activity free. Instead, they can contain "noise" spikes that do not repeat in different occurrences of the pattern.
|
The local comparison should be based on specific representations of spiking activity. We shall consider two types of representations. The point process representation registers spike times continuously. As a result, the neural activity is characterized as processes of points in time. The binary representation divides the neural activity into small time bins, and assigns 1 to a bin if there is at least one spike in it and 0 otherwise. As a result, the neural activity is characterized as sequences of 0 and 1 s. Although the local comparison algorithms for these two representations look quite different, they derive from nearly identical arguments based on likelihood tests.
By using DTW, the template-based approach automatically "parses" each identified data segment into codewords and gaps in between, thus allowing a more detailed interpretation of the identification results (Fig. 1A). Such parsing can yield additional insight. If the spiking activity pattern is preselected in association with a certain behavior, and each of its codewords is reliably associated with a landmark event during the behavior, then the parsing provides a way to infer from the neural activity when the landmark events occur within the overall behavior. Presently, the dominant procedures for inference of the time course of behavior are based on regression or state-space models (Brockwell et al. 2004
; Carmena et al. 2003
; Serruya et al. 2002
; Smith and Brown 2003
; Wu et al. 2004
, 2006
). They are designed to estimate spatial trajectories of motor output with minimized average error over time. The estimation of the timing of events during motor output can therefore provide a useful new perspective to the decoding of neuronal activity.
In what follows, under METHODS, we describe the identification procedures and present pseudocode for their implementations. The procedure for the point process representation was implemented in C++, and that for the binary representation in MATLAB (The MathWorks, Natick, MA). Under RESULTS, we illustrate the utility of the procedures with two brief neurobiological examples. The procedure for the point process representation is applied to a single-unit (SU) recording collected during sleep from the nucleus robustus arcopallium (RA) of zebra finches (Dave and Margoliash 2000
). The procedure for the binary representation is applied to multielectrode recordings from the primary motor cortex (MI) of a behaving macaque monkey during a target pursuit task (Hatsopoulos et al. 2004
; Serruya et al. 2002
). In this example, the times when a target was reached and the following target appeared in each successful trial can be estimated with error <0.2 s. This is somewhat surprising in view of the relatively high level of spike-timing variability in MI. For each procedure, we conduct a sensitivity analysis, demonstrating that the procedures are robust.
| METHODS |
|---|
|
|
|---|
Because of the nature of the recording technology and cortical activity, the levels of accuracy of these two recordings were very different. First, in terms of signal-to-noise ratio (SNR), the accuracy of separating action potentials from background was much higher for the recordings from RA. The SNRs of spikes from SU recordings in the RA of zebra finches are generally >10, whereas those from the array recordings in macaque monkeys can be as low as 3. The difference in SNRs result from the different experimental designs and thus different recording technologies are used: movable but small numbers (one to four) of electrodes in the bird versus a fixed array of large numbers (>100) of electrodes in the monkey. Second, in terms of neurophysiological characteristics, the spike-timing variability of zebra finch RA units is much lower as well (Yu and Margoliash 1996
; cf. Hatsopoulos et al. 2004
). The differences are likely to influence the performance of pattern identification. The sampling rate of recording is 20 kHz for the RA data and 30 kHz for the MI data.
Part one: point process representation
The procedure described in this part is suitable when 1) the pattern of interest consists of well-defined spike bursts; 2) across occurrences of the pattern, the bursts exhibit low variability; and 3) the data being analyzed constitute a single spike train, consisting of time registry of spikes accurately estimated from the SU recording. Under these conditions, an exemplar spike train expressing the pattern may be used as a template. Under RESULTS, more detail will be given on how templates are selected for SU activity in RA.
The procedure directly compares the spike times. Figure 1B illustrates how to compare a template with data registered around a time location. In general, the template need not start or end with a spike. Together with its bursts, the intervals at the beginning and at the end of the template and those between the bursts are all parts of the template and will be referred to as interburst intervals (IBIs). The bursts are used as the codewords of the template pattern. The IBIs can be thought of as independent "springs," and the bursts as relatively incompressible "blocks" interconnected by the springs. By compressing or stretching the IBIs, the template bursts are shifted accordingly and compared with the data registered in the corresponding time intervals. Given a set of modified IBIs, the score of similarity between the warped template and the data segment aligned with it is a sum of the similarity scores associated with the bursts, minus a weighted sum of the amount of changes in the IBIs and the number of spikes in the data segment that do not match any template spikes. Optimizing the weighted sum over a range of modified IBIs then yields the optimal score at the time location. We next consider appropriate similarity scores and their computation.
TEMPLATE.
A template spike train has to be registered in an interval [0, D] (Fig. 1B). If the template has n bursts, then in general it has n + 1 IBIs, including the interval between its onset (t = 0) and first spike, and the interval between its last spike and offset (t = D). A subset of template spikes is a burst if all its ISIs are less than a specified threshold. Fix parameter
> 0, which controls the precision for comparing spike times. Let Hn+1 = D, T0 = 0, and for 1
i
n
![]() |
LOCAL SCORES FOR TEMPLATE BURSTS.
First, we fix a kernel function K(x). This will be used to measure the temporal discrepancy between a data spike and a template spike. Some commonly used kernel functions defined on [1, 1] are constant 1 ("square" kernel), 1 |x| ("triangular"), 1 x2 ("Epanechnikov"), and (1 x2)2 ("biweight"). To evaluate the similarity between the ith template burst and a data burst s1, s2,... in [x, x + Bi], the idea is to translate both to [0, Bi] so they can be compared directly. If the ith template burst consists of spikes t1, t2,..., then the similarity score between the two bursts is
![]() | (1) |
0 the maximum absolute negative contribution a nonmatching data spike can make to the local score. From Chi et al. (2003)
i.
GLOBAL SCORE FOR THE ENTIRE TEMPLATE.
Denote by N(a, b) the number of data spikes between a and b. For the ith IBI, let Gi(x)
0 be a cost function for its warping. Suppose the warped template has its IBIs changed by v1, ..., vn+1. Denote
![]() |
![]() | (2) |
i
n + 1. Data spikes in these intervals are counted as "noise" and reduce the similarity. This gives rise to the third term in Eq. 3.
By the probabilistic interpretation given in the APPENDIX, the overall similarity score assigned to the time location x is defined as
![]() | (3) |
1(x), ...,
n+1(x) be a solution. By Fig. 1B, we can establish a one-to-one correspondence between the intervals I1, ..., In+1 in the template and I1, ..., In+1 in the data segment and identify the latter as a set of optimal matching IBIs. Then we can identify the intervals B1, ..., Bn between I1, ..., In+1 as a set of optimal matching burst intervals. The interlacing I1, B1, I2, ..., Bn, In+1 form a partition of the data segment between x and the right endpoint of In+1, which is an optimal match to the template that one can possibly find at time x under the similarity score. The optimal matching IBIs Ii and burst intervals Bi are computed as follows
![]() |
![]() | (4) |
DTW FOR THE GLOBAL COMPARISON.
To compute L(x) and
1(x), ...,
n+1(x), let
![]() |
![]() |
![]() |
![]() |
1(x), ...,
n+1(x) can be computed by DTW. Table 1 summarizes the procedure.
|
1n+1(x)] are extracted.
, where
is a given threshold;
r}, with r the radius for local comparison; and
r. This is to make sure that L is not a constant in [x r, x + r]. A reasonable choice for r is D, the duration of the template. When some of the extracted segments overlap in time, a disjoint subset of the segments may be selected.
The selection of the threshold value
in general is a difficult problem. One important reason is that the stochastic properties of the data can be difficult to grasp. In RESULTS we will describe an ad hoc method for the RA activity that tries to simulate the noise in the data. A different ad hoc method was used for the data collected from the MI of monkey, which will also be described in RESULTS.
ADAPTIVE SELECTION OF PARAMETER VALUES.
The parameters
and
can be set based on estimation. For
, we first compute the mean value d of the template ISIs, which can be regarded as the minimum resolution to distinguish template spikes. If the squarekernel is used in Eq. 1 to compute the local scores, then set
= d/2. To use a different kernel K,
has to be adjusted. The reason is that except for x = 0, K(x) is strictly less than the square kernel. Without adjusting
, the kernel puts less total weight to data spikes that match the template spikes, resulting in artificially lower similarity scores. We set
![]() | (5) |
From Eq. A1 in the APPENDIX,
log (
0/
)/log (
'/
0), where
' is the firing rate within template bursts,
is the firing rate within template IBI, and
0 is the mean firing rate in nonmatching spike trains. Because for a Poisson process the density is the reciprocal of the mean ISI, we set the value of
based on the approximation
![]() | (6) |
is adaptive.
NUMERICAL ISSUES.
In data analysis, L(x) is computed on a discrete time grid jd, with j = 0, ±1,..., and d > 0 a fixed step size. Also, the warping of each IBI has to be bounded. Table 2 contains a version of the algorithm that takes into account these constraints. It requires that Fi(x0 + jd) be computed and stored for a given x0. We apply the subroutine below to compute Fi(x0 + jd). Let
a
denote the smallest integer no less than a, and
a
the largest integer no greater than a.
i(s x0 jd), where j0 =
(s x0
)/d
and j1 =
(s x0)/d
.
|
The procedure described in this part is suitable when 1) the spiking activity is recorded from multiple single units and exhibits significant amount of variability in spike timing across trials and/or 2) the neural signals have lower SNRs compared with SU recordings; typically, the spike trains are obtained from multielectrode recordings by spike sorting. Under these conditions, the spike train of a single unit typically cannot represent the variability in a pattern of interest and therefore it alone cannot be directly used as a template for pattern identification. Instead, we shall construct a template as a linear filter that is made up of log-likelihood ratios estimated from sample spike trains of all the recorded single units. The statistical model involved in the construction is an integral part of the pattern identification procedure and is described below.
BASIC SETUP. Suppose the spiking activity pattern we are interested in is associated with a sequence of n landmark events during the course of a certain type of behavior. For example, if a behavioral task requires a monkey to reach n targets in sequence, then for i = 1, ..., n, the ith event may be defined as the cursor reaching the ith target (see more detail in RESULTS). The n 1 intervals between the times of consecutive events during an occurrence of the event sequence will be referred to as interevent intervals (IEIs).
We suppose that the multielectrode spike trains are recorded from the same set of C units and each data spike has been associated with one of the units by spike sorting. Fix parameter
> 0 as the duration of time bin. For c = 1, ..., C, the binary representation for thespiking activity of the cth unit is a binary sequence xc(1), xc(2), xc(3), ..., such that
![]() |
is small enough, almost all time bins can have at most one spike, and the binary representation induces little loss of information on spike count.
To start, one needs to collect a sample of multielectrode spike trains associated with multiple occurrences of the event sequence. For each sample spike train, if T1, ..., Tn are the associated event times, then the segments of the spike train in surrounding intervals [Ti A
, Ti + B
] are registered as codewords, where A, B
0 are fixed integers. As in the case of point process representation, the codewords cannot be warped in time.
STATISTICAL MODEL. The procedure is based on the following assumptions.
, t + B
] are independent, such that the activity of each unit forms a Poisson process. For i = 1, ..., n, c = 1, ..., C, and j = A, ..., B, let pi,c(j) be the probability that the cth unit generates spikes in the jth time bin away from the ith event. The probability can be estimated using a peristimulus time histogram method. That is, for the ith event and cth unit, we first align the sample spike trains of the unit at the event, then estimate pi,c(j) as the fraction of those sample spike trains that have spikes in the jth time bin relative to the time of the event.
For i = 1, ..., n 1, we use q(t,
i) to model the probability density of the duration of the ith IEI. The parameter
i can be estimated based on the sample as well.
LOCAL SCORES FOR SPIKING ACTIVITY PATTERNS AROUND EVENTS.
To identify the occurrences of the whole event sequence, the first step is to evaluate the likelihood of the occurrence of a single event in each time bin. The derivation of the likelihood is given in the APPENDIX. Here we consider the computational procedure only. For the ith event and the cth unit, we introduce
i,c(j), such that for j = A, ..., B, it is evaluated as
![]() |
![]() | (7) |
i,c*xc(t) is the tth entry of the convolution between
i,c and xc. The derivation of the formula is given in the APPENDIX. We therefore use
![]() |
i,c acts as a discrete linear filter. By convolving
i,c with the entire binary sequence xc, the score is obtained for all time bins.
GLOBAL SCORE FOR THE EVENT SEQUENCE.
For the ith IBI, i = 1, ..., n 1, and t
1, let Gi(t) = log q(t
,
i). Then for any time bin t and I1, ..., In1, the logarithm of the probability that the tth time bin contains the onset of an occurrence of the event sequence with IEIs I1
, ..., In1
can be expressed as l(t, I1, ..., In1) plus a constant, where
![]() | (8) |
![]() | (9) |
Denote by Î1(t), ..., În1(t) the maximizers associated with L(t). As in the case of the point process representation, L(t), Î1(t), ..., În1(t) can be computed using DTW. The algorithm is sketched in Table 3.
|
![]() |
. In practice, because the score function L(t) can be quite noisy, it may need to be smoothed to extract meaningful local maximum points.
To summarize, the numerical procedure for the binary representation starts with estimating pi,c(j) and q(x,
i). Next the procedure constructs filters
i,c for i = 1, ..., n, c = 1, ..., C, and also constructs Gi(x) for i = 1, ..., n 1. The scores and potential occurrences of the event sequence are then computed by the algorithm in Table 3.
| RESULTS |
|---|
|
|
|---|
We applied the procedure for point process representations to the SU spiking activity in RA of adult male zebra finch. The data were high signal-to-noise ratio recordings from chronically implanted movable electrodes in RA, collected in a previous study (Dave and Margoliash 2000
). Male birds directed their songs toward females housed in adjacent half-cages.
For each RA unit, the goal was to identify segments in its spontaneous spiking activity during sleep that exhibited temporal patterns similar to its premotor activities associated with daytime singing. Such segments have been interpreted as replay of premotor activity during sleep that may be involved in learning and maintaining song (Margoliash 2002
). The female-directed songs of adult male zebra finches are highly stereotyped intense behaviors (Sossinka and Böhner 1980
), acquired through learning (Brenowitz et al. 1997
), and are dynamically maintained in adults by auditory feedback (Nordeen and Nordeen 1992
). The songs constitute sequences of "syllables" repeated in a fixed pattern to form one or more repeated "motifs." Premotor spiking activity in RA associated with song renditions are highly stereotyped and exhibit precise temporal patterning within bursts, with the SD in spike timing as low as 0.2 ms (Chi and Margoliash 2001
; Yu and Margoliash 1996
). Therefore the point process representation is preferred to the binary representation for RA activity.
For these data, the spontaneous activity of a unit during sleep can be probed with one or more templates. Once a signature motif of the bird was identified, any spike train of the unit associated with a rendition of the motif could serve as a template (Fig. 2, A and B). To partition a template into bursts and IBIs, we aligned the spike trains associated with all the identified renditions of the motif by their onset times (Fig. 2C). The raster plot revealed common temporal structures of the spike trains that had a many-to-many correspondence to the acoustic structures of the motif. Based on the structures, we partitioned the spike trains into bursts and IBIs by the criteria that adjacent bursts be at least 20 ms apart. Within the bursts of these spike trains, the mean ISI was d' = 3.06 ± 2.68 ms, and within their IBIs the mean ISI was d = 73.02 ± 55.97 ms. The same partition was obtained when we applied a more refined alignment procedure which minimized the total L1 distances of the spike trains (Chi and Margoliash 2001
).
|
To implement the procedure in Table 2, we used the biweight kernel to compute local similarity scores by Eq. 1. The parameter to control the temporal precision for spike matching was
= d'/2 x 1.875 = 3.06/2 x 1.875 = 2.869 ms by Eq. 5. To calculate the global similarity scores by Eq. 2, we set Gi(x) = 0 and fixed the maximum proportion of warp at 1/5 for each template IBI. The score function was computed on a grid with step size 0.5 ms.
During the spontaneous activity of the unit in sleep, the mean ISI was d0 = 49.05 ± 20.85 ms. By Eq. 6, the penalty on each nonmatching spike was
= 0.1434. We then calculated the similarity scores across the data to identify segments that exhibited similar patterns to the template.
The results from one session of the data are shown in Fig. 3; plot A displays the similarity scores across. To identify matching segments, we set the threshold for similarity score
= N/3, where N = 41 was the total number of spikes in the template. Four above-threshold local peaks were identified in this session, with scores 16.6, 19.2, 15.9, and 15.4, respectively. The second and fourth peaks are marked by x and z in Fig. 3A. By Eq. 4, we extracted the optimal matching segments at the peak locations and partitioned them into burst intervals and IBIs that were mapped one to one to those in the template (Fig. 3, B and C). The identified segments exhibited a high degree of similarity to the template and also showed nonuniform time warping of the IBIs.
|
To see whether DTW was necessary in this example, we collected all the identified optimal matching segments from the data. There were 15 of them, with mean score 16.6 ± 1.9. To see how much warping of the template was involved to match these segments, we analyzed the two longest IBIs in the template, which were the fourth and fifth IBIs. The time compressions/expansions of the two IBIs, 4.2 ± 21.5 and 2.5 ± 9.6 ms per segment, showed no significant correlation with each other (r = 0.34, P = 0.22, 95% CI = [0.21, 0.73], t-test). For instance, to match the segment at x in Fig. 3A, the two IBIs were compressed by 33.5 ms and expanded by 6.5 ms, respectively. This suggests that nonuniform, wide-ranging time warping was present in the data, supporting the use of DTW for the pattern identification.
Another question is whether the probability distribution of the time warping in the premotor activity could be used in the pattern identification for the sleep data. The answer seems to be negative. There is evidence that the variability of IBI in the spontaneous activity during sleep differed from that during wakeful behavior. In the preceding data, the variance of the fourth IBI in the identified matching segments was significantly larger than the one in the exemplar spike trains associated with motif renditions. The ratio of the two was r = 18.61 (P < 105, 95% CI = [6.25, 55.44], F-test). The variances of the fifth IBI on the other hand did not exhibit significant difference (r = 2.20, P = 0.15, 95% CI = [0.74, 6.54]). The change in the statistical properties of neuronal activity across different behavioral states poses difficulty for the selection of parameter values, as we tried to use the templates obtained in one state to analyze the activity in another.
Sensitivity to temporal precision of spike matching
We had used
= 1.875
, where
= 1.53 ms was the half mean ISI within the bursts of the exemplar spike trains associated with the motif renditions, and the coefficient 1.875 "standardizes" the biweight kernel. To examine how sensitive the pattern identification was to
, we compared the results to those obtained with different preselected values of
= 1.875
while keeping the other parameters unchanged. For larger
, the temporal precision required for template matching was lower and more segments with above-threshold similarity scores were expected. The question is how sensitive the number of identified segments was to the change in
. Denote by n the number of segments that were identified using a preselected value of
but missed by the original procedure that used
= 1.53 ms and n' the number of those the other way around
![]() |
around the value selected based on the exemplar spike trains. Except for
= 0.5, all the extra segments identified using the preselected values of
were not completely overlooked by the original procedure. They actually showed up as local peaks with relatively high scores (>10). Therefore the increasing trend in n could be offset by lowering the threshold in the original procedure. The extra segment identified with
= 0.5 was also not completely overlooked by the original procedure. It had roughly 50% overlap in time with a segment identified by the procedure. Sensitivity to the penalty on mismatching spikes
In the procedure, the penalty
= 0.1434 was selected adaptively, based on the firing rates of the premotor activities associated with motif renditions and the spontaneous activity during sleep. We compared the results to those obtained with different fixed values of
while keeping the other parameters unchanged. Let n be the number of segments identified by using a preselected value of
but missed by using the adaptively selected value and n' the number of those the other way around:
![]() |
= 0.1434, 15 optimal matching segments were identified. The summary shows that for values of
around the adaptively selected value, the identification results were quite similar. Sensitivity to kernel
We evaluated how sensitive the pattern identification procedure was to the kernel. We implemented the procedure using the square, triangular, and Epanechnikov kernels, respectively. By Eq. 5,
had to be adjusted accordingly. Denote by n the number of segments that were identified using a different kernel but missed by using the biweight kernel and n' the number of those the other way around:
![]() |
= 41/3, with differences 0.54, 1.20, 1.47, and 0.50, respectively. Simulations for threshold selection and performance evaluation
For the pattern identification procedure, the issues of threshold selection and sensitivity to noise are complicated by the fact that the statistical properties of the sleep data are likely to be different in some unknown but important aspects from the premotor activity that generated the templates. The simulation method is based on the idea that by testing the pattern identification procedure on artificial data that incorporate certain statistical properties of the sleep data, some reasonable inference might obtain.
We generated simulated data using the exemplar spike trains associated with motif renditions (Fig. 2A) because they had unambiguous interpretation. The pattern identification procedure was then applied to the simulated data to yield score functions. Because the peak values in these score functions were associated with "true targets" (i.e., spike trains known to exhibit the pattern of interest), we could use their distributions to draw inferences on the ability of the procedure to find true targets corrupted by noise.
Given an exemplar spike train T registered in [0, D], a simulated spike train was generated as follows.
N(0,
).
was added to I as "noise" spikes.
.
Because DTW automatically accounts for time warping of IBIs, we did not randomly modify the IBIs. In all the simulations,
= 1 ms, which is a plausible lower bound for the ISI in RA. We also fixed B = 500 ms in all the simulations. The value of
was selected based on the average firing rate of the spontaneous activity during sleep, which was estimated at 21 Hz. We set
= 20 Hz. The parameters q and
are unobservable from the spontaneous activity during sleep. We set
around or above the half-mean
(=1.53 ms) of the ISI within the bursts of the exemplar spike trains, so that there was a good chance for a spike in the exemplars to have a large "jitter" relatively to the ISIs. We conducted four simulations, with q = 1/4, 1/3 and
= 1.5, 2 ms, respectively.
For each of the exemplar spike trains, 100 simulated spike trains were generated. The raster plot of a total of 1500 simulated spike trains with
= 1.5 ms and q = 1/3 is shown in Fig. 4. Despite the relatively high chance of spike deletion in the true targets, the simulated spike trains maintained the overall temporal structure of the exemplar spike trains. However, not all of them exhibited enough similarity to the exemplars. A small percentage a of the simulated spike trains either failed to yield similarity scores above
= N/4, with N the number of spikes in the template, or failed to have peak locations in score function within 50 ms from the onsets of the true targets. Excluding these spike trains, denote by S the peak value in the score function of any remaining spike train. The distribution of S was close to a Normal distribution. As
or q increased, a increased and the distribution of S shifted toward 0.
![]() |
S. For all the simulations, this value was a little less than the actually used threshold value N/3. Therefore a few more segments would have been identified, and the sum of a and the tail probability under N(µS,
S) (
2.5%) could serve as a nominal false negative rate.
|
= 50 Hz. In these simulations, the signal-noise ratio became much lower, greatly reducing the number of identified segments. For example, with all the other parameters the same as in the second simulation, the number of identified segments was only 87 (results not shown). This result, however, made no sense for the data we analyzed. Our suggestion is that to conduct meaningful simulations, the parameters in the simulations should be tuned as much as possible according to the data. Application to MI of macaque monkey
We applied the procedure for the binary representation of neuronal activity to recordings obtained from the arm area of MI in the left hemisphere of a trained adult macaque monkey. The data were collected using a chronically implanted silicon microelectrode array composed of 100 electrodes (1.0-mm electrode length; 400-µm interelectrode separation; Cyberkinetics Neurotechnology System, Foxboro, MA). Based on histological evidence from previous implants that we have performed, the electrode tips are likely to be in lower portions of layer 3 or in layer 5. To acquire extracellular action potentials, signals were amplified (gain = 5,000), band-pass filtered (250 Hz to 7.5 kHz) and sampled at 30 kHz per channel. Only waveforms that crossed a threshold were stored and spike-sorted using Off-line Sorter (Plexon, Dallas, TX). The monkey was trained to repeatedly perform a sequence of movements (see following text) to receive a juice reward. From a decoding perspective, it is of interest to have a means to estimate the times of certain landmark events in the behavior during each trial based on the associated neuronal activity. These estimates were compared with the actual event times recorded during each trial.
The monkey was operantly trained to perform a target pursuit task by moving a cursor to a target with its right arm. The cursor and the target were displayed on a horizontal projection surface in front of the monkey. The monkeys arm rested on cushioned arm troughs secured to links of a two-joint robotic arm (KINARM system; Scott 1999
) underneath the projection surface. The shoulder joint was abducted 90° such that shoulder and elbow flexion and extension movements were made in the horizontal plane. The target was programmed to appear on the four corners of a fixed square. A trial could start at any time after the end of the previous one. At the start of a trial, the target appeared at the upper right corner of the square and the monkey had to move the cursor to it. Once reached, the target jumped counterclockwise to the next corner and the monkey had to move the cursor again, and so on (Fig. 5). A trial was registered as successful only if the monkey reached the target at all of the four corners within 5 s. In the data we collected, there were 240 successful trials; see Fig. 5 for some sample traces of hand movement in successful trials. A total of up to C = 49 MI single units were simultaneously recorded.
|
3 s. We fitted the distribution by the Gamma distribution
![]() |
i = (ai, bi). We shall mainly report results attained by incorporating the fitted Gamma distributions in the scoring. Later, we will show that similar results were attained as well without exploiting these distributions.
The temporal pattern of the spiking activity in MI has more variability than in RA. Taking this into account, the time bin duration was selected so that 99.9% of the time bins each contained at most one spike. Under this condition, the largest time bin duration was about 10 ms. We therefore chose
= 10 ms. The binning reduced the size of the data while keeping most of its information. The hand position data were also down-sampled at 10-ms resolution to estimate the times when the corners were reached.
We used the movement sequences in the first 200 successful trials as the training data set to estimate the firing probabilities pi,c(j) within 1 s around each event, with i = 1, ..., 4 indexing the events, c = 1, ..., 49 indexing the units, and j = 100, ..., 100 indexing the time bins within 1 s around a time of event. The parameters
i for the Gamma distributions were estimated as well. The estimates of pi,c(j) and
i were then used to construct the template. The recording after the 200th successful trials was used to test the accuracy of the identification procedure.
The score function L(t) was computed according to Table 3 over the entire recording and then smoothed with a low-pass filter with cutoff frequency 0.5 Hz. Because successful trials occurred at a much lower frequency (
0.21 Hz) than the cutoff frequency, the smoothing had no aliasing effect on the estimation of the onset times of the movement sequences (i.e., the times when the first corner was reached) in successful trials.
Across the entire recording period, 622 time locations ti were identified where the smoothed score function
(t) peaked locally (Fig. 6). Based on the assumption that the scores were higher near the onsets of the movement sequences in successful trials than elsewhere, only ti with
(ti) >
(ti±1) were selected, resulting in 202 estimated onset times. For each of the selected ti, the corresponding event times were estimated as in Table 3 (Fig. 7).
|
|
1 s was defined as a false positive. The counts of true positives, false positives, selected ti, and events that actually occurred in the training data set, test data set, and the entire data set are as follows
![]() |
To see how sensitive the above evaluation of power was to the threshold for the mean error, we varied the threshold value from 0.5 to 1.5 s, causing the power to change from 62 to 73% (Fig. 8A). The slight change in the power indicates that the evaluation was robust to the threshold selection thus justifies using 1 s as the threshold value.
|
Finally, we tested the procedure without incorporating the estimated distributions of the IEIs in scoring. Equivalently, all Gi were set to 0. For the test data set, we again attained a high power (29/40) and a high rate of true positives (29/41). The mean error in the estimated event times was 0.226 ± 0.196 s and the errors associated with the four corners were 0.536 ± 0.607, 0.045 ± 0.062, 0.092 ± 0.189, and 0.231 ± 0.433 s. The results for the entire data set and those for the training data set were similar to these reported results. The results were only marginally inferior to those attained by exploiting the estimated distributions of the IEIs.
Comparisons with Kalman filter decoding method
A number of decoding methods have been proposed for MI of Macaque monkeys, including population vectors, linear filters, artificial neural networks, or Kalman filters (Carmena et al. 2003
; Georgopoulos et al. 1986
; Serruya et al. 2002
; Wu et al. 2006
). These methods have been demonstrated to be effective and accurate in estimating kinematic parameters such as hand movement direction or hand trajectory. However, none of them addresses the estimation of the timing of landmark events during a movement. This is largely attributed to the experiment paradigms these methods are designed for, which generate fairly straightforward hand movement of monkey, making it unnecessary to decode event timing during the movement. In contrast, in our squared movement paradigm, the monkey moved its hand following four corners of a square in each trial. In this paradigm, the times when the hand reached the corners provided important information on the movement. This raises a new perspective toward decoding: if a movement can be well defined by certain landmark events, then one may aim to decode the timing of the events without estimating the movement trajectory.
We compared the proposed procedure with the Kalman filter method, which was shown to yield better trajectory estimation than other current methods (Wu et al. 2006
). We modified the Kalman filter procedure reported in Wu et al. (2006)
so it could perform pattern identification. First, for each successful trial, we registered the time interval that started 1 s before the reaching the first corner and ended 1 s after the reaching the fourth corner. Then the procedure was applied to estimate the hand positions at discrete time points within the interval using the firing rates of all the 49 MI units. Finally, for each corner, the nearest 5% estimated hand positions in terms of the Euclidean distance to the corner were identified, which accounted for about 25 time points per trial. The average of these time points was used as an estimate of the time when the corner was reached. In our study, the estimation was fairly robust when we varied the percentage of nearest hand positions from 5% to nearby values.
Note that unlike the template matching procedure, for the Kalman filter procedure, the spike trains had to be segmented into intervals containing individual trials. The segmentation was necessary to make sure that the Kalman filter procedure only used the nearest positions within the same trial to estimate the event times. On the other hand, the segmentation provided the Kalman filter procedure with critical information on time that was not accessible to the template matching procedure. Consequently, the estimated event times by the Kalman filter procedure were automatically within the trials, and so by our criteria on true positives, the procedure had power 1.
To make a reasonable comparison, we computed the mean errors in the estimated event times obtained by the Kalman filter procedure, in parallel to those obtained by the template matching procedure. Across the last 40 trials, which were automatically true positives for the Kalman filter procedure, the mean error was 0.260 ± 0.123 s, and the errors associated with the four corners were 0.262 ± 0.178, 0.231 ± 0.343, 0.184 ± 0.156, and 0.362 ± 0.327 s, respectively. Except for the estimated event times associated with the first corner, which benefited from the segmentation, the errors in the estimated event times associated with the other corners were significant greater than the template matching procedure. The same conclusion was drawn if we restricted to the 29 true positives identified by the template matching procedure. Across these trials, the Kalman filter procedure yielded mean error 0.255 ± 0.128 s, and errors associated with the four corners 0.269 ± 0.166, 0.203 ± 0.266, 0.184 ± 0.164, and 0.373 ± 0.373 s, respectively.
Essentially, the worse performance of the Kalman filter procedure was explained by the fact that it is intrinsically designed for the reconstruction of kinematic parameters, e.g., direction, speed, trajectories, and kinematic decoding and time decoding are rather different concepts. Figure 9 shows an example in which the Kalman filter procedure yielded a very good trajectory reconstruction but provided poor estimation for when the corners were reached. This example indicates that the estimation of timing has interest of its own and is worthy of investigation in the motor cortical study.
|
| DISCUSSION |
|---|
|
|
|---|
In neurophysiological analysis, various template matching methods have been developed to identify segments of patterned activity from neuronal recordings (Abeles et al. 1988
, 1993
; Chi et al. 2003
; Dave and Margoliash 2000
; Louie and Wilson 2001
; Nádasdy et al. 1999
, 2000
). From the perspective that spike timing plays an important role in neural coding, most of the methods directly match spike times or ISIs registered on a continuum (Abeles et al. 1988
, 1993
; Dave and Margoliash 2000
; Nádasdy et al. 1999
, 2000
). In these methods, similarity to a template is computed by counting the number of matching spikes or ISIs. Although the counting approach is suitable for patterns with a small number of spikes, in our experience, it significantly slows down for patterns with a moderate number of spikes, which is undesirable when relatively complex patterns need to be analyzed for large data sets. It is also not clear how to apply the approach effectively when simultaneous spiking activity of many units over hundreds of time bins needs to be compared with a template, as in the case of MI data.
When a pattern is represented by many spikes, template matching can be based on cross-correlating firing rate functions (Louie and Wilson 2001
). Because this approach requires kernel smoothing, it is suitable for patterns without much fine temporal structure. However, as in the case of the RA data, when precise temporal structure is important for pattern identification, the firing ratebased approach is insufficient. The filtering method in Chi et al. (2003)
is an attempt to balance between high temporal precision and computational efficiency. On the one hand, it evaluates similarity based on temporal closeness between data spikes and template spikes; on the other hand, it computes the similarity scores using a smooth profile of the template, in which each template spike is blurred into a "bump" that is a few milliseconds wide. Similarity scores across a data spike train are computed efficiently by linear convolution with the profile.
In all these methods, a template cannot be warped. To incorporate timescaling, the template has to be compressed or expanded uniformly in time beforehand. This is in contrast to the DTW approach of Victor and Purpura (1996)
, which adjusts template ISIs "on the fly" during template matching. This report proposes an approach that combines linear convolution and DTW for template matching. In this approach, linear convolution is applied at the level of individual components of a template, whereas DTW is applied at the level of the entire template. Because both linear convolution and DTW can be implemented efficiently, satisfactory computational efficiency can be achieved.
A combinatorial approach was recently proposed for pattern identification. The approach aims to identify data segments that exhibit a specific temporal order in which a set of neurons fire, regardless of the temporal details of the firing pattern (Lee and Wilson 2002
, 2004
; Smith and Smith 2006
). This goal is different from the type of pattern identification considered here.
Flexible combination of local matching and time warping
The proposed approach is flexible in modeling and combining information at two timescales. We showed that linear convolution can be applied for both point process representations and binary representations of spiking activity. Linear convolution can be applied for other representations as well. In general, as long as local similarity scores are defined as weighted sums of measures of spiking activity at different time points, they can be computed by linear convolution with a filter consisting of the weights involved. For example, to cross-correlate firing rates, the filter is simply the firing rate function of the template registered backward in time. In most cases, however, it is a major issue how to select appropriate filters. We showed that based on the principle of likelihood testing, some of the filter parameters can be estimated from data. The estimation-based approach reduces the number of parameters that have to be manually tuned, thus allowing a procedure to be more easily applied for different data sets and more objective comparisons of the results.
In our proposed pattern identification procedures, linear convolution and DTW are more or less two separable modules. Depending on the problem at hand, one may replace linear convolution with another method to compute local similarity scores, and then apply DTW to combine the local scores into similarity scores for the entire template. The point of using DTW is that it is capable of dealing with variability at a relatively large timescale. Such variability is evident in the spontaneous activity during sleep in RA (Dave and Margoliash 2000
); it is even more evident in the analyzed MI activity, presumably because it was influenced by many factors. The distinction between local and more global temporal variabilities adapted herein is a simplification. Spiking activity can exhibit temporal variability at multiple timescales (Reich et al. 2000
, 2001
; Zugaro et al. 2005
). From a computational perspective, to handle the variability at more finely graded timescales, one could introduce a hierarchy of procedures, each one computing the similarity scores at one timescale by combining scores obtained at a smaller timescale.
The DTW step in our approach has two useful properties. First, it estimates the times of individual components within each identified occurrence of a preselected pattern. The estimation of timing can be useful for decoding neural activity. Although our study does not preclude the possibility that the spike rates of neurons could be used to identify behaviorally relevant event times, there is a precedent for using spike patterns to identify the event times. Riehle et al. (1997)
trained monkeys to expect go cues (i.e., cues to initiate movement) at certain predefined times. The study found that transient spike synchronization between two motor cortical neurons occurred at the expected cue times even when the cue did not appear. Interestingly, the spike rates of the individual neurons did not modulate during these expected times. One unanticipated result in our experiment was that the pattern identification procedure could attain very good estimates on when a macaque monkey moved a cursor to different targets. Further controlled experiments are necessary to determine the biological significance of this observation. For example, it is unclear whether these patterns related to aspects of the monkeys movement to the target or were associated with the visual appearance of the next target. Nevertheless, the example shows that, when appropriately designed, a computational method can reveal intriguing phenomena that otherwise are difficult to detect. The discovery of very fine submillisecond neuronal "drift" in zebra finch RA spiking activity during singing is another such example (Chi and Margoliash 2001
).
The second useful property of the DTW step is that it estimates the amount of warping for each compressible interval in the template. This may provide extra parameters to describe the large timescale dynamics of neural activity under certain condition. For example, as the experiment on the RA data indicates, one possibility is to test to what degree the assumption of uniform timescaling holds for the spontaneous spiking activity during sleep. On the other hand, although in principle extra parameters can provide more information for pattern identification, our experiment on the MI data indicates that it yielded only marginal improvement to incorporate a parametric statistical model for time warping. It remains to be seen whether parametric models can be more effectively exploited for pattern identification.
Selection of representation
Although our pattern identification procedures for the point process representation and the binary representation appear to be quite different, they are based on the same perspective. That is, a spiking activity pattern can be divided into components that are incompressible in time and those that can be scaled in time, and the information about the pattern is mostly incorporated in the incompressible components. Most of the current pattern identification methods are based on point process representations. To avoid potential side effects arising from discretization of time, point process representations should be the default choice for pattern identification, even for spiking activity with higher (electrophysiological) variability than that in RA. Higher variability can be accommodated by lowering the temporal precision for spike matching or using multiple filters to account for different aspects of variability (Chi et al. 2003
).
If the precision of spike time acquisition or computational load is of concern, then binary representations are more suitable. The choice between the two types of representations should be problem dependent. For the MI data, because the goal was to estimate behavioral timing of a monkey, which intrinsically could be highly variable, it was reasonable to discretize spike times to the extent that the resulting loss of temporal information caused no abrupt changes in the estimation. The binary representation was tested with multiple SUs extracted from simultaneous records drawn from an electrode array. It should be noted that continuous multiunit (MU) data, as opposed to single-unit or multielectrode data, can also be transformed into binary representations. Depending on the quality of the MU recording, it may be advantageous to set an amplitude threshold, count the number of threshold crossings, and quantify this as a binary signal. Alternatively, for recordings that sum over a larger number of units, it may be advantageous to measure the total amplitude of the rectified signal over some bin (e.g., Margoliash 1986
) and quantify this as a binary signal. The pattern identification approach may then be applied to the binary signal.
Limitations and possible improvements
One important situation that the proposed approach cannot handle is where the relative timing of the spiking activity of multiple units exhibit variability. Such "interunit temporal variability" may be significant if the recorded units are in different nuclei. Our procedure for the point process representation of neuronal activity is applicable only for SU recordings. The procedure for the binary representation essentially assumes that the units are time-locked with each other in their spiking. Therefore it cannot apply when the interunit timing variability has to be taken into account. One possible approach to this type of variability is to treat the relative timing of the units the same way as IBIs with appropriate DTW procedures. However, even with a moderate number of recorded units, the computational load can become heavy because the optimization involved in DTW may have to take into account the relative timing for many pairs of units. An intermediate approach may be to group the units into clusters, such that within each cluster, the interunit variability can be ignored. Then, appropriate DTW procedures may be developed for the clusters.
A second issue is how to combine the two pattern identification procedures developed here. This can be useful when different classes of neurons exhibit different levels of temporal precision in their spiking activity. For example, in the birdsong system, the spiking activity in the nucleus HVC, which is afferent to RA, has projection neurons with high temporal precision and interneurons with lower temporal precision (Hahnloser et al. 2002
; Yu and Margoliash 1996
). In general, given the interest in simultaneous recordings from multiple areas (Hoffman and McNaughton 2002
), it is valuable to develop pattern identification procedures for the joint activity in multiple areas. To manipulate behavior directly based on spiking activity, it would also be useful to extend our procedures for real-time pattern identification.
The proposed pattern identification approach does not offer a recipe for threshold selection for similarity scores. The simulation-based method in the experiment for RA could provide a reasonable range for the threshold values. However, because no simulations can account for all the statistical properties of spiking activity, simulation methods should be used with caution. Despite its statistical derivation, the proposed approach is essentially of observational nature. In the end, how large a similarity score should be to identify a data segment as a match is based on an experimenters own judgment, irrespective of whether it is supported by statistical significance measures such as P values. The biological significance of the identified patterns can be determined only by controlled experiments.
| APPENDIX |
|---|
|
|
|---|
The score function is derived from a likelihood ratio test on the pattern of interest versus background. We model the background as a Poisson process with constant density
0. Let T = {t1, t2,...} be a template representing the pattern of interest. It randomly generates a data spike train as follows.
![]() |
is the maximum "jitter" in spike times, N is the maximum number of spikes a template spike can generate, and Z = ZN is a normalizing constant. Although N depends on the recorded unit, it will be seen that different values of N lead to the same form of similarity scores up to an additive constant. Therefore we can use the same formula for template matching without having to estimate N.
<
0 is added into the warped IBI as noise spikes. The density of spikes within the pattern is assumed to be continuous. Therefore
= g(
) = g(
).
For a spike train S that starts at time x and consists of spikes s1 < s2 <..., the log-likelihood that it is generated from T in such a way that the IBIs of T are changed by v1, ..., vn+1 and the (hi1 + 1)th, ..., (hi)th spikes in S are generated by ti is
![]() |
0 x (total number of spikes in S). The log-likelihood ratio is then
![]() |
0] as (1 +
)K(x/
)
, where
0 and 0
K(x)
1 is defined on [1, 1] with K(0) = 1, K(±1) = 0. To this end, divide the log-likelihood ratio by M = log [g(0)/
0], and define
![]() | (A1) |
Score function for the binary representation
We first derive the score function associated with a single event, i.e., Eq. 7. For each t = 1, 2,..., let
![]() |
![]() |
![]() |
To verify Eq. 8, following the same idea for the point process representation (cf. Fig. 1B), it can be seen that the probability that the tth time bin contains the onset of an occurrence of the event sequence with IEIs I1
, ... In1
is
![]() |
![]() |
| GRANTS |
|---|
|
|
|---|
| ACKNOWLEDGMENTS |
|---|
|
|
|---|
| FOOTNOTES |
|---|
Address for reprint requests and other correspondence: Z. Chi, 215 Glenbrook Rd., U-4120, Storrs, CT 06269 (E-mail: zchi{at}merlot.stat.uconn.edu)
| REFERENCES |
|---|
|
|
|---|
Abeles M, Bergman H, Margalit E, Vaadia E. Spatiotemporal firing patterns in the frontal cortex of behaving monkeys. J Neurophysiol 70: 16291638, 1993.
Abeles M, Gerstein GM. Detecting spatiotemporal firing patterns among simultaneously recorded single neurons. J Neurophysiol 60: 909924, 1988.
Anderson SE, Dave AS, Margoliash D. Template-based automatic recognition of birdsong syllables from continuous recordings. J Acoust Soc Am 100: 12091219, 1996.[CrossRef][Web of Science][Medline]
Bishop MJ, Thompson E. Maximum likelihood alignment of DNA sequences. J Mol Biol 190: 159165, 1986.[CrossRef][Web of Science][Medline]
Brenowitz EA, Margoliash D, Nordeen KW. An introduction to birdsong and the avian song system. J Neurobiol 33: 495500, 1997.[CrossRef][Web of Science][Medline]
Brockwell AE, Rojas AL, Kass RE. Recursive Bayesian decoding of motor cortical signals by particle filtering. J Neurophysiol 91: 18991907, 2004.
Buzsáki G. Two-stage model of memory trace formation: a role for "noisy" brain states. Neuroscience 31: 551570, 1989.[CrossRef][Web of Science][Medline]
Buzsáki G. Memory consolidation during sleep: a neurophysiological perspective. J Sleep Res 7, Suppl. 1: 1723, 1998.[Medline]
Carmena JM, Lebedev MA, Crist RE, ODoherty JE, Santucci DM, Dimitrov DF, Patil PG, Henriquez CS, Nicolelis MAL. Learning to control a brainmachine interface for reaching and grasping by primates. PLoS Biol 1: 193208, 2003.[CrossRef][Web of Science]
Chi Z, Margoliash D. Temporal precision and temporal drift in brain and behavior of zebra finch song. Neuron 32: 899910, 2001.[CrossRef][Web of Science][Medline]
Chi Z, Rauske PL, Margoliash D. Pattern filtering for detection of neural activity, with examples from HVc activity during sleep in zebra finches. Neural Comput 15: 23072338, 2003.[CrossRef][Web of Science][Medline]
Dave AS, Margoliash D. Song replay during sleep and computational rules for sensorimotor vocal learning. Science 290: 812816, 2000.
Georgopoulos AP, Schwartz AB, Kettner RE. Neuronal population coding for movement direction. Science 233: 14161419, 1986.
Hahnloser RHR, Kozhevnikov AA, Fee MS. An ultra-sparse code underlies the generation of neural sequences in a songbird. Nature 419: 6570, 2002.[CrossRef][Medline]
Hahnloser RHR, Kozhevnikov AA, Fee MS. Sleep-related neural activity in a premotor and a basal-ganglia pathway of the songbird. J Neurophysiol 96: 794812, 2006.
Hatsopoulos NG, Joshi L, OLeary JG. Decoding continuous and discrete motor behavior from motor and premotor cortical ensembles. J Neurophysiol 92: 11651174, 2004.
Hoffman KL, McNaughton BL. Coordinated reactivation of distributed memory traces in primate neocortex. Science 297: 20702073, 2002.
Huang S-S, Gray RM. Spellmode recognition based on vector quantization. Speech Recogn 7: 4153, 1988.
Ito K, Mori K. Dynamic programming matching as a simulation of budgerigar contact-call discrimination. J Acoust Soc Am 105: 552559, 1999.[CrossRef]
Kogan JA, Margoliash D. Automated recognition of bird song elements from continuous recordings using dynamic time warping and hidden Markov models: a comparison study. J Acoust Soc Am 103: 21852196, 1998.[CrossRef][Web of Science][Medline]
Lee AK, MA. Memory of sequential experience in the hippocampus during slow wave sleep. Neuron 36: 11831194, 2002.[CrossRef][Web of Science][Medline]
Lee AK, Wilson MA. A combinatorial method for analyzing sequential firing patterns involving an arbitrary number of neurons based on relative time order. J Neurophysiol 92: 25552573, 2004.
Louie K, Wilson MA. Temporally structured replay of awake hippocampal ensemble activity during rapid eye movement sleep. Neuron 29: 145156, 2001.[CrossRef][Web of Science][Medline]
Margoliash D. Preference for autogenous song by auditory neurons in a song system nucleus of the white-crowned sparrow. J Neurosci 6: 16431661, 1986.[Abstract]
Margoliash D. Evaluating theories of bird song learning: implications for future directions. J Comp Physiol A Sens Neural Behav Physiol 188: 851866, 2002.[CrossRef][Web of Science][Medline]
Margoliash D, Staicer CA, Inoue SA. Stereotyped and plastic song in adult indigo buntings, passerina cyanea. Anim Behav 42: 367388, 1991.[CrossRef]
Nádasdy Z. Spike sequences and their consequences. J Physiol (Paris) 94: 505524, 2000.
Nádasdy Z, Hirase H, Czurkó A, Csicsvari J, Buzsáki G. Replay and time compression of recurring spike sequences in the hippocampus. J Neurosci 19: 94979507, 1999.
Needleman S, Wunsch C. A general method applicable to the search for similarities in the amino acid sequence of two proteins. J Mol Biol 48: 443453, 1970.[CrossRef][Web of Science][Medline]
Nordeen KW, Nordeen EJ. Auditory feedback is necessary for the maintenance of stereotyped song in adult zebra finches. Behav Neural Biol 57: 5866, 1992.[CrossRef][Web of Science][Medline]
OShaughnessy D. Speech Communication: Human and Machine (2nd ed.). New York: WileyIEEE Press, 1999.
Rabiner L, Juang BH. Fundamentals of Speech Recognition. Lebanon, IN: Prentice Hall, 1993.
Reich DS, Mechler F, Purpura KP, Victor JD. Interspike intervals, receptive fields, and information encoding in primary visual cortex. J Neurosci 20: 19641974, 2000.
Reich DS, Mechler F, Victor JD. Independent and redundant information in nearby cortical neurons. Science 294: 25662568, 2001.
Riehle A, Grün S, Diesmann M, Aertsen A. Spike synchronization and rate modulation differentially involved in motor cortical function. Science 278: 19501953, 1997.
Sankoff D, Kruskal JB. Time Warps, String Edits, and Macromolecules: The Theory and Practice of Sequence Comparison. Reading, MA: AddisonWesley, 1983.
Scott SH. Apparatus for measuring and perturbing shoulder and elbow joint positions and torques during reaching. J Neurosci Methods 89: 119127, 1999.[CrossRef][Web of Science][Medline]
Serruya MD, Hatsopoulos NG, Paninski L, Fellows MR, Donoghue JP. Brainmachine interface: instant neural control of a movement signal. Nature 416: 141142, 2002.[CrossRef][Medline]
Skaggs WE, McNaughton BL. Replay of neuronal firing sequences in rat hippocampus during sleep following spatial experience. Science 271: 18701873, 1996.[Abstract]
Smith AC, Brown EN. Estimating a state-space model from point process observations. Neural Comput 15: 965991, 2003.[CrossRef][Web of Science][Medline]
Smith AC, Smith P. A set probability technique for detecting relative time order across multiple neurons. Neural Comput 18: 11971214, 2006.[CrossRef][Web of Science][Medline]
Sossinka R, Böhner J. Song types in the zebra finch (poephila guttata castanotis). Z Tierpsychol 53: 123132, 1980.[Web of Science]
Victor JD, Purpura KP. Nature and precision of temporal coding in visual cortex: a metric-space analysis. J Neurophysiol 76: 13101326, 1996.
Wagner RA, Fischer MJ. The string-to-string correction problem. J Assoc Comput Mach 21: 168173, 1974.
Waterman MS, Gordon L, Arratia R. Phase transitions in sequence matches and nucleic acid structure. Proc Natl Acad Sci USA 84: 12391243, 1987.
Wilson MA, McNaughton BL. Reactivation of hippocampal ensemble memories during sleep. Science 265: 676679, 1994.
Wu W, Black MJ, Mumford D, Gao Y, Bienenstock E, Donoghue JP. Modeling and decoding motor cortical activity using a switching Kalman filter. IEEE Trans Biomed Eng 51: 933942, 2004.[CrossRef][Web of Science][Medline]
Wu W, Gao Y, Bienenstock E, Donoghue JP, Black MJ. Bayesian population decoding of motor cortical activity using a Kalman filter. Neural Comput 18: 80118, 2006.
Yu AC, Margoliash D. Temporal hierarchical control of singing in birds. Science 273: 18711875, 1996.
Zugaro MB, Monconduit L, Buzsáki G. Spike phase precession persists after transient intrahippocampal perturbation. Nat Neurosci 8: 6771, 2005.[CrossRef][Web of Science][Medline]
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Visit Other APS Journals Online |