|
|
||||||||
Sloan-Swartz Center for Theoretical Neurobiology, W. M. Keck Center for Integrative Neuroscience and Department of Physiology, University of California, San Francisco, California
Submitted 22 August 2006; accepted in final form 16 December 2006
| ABSTRACT |
|---|
|
|
|---|
20% of the error observed on each movement, despite being unaware of the visual shift. The state of adaptation is also driven by internal dynamics, consisting of a decay back to a baseline state and a "state noise" process. State noise includes any source of variability that directly affects the state of adaptation, such as variability in sensory feedback processing, the computations that drive learning, or the maintenance of the state. This noise is accumulated in the state across trials, creating temporal correlations in the sequence of reach errors. These correlations allow us to distinguish state noise from sensorimotor performance noise, which arises independently on each trial from random fluctuations in the sensorimotor pathway. We show that these two noise sources contribute comparably to the overall magnitude of movement variability. Finally, the dynamics of adaptation measured with random feedback shifts generalizes to the case of constant feedback shifts, allowing for a direct comparison of our results with more traditional blocked-exposure experiments. | INTRODUCTION |
|---|
|
|
|---|
Recently, a number of researchers used analytic techniques from engineering to study the trial-by-trial dynamics of sensorimotor adaptation. These studies found that adaptation occurs rapidly, on the timescale of single trials, when a random shift was added to the visual feedback of the fingertip (Baddeley et al. 2003
) or when perturbing forces were introduced during reaching (Donchin et al. 2003
; Scheidt et al. 2001
; Thoroughman and Shadmehr 2000
). However, subjects in these studies were aware of the experimental manipulations, either as a result of the explicit instructions regarding the visual shift or of the presence of noticeable force perturbations. Therefore learning likely reflected a combination of automatic sensorimotor processes and strategic cognitive approaches to the task. These two forms of learning obey very different underlying learning rules. For example, when subjects are aware of a shift in visual feedback, they are able to learn much more complex shift patterns then when they are not aware of the shift (Bedford 1993
). The goal of the present study was to quantify the trial-by-trial dynamics of the automatic processes of sensorimotor adaptationthat is, the processes that are presumably responsible for the ongoing maintenance of sensorimotor calibration. We therefore study the adaptive responses of naïve subjects to surreptitious shifts in visual feedback.
As in previous studies, we model the trial-by-trial dynamics of learning as a linear dynamical system (LDS). However, we make use of the analytic methods described in Cheng and Sabes (2006)
, allowing us to explore two issues that were not dealt with in earlier studies. First, instead of using least-squares regression to perform model fits, we take a more general maximum-likelihood approach. This allows us to build explicit models of the sources of variability in task performance and to fit these models to experimental data. As we will show, such variability plays an important role in the dynamics of adaptation. Second, we consider multiple potential error-feedback signals and attempt to determine which of these signals drive learning.
This study focused on the sequence of reach errors induced by a sequence of visual feedback shifts. We view these errors as a reflection of the underlying state of reach adaptation. By using a random, time-varying sequence of feedback shifts, we obtained a statistically rich sequence of reach errors that was modeled as an LDS (Cheng and Sabes 2006
). We found that this class of models is sufficient to describe the adaptation dynamics: the LDS model captures both the changes in the mean reach endpoint and the temporal correlations between these errors and the visual shift.
We draw several conclusions from the resulting model of adaptation dynamics. First, significant adaptation to visual feedback shifts occurs on single trials. Second, we explicitly measure the internal dynamics of adaptation and show that the state of adaptation decays over time. Third, a significant source of movement variability is internal state noise that directly affects the state of adaptation and thus accumulates across trials. This form of variability only indirectly affects the performance on a particular trial. We show that the state noise accounts for at least a quarter of the overall movement variability, thus offering a very different perspective on the sources of movement variability.
Finally, to relate these results to previous research using the blocked experimental design, we used the model derived from stochastic feedback trials to predict the sequence of reach errors induced by blockwise-constant feedback shifts. We find that the adaptation dynamics generalizes across feedback shift conditions. We conclude that the dynamics of adaptation are not specific to the sequence of feedback shifts and that the LDS model can provide insight into the general mechanisms of reach calibration.
| METHODS |
|---|
|
|
|---|
This study was approved by the University of California, San Francisco Committee on Human Research, and subjects gave informed consent. Ten right-handed subjects (four male, six female, ages 1828 yr) with no known neurological history and normal or corrected-to-normal vision participated in this study. Subjects were naïve to the purpose of the experiment and were paid for their participation.
The experiment consisted of trials in which subjects reached toward visual targets with their right arm with virtual visual feedback (Fig. 1A). Throughout each session, the right arm remained on or just above a horizontal table with direct view of the arm (and the rest of the body) occluded by a mirror and a drape. The right wrist was fixed with a brace in the neutral, pronate position and the index finger was extended and fixed with a splint. An infrared light-emitting diode was attached to the tip of the index finger and its position was recorded at 200 Hz with an Optotrak infrared tracking system (Northern Digital, Waterloo, Canada). Because the subject's hand was essentially restricted to a two-dimensional workspace, we analyzed only two components of the recorded positions: the positive x-axis points right and the positive y-axis points sagittally away from the subject.
|
Task design
The purpose of this experiment was, first, to identify the adaptation dynamics in response to stochastic feedback shifts (STOCH-P) and, second, to compare the dynamics with those observed in trial blocks with constant feedback shift (CONST-P), the traditional paradigm for inducing adaptation.
An experimental session consisted of four repetitions of the following sequence of trial blocks: 25 transition trials (see following text), 35 CONST-P trials, 10 transition trials, and 50 STOCH-P trials. Within each CONST-P trial block the feedback shift was constant, but the shift varied between the four CONST-P block. The four shifts were a random ordering of the four vectors with ±3 cm along both the x- and y-axes. Transition trials with either unshifted visual feedback or no visual feedback were inserted to minimize the possibility that subjects became aware of manipulations in the visual feedback. Each CONST-P trial block was preceded by 15 trials with unshifted feedback and then 10 trials without visual feedback. When a CONST-P block followed a STOCH-P block, the shift was ramped down to zero over the first five of these transition trials. Each CONST-P block was also followed by 10 trials without visual feedback. In total, each session consisted of 480 trials, with 200 STOCH-P trials and 140 CONST-P trials. An example session is shown in Fig. 2.
|
From one trial to the next, the shift in visual feedback (p; see Fig. 1B) changed incrementally by one of the following (x, y) vectors, selected with equal probability: (0, 0), (0, sy), (sx, 0), or (sx, sy)/
2, where |sx| = |sy| = 5.2 mm, and initial signs of sx and sy were assigned randomly at the beginning of each block. The maximum magnitude of the feedback shift in either dimension was limited to ±3 cm. If the selected increment would have caused the x or y component of the feedback shift to exceed this range, then the sign of sx or sy, respectively, was reversed (reflecting boundaries). In each trial block, the feedback shift in the first trial was the value used in the last preceding trial with feedback. Example random-walk sequences are shown in the STOCH-P blocks of Fig. 2.
REACH TRIALS.
Every trial in the experiment consisted of reaching to a visual target. At the beginning of each of these trials, subjects were guided to a start position without visual feedback about either the start location or their hand ("arrow field" technique; Sober and Sabes 2005
). Specifically, an array of 3 x 3 identical arrows appeared in a randomized position of the workspace. The arrows corresponded to the vector from the current finger position to the start position, with a maximal length of 9 cm. Subjects were instructed to move their finger in the direction indicated by the orientation of the arrows until the arrows disappear, which occurred when the fingertip was within 5 mm of the unmarked start position. The start position was the same for all trials and was located a few centimeters in front of the subject, roughly along the midline.
Subjects were required to hold the start position for a random delay (0.51 s) until the reach target appeared (an open green circle, radius 6 mm). The target location gt was drawn randomly from a uniform distribution over a 4-cm square centered 26 cm distally from the start location. The appearance of the target together with an audible tone constituted the go signal for initiating the reach. Subjects were instructed to make a single quick and smooth movement toward the target. The trial terminated when the velocity of the finger fell to <2 mm/s.
To minimize variation in movement speed across trials, a loose timing constraint was used. The movement duration was defined as the length of the time interval between when the finger first moved >1 cm from the start location and when the tangential velocity of the finger first fell to <15% of its peak value at the end of the movement. These landmarks were chosen to exclude effects of variable reaction time and movement corrections at the end of the reach. A tone was sounded if reaches were too slow (movement duration >500 ms) or too fast (movement duration <300 ms). Subjects generally had no difficulty meeting these criteria.
Subjects received "velocity-dependent terminal feedback." Visual feedback of the finger tip position appeared only near the end of the movement, when the tangential finger velocity fell to <15% of the peak value and feedback continued until the end of the trial. This arrangement satisfies two constraints. The feedback appears sufficiently late in the movement so that we are able to assess the endpoint of the initial reach before visual feedback is able to drive corrective changes. The endpoint is thus taken as a measure of the state of the reach adaptation. However, the brief period of visual feedback while the hand is still moving provides a richer feedback signal for learning than would static feedback after the completion of the movement. Feedback was in the form of a white disk, 3-mm radius, located at the fingertip or displaced by the feedback shift pt. Finally, subjects were instructed to correct any reach errors after completing the first reach and trials terminated when the finger remained stationary within the target circle for 500 ms.
In some transition trials subjects received no visual feedback of their reaching arm during any part of the trial. These trials were identical to those with visual feedback, except that the target was marked by a filled green circle with radius 6 mm, providing a cue of the feedback type before reach initiation. Before data collection subjects were given sufficient practice trials to ensure that all task constraints were met.
Data analysis and a model for the dynamics of adaptation
Velocities were determined by first-order numerical differentiation of the positional data after smoothing with a 5-Hz low-pass Butterworth filter. The reach endpoint ft on trial t was defined as the finger position when the tangential velocity first fell to <5% of its maximum value on that trial. Typical velocity profiles were unimodal bell-shaped curves (cf. Atkeson and Hollerbach 1985
), corresponding to the primary reaching movement, followed by one or more smaller peaks attributed to corrective movements. In a few trials the velocity fell to <5% criterion only after the second velocity peak. In these cases visual inspection usually indicated a clear endpoint for the primary reach (velocity dip and curvature peak); in the rare cases, however, where an endpoint could not be identified confidently the trial was discarded. The reach error et is defined as the difference between the target position gt and the reach endpoint ft (Fig. 1B): et
ft gt.
We use a linear dynamical systems (LDS) model to describe the adaptation dynamics (Cheng and Sabes 2006
; Donchin et al. 2003
; Scheidt et al. 2001
). The output of the system yt is a noisy readout of the internal state of the sensorimotor map xt. Here we define yt to be the reach error et. This means that the internal state of the system is defined as the mean reach error one would observe across trials if adaptation could be frozen in time. Given the limited set of reach vectors used in this experiment, the state can be described with a single two-dimensional variable. Formally we write
![]() | (1a) |
N(0, R). We model adaptation as the trial-by-trial change in the state arising from sensory feedback on the preceding trial. Formally
![]() | (1b) |
N(0, Q). The term But represents error corrective learning, Axt represents the decay of the state of adaptation back to baseline, and qt represents variability in these two processes. Because the spatial variables are all two-dimensional vectors, the parameters of the LDS model, A, B, Q, and R, are 2 x 2 matrices. There are no direct "feedthrough" inputs affecting yt because terminal visual feedback prevents on-line visual correction and the ongoing reaches were not otherwise externally perturbed.
An important variable in the LDS model of Eq. 1 is the feedback signal that drives adaptation ut. Our experimental design makes the selection of candidate signals easier because visual feedback is given only near the end of the movement, representing only the location of the index fingertip (Fig. 1B). Under these conditions, there are two likely candidates for the input signal: the visually perceived error vt, defined as the difference between the position of the visual feedback at the reach endpoint and the target position, vt
ct gt, and the artificial feedback shift pt (Fig. 1B). We also consider the reach error et as a potential input signal. Note that the three error signals are linearly related: et = vt pt. Therefore any model that includes two of these variables effectively includes the third one as well.
Given a sequence of inputs ut and outputs yt, the maximum-likelihood estimates of the model parameters A, B, Q, and R are determined using an expectation-maximization (EM) algorithm (Cheng and Sabes 2006
; Ghahramani and Hinton 1996
; Shumway and Stoffer 1982
) (Matlab routines for performing this analysis are freely available at http://keck.ucsf.edu/
sabes/software/). The algorithm takes into account that data were collected in separate blocks by resetting the state to an initial Gaussian distribution (with the necessary additional model parameters) at the beginning of each block. The EM algorithm is guaranteed only to converge to a local maximum of the log likelihood. In fact, however, we find that the parameter estimates were robust to running the estimation algorithm with different sets of initial values, suggesting that no local minima exist close to the identified parameter estimates.
Model comparisons and goodness of fit
The EM algorithm will find a maximum-likelihood model for any input and output sequence. Therefore it is important to be able to assess model performance. We begin with model selection, asking whether particular model parametersin this case input signalsprovide significant explanatory power. We then describe two approaches to quantifying whether the best-fit model is sufficient to account for important statistical features of the inputoutput sequences. These analyses are described in detail below.
LIKELIHOOD RATIO TEST FOR MODEL SELECTION.
To determine whether the inclusion of a particular input variable or other parameter provides significant explanatory power we use the generic likelihood ratio test (LRT) for maximum-likelihood estimation (Stuart and Ord 1987
). Consider a model class M1 with d1 free parameters and a second model class M0, with a subset of free parameters, d0 < d1. For example, M1 could be the model with a particular two-dimensional input variable ut and M0 could be the null model with no input variables. M0 has four fewer free parameters because it has no input weighting matrix B. Given each model class, we can find the maximum-likelihood model parameters and the values of likelihood they achieve, Li for model Mi. The inclusion of additional model parameters will always result in a better fit to the data, i.e., L1 > L0. However, under the assumption that the data come from a model in M0, the distribution of gains in log likelihood resulting only from overfitting is known in the limit of large data sets
![]() | (2) |
PREDICTING THE STATISTICS OF THE REACH ERRORS. The most direct approach to assessing model sufficiency would be to compare the empirical output sequence yt with the outputs predicted from the model and the input sequence. However, the outputs depend heavily on the specific sequence of state noise terms qt, which are not directly observable. Instead, we determine how well models are able to predict the statistics of the sequence of reach errors and the relationship to the sequence of visual shifts.
The sequence of reach errors is characterized by two measures: the variance
e2 and the autocorrelation
e(
), which is a function of the time lag
at which the autocorrelation is measured. Similarly, the statistical relationship between the reach error and the visual shift vector is characterized by two measures: the covariance
ep and the cross-correlation function
ep(
). These measures are defined as follows
![]() | (3a) |
![]() | (3b) |
![]() | (3c) |
![]() | (3d) |
represents the mean value of a vector a across trials, and aTb represents the inner product of the vectors a and b. The statistics defined earlier provide a summary of the adaptive response of the reach endpoint to the shifted visual feedback. These statistics were not used explicitly when performing the maximum-likelihood model fitting. Thus if the model is able to accurately predict these statistics, then it is sufficient to capture the key elements of the response. The model predictions for these statistics are obtained using a Monte Carlo approach. Given the LDS model parameters and actual sequence of visual shifts experienced by the subject in a given trial block, we simulate a sequence of state and output variables using Eq. 1, with state and output noise terms generated independently for each simulated trial. For each subject, we compute the desired statistics from 100 combined runs of these Monte Carlo simulations and compare the values to those obtained from the empirical data.
ONE-STEP-AHEAD PREDICTION AND THE PORTMANTEAU TEST.
Our second approach to assessing sufficiency is to test whether an LDS model is demonstrably insufficient to account for the dynamics of the sequence of reach errors. We start with the one-step-ahead predictor
t, which is the expected value of yt given a model and all the inputs and outputs up to trial t 1. The
t values are obtained from the Kalman filter using the estimated model parameters (Anderson and Moore 1979
; Shumway and Stoffer 1982
). However, if the LDS model being used to predict the same data set on which it was fit, a cross-validation procedure is used: the one-step-ahead predictions
t for each block of 25 trials are computed using a model fit to the data set with that block excluded.
If the model satisfactorily captures the adaptation dynamics, then the one-step-ahead prediction residuals yt
t should be free of temporal correlations, i.e., the residuals should be a white-noise sequence. We can compare this null hypothesis to the alternative (significant residual correlations exist) using a portmanteau test forserial autocorrelations (Hosking 1980
). This test is based on the autocorrelations of the prediction errors at a lag of
trials
![]() | (4) |
![]() | (5) |
and (thus) the Pm should be smaller than if the model were not sufficient. In the case of no inputs to the system and in the limit of large T, Pm is
2 distributed under the null hypothesis (Hosking 1980
Although the portmanteau test can be used for any maximum lag m, the statistical power of the test decreases with the lag (Davies and Newbold 1979
). On the other hand, larger lags need to be included in the portmanteau statistic to test for long-range residual autocorrelations. As a compromise, we will present the portmanteau statistic for maximum lags m up to eight trials, although no substantial differences were noted for maximum lags
20 trials.
| RESULTS |
|---|
|
|
|---|
The goal of this work is to quantify and characterize how such error sequences arise from a combination of error-corrective learning processes and the various sources of variability, including a stochastic component of the internal dynamics of adaptation. In the rest of this paper we use the LDS model of adaptation to accomplish this goal.
Error-corrective learning
We begin by determining which feedback signals, if any, drive error-corrective learning. In terms of the LDS model, this means identifying the inputs that lead to a significantly betterfit to the data. The three candidate inputs signals each have a corresponding LDS update (learning) equation
![]() | (6) |
![]() | (7) |
|
The relative contributions of these two feedback signals could, in principle, be quantified with the LDS approach. However, there is not sufficient statistical power to accomplish this when there is a strong correlation across trials between the two signals. In our case, the visually perceived error and the feedback shift are related by the expression vt = et + pt, and so we expect them to be correlated. Indeed, across subjects the mean (±SD) correlation coefficient between vt and pt in the STOCH-P trial blocks is 0.35 ± 0.12. It is thus not surprising that, although both the Mv and Mp single-input models are significant, the two-input model Mvp does not yield a significant improvement over either. The same qualitative results are obtained when the model selection procedure is applied to artificial data generated with the best-fit Mv or Mp model, underscoring the lack of power to quantify the relative roles of the two feedback signals. Thus in the remainder of the paper we will analyze both the Mv and Mp model classes.
Model parameters
We next consider the maximum-likelihood parameter values for the Mv and Mp model classes (Fig. 4). We will show that the learning rates are quite large, with subjects correcting for about one third of the error feedback after each trial. We will also show that the parameters governing state decay ("forgetting"), state noise, and output noise suggest an important role for each of these processes. Finally, we will show that, although the values for these latter parameters differ between the Mv and Mp model fits, the two model classes are nearly equivalent from the perspective of these data sets.
|
|
The parameters Q and R represent the state and output noise, respectively. The best-fit parameter values differ across the two model classes (Fig. 4). However, in both cases the magnitude of the state noise is comparable to that of the output noise. For example, in the Mv model fit, the SD of the state noise along its most variable axis (first eigenvalue) is 8.7 mm, compared with 12.3 mm for the output noise (Table 1). We will return to this comparison later.
Last, we analyze the differences between the Mv and Mp model fits. It might seem odd that the models agree so closely on the learning parameter B, which is applied to different input signals in the two models, whereas they disagree on the state decay parameter A. Nonetheless, these differences are expected given the relationship between the visually perceived error vt and the feedback shift pt. To show this we rewrite the state update equations for the two models (Eq. 6) using the LDS output (Eq. 1a) and the fact that et is the LDS output
![]() | (8) |
![]() |
This analysis shows that the Mv and Mp model classes are essentially equivalent, formally differing only in the structure of the noise terms. Because in the following we focus on these noise terms, we will continue to present results for both model classes, in that they represent endpoints in the continuum of models in which both signals contribute with varying strengths.
Model sufficiency
In the two previous sections, we showed that the visually perceived reach error and/or the visual feedback shift drive a large and significant adaptive response. Here we ask whether our LDS models of that response are sufficient, that is, whether they do a "good enough job" of explaining the trial-by-trial sequence of reach errors.
SAMPLE DATA.
Figure 5 shows the sequence of reach errors in the STOCH-P trial blocks for a single subject. In addition, this figure shows one-step-ahead model predictions for three different models, i.e., the predicted reach errors for each trial given the actual inputs and outputs up to the previous trial. The one-step-ahead predictions of the best-fit Mv and Mp models (Fig. 5, red and blue traces, respectively) largely overlap each other, as expected. These predictions appear to track the errors well, capturing the general trends in the data. However, two other features of these traces should be noted. First, although according to the LRT the Mv and Mp models fit the data significantly better than the null model M
(black trace), the difference in the one-step-ahead predictions is rather small. This is explained by the fact that the predictor is using all inputs and outputs up to a given trial to predict the output on the next trial. Second, all three models leave a large residual of unpredicted reach error in this sample data set.
|
PREDICTING REACH ERROR STATISTICS. Our first test of model sufficiency is how well a model predicts the statistical structure of the adaptive response to shifted feedback. As described in METHODS, we chose four statistical measures: the variance and autocorrelation of the reach errors and the covariance and cross-correlation between the reach errors and feedback shifts (Eq. 3). For each subject, we computed these measures from the empirical sequence of reach errors in the STOCH-P trial blocks. We also computed the measures from 100 combined Monte Carlo simulations of that sequence, generated from the best-fit LDS model and the true sequence of visual feedback shifts (see METHODS).
We first observe that the Mv and Mp models provide a nearly perfect account of the reach variance and autocorrelation (Fig. 6, A and B). However, the null model performs just as well by these measures. This result shows that the state decay A, state noise Q, and output noise R parameters of the LDS are sufficient to account for the second-order statistics of the reach errors. Furthermore, it shows that the maximum-likelihoodfitting procedure implicitly fits these quantities.
|
The close agreement between the empirical and predicted error-shift cross-correlation functions raises the question of how sensitive a measure this is. We want to be sure that the predictions depend on the details of the model and are not, for example, dominated by the sequence of visual shifts. Therefore we performed a sensitivity analysis to quantify how the predicted cross-correlation function depends on the model parameters. We generated Monte Carlo simulations with individual parameters altered from their actual best-fit values. The predicted cross-correlations were found to be sensitive to all four parameters and even fairly small parameter changes can produce large discrepancies between the data and model predictions (Fig. 6E, results for only Mv are shown).
RESIDUALS OF ONE-STEP-AHEAD PREDICTIONS. If the one-step-ahead model predictions captured the full dynamics of adaptation, then the residual errors should have no statistical structure, i.e., they should be white noise. This can be assessed by the portmanteau test for serial autocorrelations. If significant correlations existed in the model prediction residuals, then the model would be insufficient to account for the dynamics of adaptation and the model would be rejected. For all subjects, the one-step-ahead predictions of both the Mv and the Mp model leave no significant residual correlations in a cross-validation test for the STOCH-P trial blocks (portmanteau test, P > 0.05, max. lag m = 8).
These results suggest that the LDS models are sufficient to capture the trial-by-trial dynamics of adaptation. However, because the portmanteau test pools all residuals and all time lags into a single statistic, it has relatively low statistical power. This means that more subtle model inaccuracies might not be detectable with this approach. Detecting such inaccuracies is especially important when interpreting the state and output noise terms because model inaccuracies will appear in our model fits as additional noise. Therefore we performed a variety of additional analyses on the model residuals, testing for violations of normality, stationarity, and model linearity. We found no significant evidence for any of these effects, as described in detail in the APPENDIX.
Together, these analyses suggest that the LDS models considered here are indeed sufficient to explain the dynamics of reach adaptation in the STOCH-P trial blocks.
State noise
The two sources of variability in the LDS modelthe state noise and the output (or sensorimotor) noiseare conceptually quite different and both will contribute to the overall variability in any measure of performance. Although the output noise is uncorrelated across trials, state noise is accumulated in the state and thus its contribution to the reach variability is correlated across trials. Given that the state noise is often overlooked in models of motor variability, we want to confirm here that the level of state noise is indeed significant.
We use the LRT to compare the Mv or Mp model class to a null hypothesis class with no state noise. In practice, Q cannot vanish entirely or the parameter estimation algorithm would become unstable. Thus for the null hypothesis we use a model with the state noise covariance fixed to a negligible value (Q = 0.1 mm2) relative to the output noise covariance R. The LRT shows that the addition of state noise significantly improves the model fit in the STOCH-P condition for both learning models and for all subjects (Mv: P < 104; Mp: P < 0.003; n = 10).
We confirm this result by applying the portmanteau test to the best-fit null hypothesis model (i.e., the best model with Q = 0.1 mm2). These models could not capture the temporal structure of the reach errors: the residuals were significantly correlated across trials (Mv rejected in 10/10 subjects, Mp rejected in 7/10; P < 0.05, max. lag m = 8). These results establish that state noise contributes significantly to the trial-by-trial sequences of reach error.
Next, we assess the magnitude of the state noise by comparing it to the output noise. As a measure we choose the ratio of the largest eigenvalues of the state and output covariance matrices, vQ and vR, respectively, (see also Table 1)
![]() | (9) |
|
![]() |
![]() |
Adaptation dynamics generalizes to constant feedback shift
Up to this point we have examined only the adaptation dynamics in the STOCH-P trial blocks. One concern that might arise is that the results we have found, such as the magnitudes of the learning rates and state noise, are specific to the stochastic sequence of feedback shifts. It is important to verify that the conclusions drawn from studies using these dynamical systems techniques will generalize to other experimental paradigms, in particular to the blocked-exposure design traditionally used in studying reach adaptation. Therefore we quantified how well LDS models that were fit to the STOCH-P data predict the results of the CONST-P condition, which is similar to a blocked-exposure design. We focused on predictions of the steady state of adaptation, the statistics of the sequence of reach errors, and the residuals of the one-step-ahead model predictions.
In general, LDS models predict that adaptation should converge to a "steady state" when the input is held constant. However, as a result of the presence of state and output noise, there will be random fluctuations in both the state and the output even after this convergence has occurred. The LDS model predicts both the magnitude of this steady-state reach error and the fluctuations around it. We compared these model predictions to values obtained from the data. The Mv and Mp models make nearly identical predictions for the steady-state error magnitude and these values correlate well with the empirical data across subjects (Fig. 8A). There is a small bias, however: the models consistently predict a slightly smaller steady-state error than what is empirically observed (mean bias is 2.7 mm for Mv and 2.8 mm for Mp). Both models make accurate predictions for the SD of the reach error during the steady state (Fig. 8B).
|
|
Taken together these comparisons suggest that the dynamics of adaptation largely generalizes from a stochastic sequence of visual shifts to a constant shift paradigm. Furthermore, the LDS model fit to the stochastic shift data is able to quantitatively predict the key features of the of the blocked-exposure paradigm.
| DISCUSSION |
|---|
|
|
|---|
20% of the error observed in the preceding movement, on average. The data presented here are consistent with two candidate error signals: the visually perceived reach error and the artificial visual shift. Second, adaptation is driven by the internal dynamics of learning, including a decay back to a baseline state and the accumulation of an internal "learning" or state noise. Finally, the LDS model generalizes from the case of stochastic feedback shifts to the more traditional case of constant feedback shifts. Sufficiency and generalization of the LDS model
One finding of this study is that simple LDS models are sufficient to describe the dynamics of adaptation. If our model captured all the temporal structure in the data, the model prediction residuals should be a white-noise sequence. We therefore analyzed the correlations among and between residuals and various key task variables. We found no significant evidence for autocorrelations in the model residuals, nonstationarity and nonlinearities in the dynamics, or non-Gaussian noise.
Nevertheless, another indication that the LDS models are indeed capturing the dynamics of adaptation is the fact that the models generalize from stochastic to constant feedback shifts. Although the bulk of our analyses support this conclusions, there are three deviations from the model predictions that should be considered. First, the predicted covariances between reach errors and feedback shift were systematically smaller than the empirical values (Fig. 6C). Second, the models predicted a slightly smaller steady-state adaptation in the CONST-P condition (Fig. 8A). Third, the portmanteau test for generalization to the CONST-P case failed for a minority of the subjects.
All three of these observations could be explained by an underestimate of about 10 to 20% in the magnitude of the learning rate B. The learning rate controls how much the external input affects the state of adaptation and so a more negative B would increase the covariance between reach error and feedback shift in the STOCH-P condition. In addition, the magnitude of the steady-state error in the CONST-P condition depends only on the decay parameter A and the learning rate B. Increasing the magnitude of either parameter would lead to a larger predicted steady state (Cheng and Sabes 2006
). Similarly, if the estimated magnitude of B is low, there will be a persistent bias in the one-step-ahead prediction of the reach error in the CONST-P trial blocks, resulting in a significant correlation in the prediction residuals across trials. This would explain the occasional failure of the portmanteau test for generalization to the CONST-P condition. Visual inspection confirms that predictions of CONST-P reach errors are indeed biased in the very cases for which the portmanteau test failed (data not shown). Finally, a higher learning rate (more negative B) would not significantly degrade the predictions of the cross-correlations between reach errors and feedback shifts (Figs. 6E and 9E).
Why might the learning rate B have been underestimated? One obvious reason would be a bias in the maximum-likelihoodfitting procedure. However, we found no evidence for such a bias in control analyses in which we estimated the LDS parameters from artificial data generated from a known LDS (data not shown). Another possibility is that adaptation is a higher-order systemi.e., subjects maintain a memory of more than just the immediately preceding inputand these older error signals also influence learning. To address this possibility, we fit the stochastic shift data with augmented models in which learning is driven by the two preceding inputs. This model yielded a significant improvement for some subjects (LRT, Mv 2/10 subjects, Mp 6/10, with P < 0.05). Because both the feedback shift and the visually perceived error had sizable autocorrelations at a lag of one trial, this analysis had limited power. Therefore it is plausible that such second-order effects are present in all of our data sets. This extra source of input would explain the prediction errors described here, even in the absence of a fitting bias.
A second explanation for imperfect generalization to the constant-shift data could be the presence of multiple timescales of reach adaptation (Smith et al. 2006
). Suppose that there were two state variables that contribute to the trial-by-trial task performance: one that learns on the fast timescales described above and one with a much slower learning rate. The effects of the slow-learning system would not be apparent when the visual feedback shift changes on a trial-by-trial basis because its effects would average out across trials. However, when a constant feedback shift is used, the slow learning system would contribute to the state of adaptation. This contribution could account for the discrepancies we observed between the constant shift data and the model predictions.
State noise
We have found that state noise accounts for at least a quarter of the overall trial-by-trial variability in reaching, after discounting the changes arising from our artificial feedback shifts. The presence of significant state noise implies that sensorimotor calibration is changing continually, even without exogenous driving inputs. This model offers a strong counterpoint to the traditional view of motor variability as arising largely from limitations in the sensory and motor peripheries (Donchin et al. 2003
; Gordon et al. 1994
; Harris and Wolpert 1998
; Messier and Kalaska 1990
; Osborne et al. 2005
; Thoroughman and Shadmehr 2000
; van Beers et al. 1998
). Neglecting the presence of such state noise in studies of sensorimotor variability can lead to overestimates of those variances. Also, because state noise leads to correlations in movement variability across trials, the application of statistical models that assume independent noise across trials (e.g., Donchin et al. 2003
; Thoroughman and Shadmehr 2000
) may lead to incorrect conclusions (Cheng and Sabes 2006
).
There are several potential sources for the state noise we observed: variability could arise in the sensory processing of the error feedback signals; variability could be injected into the state during the process of adaptation, i.e., as a by-product of the computations that underlie learning; or variability can be introduced in the memory or maintenance of the state across trials. In fact, it is likely that at least some of the state noise comes from each of these sources. Quantifying the relative importance of sources of variability is a direction for further research.
An alternative explanation for the apparent state noise is that our LDS models are deficient in some respect. The resulting error in the state update would then be subsumed into the state noise when estimating the LDS parameters. In the APPENDIX, we argue that the state noise is unlikely to be the result of nonlinearity, nonstationarity, or nonnormal noise in the true process of adaptation. Of course, those analyses would be unable to detect high-order nonlinearity or rapid nonstationarity. As an extreme example, consider the case where the neural circuit that underlies adaptation is made up of a large number of deterministic, nonlinear "units" that combine to approximate a linear learning rule. This circuit is deterministic, but there will be many small fluctuations about the linear learning rule. Although such variability may in fact be "deterministic," from a practical perspective we may view it as "noise."
Another model assumption that could potentially be incorrect is that there is a single process giving rise to the adaptation we study here. We tested whether multiple processes with different timescales (Smith et al. 2006
) could account for the state noise. In fact, when we fit the data with a two-timescale model, the state noise was still significant for all subjects (likelihood ratio test). Furthermore, despite having many extra model parameters, the best-fit state noise in the two-timescale model was appreciably lower in only three subjects (data not shown) and, even in those cases, the state noise covariance was within the range of values found for other subjects with the single-timescale model. We conclude that the existence of multiple timescales could not account for the state noise that we observed.
The LDS models would also be deficient if they were missing a significant input signal. Two candidate signals come easily to mind. One candidate is the error feedback from earlier trials, as discussed earlier. However, the estimated state noise is qualitatively unchanged when these earlier inputs are included in the model (data not shown). Another variable that we did not include in our models is the actual position of the reach target, which was drawn uniformly from a 4-cm square for each trial. However, adding the target position improved the Mp model fit in only two subjects (LRT,
= 0.05) and lowered the estimated state noise covariance only marginally.
Finally, we recall that the distinguishing difference between state and output noise in the LDS model is the fact that the state noise creates variability, which is correlated across trials (a random walk), whereas the output noise in uncorrelated (white noise). Therefore noise in the sensory or motor periphery would mimic state noise if it were correlated across trials (on the order of tens of seconds to minutes). Although we know of no evidence for such correlations, they might exist and could account for some fraction of our observed state noise.
Error-driven learning
One clear result from this study is that the trial-by-trial dynamics of reach adaptation are well modeled by an error-corrective learning rule. Furthermore, the rate of learning is quite rapid, with an average correction in the state of
20% of the last error after each movement. These rapid adaptation dynamics appear to be inconsistent with slower models of learning, such as those based on Hebbian learning rules (Hua and Houk 1997
; Salinas and Abbott 1995
).
What is less clear, however, is the specific sensory signal that drives learning. We have shown that the visually perceived error and the artificial feedback shift provide equally good explanations of the trial-by-trial changes in reach perfor