|
|
||||||||
Laboratory for Computational Motor Control, Department of Biomedical Engineering, Johns Hopkins University School of Medicine, Baltimore, Maryland
Submitted 6 December 2006; accepted in final form 4 April 2007
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
More recent computational models of motor learning suggest that motor error engages multiple adaptive processes of different timescales: one process strongly responds to error but has poor retention and another has poor sensitivity to error but has better retention (Smith et al. 2006
). This model and its Bayesian variant (Kording 2007) emphasize that passage of time is an important variable that influences content of memory. Can such models help explain the massed versus spaced training effects?
Here we begin with an experiment that confirms the previous findings that reach adaptation in force fields is indeed faster (i.e., require fewer trials) when the intertrial interval (ITI) is increased from 4 to 14 s. We demonstrate that the multiple-timescale model under a Bayesian formulation can account for this result. Because time passage increases the Bayesian learner's uncertainty about his/her environment and increased uncertainty promotes the incentive to adapt, longer ITIs lead to fewer trials required for adaptation. The model predicts that the learner will learn more from a trial that immediately follows a long delay than one that immediately follows a short delay.
To test this prediction, we performed another experiment where ITIs were randomly distributed. However, we found results inconsistent with the predictions of the Bayesian model. People learned more from a trial that preceded a long delay, not a trial that followed a long delay. Therefore the improved adaptation rates in spaced trials were unlikely to be a result of increased uncertainty. Rather, our results suggest that movement errors produced a trace that continued to benefit the learner during the ITI. It appears that this trace has a time constant of about 4 s.
| METHODS |
|---|
|
|
|---|
![]() | (1) |
is in meters per second. No forces were applied during "catch trials" and trials in the null training set. Behavioral training
All procedures were approved by the Johns Hopkins Medicine Institutional Review Board. Subjects gave consent before their participation in the study. All subjects were healthy and right-handed. They were naive to the purpose of the experiment and had never participated in any experiment with our device before. Participants were seated in front of a flat-screen monitor situated at eye level (Fig. 1). With their right hand they grasped the handle at the end of the manipulandum to navigate a cursor (a white dot) on the screen. Subjects were trained to maintain the cursor at the center of the screen as indicated by a yellow crosshair until a 1-cm2 green target box appeared, which also served as the "go" signal. They were also told to "aim for the target as it appears and try making a straight point-to-point movement to the target in a fast and smooth fashion." The yellow crosshair vanished as soon as subject started to move.
|
Performance measure
Movement errors were measured as the signed perpendicular displacements (PDs) of the reach at peak speed with respect to a straight line to the target. We grouped 32 movements into one movement bin and averaged the errors in field and catch trials to arrive at
and
. Next, we computed a learning index (LI) (Criscimagna-Hemminger et al. 2003
; Hwang et al. 2003
)
![]() | (2) |
The denominator of this expression is a measure of limb compliance; smaller stiffness gives rise to larger differences in errors between catch and field trials. The numerator is a measure of change in motor output with respect to null trials. Therefore the ratio is a measure of learning, normalized with respect to limb compliance. As subjects learn to predict the forces, their movement errors in field trials decrease, whereas errors in catch trials increase. Thus we would expect the learning index to grow and plateau as the subjects adapt. Complete adaptation would result in a learning index of one. However, catch trials cause unlearning (Thoroughman and Shadmehr 2000
) and prevent the index from reaching one. The ratio of field trials to catch trials (i.e., 5/6) dictates the theoretical limit of learning index, which is 0.83. In our experience, the highest actual learning performance is slightly below this number (Criscimagna-Hemminger et al. 2003
). Movements were excluded if they did not meet the following criteria: maximal tangential speed was between 0.20 and 0.55 m/s, movement duration was between 0.3 and 1.2 s, and the total movement trajectory length was <20 cm.
Experiment 1: constant ITI
In this experiment we sought to replicate the results of Francis (2005)
and Bock et al. (2005)
. The experiment was divided into four sets (Fig. 1A), each containing 192 reach trials: set A (null baseline), set B (field), set C (null washout), and set D (field). In sets B and D, the field was randomly removed in one sixth of the trials (catch trials). Short breaks of 5 min were given between sets. Subjects (n = 24) were randomly assigned to two counterbalanced groups. One individual did not follow instructions and was excluded from data analysis. Both groups performed the same sequence of movements in each of the four sets with one difference: in set B of the first group, after the hand returned to the center location the target presentation was delayed by 10 s, whereas target presentation in all other sets was delayed by only 0.5 s (sets A, C, D). In the second group, target presentation was delayed by 10 s in set D and by 0.5 s in all other sets (sets A, B, C). We defined ITI to be the time between the onsets of consecutive reaching trials and it included all delays and movement time. Thus the presentation delays resulted in mean ITI of either 4 or 14 s.
Because we found that longer ITI produced significantly faster rates of adaptation, we considered two general models that might account for the data.
MODEL 1 (BAYESIAN MULTIRATE MODEL): MOTOR ADAPTATION AS OPTIMAL INFERENCE AT MULTIPLE RATES.
One way to account for time-dependent changes in motor performance is to envision that the learner is a Bayesian estimator that assumes that motor error arises from multiple causes: some perturbations go away quickly but tend to be highly variable (fast system; e.g., fatigue), whereas other perturbations tend to go away slowly and tend to be less variable (slow system; e.g., disease). We recently formalized this idea and demonstrated that a multiple-system model can account for a large body of data in saccade and reach adaptation (Kording 2007). The principal idea in this model is that movement error results in a credit assignment problem for the nervous system. To solve a credit assignment problem, we need to determine how to vary the contributions of the two systems to a common task: what is the timescale of perturbation that is most likely responsible for the current error? Is the perturbation likely to go away quickly or is it likely to be sustained? If it is likely to be sustainedas with spaced perturbationsthe learner should increase his/her error sensitivity for slower timescales. To show how such a model will be affected by ITI, suppose that the learner assumes that the perturbations (e.g., force f imposed on the limb) are caused by a linear combination of two sources, each with its own states, timescale, and noise properties
![]() |
![]() |
![]() | (3) |
0 is the noise in our sensors that measure the perturbation and
1 and
2 are noises associated with the fast and slow perturbations (we assume that a1 < a2 < 1 and
1 >
2). The learner's "knowledge" on each trial is a sum of contributions of the fast and slow systems with associated noises. It is convenient to rewrite Eq. 3 in vector format as
![]() |
![]() | (4) |
(n|n1) and a predicted perturbation
(n) = xT
(n|n1). The optimal way that it can distribute the error f(n)
(n) to each of the two potential sources is described by the gain k(n) (the Kalman gain). The mean of the posterior estimate will become
![]() |
![]() | (5) |
Now if during an iteration of the model, the learner makes a movement, it will make an observation, and therefore the posterior uncertainties will change
![]() | (6) |
![]() |
![]() | (7) |
![]() | (8) |
MODEL 2 (ERROR TRACE MODEL): ADAPTATION AS CONTINUOUS INTEGRATION OF MOTOR ERROR.
Model 1 assumes that adaptation in response to error is an instantaneous process that completes by the next iteration. A different way to view adaptation is to imagine that it is a process initiated with the experience of error, but continues as long as the error memory trace is available. Let us assume that the error trace is an exponentially decaying function of the form
![]() |
is the time variable, and r is the time constant for the temporal decay. Suppose that the learner continuously learns from the error trace that was initiated at the time of trial n until the new error trace at trial n + 1 interrupts this process, the learner's state at tn+1 (immediately before experiencing the error in trail n + 1) becomes
![]() | (9) |
![]() | (10) |
PREDICTIONS OF THE TWO MODELS. Models 1 and 2 both predict that longer ITIs will affect rates of adaptation, but their mechanisms are different. In Model 1, when a trial is followed by a long delay, parameter uncertainties can increase, which in turn increases the sensitivity to error in the trial that follows the long delay. In other words, longer ITIs signal the learner to pay attention. In Model 2, when a trial is followed by a long delay, the error that was experienced in the preceding trial is integrated over a longer time interval. In other words, longer ITIs allow more samplings from the error. This increases the sensitivity to error in the trial that preceded the long delay. Therefore the key experiment is one that measures the learner's sensitivity to error in trial n, as a function of the time that either preceded or followed that trial.
Experiment 2: variable ITI
This experiment (Fig. 1B) consisted of three sets (each with 192 trials) separated by short breaks: set A (null), set B (field), and set C (field). During the null training set, presentations of targets were delayed by 0.5 s as in experiment 1. During the field sets, one sixth of the trials were catch trials. However, unlike experiment 1, the delay before target presentation was pseudorandomly selected to be 0.5, 5, 10, or 20 s. Equal numbers of different delays for a given movement were presented in each set. Because an ITI also included the time for movements (that remained relatively constant for each participant), imposing this "go" signal delay resulted in mean ITIs of 4, 9, 14, and 24 s.
We recruited a new group of naive subjects for this experiment (n = 31). To assess sensitivity to error and the influence of ITI, we fitted the trial-by-trial movement errors to a state-space model (Wainscott et al. 2005
). The state-space model included a hidden state for each of the eight directions of movement. We estimated a trial-to-trial generalization function from the direction in which error was experienced to all possible directions. This generalization function was a measure of sensitivity to error. The models predicted that changing the ITI would affect this sensitivity. In particular, Model 1 predicted that the sensitivity to error experienced in trial n should increase as a function of the delay that preceded that trial. Model 2 predicted that the sensitivity to error experienced in trial n should increase as a function of the delay that followed that trial.
Experiment 3: variable ITI with channel trials and random forces
The results of experiment 2 were consistent with predictions of Model 2 but not Model 1. To test the assumptions of Model 2 more directly, we performed a final experiment. In this experiment, we measured state of the motor system directly through the use of "channel trials" (Hwang et al. 2006a
; Scheidt et al. 2000
; Smith et al. 2006
). In a channel trial, the robot restricts the hand's motion along a straight line to the target. Although it prevents errors during the reach, it allows us to measure how much force the subject expected to experience for that trial. This expectation is equal to the force that the subject produces against the channel wall. Previous work found that during reach adaptation, these forces gradually approximate the force field that the robot produces during free movements (Hwang et al. 2006). Our idea here was to use channel trials to measure the change in the expected force as a function of error in the previous reach trial.
Experiment 3 (Fig. 1C) consisted of three sets (each 192 trials), separated by short breaks: set A (null baseline), set B (random field), and set C (random field). We recruited a new group of naive subjects (n = 28). Unlike Experiments 1 and 2, however, there were only two targets: either up or down with respect to the center position. In the field sets subject were given, in a pseudorandom order, a clockwise curl-force field (Eq. 1, +V, 3/8 of the movements), a counterclockwise curl-force field (V, 3/8 of the movements), or a "channel trial" in which movements perpendicular to the target direction were prevented by a stiff one-dimensional spring/damper (2 kN/m, 45 Ns/m). The seamless production of the channel force was based on the hand position in the center start box and unperceivable to participants unless they purposely tried to move in the perpendicular direction. Such movements were not observed and participants reported that they were not aware of this force pattern at all.
We were interested in the change in the force exerted by the subject against the channel walls when two channel trials were separated by a trial in which there was a movement error. There were 34 such channelforcechannel trial triplets dispersed pseudorandomly in set B and 35 in set C. In the triplet, between the first and the second movements, the ITI was kept at a constant 4 s. Between the second and the third movements, we varied the ITI in a random fashion identical to the experiment 2 design. Thus for each triplet we looked at the difference in the force output in the first and third movements (both were channel trials) as a function of the time passed since the error experienced in the second movement. Model 2 predicts that when one experiences a reach error in trial n, on trial n + 1 one will produce a force against the channel walls that is proportional to this error and this sensitivity will grow as a function of the time between trial n and trial n + 1.
Bootstrap methods for estimating the confidence limits
We followed the procedures described in our previous publications (Donchin et al. 2003
), which followed procedures laid out by Efton and Tibshirani (1993)
, to estimate the SEs (Fig. 4) in the model parameters in the analysis of experiment 2. We resampled data from the 31 subjects with replacement and estimated the parameters from the averaged sample. We iterated this procedure 200 times and the SD of the 200 estimations of the parameters yielded an estimation of the SE of the parameters.
|
| RESULTS |
|---|
|
|
|---|
In experiment 1, we sought to replicate the results of Francis (2005)
and Bock et al. (2005)
. We quantified the effect of massed versus spaced training in a within-subject design (Fig. 1A). Subjects were randomly assigned to train with either a long (13.60 ± 0.13 s, mean ± SD) or short delay (3.88 ± 0.14 s) before each trial. We found that they showed better adaptation in the longer ITI setcompensation in field trials was stronger and aftereffects in catch trials were larger (Fig. 2, AC). We measured performance with a learning index in each of the six 32-movement bins in each set. Repeated-measures three-factor ANOVA analysis (ITI, set order, and movement bin) on the learning index showed no significant effect resulting from set order [F(1,21) = 0.020, P = 0.89]. We therefore combined data from the two groups and considered the effect of ITI on the learning (Fig. 2D). We found a significant interaction effect between ITI and movement bin [F(5,105) = 4.07, P = 0.002]. Post hoc analysis showed significantly better performance in bins 2, 3, and 4 within subjects (paired t-test, P < 0.05 for each bin) for the long ITI set. Therefore the adaptation rate was enhanced with the longer ITI.
|
Existing models and predictions
To account for the observation that performance during motor learning exhibited time-dependent changes such as savings and spontaneous recovery, our group previously proposed a deterministic multirate model that suggests that motor output is a sum of at least two systems: a fast adapting system that rapidly forgets and a slow adapting system that has good retention (Smith et al. 2006
). In this model, the learner's state (represented by variable z) is a reflection of two underlying systems: one system (represented by state z1) is highly sensitive to errors and changes rapidly, but has a limited capacity and will tend to quickly forget. Another system (represented by state z2) has low sensitivity to error and changes slowly, but has large capacity and will tend to remember. The learner's "knowledge" on each trial is a sum of contribution of the fast and slow systems: z = z1 + z2. In this model, after an error y(n) in trial n, the state of the learner {z1, z2} changes deterministically as follows
![]() |
![]() | (11) |
|
The link between trial-to-trial sensitivity to error, generalization, and overall learning rate
In the Bayesian model, Eqs. 58 describe how the state of the learner changes on a trial-to-trial basis. We showed that in simulation, an increased trial-to-trial sensitivity to error sufficiently expedites overall learning rate (Fig. 3A). The trial-to-trial sensitivity to error, however, is not uniform across all movement directions. Gandolfo et al. (1996)
demonstrated that the amount of trial-to-trial sensitivity depends on the angular disparity between movement directions. That is, errors in one direction affect the states of the learner in other directionsa phenomenon termed generalization. Therefore if increased ITIs increase trial-to-trial learning and in turn overall learning rate, the effect will produce a modulation of generalization as a function of ITIs.
Experiment 2: sensitivity to error in trial n increased with time between trials n and n + 1
In experiment 2, the delay between movements was drawn from a multinomial random variable. The ITIs were approximately 4, 9, 14, and 24 s [respectively, 4.14 ± 0.17, 8.63 ± 0.14, 13.63 ± 0.16, or 23.70 ± 0.12 s (mean ± SD)]. We were interested in measuring the learner's sensitivity to error in each trial as a function of the ITI. To estimate this sensitivity, we used a state-space approach (Donchin et al. 2003
). For each subject we measured the error y*(n) in each trial n, force f(n) produced by the robot at that velocity, and target direction L(n). Because the movement directions were identical between subjects, we averaged the movement errors and forces across subjects and arrived at a single sequence.
To determine how the error in each trial affected the movement that the subject made in the subsequent trial, we fitted this sequence to a hidden state-space model (Donchin et al. 2003
; Wainscott et al. 2005
). The hidden states represented the knowledge of the learner about the perturbation in each direction of movements. On experience of an error in a given direction, we estimated how this error was generalized to other directions. We also estimated how this generalization was modulated by the time spent between the trials. The sequence of movement errors were fitted to the following dynamical system
![]() |
![]() | (12) |
, where
= tn tn1.
Using nonlinear optimization (the lsqnonlin function in Matlab with default settings), we fitted Eq. 12 to the measured data [F(35,349) = 33.95, r2 = 0.7730, P < 0.0001]. The resulting generalization function B (with SEs of the mean estimated through a bootstrap procedure) is plotted in Fig. 4A. This function had its peak at 0 ° and decreased with angular distance in a pattern similar to those recorded in other studies where ITIs were kept constant (Donchin et al. 2003
; Wainscott et al. 2005
). We found that the function a remained extremely close to 1 (Fig. 4B), suggesting that there was little or no forgetting during the seconds that passed between trials. Arm compliance D was consistent with previous measurements of limb compliance (Fig. 4C) (Mussa-Ivaldi et al. 1985
). However, contrary to the predictions of the Bayesian model, we found that k did not monotonically increase with the time that preceded the trial (Fig. 4D, gray line).
The error trace model
Because the results of our fit were inconsistent with the Bayesian model, we considered an alternative model. It is possible that in spaced training, adaptation rate is faster because errors produce a memory trace that decays with time but that the learner continues to benefit from the trace during the ITI period (Fig. 3C), effectively continuing to learn from the error trace. In this model, the time that is of importance is the period that follows a trial, not the time that precedes it (Eq. 10). The error trace model predicts that sensitivity to error in trial n should increase as a function of the time between trials n and n + 1. (In contrast, the Bayesian model predicted that the sensitivity to error in trial n should increase a function of time between trials n 1 and n.) To test for this prediction, we slightly modified Eq. 12 to represent this idea and then fitted Eq. 13 to the same data observed in experiment 2
![]() |
![]() | (13) |
The results produced a highly significant fit [r2 = 0.7733, F(35,349) = 34.01, P < 0.0001]. Although there were few or no changes in the model parameters B, a, or D (Fig. 4, AC), the important parameter k changed significantly: it now became a function that monotonically increased with ITI (Fig. 4D). That is, results of experiment 2 suggested that the sensitivity to error in trial n increased as a function of postmovement ITI, not premovement ITI. This result was consistent with the error trace model but not the Bayesian model.
Was the sensitivity to error significantly greater at longer ITIs? To check for this, we asked whether the slope of the function k(
) in Eq. 13 was significantly greater than zero. We first computed k(
) from each of the bootstrapped groups and then estimated the slope by fitting a straight line. The P value was estimated by counting the number of nonpositive slopes and then dividing by the total number of bootstrapped samples. We found a significantly positive slope (P < 0.005). On average, the subjects' generalization was 45% higher during those trials with the longest ITI with respect to those with the shortest ITI.
What was the time constant of the error trace? To estimate this time constant, we replaced the term k(
) in Eq. 13 with the exponential function in Eq. 10 and refitted the system of equations to the measured data. We again found a highly significant fit [r2 = 0.77, F(29,355) = 40.8, P < 0.0001]. The fit estimated an error trace time constant of about 4 s (r = 3.77 ± 0.6 s).
In summary, we found that improved performance in the spaced trials was attributed to increased error sensitivity as a function of the period that followed the movement, as would be predicted by the error trace model, and not the period that preceded the movement, as would be predicted by the Bayesian model.
Cross-validation of the error trace model
To validate the error trace model, we performed two tests. First, we asked whether the specific parameter values found in the random ITI of experiment 2 could explain the performances in the constant ITI of experiment 1. Second, we performed experiment 3 to specifically measure the change in motor output as a function of ITI.
In our first test, we asked whether the model of Eq. 13 could predict the specific shape of the learning function that we had measured in the short- and long-delay conditions of Fig. 2D. We ran the dynamic system of Eq. 13 on the target sequence L(n) and force sequence f(n) of experiment 1, generating a sequence of errors in field and catch trials. We then computed a learning index on this sequence of movement errors in the same way that we had computed the performance of our subjects (Eq. 2). For simplicity, predicted and actual performances were computed on a single sequence of movements by first averaging data across subjects; the results were similar to those shown in Fig. 2D. Figure 4E shows performance of the predicted and the actual data from experiment 1. There was an excellent correspondence between the predicted performance and the measured data.
Experiment 3: predictions of the error trace model
A crucial prediction of the error trace model is that when one experiences a motor error, the longer one waits, the larger will be the effect of this error on the motor output in the next trial. To test this prediction, we used force channels, first introduced by Scheidt et al. (2000)
. In a "channel" trial, the hand is guided and restricted along a straight line to the target. The motor output that is relevant to the task is the force that the subject produces against the walls of this channel. In a triplet of "channelfieldchannel" trials, one can measure the change in motor output between the two channel trials as a function of the error in the intervening field trial. The prediction of the error trace model is that this change should monotonically increase with the time period between the field trial (when error was experienced) and the second channel trial.
In a channel trial n, let us label the force perpendicular to the direction of motion produced at maximum velocity as u(n). Assume u(n) =
(n)
(n), where
(n) is the subject's best guess at the constant V in Eq. 1, and that
(n+1) =
(n) because the subject did not experience an error in trial n (as it was a channel trial). Now in trial n + 1 (field trial), the subject experienced a movement with error y(n+1), and this results in adaptation
![]() | (14) |
= tn+2 tn+1. Thus for any channelforcechannel triplet, we have
![]() | (15) |
|
| DISCUSSION |
|---|
|
|
|---|
We considered two models of motor adaptation that are sensitive to passage of time: a Bayesian multirate model (Eq. 3) and an error trace model (Eq. 10). Our previous work suggested that motor errors result in an adaptive response in at least two "systems": a fast system that rapidly learns but has poor retention and a slow system that is less sensitive to error but hardly forgets (Smith et al. 2006
). Unfortunately, this model could not explain the results of experiment 1. A Bayesian variant of this model recasts it in a probabilistic framework (Kording 2007). It hypothesizes that spaced training leads to improved rates of adaptation because during the time between trials n 1 and n, the brain becomes uncertain about its internal model. The increased uncertainty results in increased sensitivity to subsequent motor errors. The model predicts that if one could measure error sensitivity on each trial, one would find that the sensitivity to error in trial n increases as a function of the time between trials n 1 and n. In experiment 2, we tested and found results that were inconsistent with this prediction: the sensitivity to error in trial n monotonically increased with the time period that followed that trial, not the period that preceded that trial. Therefore spaced training improved rates of performance not because time delay made the learner more sensitive to the error in the next movement, but because it made the learner learn more from the error in the last movement.
To account for this result, we proposed that the error might be represented by a trace that exponentially declined with time, effectively allowing the nervous system to learn from the trace for as long as it was available. Results of experiment 2 suggested that the error trace had a time constant of 4 s. To test the model more directly, we performed a final experiment where movements experienced random errors, but were sandwiched between "channel" trials from which we could measure change in motor output from error. Consistent with the trace model, we found that the sensitivity to error experienced in trial n increased with the delay between trials n and n + 1.
We assumed that the error trace was "reset" by the next trial. We also considered the possibility that the error trace lingered beyond the immediate next trial. If the time constant of such a "lingering" trace is more than a few seconds, one expects that the generalization function will be independent of the ITI. This was inconsistent with our results in the second experiment.
We believe cognitive strategies had minimal effect in our data. We examined the effect of conscious policy versus an implicit memory in a recent report and found that there was a high probability that subjects would become conscious of the force field pattern when the number of targets was small (three) and the forces were consistent (Hwang et al. 2006b
). Furthermore, in that experiment we did find that subjects who became conscious of the force pattern had a small but significant boost on learning versus those who did not. In the current experiment, we designed our procedures to minimize this effect. First, we had eight targets rather than three. Second, we performed a control experiment where forces were random. Importantly, we observed that the effect of ITI in the random experiment was consistent with the ITI effect in the constant force field experiment. Furthermore, we asked subjects what they were thinking during the intertrial intervals in postexperimental questionnaires (experiment 3). No subject was aware of the force pattern (because it was random) and only one subject in 28 answered that it was an attempt to come up with a cognitive strategy. Because participants were required to manually keep the cursor centered during the wait, the vast majority answered that they were tying to focus on centering the cursor.
The idea that adaptation might take place during the time between trials is a common theme among computational models of learning where events produce an eligibility trace for synaptic plasticity. For example, Sutton and Barto (1981)
suggested that a stimulus or error signal that excites a neuron may produce an eligibility trace on the neuron's synapses that acts as a low-pass filter of that input. When the input is removed, the trace declines exponentially in time. As long as the error and stimulus traces are available, their coincidence results in modification of the synapse associated with the stimulus. If each new error or stimulus cancels the trace of the previous input, such models predict that the effect of a given error should grow with the ITI between trials. The results of experiments 2 and 3 are consistent with this framework.
For the reaching task considered here, a candidate area where such computations may be performed is the cerebellum (Smith and Shadmehr 2005
). An eligibility trace (or in our terms, sensitivity to error) may be represented by the concentrations of second-order messenger chemicals. Several studies suggest that parallel fiber activity is responsible for the graduate rise of this trace (Kettner et al. 1997
; Raymond and Lisberger 1998
). Similar proposals were previously suggested for timed learning in delayed conditioning of eye blinks (Fiala et al. 1996
); parallel fiber activity leads to increased phosphorylation of receptors over time and, in turn, reduces Purkinje cell firing during the interval between the sustaining conditioned stimulus (CS) and the onset of unconditioned stimulus (US). Interestingly, it was previously observed that delayed conditioned response can be learned if CS precedes the US by
4 s (Gormezano 1966
), suggesting that such persistent phosphorylation of Purkinje cell receptors would have similar time course put forth here in the error trace model.
From a neurobiological perspective, synaptic changes that are produced by spaced training produce memories that are dependent on protein synthesis (Comas et al. 2004
; Josselyn et al. 2001
; Locatelli et al. 2002
; Maldonado et al. 1997
; Scharf et al. 2002
; Tully et al. 1994
). Indeed, cellular response in animal models to spaced stimuli may prime additional memory traces and give rise to resistance to memory interference in different temporal phases (Isabel et al. 2004
). However, such cellular processes generally occur on a much longer timescale than what we examined here. The structure of our model might imply that the error trace is kept in some kind of buffer that continues to benefit the learner. Any mechanism with which the influence of the error on the next trial can grow as a function of ITI will produce the same result. What might be the neural basis of such a mechanism?
Neurons that are stimulated with longer ITIs produce larger long-term potentiation (Scharf et al. 2002
) and are more resistant to depotentiation. Staddon et al. (Staddon and Higa 1996
; Staddon et al. 2002
) and Fusi et al. (2005)
proposed a cascade model of synaptic plasticity that can account for this. For example, in the model of Fusi et al. (2005)
, a given synaptic strength is supported by a synaptic state that may be shallow or deep in its cascade. The probability of transition in the synaptic state depends on the depth of that state: the deeper the state, the more resistant it is to change. If we imagine that it takes time for the internal state of the synapse to transition from one depth in its cascade to another and that the time needed increases with the depth of the state, then events that can cause synaptic change are more effective when they come spaced in time. At short ITI, only those synapses change that have a shallow internal state. With increased ITI, one engages not only the shallow state synapses, but also has a higher likelihood of causing state change in the deeper state synapses. Neural models of adaptation that rely on such synapses should exhibit the ITI-dependent patterns of generalization that we found here.
There are a number of limitations to our error trace model of motor adaptation. By itself, the model cannot account for much of the rich body of data that was recently highlighted in Smith et al. (2006)
. For example, if there is an error trace, at this point we do not know how that trace affects the fast and slow systems that were inferred from that study. To account for those data, one idea is to combine the error trace model with the Bayesian model (Kording 2007) so that the effect of an observed error is a memory trace that decays in time. It was previously suggested that uncertainty of a task variable is encoded in the lateral intraparietal area (Platt and Glimcher 1997
; Schall and Thompson 1999
) and acetylcholine and norepinephrine were suggested to play crucial roles in forming the context-dependent priors during learning (Dayan and Yu 2006
; Yu and Dayan 2005
). Our data do suggest that this uncertainty does not change during the delay period between trials.
| GRANTS |
|---|
|
|
|---|
| ACKNOWLEDGMENTS |
|---|
|
|
|---|
| FOOTNOTES |
|---|
Address for reprint requests and other correspondence: V. Huang, Johns Hopkins University School of Medicine, 720 Rutland Ave., 416 Traylor Building, Baltimore, MD 21205 (E-mail: vhuang{at}jhu.edu)
| REFERENCES |
|---|
|
|
|---|
Bhushan N, Shadmehr R. Computational nature of human adaptive control during learning of reaching movements in force fields. Biol Cybern 81: 3960, 1999.[CrossRef][Web of Science][Medline]
Bock O, Thomas M, Grigorova V. The effect of rest breaks on human sensorimotor adaptation. Exp Brain Res 163: 258260, 2005.[CrossRef][Web of Science][Medline]
Comas D, Petit F, Preat T. Drosophila long-term memory formation involves regulation of cathepsin activity. Nature 430: 460463, 2004.[CrossRef][Medline]
Commins S, Cunningham L, Harvey D, Walsh D. Massed but not spaced training impairs spatial memory. Behav Brain Res 139: 215223, 2003.[CrossRef][Web of Science][Medline]
Criscimagna-Hemminger SE, Donchin O, Gazzaniga MS, Shadmehr R. Learned dynamics of reaching movements generalize from dominant to nondominant arm. J Neurophysiol 89: 168176, 2003.
Dayan P, Yu AJ. Phasic norepinephrine: a neural interrupt signal for unexpected events. Network 17: 335350, 2006.[Web of Science][Medline]
de Haan A, Jones DA, Sargeant AJ. Changes in velocity of shortening, power output and relaxation rate during fatigue of rat medial gastrocnemius muscle. Pfluegers Arch 413: 422428, 1989.[CrossRef][Web of Science][Medline]
Donchin O, Francis JT, Shadmehr R. Quantifying generalization from trial-by-trial behavior of adaptive systems that learn with basis functions: theory and experiments in human motor control. J Neurosci 23: 90329045, 2003.
Ebbinghaus H. Memory: A Contribution to Experimental Psychology. New York: Dover, 1964.
Efron B, Tibshirani RJ. An Introduction to the Bootstrap Monographs on Statistics and Applied Probability. New York: Chapman & Hall, 1993.
Fiala JC, Grossberg S, Bullock D. Metabotropic glutamate receptor activation in cerebellar Purkinje cells as substrate for adaptive timing of the classically conditioned eye-blink response. J Neurosci 16: 37603774, 1996.
Francis JT. Influence of the inter-reach-interval on motor learning. Exp Brain Res 167: 128131, 2005.[CrossRef][Web of Science][Medline]
Fusi S, Drew PJ, Abbott LF. Cascade models of synaptically stored memories. Neuron 45: 599611, 2005.[CrossRef][Web of Science][Medline]
Gandolfo F, Mussa-Ivaldi FA, Bizzi E. Motor learning by field approximation. Proc Natl Acad Sci USA 93: 38433846, 1996.
Gormezano I. Classical conditioning. In: Experimental Methods and Instrumentation in Psychology, edited by Sidowski JB. New York: McGraw-Hill, 1966.
Han JS, Gallagher M, Holland P. Hippocampal lesions enhance configural learning by reducing proactive interference. Hippocampus 8: 138146, 1998.[CrossRef][Web of Science][Medline]
Hwang EJ, Donchin O, Smith MA, Shadmehr R. A gain-field encoding of limb position and velocity in the internal model of arm dynamics. PLoS Biol 1: e25, 2003.[Medline]
Hwang EJ, Smith MA, Shadmehr R. Adaptation and generalization in acceleration-dependent force fields. Exp Brain Res 169: 496506, 2006a.[CrossRef][Web of Science][Medline]
Hwang EJ, Smith MA, Shadmehr R. Dissociable effects of the implicit and explicit memory systems on learning control of reaching. Exp Brain Res 173: 425437, 2006b.[CrossRef][Web of Science][Medline]
Isabel G, Pascual A, Preat T. Exclusive consolidated memory phases in Drosophila. Science 304: 10241027, 2004.
Jaric S, Radovanovic S, Milanovic S, Ljubisavljevic M, Anastasijevic R. A comparison of the effects of agonist and antagonist muscle fatigue on performance of rapid movements. Eur J Appl Physiol Occup Physiol 76: 4147, 1997.[CrossRef][Web of Science][Medline]
Josselyn SA, Shi C, Carlezon WA Jr, Neve RL, Nestler EJ, Davis M. Long-term memory is facilitated by cAMP response element-binding protein overexpression in the amygdala. J Neurosci 21: 24042412, 2001.
Kettner RE, Mahamud S, Leung HC, Sitkoff N, Houk JC, Peterson BW, Barto AG. Prediction of complex two-dimensional trajectories by a cerebellar model of smooth pursuit eye movement. J Neurophysiol 77: 21152130, 1997.
Kording KP, Tenanbaum T, Shadmehr R. The dynamics of memory as a consequence of optimal adaptation to a changing body. Nat Neurosci In press.
Locatelli F, Maldonado H, Romano A. Two critical periods for cAMP-dependent protein kinase activity during long-term memory consolidation in the crab Chasmagnathus. Neurobiol Learn Mem 77: 234249, 2002.[CrossRef][Web of Science][Medline]
Maldonado H, Romano A, Tomsic D. Long-term habituation (LTH) in the crab Chasmagnathus: a model for behavioral and mechanistic studies of memory. Braz J Med Biol Res 30: 813826, 1997.[Web of Science][Medline]
Mussa-Ivaldi FA, Hogan N, Bizzi E. Neural, mechanical, and geometric factors subserving arm posture in humans. J Neurosci 5: 27322743, 1985.[Abstract]
Platt ML, Glimcher PW. Responses of intraparietal neurons to saccadic targets and visual distractors. J Neurophysiol 78: 15741589, 1997.
Raymond JL, Lisberger SG. Neural learning rules for the vestibulo-ocular reflex. J Neurosci 18: 91129129, 1998.
Savion-Lemieux T, Penhune VB. The effects of practice and delay on motor skill learning and retention. Exp Brain Res 161: 423431, 2005.[CrossRef][Web of Science][Medline]
Schall JD, Thompson KG. Neural selection and control of visually guided eye movements. Annu Rev Neurosci 22: 241259, 1999.[CrossRef][Web of Science][Medline]
Scharf MT, Woo NH, Lattal KM, Young JZ, Nguyen PV, Abel T. Protein synthesis is required for the enhancement of long-term potentiation and long-term memory by spaced training. J Neurophysiol 87: 27702777, 2002.
Scheidt RA, Dingwell JB, Mussa-Ivaldi FA. Learning to move amid uncertainty. J Neurophysiol 86: 971985, 2001.
Scheidt RA, Reinkensmeyer DJ, Conditt MA, Rymer WZ, Mussa-Ivaldi FA. Persistence of motor adaptation during constrained, multi-joint, arm movements. J Neurophysiol 84: 853862, 2000.
Shadmehr R, Mussa-Ivaldi FA. Adaptive representation of dynamics during learning of a motor task. J Neurosci 14: 32083224, 1994.[Abstract]
Smith MA, Ghazizadeh A, Shadmehr R. Interacting adaptive processes with different timescales underlie short-term motor learning. PLoS Biol 4: e179, 2006.[CrossRef][Medline]
Smith MA, Shadmehr R. Intact ability to learn internal models of arm dynamics in Huntington's disease but not cerebellar degeneration. J Neurophysiol 93: 28092821, 2005.
Staddon JE, Chelaru IM, Higa JJ. Habituation, memory and the brain: the dynamics of interval timing. Behav Process 57: 7188, 2002.[CrossRef][Web of Science][Medline]
Staddon JE, Higa JJ. Multiple time scales in simple habituation. Psychol Rev 103: 720733, 1996.[CrossRef][Web of Science][Medline]
Sutton RS, Barto AG. Toward a modern theory of adaptive networks: expectation and prediction. Psychol Rev 88: 135170, 1981.[CrossRef][Web of Science][Medline]
Thoroughman KA, Shadmehr R. Learning of action through adaptive combination of motor primitives. Nature 407: 742747, 2000.[CrossRef][Medline]
Tully T, Preat T, Boynton SC, Del Vecchio M. Genetic dissection of consolidated memory in Drosophila. Cell 79: 3547, 1994.[CrossRef][Web of Science][Medline]
Wainscott SK, Donchin O, Shadmehr R. Internal models and contextual cues: encoding serial order and direction of movement. J Neurophysiol 93: 786800, 2005.
Yu AJ, Dayan P. Uncertainty, neuromodulation, and attention. Neuron 46: 681692, 2005.[CrossRef][Web of Science][Medline]
This article has been cited by other articles:
![]() |
V. S. Huang, R. Shadmehr, and J. Diedrichsen Active Learning: Learning a Motor Skill Without a Coach J Neurophysiol, August 1, 2008; 100(2): 879 - 887. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Hadipour-Niktarash, C. K. Lee, J. E. Desmond, and R. Shadmehr Impairment of Retention But Not Acquisition of a Visuomotor Skill Through Time-Dependent Disruption of Primary Motor Cortex J. Neurosci., December 5, 2007; 27(49): 13413 - 13419. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. A. G. Mattar and D. J. Ostry Modifiability of Generalization in Dynamics Learning J Neurophysiol, December 1, 2007; 98(6): 3321 - 3329. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Visit Other APS Journals Online |