|
|
||||||||
Laboratoire Ureca, UFR de Psychologie, Université Ch. De Gaulle Lille III, Villeneuve d'Ascq, France
Submitted 8 December 2006; accepted in final form 8 August 2007
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
A number of studies have demonstrated that low-level (i.e., visual) and high-level factors influence the average time it takes to respond to a visual stimulus. For instance, visual factors such as eccentricity (Kalesnykas and Hallett 1994
), stimulus arrangement (Fischer and Ramsperger 1984
; Kingstone and Klein 1993
; Saslow 1967
), luminance or contrast (Carpenter 2004a
) have been shown to greatly affect saccade latencies. Other higher-level factors such as attention (Crawford and Muller 1992
; Hoffman and Subramaniam 1995
; Kowler et al. 1995
; Madelain et al. 2005
; Shepherd et al. 1986
), expectation (Anderson and Carpenter 2006
; Carpenter 2004a
; Carpenter and Williams 1995
), or urgency (Montagnini and Chelazzi 2005
) can control saccadic reaction time as well.
Interestingly, saccade latencies also change on a trial-by-trial basis, even though the procedure remains unchanged. This observation has been reported in a number of studies: over a series of trials, latency varies unpredictably over a wide range, resulting in typically skewed distributions. At a neural level, it has also been reported that responses of single neurons vary from trial to trial (Schall and Bichot 1998
; Thompson et al. 1996
). Moreover, most models of decision making, such as the LATER model (Carpenter and Williams 1995
; Reddi and Carpenter 2000
) or the diffusion model (Smith and Ratcliff 2004
) are based on the hypothesis that some random noise must be incorporated into the model to account for reaction-time distributions. It has been proposed that noisy decision processes reflect an evolutionary strategy allowing an adaptive randomization of behavior (Carpenter 1999
): small variations in a number of competing processes might produce large unpredictable variations in choice of response. The nature of the process underlying this variability is unknown but is assumed to result either from some dedicated mechanisms of gratuitous randomization (Carpenter and Reddi 2001
) or noisy accumulation of information (Ratcliff 2001
). An important feature of these models is that they imply a mechanical relation between the mean and the variance of the distribution: changing one single parameter in these models affects both the mean and variance of the latency distributions. This relationship is also found in empirical psychometric studies, and it has recently been proposed that there is a linear relationship between the mean and variance of reaction time-distributions (Wagenmakers and Brown 2007
; Wagenmakers et al. 2005
). Such a relation might be of importance to allow for a better mathematical description of reaction-time distributions as well as for theoretical considerations.
Despite these strong theoretical predictions, to our knowledge, there has been no systematic attempt to study the ability to control variability in saccadic latencies and to test experimentally for a possible dissociation between the mean—or median—and the variability of reaction-time distributions by changing one parameter while keeping the other constant. In this paper, we investigate the effects of reinforcement contingencies on latency variability in the saccadic system. Specifically, our experimental paradigm was designed to manipulate independently the median and variability of reaction time distributions. We also test for the possibility of extending our procedure to manual responses to ascertain that the control of variability is not specific to the saccadic system.
| METHODS |
|---|
|
|
|---|
The target and distractor stimuli consisted of luminance-defined Gaussian patches (SD = 0.93°) embedded in background noise (white uniform noise, luminance = 31 ± 23 cd/m2), the target stimulus being brighter than the mask (Fig. 1). The target and distractor stimuli were located to the right and left of a fixation point at an eccentricity of 9.22°.
|
90% correct responses (maximum luminance = 21.16, 21.16, 20.85 and 21.16 cd/m2, respectively, for subjects LC, LM, AC and CP), and the luminance of the distractor was identical for all subjects (maximum luminance = 16.06 cd/m2). We used a two-alternative forced-choice procedure, and subjects were instructed to make a saccade to the brighter luminance patch (the target stimulus).
Each trial started with a fixation period of 500–1,000 ms during which a fixation cross superimposed on a patch of random noise was displayed at the center of the screen (Fig. 1). The target and distractor stimuli then appeared simultaneously and were displayed for 1,500 ms. The target stimulus was either placed at the right or left location in a pseudorandom fashion. A fixation cross was then displayed at the center of the monitor. Experimental sessions consisted of 200 trials; each subject performed two sessions on the same day. Prior to the first session subjects performed
200 training trials in which they were asked to saccade to the brightest stimulus as fast as they could.
Experiment one: saccade latencies
The first experiment studied the ability to manipulate independently the median and SD in saccade latencies distributions. Saccade latencies were computed on-line.
Subjects received feedback at the end of each trial depending on their choice and the latency of the saccade. If the gaze landed on the target (correct choice) and the latency remained within a set of temporal criteria, the trial was successful, and an auditory tone was delivered at the end of the trial, i.e., 1,500 ms after the stimulus onset (Fig. 1). If the choice was correct but the latency was too short or too long, a gray square (lighter or darker than the background, respectively) was displayed at the center of the monitor. If the eye landed on the distractor, no feedback was given. When three of five consecutive trials were successful including the fifth one, auditory feedback (a synthetic voice saying "bravo") was delivered indicating that the subject won one point. Subjects were instructed to win as many points as they could.
To compute the upper and lower limits of the temporal criteria, we used a variation of a percentile procedure (Machado 1989
) that allows the criteria to change from trial to trial with respect to the latencies collected on the previous trials (Fig. 2). For each experimental condition and each subject, a target median latency and a probability of reward were used: the target median latency was set with respect to a baseline performance (Fig. 2A), and the probability of reward was either 0.35 (low variability) or 0.8 (high variability). Based on the cumulative latency distribution computed over the last 50 correct trials, the lower limit was defined as the lower observed latency less than the target median latency that was within the rewarded probability range. Similarly, the upper limit was defined as the upper observed latency after the target median latency by half of the probability (Fig. 2, B and C). The lower limit was automatically set to 0 (see Fig. 2D) when the minimum of the last 50 correct trials was too high, and the upper limit was set to 1,000 when the maximum was too low. For the first 50 correct trials, the limits were computed based on the previous session, allowing across sessions changes.
|
Each subject was tested in five experimental conditions defined by a target median latency and a probability of reward: baseline; baseline median, low variability (B-LV); baseline median, high variability (B-HV); fast median, low variability (F-LV); slow median, low variability (S-LV). In the first condition (baseline), no feedback was delivered to establish baseline performance. Each subject performed five sessions (1,000 trials) of these baseline trials. In a second condition (B-LV), the target median latency was the median of the baseline distribution and the probability was 0.35. These values were chosen to encourage subjects to respond with a median latency similar to the baseline one but with lower variability. In a third condition (B-HV), the target median latency remained unchanged and the probability was set to 0.8. To further encourage subjects to increase the variability of the latency distribution while keeping the median constant, the probability of receiving feedback on a successful trial was 1 minus the frequency of the latency over the last 50 correct trials. For instance, if the frequency of the binned latency X was 0.4 on the last 50 correct trials, the subjects received a feedback with a probability of 0.6 when making saccades with binned latency X, even though the latency fell within the temporal criteria. This forced subjects to vary their reaction time to maximize the probability of receiving a feedback. In the last two conditions (F-LV and S-LV), the probability was set to 0.35, and the fixed points corresponded to the latencies at which the frequency reached 0.2 and 0.8 in the cumulative baseline distributions, respectively (Fig. 2A). In each of these experimental conditions, the limits were adjusted on a trial-by-trial basis (Fig. 2E). An experimental condition was completed when both the median and SD remained mostly unchanged for the five previous sessions (1,000 trials). One subject (CP) performed the same conditions but in a different order (baseline, B-LV, S-LV, F-LV, and B-HV).
Experiment two: manual reaction time
The second experiment was similar to the first except that manual responses were used. Subjects were asked to press a left or right button depending on whether they saw the target stimulus on the left or on the right side of the screen. Manual reaction times were computed on-line by measuring the time between the stimulus and the response onset. The same five experimental conditions were used (baseline, B-LV, B-HV, F-LV, S-LV). The cumulative distributions of reaction time in the baseline condition were used to determine the target median latencies for the other four conditions. This experiment was conducted 2 months after the end of the first experiment. Only three subjects (LC, LM, and AC) performed this second experiment.
Data acquisition and analysis
Stimuli were generated on a Power Mac G4 using the Psychophysics Toolbox extensions for Matlab (Brainard 1997
; Pelli 1997
), and displayed on a video monitor (Iiyama HM204DT, 100 Hz) at a viewing distance of 60 cm. To minimize measurement errors, the subject's head movements were restrained using a head and chin rest so that the eyes in central position were directed toward the center of the screen. Presentation of stimuli, and acquisition, display, and storage of responses were controlled by a second computer using the LabView Real-Time software package (National Instruments). Trigger signals from the visual display computer to the LabView computer allowed us to synchronize data collection to stimulus presentation with 1-ms resolution. Eye movements were measured continuously with an infrared video-based eye tracking system (ISCAN, RK-726) at 240 Hz. Before each session, we calibrated the eye tracker by having subjects repeatedly fixate a set of seven horizontal locations to generate a linear function for converting raw eye tracker values to horizontal eye position.
For on-line saccade detection, an algorithm was used to filter and analyze the velocity signal, and saccades were identified by applying a fixed set of amplitude criteria values to the velocity signal at the end of each trial. The algorithm automatically detected the onset of the first saccade within 1,000 ms after the target onset and the gaze landing position was determined by taking the median position over 100 ms, 50 ms after the saccade onset. This landing position was then used to determine whether the subject chose the target or the distractor stimulus. To avoid anticipatory responses, trials in which the latency was <100 ms were discarded. Manual responses were detected on-line by monitoring changes in the buttons status with a 1-ms resolution.
For off-line analysis, an interactive analysis program was used to filter, display, and analyze the data. Horizontal eye velocities were obtained by differentiating the eye position signal using a finite impulse response (FIR) filter (–3 dB at 54 Hz). Eye acceleration was then obtained by applying the same FIR filter to the velocity signals. Saccades were detected by applying a set of velocity and acceleration criteria.
We measured only the first saccade within 1,000 ms after the target onset, and we computed latencies with respect to the target onset. We omitted trials in which the saccade was opposite in direction to the probe stimulus (incorrect responses) or had a latency <100 ms (anticipatory responses, <0.2% of the trials). Saccades were measured in the last five sessions (1,000 trials) of each condition.
| RESULTS |
|---|
|
|
|---|
As illustrated by the distributions plotted on Fig. 3 for a single subject (AC), the latency distribution in the baseline condition was broader than in the low variability conditions (B-LV, F-LV, and S-LV). Figure 3A shows the latency distribution for a single subject (AC) in the second condition (B-LV) superimposed to the baseline distribution. It can be seen that although the median latencies are similar (464 and 476 ms, respectively, in the baseline and B-LV conditions), the dispersion of the saccade latencies was greatly reduced by our procedure. In particular, the frequencies in the high and low tails of the distribution were much smaller in the B-LV condition, leading to an increase in the frequencies at the center of the distribution. A similar effect was observed in the F-LV and S-LV conditions (Fig. 3, C and D), although the median latencies were now, respectively, shorter (392 ms) or longer (572 ms) than in the baseline trials. In contrast, the distribution in the high variability condition was very similar to the baseline one (Fig. 3B) with identical medians (464 ms).
|
|
To illustrate the distinctness of the distributions, we show cumulative frequency distributions for each condition for each subject (Fig. 4, C–E). In general, the slopes were steeper in the low variability distributions, reflecting the lower dispersions, whereas the curves for the baseline and B-HV distributions were almost perfectly superimposed. It can be seen that our procedure also affected the median latencies (the point at which the cumulative frequency equals 0.5). In the baseline, B-LV, and B-HV conditions, the median latency remained almost unchanged (with a 7 ms average absolute difference). In the F-LV conditions, the median latency was slightly shorter than the point at which the cumulative frequency reached 0.2 in the baseline distributions (by 8 ms on average), and in the S-LV, it was slightly shorter than the point at which the cumulative frequency reached 0.8 in the baseline distributions (by 9 ms on average).
Overall, our procedure was able to constrain the latency distributions: the dispersions were smaller in the low variability conditions and larger in the baseline and high variability conditions, and the median latencies were close to the chosen target median latencies. One could postulate that these variations in latencies were correlated with changes in the accuracy of the task, following a classical speed-accuracy tradeoff law. However, the percent correct remained mostly unchanged during the experiment. On average, the proportions of correct responses were 84.4, 88.1, 87.8, 87.6, and 90.9% in the baseline, B-LV, B-HV, F-LV, and S-LV conditions, respectively. It is therefore not surprising that the correlation coefficients between median latencies and percent correct (0.35, P = 0.12) and between SDs and percent correct (–0.35, P = 0.13) were low. Interestingly, we did not observe any significant change in the peak velocities of saccades.
The number of trials necessary to achieve learning revealed large inter-conditions differences. On average, 4,800 trials were necessary to complete the B-LV condition 1,900 trials were collected in the B-HV condition, 1,250 in the F-LV condition, and 2,950 trials in the S-LV condition.
Learned changes in variability of manual reaction times
In the second experiment, the changes in the distributions were similar to the ones in the first experiment. Figure 5A shows the medians as well as the bootstrap percentile intervals (P < 0.05) for the three subjects. These revealed that the medians from the F-LV and S-LV conditions were systematically faster (by 65 ms on average) and slower (by 87 ms on average) than the baseline ones. For the B-LV condition, all the medians fall within the percentile intervals. For the B-HV condition one median was slower (by 10 ms for subject LC).
|
In the cumulative frequency distributions (Fig. 5, C–E), the slopes were steeper in the low variability distributions than in the baseline distributions, reflecting the lower dispersions, whereas the curves for the B-HV distributions were slightly shallower. Our procedure also affected the median latencies: in the baseline, B-LV, and B-HV conditions, the median latency remained almost unchanged (with a 4 ms average difference). In the F-LV conditions, the median latency was similar to the point at which the cumulative frequency reached 0.2 in the baseline distributions (±1 ms on average) and in the S-LV it was slightly shorter than the point at which the cumulative frequency reached 0.8 in the baseline distributions (by 3 ms on average).
The proportions of correct responses were in general high (92.8% on average). The correlation between accuracies and median latencies (0.5, P = 0.057) and between SD and percent correct (–0.09, P = 0.75) were not significant.
Again, the number of trials necessary to achieve learning revealed large inter-condition differences. On average, 3,200, 1,133, 1,333, and 3,333 trials were necessary to complete the B-LV, B-HV, F-LV, and S-LV conditions, respectively.
Assessing changes in latency variability using the LATER model
To examine whether the shapes of the latency distributions might explain the changes in saccade programming, we used the LATER model (Reddi and Carpenter 2000
). This simple model assumes that response latencies are a function of the time it takes for a monotonically rising decision signal to reach a threshold value (Fig. 6, top). The decision signal depends on the rate of information accumulation and the threshold represents the urgency. The rate of rise changes on a trial-by-trial basis following a Gaussian distribution. When the distributions of latencies are plotted on a reciprobit graph (i.e., plotting the cumulative probability on a probit scale as a function of the reciprocal of latency), a straight line results (Fig. 6, middle). According to this model, changes to the saccade trigger mechanism alter the reciprobit plots in distinctive ways (Fig. 6, top). Changes in the threshold (
) with no change in the rate of rise alter the slope, such that the line pivots about the infinite-time intercept. Changes in the rate of rise (µ) of the decision signal with no change in the threshold result in a parallel shift of the line. A third possibility, not usually included in discussions of the LATER model (but see Madelain et al. 2005
) is that increasing the variability of the rate of rise (the SD of the rate variable,
) decreases the slope and intercept of a cumulative distribution without changing its median.
|
). Second, the distributions of latencies for the baseline and B-HV conditions mostly involved a rotation around the median from the distribution in the B-LV, suggesting that the changes in distributions were mostly a consequence of an increase of the SD of the rate of rise (
). To obtain a quantitative evaluation of these observations, we contrasted the three alternative hypotheses that the distributions in the baseline, B-HV, F-LV, and S-LV might be obtain from the B-LV distribution through a swivel around the infinite-time intercept, a parallel shift, or a rotation around the median. We first computed the best fits compatible with the three hypotheses in terms of the required changes in the model and the sum of square error associated with each fit (Fig. 6, bottom). We found that the SSE was systematically smaller for the rotate model for the baseline and B-HV conditions and for the swivel model for the F-LV (except for subject CP) and S-LV conditions. This was true for both saccade and manual response latencies. This led us to compute the difference of LogLikelihood for the rotate versus swivel models. As illustrated in Fig. 7 (A), the differences in the distributions in the B-LV condition versus the baseline and B-HV conditions were more likely to be a consequence of a change in variability than a change in threshold. On the other hand, the distributions in the F-LV and S-LV conditions were more likely to come from a change in threshold when compared with the B-LV condition. We performed the same analysis for the manual responses latencies and found similar results (Fig. 7B).
|
| DISCUSSION |
|---|
|
|
|---|
We will first discuss the possible effects of the reinforcement contingencies on variability, then we will discuss what these results say about decision process models and finally, what are the implications of the ability to voluntary control the variability of reaction times.
Effects of the reinforcement contingencies
Recent experiments in monkeys have shown that latencies were shorter when making a saccade to a rewarded location than to nonrewarded locations (Ikeda and Hikosaka 2003
; Takikawa et al. 2002
; Watanabe et al. 2003
). These changes at the behavioral level were correlated with modulations at the neural level (see also Platt and Glimcher 1999
; Sugrue et al. 2004
). An interpretation is that saccades are, at least partly, under the control of their consequences. In natural situations, a saccade to a particular stimulus allows an optimization of the visual perception of the target. This optimization in return acts as a reinforcer for the saccadic system, allowing subsequent adaptive responses to changes in the environment. A recent study used this observation to build a new experimental design (Montagnini and Chelazzi 2005
). Subjects were asked to perform a letter discrimination that required a saccade to a peripheral stimulus displayed for a brief interval after the median saccade landing time such that saccade latency was the main determinant of the task accuracy. In other words, subjects were reinforced when making saccades with short latency by the ability to perform the perceptual task. Montagnini and Chelazzi (2005)
showed that saccade latencies in humans are reduced under these conditions of perceptual urgency. Together, these results strongly suggest that triggering a saccade is an operant behavior, i.e., a class of response controlled by environmental contingencies, (Skinner 1981
).
Our data revealed that consequences might act on the median as well as on the variability of saccadic reaction times. In other words, the median and variability are operant dimensions, i.e., response attributes influenced by reinforcers. Our experimental procedure was based on the assumption that given a baseline repertoire of latencies, one may modify the probability of emission of a subclass of latencies by controlling the relationship between a range of latencies and the reinforcer. In the low-variability conditions, this range of latencies was restrained to increase the probability of a narrow subclass, whereas in the high-variability condition, the range was maintained large. This results in the modifications plotted in Fig. 3. To put the matter in terms of selectionist formulation, in the B-LV condition, extreme values were selected against from the population of latencies (Fig. 3A), whereas in the F-LV and S-LV conditions (Fig. 3, C and D) fast or slow subclasses were selected, respectively. This is analogous to stabilizing selection (Fig. 3A) and directional selection (Fig. 3, C and D), two concepts used in population genetics to account for changes in populations. We are confident that contingencies are responsible for these changes, instead of simple repetition of the task, because variability was set back to its baseline level in the B-HV condition.
It is interesting to point out that our procedure was successful because subjects maximized the local probability of reward. To make that point clearer, we will discuss the B-LV condition more precisely. The probability of reinforcement was set to 0.35 and the actual upper and lower limits of the temporal criteria were first computed based on the baseline distribution. Therefore subjects could have obtain 35% of reinforced trials by maintaining this overall distribution at its baseline level. However, maximizing the local probability of reinforcement resulted in local changes in the distributions that, in return, resulted in reducing the range of reinforced latencies. This is apparent in Fig. 2E. In other words, subjects could have obtained the exact same global proportion of successful trials without changing the variability of the latencies distributions, but they did not. However, a point was given only when three of five consecutive trials were successful, an incentive to maximize the local probability of success. We think that without this incentive we might have failed to show reductions in variability.
It should be pointed out that inducing these changes in variability required a large number of trials,
10,000 on average for each subjects in each experiment. Although we did not explicitly measure the difficulty of learning, the number of trials necessary to achieve learning might reveal inter-conditions differences. Interestingly B-LV and S-LV conditions systematically required about three times as many trials as the B-HV and F-LV conditions for subjects AC and LM. This implies that it is somehow easier to increase variability than to decrease it unless the median reaction time is reduced as well. The latter observation is in conformity with a number of experiments indicating that the mean and variability in reaction time are usually tightly linked (Wagenmakers and Brown 2007
). Interestingly, the average number of trials in the first condition was smaller in the second experiment than in the first one (3,200 and 5,200, respectively). This might indicate a transfer of the learned changes, despite the fact that 2 months elapsed between the two experiments. However, our experiments were not designed to test for this facilitation, and further studies are necessary to explore this possibility.
Decision models
Visual processing, response selection, and motor processing are the three stages associated with a reaction time (Glimcher 2003
; Schall 2001
). Although each of these stages might contribute to variability in latencies, we think that our procedure mostly affected the response selection—or decision process. First, the visual stimulus was constant all through the experiments for a given subject, preventing from major changes in visual processing. The effects of perceptual learning were probably limited because subjects were trained to respond to the stimulus prior the first session. Moreover, perceptual learning should always result in faster and less variable latencies. Second, it is commonly accepted that the motor processing stage's contribution to latencies variability is only limited (Hanes and Schall 1996
). This is why we now focus on decision models.
A class of reaction-time models assumes that motor responses are triggered when an internal decision signal reaches a threshold value (Reddi and Carpenter 2000
; Smith and Ratcliff 2004
). By applying Carpenter's LATER model to our reaction-time data, we found different patterns of changes associated with changes in latency distributions. Although previous studies using this model have generally identified only changes in threshold or changes in the mean rate of decision signal (but see Kurata and Aizawa 2004
; Madelain et al. 2005
; Montagnini and Chelazzi 2005
), our data suggest that our procedure might affect either both factors or the variability of rate. As we pointed out elsewhere (Madelain et al. 2005
), the LATER model does not fully disentangle the effects in changes in these three factors. In fact, any change in the reciprobit plot attributed to a single parameter (e.g., the threshold) could also be attributed to a combination of modifications in the other two parameters (e.g., rate and variability of the decision signal). This could be view as a weakness of the model, but classical interpretations using the LATER model are usually based on the assumption that each of these parameters is related to a particular property of the situation. This allows predicting specific changes in the distributions when manipulating one of these properties. The rate of rise is related to the time it takes to accumulate information, while the initial level reflects prior expectation. The threshold corresponds to a significance level in a statistical test: given a particular case what probability of making the wrong decision does the system accept ? It should be pointed out that the threshold and initial level induce similar changes. In our experiments, the rate of rise and the initial level probably remained unchanged because the distractor and target stimuli remained constant as well as the probability of target location. Therefore we propose that only the threshold and variability of rate of rise were affected by our manipulations. In fact, the distributions in the B-LV, F-LV, and S-LV conditions were compatible with swiveling around about a common infinite-time intercept, which indicates a change in threshold. In other words, it might be that when manipulating the medians while maintaining the variability approximately constant, urgency was mostly affected. The distributions in the baseline, B-LV, and B-HV conditions were compatible with a rotation around the median, which indicates a change in variability. Therefore directly manipulating the variability of the distributions is best explained by a change in the variability of the rate of rise.
To our knowledge, only two papers have explored the possibility of altering the variability of the rate of rise in isolation (Madelain et al. 2005
; Montagnini and Chelazzi 2005
). However, this is the first direct evidence that in some specific manipulation this parameter must be included to properly use the LATER model. This maybe problematic in that the reciprobit plot allows only two parameters, the slope and infinite-time intercept. Traditional use of the LATER model assumes that two parameters are at play, the threshold (or the initial level of the rate of rise that is functionally equivalent but has a different conceptual meaning) and the average rate of rise, to capture the changes in the reciprobit plot, each fitting parameter finding its equivalent in the model. However, we now show that a third parameter (the variability of the rate of rise) must be taken into account when the experimental manipulation imposes a particular variability on the distributions.
Other decision models, for instance the diffusion model (Smith and Ratcliff 2004
) or the leaky competing accumulator model (Usher and McClelland 2001
), also assume that a reaction time results from the projection on the time axis of the intercept of a decision signal reaching a threshold level. Variability in the drift, which represents the encoded stimulus information, generates variability in the reaction time distribution: a Gaussian process (the drift variability) generates asymmetrical distributions. Here the drift rate varies randomly from trial to trial for any given stimulus because of random variations in the efficiency of visual transduction and encoding. The inclusion of drift variance in the model serves two functions. On the one hand, it allows the model to predict that correct responses will be made, on average, more rapidly than errors. On the other hand, it limits the speed-accuracy tradeoff and ensures that observers cannot attain arbitrarily high levels of accuracy (Smith and Ratcliff 2004
). In this class of models, variability is used because it generates errors and variable reaction times. Assuming variability reflects some perceptual noise requires a fixed SD of the drift variability in the model for a given experimental condition. This implies that variability grows with the mean reaction time due to the geometry of the decision process (Wagenmakers and Brown 2007
; Wagenmakers et al. 2005
). However, our data show a clear independence between the mean and variance even though the stimuli, probability of target location, and urgency remained constant. This implies that these models must also account for changes in variability without changing the drift and threshold. Again, the drift variability must be adaptable independently of the drift, the initial level, and the threshold.
Significance of reaction-time variability
A number of studies explored the possibility of controlling variability of behavior. For instance, Blough (1966)
rewarded pigeons for pecking a response key if the time between consecutive pecks had occurred less frequently in the recent past. The results were such that the distributions of inter response time were close to those of a random model, unlike the stereotyped baseline distributions. Similarly, when animals received a reinforcer for generating sequences of left or right responses, provided that such sequences had not occurred during the most recent trials, they produced high degrees of variability (Page and Neuringer 1985
; Shimp 1967
). On the other hand it has been reported that reinforcing a class of responses reduces variability (see Boulanger et al. 1987
for a review). Taken together, these results indicate that one effect of reinforcement is to reduce variability unless reinforcement is contingent to response variation itself (see Neuringer 2002
for a review).
Montagnini and Chelazzi (2005)
observed a diminution of variability when rewarding eye movements, and our data suggest that variability in reaction times might be manipulated by controlling its relationship with environmental contingencies, a pattern of results compatible with previous work on variability in behavior. Interestingly, this implies that, to some extent, the baseline level of variability in our experiment also reflects environmental constraints, even though we did not explicitly manipulate them. Because baseline variability was higher than in the low-variability conditions, one could postulate that these constraints were not very strong: a high level of variability might just reflects that all latencies are equally adapted to the context. This view departs from Carpenter's proposition that variability reflects some sort of purposeful internal randomization (Carpenter 1999
). An alternative could very well be, for instance, that the saccadic system does not always allocate all its resources to produce particular latencies because the point in time at which the gaze lands on the target is simply not relevant. However, when the situation imposes a particular range of reaction time, the system might allocate a less variable amount of resources to control the latencies. Of course this does not imply that the system is totally free of random noise, and in fact it seems impossible to completely eliminate the variability, but it appears that postulating a deliberate randomization of reaction times is not necessary.
That variability is related to particular environmental constraints is at the root of selectionist theories. The ability to select a particular class of responses is constrained by the range of variations. Whatever increases variability might contribute to the appearance of new responses (Neuringer 2002
), and it is not surprising that the relationship between the level of variability and the ability to learn new responses has been experimentally demonstrated (Grunow and Neuringer 2002
). These ideas have also been proposed in relation to the LATER model: randomness in timing might result in randomness of choice, allowing unpredictable or exploratory behaviors (Carpenter 2004b
). Therefore the functional significance of the ability to control variability of reaction time might extend beyond noise reduction and reflect adaptive abilities of living organisms.
| GRANTS |
|---|
|
|
|---|
| FOOTNOTES |
|---|
Address for reprint requests and other correspondence: L. Madelain, Laboratoire URECA, UFR de Psychologie, Université Lille III, Domaine du Pont de Bois, B.P. 149, 59653 Villeneuve d'Ascq Cedex, France (E-mail: laurent.madelain{at}univ-lille3.fr)
| REFERENCES |
|---|
|
|
|---|
Blough DS. The reinforcement of least-frequent interresponse times. J Exp Anal Behav 9: 581–591, 1966.[CrossRef][Web of Science][Medline]
Boulanger B, Ingebos AM, Lahak M, Machado A, Richelle M. Variabilité comportementale et conditionnement operant chez l'animal; revue critique. L'Année Psychol 87: 417–434, 1987.
Brainard DH. The psychophysics toolbox. Spat Vis 10: 433–436, 1997.[Web of Science][Medline]
Carpenter RH. A neural mechanism that randomises behaviour. J Consciousness Stud 6: 13–22, 1999.
Carpenter RH. Contrast, probability, and saccadic latency; evidence for independence of detection and decision. Curr Biol 14: 1576–1580, 2004a.[CrossRef][Web of Science][Medline]
Carpenter RH. The saccadic system: a neurological microcosm. Adv Clin Neurosci Rehab 4: 6–8, 2004b.
Carpenter RH, Reddi BAJ. Putting noise into neurophysiological models of simple decision making. Nat Neurosci 4: 337, 2001.[CrossRef][Web of Science]
Carpenter RH, Williams ML. Neural computation of log likelihood in control of saccadic eye movements. Nature 377: 59–62, 1995.[CrossRef][Medline]
Crawford TJ, Muller HJ. Spatial and temporal effects of spatial attention on human saccadic eye movements. Vision Res 32: 293–304, 1992.[CrossRef][Web of Science][Medline]
Fischer B, Ramsperger E. Human express saccades: extremely short reaction times of goal directed eye movements. Exp Brain Res 57: 191–195, 1984.[Web of Science][Medline]
Glimcher PW. The neurobiology of visual-saccadic decision making. Annu Rev Neurosci 26: 133–179, 2003.[CrossRef][Web of Science][Medline]
Grunow A, Neuringer A. Learning to vary and varying to learn. Psychon Bull Rev 9: 250–258, 2002.[Web of Science][Medline]
Hanes DP, Schall JD. Neural control of voluntary movement initiation. Science 274: 427–430, 1996.
Hoffman JE, Subramaniam B. The role of visual attention in saccadic eye movements. Percept Psychophys 57: 787–795, 1995.[Web of Science][Medline]
Ikeda T, Hikosaka O. Reward-dependent gain and bias of visual responses in primate superior colliculus. Neuron 39: 693–700, 2003.[CrossRef][Web of Science][Medline]
Kalesnykas RP, Hallett PE. Retinal eccentricity and the latency of eye saccades. Vision Res 34: 517–531, 1994.[CrossRef][Web of Science][Medline]
Kingstone A, Klein RM. What are human express saccades? Percept Psychophys 54: 260–273, 1993.[Web of Science][Medline]
Kowler E, Anderson E, Dosher B, Blaser E. The role of attention in the programming of saccades. Vision Res 35: 1897–1916, 1995.[CrossRef][Web of Science][Medline]
Kurata K, Aizawa H. Influences of motor instructions on the reaction times of saccadic eye movements. Neurosci Res 48: 447–455, 2004.[CrossRef][Web of Science][Medline]
Machado A. Operant conditioning of behavioral variability using a percentile reinforcement schedule. J Exp Anal Behav 52: 155–166, 1989.[CrossRef][Web of Science][Medline]
Madelain L, Krauzlis RJ, Wallman J. Spatial deployment of attention influences both saccadic and pursuit tracking. Vision Res 45: 2685–2703, 2005.[CrossRef][Web of Science][Medline]
Martinez WL, Martinez AR. Computational Statistics Handbook with Matlab. Boca Raton, FL: Chapman and Hall, 2002.
Montagnini A, Chelazzi L. The urgency to look: prompt saccades to the benefit of perception. Vision Res 45: 3391–3401, 2005.[CrossRef][Web of Science][Medline]
Neuringer A. Operant variability: evidence, functions, and theory. Psychon Bull Rev 9: 672–705, 2002.[Web of Science][Medline]
Page S, Neuringer A. Variability is an operant. J Exp Psychol Anim Behav Processes 11: 429–452, 1985.[CrossRef][Web of Science]
Pelli DG. The VideoToolbox software for visual psychophysics: transforming numbers into movies. Spat Vis 10: 437–442, 1997.[Web of Science][Medline]
Platt ML, Glimcher PW. Neural correlates of decision variables in parietal cortex. Nature 400: 233–238, 1999.[CrossRef][Medline]
Ratcliff R. Putting noise into neurophysiological models of simple decision making. Nat Neurosci 4:336, 2001.[CrossRef][Web of Science][Medline]
Reddi BA, Carpenter RH. The influence of urgency on decision time. Nat Neurosci 3: 827–830, 2000.[CrossRef][Web of Science][Medline]
Saslow MG. Effects of components of displacement-step simuli upon latency for saccadic eye movement. J Opt Soc Am 57: 1024–1029, 1967.[Medline]
Schall JD. Neural basis of deciding, choosing and acting. Nat Rev Neurosci 2: 33–42, 2001.[Web of Science][Medline]
Schall JD, Bichot NP. Neural correlates of visual and motor decision processes. Curr Opin Neurobiol 8: 211–217, 1998.[CrossRef][Web of Science][Medline]
Shepherd M, Findlay JM, Hockey RJ. The relationship between eye movements and spatial attention. Q J Exp Psychol A 38: 475–491, 1986.[Web of Science][Medline]
Shimp CP. Reinforcement of least-frequent sequences of choices. J Exp Anal Behav 10: 57–65, 1967.[Medline]
Skinner BF. Selection by consequences. Science 213: 501–504, 1981.
Smith PL, Ratcliff R. Psychology and neurobiology of simple decisions. Trends Neurosci 27: 161–168, 2004.[CrossRef][Web of Science][Medline]
Sugrue LP, Corrado GS, Newsome WT. Matching behavior and the representation of value in the parietal cortex. Science 304: 1782–1787, 2004.
Takikawa Y, Kawagoe R, Itoh H, Nakahara H, Hikosaka O. Modulation of saccadic eye movements by predicted reward outcome. Exp Brain Res 142: 284–291, 2002.[CrossRef][Web of Science][Medline]
Thompson KG, Hanes DP, Bichot NP, Schall JD. Perceptual and motor processing stages identified in the activity of macaque frontal eye field neurons during visual search. J Neurophysiol 76: 4040–4055, 1996.
Usher M, McClelland JL. The time course of perceptual choice: the leaky, competing accumulator model. Psychol Rev 108: 550–592, 2001.[CrossRef][Web of Science][Medline]
Wagenmakers EJ, Brown S. On the linear relation between the mean and the standard deviation of a response time distribution. Psychol Rev 114: 830–841, 2007.[CrossRef][Web of Science][Medline]
Wagenmakers EJ, Grassman RPPP, Molenaar PCM. On the relation between the mean and the variance of a diffusion model response time distribution. J Math Psychol 49: 195–204, 2005.[CrossRef][Web of Science]
Watanabe K, Lauwereyns J, Hikosaka O. Neural correlates of rewarded and unrewarded eye movements in the primate caudate nucleus. J Neurosci 23: 10052–10057, 2003.
This article has been cited by other articles:
![]() |
M. R. Harwood, L. Madelain, R. J. Krauzlis, and J. Wallman The Spatial Scale of Attention Strongly Modulates Saccade Latencies J Neurophysiol, April 1, 2008; 99(4): 1743 - 1757. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Visit Other APS Journals Online |