|
|
||||||||
1Department of Physiology, Juntendo University, School of Medicine, Tokyo 113-8421, Japan; and 2Laboratory of Sensorimotor Research, National Eye Institute, National Institute of Health, Bethesda, Maryland 20892-4435
Submitted 9 March 2004; accepted in final form 18 May 2004
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
Consistent with this hypothesis, we previously found that DA neurons increased or decreased their activity in a predicative manner when a cue, not reward, was presented (Kawagoe et al. 2004
). In our experiments, the subject (a macaque monkey) performed a memory-guided saccade task while a reward was given after a saccade to one particular direction out of four [one-direction-rewarded (1DR) version of a memory-guided saccade task] (Kawagoe et al. 1998
). A visual cue stimulus indicated the saccade direction and the presence or absence of reward. We found that DA neurons responded to the cue stimulus in a selective manner: a phasic excitation in response to a reward-indicating cue and a phasic suppression to a nonreward-indicating cue (Kawagoe et al. 2004
). These responses were interpreted as reflecting reward prediction error because the likelihood that the trial would be rewarded was 25% (1 of 4) before the cue came on but changed to either 100% (after a reward-indicating cue) or 0% (after a nonreward-indicating cue).
A prominent feature of our 1DR task was that the reward contingency was changed systematically across blocks of trials. There were four sets of position-reward mapping, all of which the monkey had experienced extensively, and the question was to choose one particular set of position-reward mapping in one block and switch to another in the next block. We found that, as a new block of trials started, DA neurons changed their cue responses quickly to match the altered position-reward mapping (Kawagoe et al. 2004
). That these changes in DA neuronal activity may have significant behavioral consequences has been suggested by a series of studies in the basal ganglia from our laboratory. Activity of presumed projection neurons in the caudate nucleus, to which DA neurons may project, responded to the cues depending on the current position-reward mapping (Kawagoe et al. 1998
). Neurons in the substantia nigra pars reticulata (Sato and Hikosaka 2002
) and those in the superior colliculus (Ikeda and Hikosaka 2003
), which may receive inputs from caudate projection neurons, exhibited cue responses depending on the position-reward mapping. Finally, saccadic eye movements were strongly influenced by the position-reward mapping such that the saccades to the rewarded direction had shorter latencies and higher velocities than those to the unrewarded directions (Takikawa et al. 2002
). Thus the neural circuits inside and emanating from the basal ganglia seem to play an important role in quick adaptation of goal-oriented behavior.
Having suggested the scheme for position-reward mapping, we were still puzzled by the quickness of the neural and behavioral adaptation. An obvious explanation would be long-term learning. In all of the preceding studies, we examined neural activity and behavior (saccadic eye movement) after the monkey had been trained to perform 1DR extensively. The ability to quickly adapt to the altered position-reward mapping may be acquired through long-term training on 1DR. By training monkeys on operant and classical conditioning tasks, Ljungberg et al. (1992)
, Schultz et al. (1993)
, and Mirenowicz and Schultz (1994)
showed that DA neurons in naive monkeys responded to a reward, whereas DA neurons in well-trained monkeys responded to the earliest stimulus that indicated the reward but not the reward. Their studies suggest that the pattern of DA neuronal activity changes with long-term training. However, there is a critical difference between the tasks employed by Schultz's group and our task, 1DR. In the tasks of Schultz et al., the reward contingency was fixed throughout the long-term training. In our 1DR task, the reward contingency was altered frequently; it required switches between four learned sets of position-reward mapping.
In this study, we found that overall responses of DA neurons shifted from reward to reward-predictor during a long-term training period, consistent with the findings by Ljungberg et al. (1992)
, Schultz et al. (1993)
, and Mirenowicz and Schultz (1994)
. We further found that the shift from reward to reward-predictor occurred every time the position-reward mapping was altered. Long-term training changed the speed and accuracy of the adaptation to the altered position-reward mapping. This appeared as changes in DA neuronal activity as well as changes in the reward-contingent bias of saccadic eye movements.
| METHODS |
|---|
|
|
|---|
We used two male Japanese monkeys (Macaca fuscata). The monkeys were kept in individual primate cages in an air-conditioned room where food was always available. At the beginning of each experimental session, they were moved to the experimental room in a primate chair. The monkeys were given restricted amounts of fluid during periods of training and recording. Their body weight and appetite were checked daily. Supplementary water and fruit were provided daily. All surgical and experimental protocols were approved by the Juntendo University Animal Care and Use Committee and are in accordance with the National Institutes of Health Guide for the Care and Use of Animals.
The experiments were carried out while the monkey's head was fixed and his eye movements were recorded. For this purpose, a head holder, a chamber for unit recording, and an eye coil were implanted under surgical procedures. The monkey was sedated by intramuscular injections of ketamine (4.05.0 mg/kg) and xylazine (1.02.0 mg/kg). General anesthesia was induced by intravenous injection of pentobarbital sodium (5 mg/kg/h). Surgical procedures were conducted in aseptic conditions. After exposing the skull, 1520 acrylic screws were bolted into it and fixed with dental acrylic resin. The screws served as anchors by which a head holder and a recording chamber, both made of delrin, were fixed to the skull. A scleral eye coil was implanted in one eye for monitoring eye position (Robinson 1963
; Judge et al. 1980
). The recording chamber, which was rectangular (antero-posterior: 42 mm; lateral: 30 mm; depth: 10 mm), was placed over the fronto-parietal cortices, tilted laterally by 35°. The monkey received antibiotics (sodium ampicillin 2540 mg/kg im each day) after the operation.
Behavioral tasks
We used the memory-guided saccade task (Hikosaka and Wurtz 1983
) in two different reward conditions: all-directions-rewarded condition (ADR) and 1DR (Kawagoe et al. 1998
) (Fig. 1). In both conditions, a task trial started with onset of a central fixation point on which the monkeys had to fixate. A cue stimulus (spot of light) came on 1 s after onset of the fixation point (duration: 100 ms), and the monkeys had to remember its location. After 11.5 s, the fixation point turned off, and the monkeys were required to make a saccade to the previously cued location. The target came on 400 ms later for 150 ms at the cued location. The saccade was judged to be correct if the eye position was within a window around the target (usually within ±3°) when the target turned off. The monkeys made the saccade before target onset based on memory, because, otherwise, the eyes could not reach the target window within the 150-ms target-on period; the target was presented only to give the monkeys the accurate feedback information. The next trial started after an intertrial interval of 3.54 s. In ADR, every correct saccade was rewarded with the liquid reward together with the tone stimulus. In 1DR, an asymmetric reward schedule was used such that only one of the four directions was rewarded, while the other directions were not rewarded. The rewarded direction was fixed in a block of experiments, which included 60 successful trials. Even for the unrewarded direction, the monkeys had to make a correct saccade, because the same trial was repeated if the saccade was incorrect. The amount of reward per trial was set approximately the same between 1DR and ADR. The cue was chosen pseudorandomly such that the four directions were randomized in every sub-block of four trials; thus one block of experiment (60 trials) contained 15 trials for each direction (15 sub-blocks). Other than the actual reward, no indication was given to the monkeys as to which direction was currently rewarded. 1DR was performed in four blocks, in each of which a different direction was rewarded. The order of the four blocks of 1DR was randomized. To change the task schedule (1DR or ADR) or the reward direction in 1DR, there was an interblock interval (about 30 s).
|
Each monkey was first trained to perform ADR for 4 days (about 100 blocks every each monkey). After the monkey mastered ADR, we started recordings from DA neurons (Fig. 1D). 1DR was performed only when a DA neuron was being recorded; otherwise, the monkey continued to perform ADR. The target was chosen randomly out of four locations of equal eccentricity (10 or 20°), arranged in either normal or oblique angles. Once a candidate DA neuron was isolated, we examined whether it responded to the delivery of reward (free reward). A drop of water as a reward was given with a random time interval (49 s) while the monkey was sitting in the dim experimental room. The reward consisted of a tone followed by actual delivery of water to the monkey from a spout. The water delivery was delayed by about 150 ms from the tone, largely due to the compliance of the connecting silicon tube. Only when the neuron responded to reward did we ask the monkey to perform 1DR. This was done in four blocks, in each of which one out of four directions was rewarded; in addition, we performed one block of ADR (Fig. 1C). We repeated some of the 1DR blocks to confirm the stability of recording. During this standard set of experiments, we kept recording from the same DA neuron. This procedure was repeated every time a DA neuron was recorded.
Recording procedures
Eye movements were recorded using the search coil method (Enzanshi Kogyo MEL-20U) (Robinson 1963
; Judge et al. 1980
; Matsumura et al. 1992
). Eye positions were sampled at 500 Hz. The behavioral tasks and storage and display of data were controlled by a computer (PC 9801RA, NEC, Tokyo, Japan). The unitary action potentials were passed through a window discriminator (model DDIS-1, Bak), and the times of their occurrences were stored with a resolution of 1 ms.
Before the single unit recording experiment, we obtained MR images (AIRIS, 0.3 T, Hitachi) such that they were perpendicular to the recording chamber. We then determined the recording sites in the substantia nigra based on the chamber-based coordinates (Kawagoe et al. 1998
). The recording sites were further verified by MRI of a plastic guide tube through which the electrodes were inserted.
Single unit recordings were performed using tungsten electrodes (0.25 mm diam, 15 M
, measured at 1 KHz; Frederick Haer). To introduce the electrode into the brain, we first inserted a stainless steel guide tube (OD, 0.85 mm; ID, 0.60 mm) containing the electrode. A hydraulic microdrive (MO95-S, Narishige) was used both to insert the guide tube and subsequently to advance the electrode into the brain. We sometimes implanted a plastic guide tube (OD, 1.1 mm; ID, 0.8 mm) semi-permanently at the location where DA neurons were concentrated. The location of the guide tube was visualized on MRIs and was confirmed to be directed to the substantia nigra pars compacta (SNc).
DA neurons were identified by their irregular and tonic firing around 5 spikes/s with broad spike potentials. Extracellular spikes may have an initial positive component or may be followed by prolonged positive component (Schultz and Romo 1987
). A neuron with these features was thus determined to be a DA neuron candidate. Near the end of a long-term experimental session, we made electrolytic microlesions at the recording sites of DA neurons for later histological analysis. Later histological examination showed that the presumed DA neurons were located among tyrosinehydroxylase (TH) positive neurons. They were usually in the SNc (A9) and sometimes in the area medio-dorsal to the SNc (A8) (Kawagoe et al. 2004
).
Definition of the learning stage
We divided the long-term training period of two monkeys into three stages based on the differentiation of saccade latencies between the rewarded and unrewarded conditions. First, for every sub-block, we obtained the saccade latency for the one rewarded trial and the mean saccade latency for the three unrewarded trials. We compared pair-wise the saccade latencies in the rewarded trials and the saccade latencies in the unrewarded trials using Wilcoxon signed-rank test (P < 0.05). We chose the pair-wise test because it was more tolerant to slow change in saccade latency that occurred within a block of 60 trials whether or not the saccade was rewarded (see Fig. 7). The statistical test was performed for the four blocks of 1DR while one DA neuron was recorded.
|
Data analysis
NEURONAL ACTIVITY. For each DA neuron, we calculated the spontaneous activity and its responses to the fixation point, to the cue stimulus, and to the reward as the spike frequency. The spontaneous activity (500 to 0 ms before fixation onset) was calculated for each neuron, and these were averaged for each stage. The fixation response (100200 ms after fixation onset) was calculated for each neuron for the four blocks of 1DR. The cue (100300 ms for monkey G and 150350 ms for monkey C) and reward (200400 ms after reward onset) responses were calculated for each neuron separately for rewarded and unrewarded conditions.
STATISTICAL ANALYSIS FOR THE CUE AND REWARD RESPONSES OF DA NEURON. To determine the presence of cue and reward responses, we compared the spike frequency in the control period and the spike frequency in the test period using Wilcoxon signed-rank tests (P < 0.01) separately for rewarded and unrewarded trials in 1DR for each neuron. The test period was 100300 ms for monkey G and 150350 ms for monkey C after cue onset for cue response and 200400 ms after reward onset for reward response. The control period was set just before the onset of the fixation point, and its duration was the same as that of the test period. If the cue and reward responses were determined statistically, the responses were classified into two groups: increase and decrease.
SACCADE LATENCY. We calculated the mean saccade latency separately for the rewarded and unrewarded conditions in two ways. To examine long-term changes of saccade latency, we averaged saccade latencies for all trials in each experiment that consists of four blocks of 1DR. To examine short-term (within a block) changes, we averaged saccade latencies for every sub-block in a block.
| RESULTS |
|---|
|
|
|---|
The data presented in this paper are based on the long-term training of two monkeys. During the training period of 1DR (3 mo in each monkey), we recorded 264 and 169 DA neurons in monkey C and monkey G, respectively. Among these neurons, 164 of 264 (62%) and 104 of 169 (62%) in each monkey responded to free reward by phasically increasing its discharge rate. In this experiment, we focused on neurons that responded to the delivery of free reward. We examined 53 of 164 neurons and 50 of 104 neurons in each monkey by performing four blocks of 1DR and one block of ADR. Monkey C and monkey G performed 626 and 431 blocks of 1DR, respectively, in addition to many blocks of ADR. In this paper, we will show only the data of 1DR.
Long-term changes in saccade parameters
The monkey's behavior changed during the long-term training period of the 1DR task. As shown in Fig. 2, saccade latency became shorter initially and then became longer gradually. Another clear change was the differentiation between the rewarded trials and the unrewarded trials. Saccade latency was initially not different between the two conditions, but the difference became clearer as the monkey experienced more blocks of 1DR such that saccade latencies in the rewarded condition were shorter than those in the unrewarded condition. We defined the learning stages based on the reward-dependent differentiation of saccade latency. In learning stage 1, there was no differentiation in either the first half or the second half of the 1DR block. In learning stage 2, a statistical difference appeared only in the second half of the 1DR block (Wilcoxon signed-rank test, P < 0.05). In learning stage 3, there was a statistical difference already in the first half. The rationale for the definition came from the observation that the reward-dependent differentiation appeared earlier within a block as learning progressed. This phenomenon will be one of the major results described here.
|
Long-term shift in DA neuronal activity
Along with the long-term changes in saccade latency, the responses of DA neurons changed its pattern. Figure 3 shows the population activity of DA neurons of two monkeys at each learning stage separately for rewarded (black lines) and unrewarded (gray lines) trials. DA neurons showed three types of response: the response to the onset of the fixation point (fixation response), the response to the cue stimulus that indicated an upcoming reward (cue response), and the response to reward (reward response).
|
Long-term changes were also evident in the proportion of DA neurons that showed these responses. In general, progressively fewer neurons responded to the reward (Table 1), whereas progressively more neurons responded to the cue (Table 2). Cue responses were initially nonselective, mostly positive for both the reward-indicating and nonreward-indicating cues, but became selective, positive for the reward-indicating cue and negative for the nonreward-indicating cue (Table 2).
|
|
The data in Fig. 3 show that the bi-directional reward responses of DA neuron (positive and negative responses in rewarded and unrewarded trials, respectively) were replaced with the bi-directional cue responses during the long-term training of 1DR (long-term shift). However, in stages 2 and 3, the replacement of cue and reward responses was also observed within a block that took only
7 min to perform (short-term shift). Such within-block changes are shown in Fig. 4. We plotted the mean magnitude of cue and reward responses against "sub-block number" (from 1 to 15). Since the four cue directions were chosen randomly in every block of four trials (which we call "sub-block"), each cue appeared once for each sub-block and 15 times in the entire block of 60 trials. The cue and reward responses are shown for two monkeys as population averages at each learning stage separately for rewarded (black) and unrewarded (dark gray) trials. In stage 1, DA neurons continued to respond to reward with bi-directional manner, but not to the cue. In stage 2, the reward response decreased gradually, whereas the response to the reward-indicating cue increased gradually. In stage 3, the shift of response from reward to the cue occurred quickly. This was true for both rewarded trials and unrewarded trials.
|
As indicated in METHODS, the task 1DR was done in four blocks in which four different directions were rewarded in a random order. The rewarded direction was indicated only by reward itself when the monkey experienced the particular block of 1DR. Therefore it was interesting to examine the transition of DA neuronal activity from one 1DR block to another. This is shown in Fig. 5 for two typical neurons recorded in stages 2 and 3. In stage 2, the DA neuron appeared to respond to the reward-indicating cue (indicated by a dotted circle), but the response was evident only in the later trials of a particular block. In the early trials, the neuron continued to respond to a nonreward-indicating cue that had been the reward-indicating cue in the preceding block. For example, in the left-up (LU) block (2nd column), the neuron responded to the right-down (RD) cue stimulus in the early trials (rewarded direction in the preceding block) and responded to the LU cue stimulus in the later trials (current rewarded direction). In the next block [left-down (LD) block, 3rd column], the neuron continued to respond to the LU cue (rewarded direction in the preceding block). Such transitional effects were much weaker in stage 3. The DA neuron's response to the reward-indicating cue was established quickly in one 1DR block. There was no indication that the neuron continued to respond to the same cue in the next block when it no longer indicated reward.
|
|
We have shown that, while the monkey experienced 1DR daily, both saccade latency and DA neuronal responses became differentiated proactively between the rewarded and unrewarded trials. These data raise the possibility that DA neuronal activity influences saccadic motor outputs. To examine this possibility further, we compared the within-block changes of DA neuronal responses and saccade latency.
Figure 7 shows the within-block changes of saccade latency in each stage, using the same format as used for DA neuronal responses (Fig. 6). In stage 1, saccade latency was not clearly differentiated among the three kinds of cue (i.e., R-, RN-, NN-cues). In stage 2, saccade latency became shorter gradually for the R-cue, became longer gradually for the RN-cue, and remained long for the NN-cue. In stage 3, saccade latency started from similar levels for these cues, but became differentiated such that latencies for RN- and NN-cues were longer than for R-cue. These changes in saccade latency, especially the within-block changes in stage 2, are reminiscent of those in DA neuronal activity shown in Fig. 6.
Long-term change in response to fixation onset of DA neurons
As shown in Fig. 3, DA neurons responded to the onset of the fixation point, and the response was present throughout the learning stages. The population response to the fixation point plotted against trial numbers (Fig. 8) also showed that the fixation response was weakly present throughout trials in every stages in both monkeys (i.e., increase from the spontaneous activity). However, the response changed considerably within a block of 1DR. In stage 3, the fixation response was very strong in the first trial and was diminished in the second or later trials. This prominent fixation response was common in both monkeys. Interestingly, the fixation response in the first trial was stronger than the cue or reward response (cf. Figs. 8 and 4) and was consistent across different DA neurons. This first trial effect was less clear in stages 1 and 2.
|
| DISCUSSION |
|---|
|
|
|---|
In a previous study (Kawagoe et al. 2004
), we recorded from DA neurons in well-trained monkeys. They responded strongly to reward predictor but not to reward itself except for the first or second trial. In rewarded trials, DA neurons responded to the delivery of a reward at the beginning of a 1DR block, but this response was soon replaced by a positive response to the cue stimulus that indicated an upcoming reward. In unrewarded trials, activity of DA neurons was suppressed at the time of reward at the beginning of a 1DR block, but the response was soon replaced by a negative response to the cue stimulus that indicated no reward. Every time the rewarded direction was changed, DA neurons changed their responses from reward to cue in a few trials and adapted to the altered position-reward mapping.
However, it was unknown how DA neurons adapted to the altered position-reward mapping so quickly. In this study, we found that the speed of the short-term shift of DA neuronal responses became quicker with long-term training of 1DR. In the early stage of training, the dominant activity of DA neurons was the response to reward. The short-term shift first appeared in the intermediate stage, but the shift was slow, in that activity of DA neurons transferred from reward to reward-indicating cue gradually within a block. The short-term shift became quicker in the advanced stage, and therefore the dominant activity of DA neurons was the response to the reward-indicating cue. Consequently, the overall activity of DA neurons shifted from reward to the reward-indicating cue during the long-term training (long-term shift).
Our results are in accordance with a pioneering work by Ljungberg et al. (1992)
, Schultz et al. (1993)
, and Mirenowicz and Schultz (1994)
. In their study, DA neurons in naive monkeys tended to respond to a reward, whereas those in well-trained monkeys responded to the earliest stimulus that indicated the reward. After further training, however, the response of DA neurons to the reward-predicting stimulus became weaker. In contrast, DA neurons in our experiments continued to respond to the cue stimulus even after long-term experience of 1DR. The difference may originate from a difference in reward schedule. In the study of Schultz et al., a reward was given in every trial and was fully expected, whereas in our 1DR task, a reward was given selectively for one out of four positions. The stimulus-reward mapping was fixed in the study of Schultz et al., whereas it was variable in our 1DR task. In the study of Schultz et al., DA neurons responded either only to reward itself or only to a reward-predicting stimulus, whereas in 1DR, the responses of DA neurons shifted from reward to a reward predictor within a block. In this sense, our data provided a new aspect in the function of DA neurons, namely, their relationship to behavioral switching.
Emergence of behavioral bias with short- and long-term shifts of DA neuronal activity
We previously found that the 1DR task gave rise to a strong behavioral bias: saccade latency was shorter and saccade velocity was higher when the saccade was followed by a reward (rewarded condition) than when it was followed by no reward (unrewarded condition) (Takikawa et al. 2002
). The comparison of DA neurons and caudate (CD) projection neurons in well-trained monkeys suggested that DA neurons, with their connection to CD neurons, modulate the spatially selective signals in CD neurons in the reward-predicting manner (Kawagoe et al. 2004
), and CD neurons in turn modulate saccade parameters (Itoh et al. 2003
; Watanabe et al. 2003
) with their connections to the substantia nigra pars reticulata and the superior colliculus (Hikosaka et al. 2000
). The present data are consistent with this hypothesis. Both saccade latency and DA neuronal activity changed within a block of 1DR trials, usually in the opposite directions (Figs. 6 and 7).
However, the related changes in DA neuronal activity and saccade latency may not necessarily indicate a causal relationship. An alternative possibility is that DA neuronal activity and saccade latency are modulated by a common input. For example, CD neurons could be a source of the common input. While CD neurons may influence saccade parameters, they may also influence activity of DA neurons with their direct connections or indirect connections through substantia nigra pars reticulata neurons (Parent and Hazrati 1994
; Tepper et al. 1995
). In this case, the reward-predicting activity of DA neurons would originate from CD neurons.
The two hypotheses stated above are contrasting and will remain an important topic in future studies on basal ganglia mechanisms.
Relation to behavioral switching
The performance of well-experienced monkeys in the 1DR task may be compared with behavioral switching rather than learning. For every block of 1DR, the rewarded direction was changed, and the monkey changed its behavior so that saccades were quicker and faster to the currently rewarded direction. The monkey acquired the ability of behavioral switch while DA neurons changed their cue responses quickly to adapt to the current position-reward mapping. Both behavioral and neuronal changes became faster as the monkey experienced more 1DR blocks.
It has been suggested that the basal ganglia play an important role in switching behavior (see Redgrave et al. 1999b
). A large body of evidence is derived from studies on patients with Parkinson's disease (Brown and Marsden 1990
). The basal ganglia are equipped with parallel mechanisms that may act antagonistically on target structures such as the superior colliculus and the thalamus (Albin et al. 1989
; Chevalier and Deniau 1990
). The concurrent and sequential actions of these mechanisms, respectively, would be suitable for selection (Mink 1996
; Nambu et al. 2002
) and switching (Hikosaka et al. 1993
) of motor programs or behavioral sets.
However, there have been few studies indicating that changes in neuronal activity in the basal ganglia were correlated with (or caused) behavioral switching. A series of studies in our laboratory using 1DR task have shown that, when position-reward mapping was altered, neurons in the caudate (Kawagoe et al. 1998
), substantia nigra (Sato and Hikosaka 2002
), and superior colliculus (Ikeda and Hikosaka 2003
) changed their activity, together with saccade parameters (Takikawa et al. 2002
), to match the position-reward mapping. In this study, we showed that the cue responses of DA neurons and the differentiation of saccade latency emerged after repeated experiences of different sets of position-reward mapping and became quicker jointly until they may be called switching. These results suggest that the reward-predictive activity of DA neurons may play an important role in adapting to frequent switches of position-reward mapping.
Response of DA neurons to the initiation signal
A robust response of DA neurons occurred to the onset of the fixation point which signals the start of a task trial. The fixation response also underwent long-term changes: in the advanced sage it became prominent in the first couple of trials in a block of experiments (Fig. 8). Remarkably, this response was one of the strongest responses emitted by DA neurons. It has been reported that DA neurons respond to novel sensory stimuli (Ljungberg et al. 1992
; Horvitz 2000
), suggesting that a major signal carried by DA neurons is novel (Redgrave et al. 1999a
). The fixation response may signal novelty because there was a clear break before the fixation point appeared in the first trial in a block. We also noticed that DA neurons typically responded to a visual or auditory stimulus when it was presented unexpectedly, but stopped responding if the stimulus was repeated; a subtle sound outside the monkey's view was particularly effective. The onset of the fixation point in the first trial may be regarded as a novel, unexpected stimulus, while the same stimulus in the second trial would no longer be novel. On the other hand, the fixation point in the first trial informed the monkey of a change or switch in the learned set, provided that the monkey was well trained. This is consistent with the hypothesis that DA neurons contribute to the processes of terminating current selections and opening new ones, as Redgrave et al. (1999a)
argued previously.
| FOOTNOTES |
|---|
Address for reprint requests and other correspondence: O. Hikosaka, Laboratory of Sensorimotor Research, National Eye Institute, National Institute of Health, 49 Convent Dr., Bldg. 49, Rm. 2A50, Bethesda, MD 20892-4435 (E-mail: oh{at}lsr.nei.nih.gov).
| REFERENCES |
|---|
|
|
|---|
Brown RG and Marsden CD. Cognitive function in Parkinson's disease: from description to theory. Trends Neurosci 13: 2129, 1990.[CrossRef][ISI][Medline]
Chevalier G and Deniau JM. Disinhibition as a basic process in the expression of striatal functions. Trends Neurosci 13: 277280, 1990.[CrossRef][ISI][Medline]
Dickinson A and Balleine B. Motivational control of goal-directed action. Anim Learn Behav 22: 118, 1994.
Hikosaka O and Wurtz RH. Visual and oculomotor functions of monkey substantia nigra pars reticulata. III. Memory-contingent visual and saccade responses. J Neurophysiol 49: 12681284, 1983.
Hikosaka O, Matsumura M, Kojima J, and Gardiner TW. Role of basal ganglia in initiation and suppression of saccadic eye movements. In: Role of the Cerebellum and Basal Ganglia in Voluntary Movement, edited by Mano N, Hamada I, and DeLong MR. Amsterdam: Elsevier, 1993, p. 213219.
Hikosaka O, Takikawa Y, and Kawagoe R. Role of the basal ganglia in the control of purposive saccadic eye movements. Physiol Rev 80: 953978, 2000.
Hollerman JR and Schultz W. Dopamine neurons report an error in the temporal prediction of reward during learning. Nat Neurosci 1: 304309, 1998.[CrossRef][ISI][Medline]
Horvitz JC. Mesolimbocortical and nigrostriatal dopamine responses to salient non-reward events. Neuroscience 96: 651656, 2000.[CrossRef][ISI][Medline]
Ikeda T and Hikosaka O. Reward-dependent gain and bias of visual responses in primate superior colliculus. Neuron 39: 693700, 2003.[CrossRef][ISI][Medline]
Itoh H, Nakahara H, Hikosaka O, Kawagoe R, Takikawa Y, and Aihara K. Correlation of primate caudate neural activity and saccade parameters in reward-oriented behavior. J Neurophysiol 89: 17741783, 2003.
Judge SJ, Richmond BJ, and Chu FC. Implantation of magnetic search coils for measurement of eye position: an improved method. Vision Res 20: 535538, 1980.[CrossRef][ISI][Medline]
Kawagoe R, Takikawa Y, and Hikosaka O. Expectation of reward modulates cognitive signals in the basal ganglia. Nat Neurosci 1: 411416, 1998.[CrossRef][ISI][Medline]
Kawagoe R, Takikawa Y, and Hikosaka O. Reward-predicting activity of dopamine and caudate neuronsa possible mechanism of motivational control of saccadic eye movement. J Neurophysiol 91: 10131024, 2004.
Ljungberg T, Apicella P, and Schultz W. Responses of monkey dopamine neurons during learning of behavioral reactions. J Neurophysiol 67: 145163, 1992.
Matsumura M, Kojima J, Gardiner TW, and Hikosaka O. Visual and oculomotor functions of monkey subthalamic nucleus. J Neurophysiol 67: 16151632, 1992.
Mink JW. The basal ganglia: focused selection and inhibition of competing motor programs. Prog Neurobiol 50: 381425, 1996.[CrossRef][ISI][Medline]
Mirenowicz J and Schultz W. Importance of unpredictability for reward responses in primate dopamine neurons. J Neurophysiol 72: 10241027, 1994.
Nambu A, Tokuno H, and Takada M. Functional significance of the cortico-subthalamo-pallidal "hyperdirect" pathway. Neurosci Res 43: 111117, 2002.[CrossRef][ISI][Medline]
Parent A and Hazrati L-N. Multiple striatal representation in primate substantia nigra. J Comp Neurol 344: 305320, 1994.[CrossRef][ISI][Medline]
Redgrave P, Prescott TJ, and Gurney K. Is the short-latency dopamine response too short to signal reward error? Trends Neurosci 22: 146151, 1999a.[CrossRef][ISI][Medline]
Redgrave P, Prescott TJ, and Gurney K. The basal ganglia: A vertebrate solution to the selection problem? Neuroscience 89: 10091023, 1999b.[CrossRef][ISI][Medline]
Robinson DA. A method of measuring eye movement using a scleral search coil in a magnetic field. IEEE Trans Biomed Eng 10: 137145, 1963.[Medline]
Sato M and Hikosaka O. Role of primate substantia nigra pars reticulata in reward-oriented saccadic eye movement. J Neurosci 22: 23632373, 2002.
Schultz W. Predictive reward signal of dopamine neurons. J Neurophysiol 80: 127, 1998.
Schultz W. Getting formal with dopamine and reward. Neuron 36: 241263, 2002.[CrossRef][ISI][Medline]
Schultz W and Romo R. Responses of nigrostriatal dopamine neurons to high-intensity somatosensory stimulation in the anesthetized monkey. J Neurophysiol 57: 201217, 1987.
Schultz W and Dickinson A. Neuronal coding of prediction errors. Annu Rev Neurosci 23: 473500, 2000.[CrossRef][ISI][Medline]
Schultz W, Apicella P, and Ljungberg T. Responses of monkey dopamine neurons to reward and conditioned stimuli during successive steps of learning a delayed response task. J Neurosci 13: 900913, 1993.[Abstract]
Schultz W, Dayan P, and Montague PR. A neural substrate of prediction and reward. Science 275: 15931599, 1997.
Takikawa Y, Kawagoe R, Itoh H, Nakahara H, and Hikosaka O. Modulation of saccadic eye movements by predicted reward outcome. Exp Brain Res 142: 284291, 2002.[CrossRef][ISI][Medline]
Tepper JM, Martin LP, and Anderson DR. GABAA receptor-mediated inhibition of rat substantia nigra dopaminergic neurons by pars reticulata projection neurons. J Neurosci 15: 30923103, 1995.[Abstract]
Watanabe K, Lauwereyns J, and Hikosaka O. Neural correlates of rewarded and unrewarded eye movements in the primate caudate nucleus. J Neurosci 23: 1005210057, 2003.
This article has been cited by other articles:
![]() |
K. Nakamura, M. Matsumoto, and O. Hikosaka Reward-Dependent Modulation of Neuronal Activity in the Primate Dorsal Raphe Nucleus J. Neurosci., May 14, 2008; 28(20): 5331 - 5343. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. HIKOSAKA Basal Ganglia Mechanisms of Reward-Oriented Eye Movement Ann. N.Y. Acad. Sci., May 1, 2007; 1104(1): 229 - 249. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. C. HORVITZ, W. Y. CHOI, C. MORVAN, Y. EYNY, and P. D. BALSAM A "Good Parent" Function of Dopamine: Transient Modulation of Learning and Performance during Early Stages of Training Ann. N.Y. Acad. Sci., May 1, 2007; 1104(1): 270 - 288. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. Hikosaka, K. Nakamura, and H. Nakahara Basal Ganglia Orient Eyes to Reward J Neurophysiol, February 1, 2006; 95(2): 567 - 584. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Haruno and M. Kawato Different Neural Correlates of Reward Expectation and Reward Expectation Error in the Putamen and Caudate Nucleus During Stimulus-Action-Reward Association Learning J Neurophysiol, February 1, 2006; 95(2): 948 - 959. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Watanabe and O. Hikosaka Immediate Changes in Anticipatory Activity of Caudate Neurons Associated With Reversal of Position-Reward Contingency J Neurophysiol, September 1, 2005; 94(3): 1879 - 1887. [Abstract] [Full Text] [PDF] |
||||
![]() |
W.-X. Pan, R. Schmidt, J. R. Wickens, and B. I. Hyland Dopamine Cells Respond to Predicted Events during Classical Conditioning: Evidence for Eligibility Traces in the Reward-Learning Network J. Neurosci., June 29, 2005; 25(26): 6235 - 6242. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Dommett, V. Coizet, C. D. Blaha, J. Martindale, V. Lefebvre, N. Walton, J. E. W. Mayhew, P. G. Overton, and P. Redgrave How Visual Stimuli Activate Dopaminergic Neurons at Short Latency Science, March 4, 2005; 307(5714): 1476 - 1479. [Abstract] [Full Text] [PDF] |
||||
| |||||||||||||||||||||||||||||||||||||||||||||