|
|
||||||||
J Neurophysiol (May 1, 2003). 10.1152/jn.01014.2002
Submitted on Submitted 7 November 2002; accepted in final form 21 January 2003
Institute of Physiology, University of Fribourg, CH-1700 Fribourg, Switzerland
| |
ABSTRACT |
|---|
|
|
|---|
Cromwell, Howard C. and Wolfram Schultz. Effects of Expectations for Different Reward Magnitudes on Neuronal Activity in Primate Striatum. J. Neurophysiol. 89: 2823-2838, 2003. In behavioral science, it is well known that humans and nonhuman animals are highly sensitive to differences in reward magnitude when choosing an outcome from a set of alternatives. We know that a realm of behavioral reactions is altered when animals begin to expect different levels of reward outcome. Our present aim was to investigate how the expectation for different magnitudes of reward influences behavior-related neurophysiology in the anterior striatum. In a spatial delayed response task, different instruction pictures are presented to the monkey. Each image represents a different magnitude of juice. By reaching to the spatial location where an instruction picture was presented, animals could receive the particular liquid amount designated by the stimulus. Reliable preferences in reward choice trials and differences in anticipatory licks, performance errors, and reaction times indicated that animals differentially expected the various reward amounts predicted by the instruction cues. A total of 374 of 2,000 neurons in the anterior parts of the caudate nucleus, putamen, and ventral striatum showed five forms of task-related activation during the preparation or execution of movement and activations preceding or following the liquid drop delivery. Approximately one-half of these striatal neurons showed differing response levels dependent on the magnitude of liquid to be received. Results of a linear regression analysis showed that reward magnitude and single cell discharge rate were related in a subset of neurons by a monotonic positive or negative relationship. Overall, these data support the idea that the striatum utilizes expectancies that contain precise information concerning the predicted, forthcoming level of reward in directing general behavioral reactions.
| |
INTRODUCTION |
|---|
|
|
|---|
Expectations for different
magnitudes of reward have powerful effects on learning and performance
(Black 1968
; Campbell and Seiden 1974
;
Flaherty 1996
). In discrete choice trials, animals consistently choose larger rewards compared with smaller rewards in
both food-deprived and nondeprived conditions (Collier
1982
; Tolman 1932
). Chimpanzees trained in a
choice task for rewards of varying magnitude with an important caveat
that the chosen reward is given to the passive (observer) monkey while
the remaining item goes to the selector, consistently selected the
larger reward from the array of items, thereby receiving the smaller
reward (Boysen and Berntson 2001
). This inapt
strategy suggests that an immediate, obvious magnitude difference in
primary reward has a powerful overriding influence in directing
behavioral reactions. More recently, it has been shown that details of
behavioral reactions used to obtain different liquid rewards in a
delayed response task varied dependently on the outcome predicted by
the presentation of an instruction cue (Watanabe et al.
2001
). Faster reaction times and longer anticipatory lick
durations were seen in trials for the preferred outcome. These findings
support the notion that animals vary details of their reactions within
these classic delay tasks dependent on the estimated value of the
future outcome.
This study was undertaken to search for the neural substrates involved
in preparing the animals to make different behavioral reactions
dependent on reward expectations. Candidate structures should be able
to integrate information concerning the reward with the movement plans
to produce appropriate responses according to expectations. One of the
main neural structures involved in these types of
motivational-sensorimotor integrations is the striatum (Berridge
and Cromwell 1990
; Mogenson et al. 1980
;
Robbins and Everitt 1992
). There has been ample evidence
that information concerning the reward influences striatal activity
(Aosaki et al. 1994
; Apicella et al.
1991
; Bowman et al. 1996
; Hikosaka et al.
1989b
). This influence has been noted on several different types of striatal responses, including the activity at the time of
reward reception following behavioral performance (Apicella et
al. 1991
) and an activation that precedes the reward delivery when the animal is expecting a particular reward outcome
(Apicella et al. 1992
; Schultz et al.
1992
).
Recently, a series of papers (Hassani et al. 2001
;
Hollerman et al. 1998
; Kawagoe et al.
1998
; Watanabe 1996
) has examined whether or not
these activations in striatum or prefrontal cortex incorporate details
concerning the upcoming reward. Differences between the rewarding
outcomes were either reward versus no reward (Hollerman et al.
1998
; Kawagoe et al. 1998
) or a comparison
between two different juice types (Hassani et al. 2001
;
Watanabe 1996
). Results of these studies showed that not
only did sustained activity that preceded the reward vary dependently
on the reward about to be received, but also activity that was
temporally separated from the reward, linked to the predictive
instruction picture or to the motor response, itself, showed a
dependence to the rewarding outcome to be received in the future. It
was hypothesized that this influence from the upcoming reward on this
activity contributed to expectation formation and was a critical
component of the goal-directed behavior the animal committed prior to
movement execution.
Here, we advance this theory a step farther by examining these
activations while monkeys work for different reward magnitudes. By
using reward magnitude differences, we take advantage of a precise
parametric property in reward outcome that can be shifted by a specific
value, and we can evaluate as to whether or not a similar shift is
observed in the neural activity of striatal neurons. Magnitude shifts
avoid confounding aspects of differing sensory qualities of the tastant
in influencing neural activity. In addition, it places the
neurophysiological findings in line with a large set of psychological
studies that have used shifts in reward magnitude as the independent
variable to examine the influence of reward contrast and incentive
relativity on motivated behavior (Flaherty 1996
). We
used a modified version of the spatial delayed task to test,
separately, the behavioral reaction to be completed (arm movement to
the left vs. arm movement to the right) and the amount of liquid reward
to be acquired (different magnitudes). These trial types allowed us to
examine how expectations for different reward magnitudes could modulate
neuronal activity during the decision, preparation, initiation, and
execution of behavioral reactions. Furthermore, using this task allows
us to directly compare our results with results from previous studies
that have used similar procedures (Hassani et al. 2001
;
Hollerman et al. 1998
). The results have been previously
presented in abstract form (Cromwell and Schultz 2000
).
| |
METHODS |
|---|
|
|
|---|
Subjects were two macaque monkeys (A: Macaca
fascicularis, male, 2.8 kg; B: M. fascicularis, male,
2.5 kg). The activity of single cells was recorded using moveable
microelectrodes during performance of a spatial delayed response task
for different reward magnitudes. Arm muscle activity, licking
movements, and eye movements were monitored during the task
performance. Electrode positions were reconstructed from small
electrolytic lesions on 50-µm-thick, cresyl violet-stained
histological brain sections. Most methods were similar to those
described in detail before (Hollerman et al. 1998
). All
experimental protocols conformed to National Institutes of Health
Guidelines and the Swiss Animal Protection Law; they were supervised by
the Fribourg Cantonal Veterinary Office.
Behavioral procedures
Animals performed in a spatial delayed response task for liquid reward using either their right or left hand. The monkey kept its hand relaxed on an immovable, touch-sensitive resting key. It faced a 13" computer monitor positioned behind a transparent plastic wall in which two small levers were mounted to the left and right of the midline. To start each trial, a color instruction picture (13° × 13°) appeared for 1 s on a computer screen above the left or right lever (Fig. 1; top). The instruction indicated both the target of a future arm movement by its position on the screen and the liquid reward delivered for correctly reacting to an upcoming trigger stimulus. After a randomly varying delay of 3.5-4.5 s following instruction onset, two identical red squares appeared simultaneously as movement trigger at the left and right positions of the instruction. The trigger determined the time of the behavioral response without indicating the spatial target or the specific reward. The animal released the resting key, touched the lever at the position previously indicated by the instruction, and received the liquid reward indicated by the instruction. Incorrect lever touch or failure to touch a lever went unrewarded. Both trigger squares extinguished on correct or incorrect lever touch or after 2.0 s without lever touch.
|
A liquid reward drop was dispensed 2.0 s after lever touch by a
computer-controlled liquid valve from a spout at the animal's mouth.
Drop sizes were 0.12, 0.18, and 0.24 ml for the low, medium, and high
magnitudes, respectively. Each reward amount was indicated at trial
onset by a specific instruction picture. Different liquid types (black
currant or raspberry juice for Animal A and grenadine or blackcurrant
juice for Animal B) were used; however, only one liquid type was used
within any one recording session. In this way, we did not confound the
preferences for the different reward juices (Hassani et al.
2001
) with the different reward amounts to be received. To
assess the influence of visual features on neuronal responses, we used
two to six different pairs of three instruction pictures in each animal
(Fig. 1; bottom). Only two instruction pictures with their
associated two liquid reward magnitudes were used in a given block of
trials. All reward magnitudes were used in combinations in which
animals showed reliable and persistent preferences (see Control Task below).
The two spatial targets and two liquid amounts alternated semi-randomly, with the consecutive occurrence of same trial types being restricted to three trials. Trials lasted 12 s irrespective of behavioral performance; intertrial intervals were 2-3 s. Closed-circuit video systems served to continuously supervise limb and mouth movements. Animals were partially fluid-deprived during weekdays and were returned to their home cages after each daily session.
Two additional trial types were used for control purposes. First, we assessed reward preferences in blocks of choice trials before or after recording from each neuron. Two different instructions for two reward magnitudes were shown simultaneously at semi-randomly alternating left and right target positions, allowing the animal to touch the lever of its choice following the trigger stimulus. Thus each pair of instruction stimuli contained one picture associated with the preferred liquid amount (larger magnitude) and one with the nonpreferred amount (smaller magnitude).
In the second task variation, we assessed further movement relationships of selected neurons in nonmovement trials that were semi-randomly interspersed with left and right movement trials in a delayed go-nogo task. The instruction picture was presented for 1.0 s at the center of the monitor instead of the left or right positions. The animal kept its hand on the resting key during the instruction-trigger delay and for 2.0 s during presentation of the same trigger stimulus as in movement trials to receive, after a further 2.0 s, the specific liquid reward magnitude indicated by the instruction. Thus nonmovement trials were indicated by the center position of the instruction.
Data acquisition
Following behavioral conditioning, animals were implanted with two horizontal cylinders for head fixation and a stainless steel chamber permitting vertical access with microelectrodes to the left striatum under deep sodium pentobarbital anesthesia and aseptic conditions. The dura was left intact. Teflon-coated, multistranded, stainless steel wires were implanted into the right extensor digitorum communis and biceps brachii muscles for EMG recordings. The implant was fixed to the skull with stainless steel screws and several layers of dental cement. Animals received postoperative antibiotics and analgesics.
Glass-insulated, platinum-plated tungsten microelectrodes positioned inside a metal guide cannula served to record extracellularly the activity of single neurons, using conventional electrophysiological techniques. Discharges from neuronal perikarya were converted into standard digital pulses by means of an adjustable Schmitt-trigger. EMGs and horizontal and vertical eye positions (infrared oculometer, Iscan) were collected during neuronal recordings. EMGs were converted into standard digital pulses by a Schmitt-trigger. Licking movements were recorded as standard digital pulses by tongue interruptions of an infrared lightbeam at the liquid spout.
Pulses from neuronal discharges and EMGs were sampled together with
digital signals from the behavioral task by a computer, together with
analog signals from EMGs using infrared monitoring of horizontal and
vertical eye movements (Iscan). Only data from neurons sampled by the
computer for
10 trials in each of the four trial types are reported.
All data from neurons suspected to covary with some task component, and
occasionally from unmodulated neurons, were stored on computer disks.
Data analysis
Data analysis was performed similarly to previous studies (see
Hassani et al. 2001
; Hollerman et al.
1998
). Task-related neuronal changes were assessed during
individual task periods with the nonparametric one-tailed Wilcoxon
signed-rank test incorporated into the evaluation software
(P < 0.01). Only task-related activations were
considered from neurons, showing statistically significant activity
increases in relation to at least one task event, compared with a 1- or
2-s control period that was either immediately before the instruction
as first task event or at a period of apparent lack of modulation
outside of the task sequence in cases of suspected preinstruction activations.
Task-related changes were compared in individual neurons during identical task periods and durations with the two-tailed Mann-Whitney U test (P < 0.01), separately between different magnitudes, between left and right targets, and between corresponding pictures of different instruction sets. Comparisons between these different conditions used the following standard time windows: 0-1 s after instruction, 0-3.0 s before the trigger, 0-1.0 s after trigger, 0-2.0 s before the reward, and 0-2.0 s after reward. Only data from neurons with insignificant differences in control periods were evaluated (P > 0.05). Reaction times (from trigger stimulus onset to key release) were collected during neuronal recordings. Because of these distributions being occasionally skewed, we compared reaction times with the nonparametric Wilcoxon matched pairs signed-rank test between trials with two levels of reward magnitude.
A simple linear regression was completed that took advantage of the
parametric features of the reward magnitude comparisons. The
relationship between spike rate and reward magnitude was analyzed using
a linear regression model, y = bx + a, in which y is the normalized spike rate and
x equals the different levels of reward magnitude. Reward
magnitude was measured as a percentage of the highest liquid amount
delivered (high level = 1, medium level = 0.7, and low
level = 0.4). Normalization of spiking rate was completed only on
the data used for the simple linear regression and was determined by
dividing the average spike rate at the time of increased neural
activity by the mean of the baseline spike rate. To compare between the
different magnitude conditions, the baseline spike rate was obtained as
the mean of the average firing rates across the different reward
magnitude conditions for each neuron. The different conditions included
low, medium, and high reward amount trials. Normalization was used to
transform the data from neuronal activations tested using all
three-reward magnitude levels in two different combinations (high level
vs. medium level and medium level vs. low level) and not for neuronal
activations tested in only two liquid magnitude conditions. Two levels
of analysis were completed. One level was more conservative and
included only those neurons that had a significant difference in
task-related periods for the two reward magnitude comparisons completed
(high level vs. medium level and medium level vs. low level), while the
second level was less stringent and required that only one comparison
be significantly different (high level vs. medium level or medium level
vs. low level) and combined with the second comparison with a trend in
the similar direction. The second analysis was completed due to the low
neuron number obtained using the first criterion (n = 7 neurons), while the numbers of neurons were greater in the less
stringent analysis (n = 39). The significance of the slope of the linear fit for the correlation coefficient r
being different from zero was computed using an estimate of the SE, which is distributed as t on N
2 df:
|
| |
RESULTS |
|---|
|
|
|---|
Behavioral performance
Behavioral data used to measure performance were obtained
throughout the recording period. For both monkeys, this included
3
sessions from the first third, second third, and the last third of
their recording periods. Within this sample, animals performed the task
>95% correctly for all trial types. This level of competence as an
average for all trial types shows that the animals learned the spatial
response task well, and overall, performed at a high accuracy level.
When accuracy was analyzed between different reward magnitude trial
types, the performance level varied. Animals performed significantly
more accurately in trials for large rewards (100%) versus trials for
the smaller reward (91.5% median values; P < 0.001, Wilcoxon signed-rank test). In choice trials, where animals could
choose either the larger or smaller of the reward liquid amounts, both
subjects reliably chose the larger reward drop size (>95% of the
trials). This preference was very consistent within a 2- to 3-h
recording session. We chose trials to analyze behavioral reactions that
had been preceded or followed by a choice test to better ensure that
the animal was discriminating between the different reward magnitude
conditions. In trials with a single target and a single reward, the
animals showed greater anticipatory licking and shorter reaction times
prior to receiving the larger reward amount and the inverse
relationship in trials for the smaller reward drop size (Table
1). These differences in behavioral
reactions were reliably consistent within the recording sessions for
individual neurons and are very similar to behavioral results obtained
in previous studies (Hassani et al. 2001
;
Watanabe et al. 2001
). The behavioral reactions occur
seconds prior to the delivery of the reward and suggest that the animal
has formed an expectation for the particular reward magnitude by the
time the behavioral reaction is initiated.
|
Muscle activity (extensor digitorium communis and biceps) was similar
between trials for smaller or larger rewards (Fig.
2). Gaze and eye movements did not vary
systematically between trials for larger or smaller rewards. Usually,
the instruction cue elicited a saccade to the image, unless the gaze
was already there. The trigger stimuli presentation elicited a saccade
in most cases to the target lever to be pressed. We found no instances
of activity during the delay period that could account for differences
in the neural activity seen in the striatum similar to previous
findings in very comparable task situations (Hassani et al.
2001
; Tremblay and Schultz 1999
).
|
Neuronal database
We studied 2,000 slowly discharging striatal neurons with control
rates between 0.1 and 3 impulses/s in the spatial delayed response
task. Of the 2,000 neurons, 374 showed 500 statistically significant
task-related activations (190 and 184 in animals A and B,
respectively). The number of activations is greater than the number of
neurons because neurons could show more than one task-related event.
The remaining cells failed to show task related modulations in raster
displays during the experiment and were not further examined. Five
different forms of task-related activity were uncovered during the
trial periods and consisted of the following: 1) responses
to instructions; 2) activations preceding trigger; 3) activations following the movement trigger; 4)
activations preceding rewards; and 5) responses to the
reward. The distinct group of tonically active neurons (Apicella
et al. 1997
; Kimura et al. 1984
) with discharge
rates of 3-8 impulses/s was not investigated.
A total of 204 of 374 task-related neurons (55%) showed statistically significant differences between at least two different reward magnitudes. Neurons could show more than one type of activation to an event or movement or preceding an event or movement so the total of activations (244) exceeds the total number of neurons (204). Activations were higher with either larger rewards (66% of discriminations; Figs. 5, 6, and 12) or smaller rewards (34%; Figs. 8, 10, 11, and 13) in all five types of task-related responses. In these same categories of responses (Instruction, Preceding-trigger, Following-trigger, Preceding-reward, and Reward), the proportion of neurons that had significantly different activity between different reward magnitude trials varied (27-100%: see Table 2). We determined the percent increase in activity from the baseline firing rate for the five main response types using the highest level of reward magnitude trials. Figure 3 presents the distribution of the activity changes for the different responses. The average percent increases of neural activity for the following response types were 121 ± 18% (SE) for the 61 instruction responses, 91 ± 12% for the 56 activations preceding the trigger, 69 ± 10% for the 31 activations following the trigger stimulus, 88 ± 13% for the 48 activations preceding rewards, and 92 ± 14% for the 48 reward responses (Fig. 3).
|
|
Regression analysis between spiking rate and reward magnitude
To better understand the relationship between single unit spike
rate and reward magnitude, a regression analysis was completed. Forty-eight neural responses were tested using all three liquid reward
amounts. Seven of these neural responses showed significant activity
differences for both reward combinations (High Reward vs. Medium Reward
and Medium Reward vs. Low Reward), while 41 responses had a significant
difference between one reward magnitude combination. Three types of
relationships were seen between neural activity and reward magnitude.
One population had the greatest activity to the high-level reward
magnitude and showed a monotonic positive relationship between reward
drop size and firing rate (n = 21; Fig.
4A; y =
0.67 + 2.375 × x; r2 = 0.89;
n = 21; t = 2.92; P < 0.001). Of these 21 neurons showing a descending magnitude relationship
(High > Med > Low), 5 responded at the time of instruction
picture presentation, 4 neurons responded during the delay period, 2 neurons responded during trigger presentation, 6 neurons responded
prior to the reward delivery and after the movement execution, and 4 responded during reward delivery. Of these 21 responses, 4 responses
had a significant difference between both reward magnitude combinations
while 17 of the responses had a significant difference between one
combination and an activity difference trend in the similar direction.
|
The second population of cells analyzed using regression analysis
showed the greatest activity in trials for the low level reward drop
size and the least activity in trials for the large reward drop size
(n = 18). These 18 neurons had either a significant difference in both reward comparison trials (n = 3) or
a significant difference in one reward comparison with a trend in the
similar direction in the second magnitude comparison (n = 15). Regression analysis showed that these activations had a clear
negative monotonic relationship to reward magnitude (Fig.
4B; y = 2.25
1.787 × x; r2 = 0.902;
n = 18; t = 5.57; P < 0.0005). Of these 18 neurons used in the ascending reward magnitude
comparison (Low > Medium > High), 2 were instruction
responses, 6 were responses during the extended delay period, 4 were
responses during the trigger presentation, 3 were activations prior to
the reward delivery, and 3 were responses during reward acquisition.
A final category of cells (n = 9) that were tested using all three reward magnitudes showed a nonlinear relationship between reward magnitude and neural activity. These neural responses had an activity change in one reward magnitude level compared with a similar response to the other two reward magnitude levels. This quadratic relationship was characterized by the highest neural response at the medium level reward magnitude in eight cases and with the highest activity at the largest reward amount in a single case. Four of these responses were to the instruction cue, one was prior to the trigger stimulus, two were prior to the reward outcome, and two were during the reward delivery.
Responses to instructions
A total of 84 neurons responded to the instruction cues with transient activations that subsided within 500 ms after stimulus disappearance (Table 2). Of these, 61 neurons (73%) discriminated between the trials for different reward liquid amounts. Spatial discrimination between the left and right instruction picture was seen in only three neurons, with all three cells showing differential activity between trials for differing reward magnitudes, all of them on a single side of arm movement. The remaining 58 cells discriminating between reward magnitudes were spatially nonselective. Eleven of these neurons were tested with all three levels of reward magnitude and showed consistently higher activity for a particular magnitude level (Fig. 5). A subset of seven nonspatial neurons was tested in nonmovement trials, and five of them showed greater activity in movement than nonmovement trials (Fig. 6). We tested 24 neurons in trials with different picture sets and found that responses in 18 of them maintained the same relationship to the picture-magnitude association and varied insignificantly between different pictures predicting the same reward magnitude (Fig. 7).
|
|
|
Activations during the instruction-trigger delay
A total of 91 task-related neurons showed activations that started during the instruction-trigger period and terminated >500 ms after instruction cue departure, either before or immediately after the trigger stimulus (Table 2). Activations in 56 of these 91 neurons (62%) differed between reward magnitudes. Activations in 4 of these 56 responses differed between the two spatial locations (left vs. right) of the cue (Fig. 8). Within this spatially selective group, activations either failed to discriminate between the different reward magnitudes on the alternate spatial location (2 neurons) or were entirely absent on the alternate spatial location (2 neurons). The remaining sustained activations were spatially nonselective (Fig. 9; n = 52). A subset of five of these activations was tested in nonmovement trials. Of these, three cells discriminated between reward magnitudes in movement trials as well movement trials.
|
|
Activations following the trigger stimulus
A total of 112 task-related neurons showed activations that closely followed the appearance of the movement trigger stimulus (Table 2). Activations in 31 of these neurons discriminated between rewards (28%). Ranking of trials according to the interval between the trigger stimulus and movement onset allowed us to determine the temporal relationships between these two events. Accordingly, the activations were classified as movement-related (73 neurons, 19 of them reward-discriminating; Fig. 10), trigger responses (9 neurons, 2 of them reward magnitude discriminating), or undefined (30 neurons, 10 of them reward magnitude discriminating). Four of the movement-related and reward magnitude discriminating responses (n = 19) differentiated between left and right movement targets, all of them on one side only. Activations with movement targets on the other side either failed to discriminate (2/4 neurons) or were entirely absent (2/4 neurons). None of the other responses showed spatial selectivity (n = 39). Out of these activations, 16 were tested in nonmovement trials. Fourteen showed higher responses in movement compared with nonmovement trials. Of these, four neurons discriminated between reward magnitudes in movement trials (Fig. 11).
|
|
Activations preceding rewards
A total of 109 task-related neurons showed activations that began
prior to reward delivery (Table 2). These activations remained present
until the reward was delivered and terminated <500 ms afterward. These
responses occurred in both movement and nonmovement trials. Prereward
activations in 48 of the 109 neurons (44%) discriminated between
rewards. In 28 of 48 (58%), the greater activity was to the larger
reward magnitude (Fig. 12), and in 20 of 48 (42%) neurons, the greater activity was seen to the smaller
reward magnitude. Previous work investigating licking actions has
supported the idea that many of the reward-discriminating, anticipatory
neuronal responses were not due to the differences in anticipatory
licking (Hassani et al. 2001
; Hollerman et al.
1998
). This observation was presently corroborated by the
subset of neurons that showed greater activity in trials with smaller
reward, because smaller rewards were usually accompanied by fewer
anticipatory licking.
|
Activations following rewards
A total of 104 task-related neurons showed responses that followed
the delivery of the reward drop to the mouth and subsided prior to the
instruction presentation of the subsequent trial (Table 2). Activations
showing close temporal relationships to licking movements were
discarded from the sample. These responses were characterized by phasic
activity during licking, time-locked to tongue extensions and
retractions and appeared similar to previously reported tongue-movement
related activity within the striatum (Apicella et al.
1991
). The reward responses varied insignificantly in activity
level between left and right lever target trials and occurred in both
movement and nonmovement trials. Activations in 48 of the 104 (46%)
neurons discriminated between reward magnitudes (Fig.
13), irrespective of the side of the
movement target. These 48 reward magnitude discriminating neuronal
activations could occur in trials in which there were no major
differences in licking behavior. All of the activations tested
(n = 12) were also observed in nonmovement trials.
|
Recording positions
Histological reconstructions of recording positions revealed that neurons were sampled in caudate nucleus, putamen, and ventral striatum, including nucleus accumbens between rostrocaudal levels A18 and A25 and mostly rostral to the anterior commisure (Fig. 14). Recordings were made throughout the entire dorsoventral extent of these structures and were mediolaterally concentrated around the internal capsule.
|
Reward magnitude-discriminating neurons were found in the caudate (41 of 83 task-related neurons, 49%), putamen (99 of 175 neurons, 56%),
and ventral striatum (64 of 116 neurons, 55%). Activations preceding
the trigger stimulus were more prevalent in the putamen compared with
the caudate nucleus or ventral striatum for both the reward
discriminating neurons (Fig. 15;
P = 0.006;
2 test) and the
nondiscriminating neurons (Fig. 15; P = 0.0001,
2 Test). Their distribution failed to vary
significantly among the rostrocaudal levels explored (Fig.
16; A18-A25; P = 0.19). The distribution of spatially discriminating cells varied
significantly between these structures (P = 0.0005),
being lower in the ventral regions of the striatum (0 neurons with
instruction, delay, or trigger activations) compared with dorsal
striatum (caudate = 2%, putamen = 5%). The distribution of
reward magnitude-discriminating neurons among the spatially
discriminating neurons varied insignificantly among the three
structures (P = 0.18). Very similar results were obtained in previous studies examining the neuronal discrimination between different rewards (Hassani et al. 2001
) or
between reward and no reward trials (Hollerman et al.
1998
).
|
|
| |
DISCUSSION |
|---|
|
|
|---|
The present results show that single neurons within the anterior
striatum distinguish between minute differences in reward magnitude
(<0.1 ml). These discriminatory activations were time-locked to the
predictive stimuli, during the delay periods and preceding and
following the liquid reward delivery. In addition, they were observed
preceding and following the arm movement to obtain the reward. The
percentage of neurons that showed a task-related activation in the
present study (374/2,000; 19%) is comparable to previous findings in
studies examining striatal activity during the delayed response task
(Hassani et al. 2001
; Hollerman et al.
1998
). The percentage of these task-related activations with an
activity difference between rewarding magnitudes was 55%. This number
is exceedingly higher than the percentage of striatal neurons that were
found to discriminate between two different rewarding outcomes (34%;
Hassani et al. 2001
) and substantially lower than the
proportion of neurons that discriminated between trials for reward
versus no reward (97%; Hollerman et al. 1998
). We
believe that these levels of responsiveness represent meaningful
signals that potentially have a large influence on behavioral
reactions. Recent neuroanatomical data on the convergence and
connectivity of corticostriatal inputs supports the idea that small
samples of striatal neurons can carry a heavy information processing
load. It has been shown that individual neurons in the striatum receive
a great amount of information in terms of the number and variety of
inputs from cortical and subcortical inputs. Individual cortical axons
are unlikely to synapse onto striatal neurons within the same subregion
but more likely to synapse onto neurons spatially segregated
(Kincaid et al. 1998
). This type of neuroanatomical
organization could enable low numbers of neurons to be sufficient for
information processing as the level of integration of information in
terms of the number of features being encoded becomes larger or the
degree of difficulty involved in making stimulus/outcome
discriminations becomes greater.
It is still unknown exactly how these differences in striatal neural activity influence behavioral reactions of the animals as they work for different reward outcomes. The neural activations at different timepoints discriminating between outcomes could influence behavior in several different manners. For example, neural activity surrounding the predictive cue could be more important in mediating cue-reward associations and neural activity surrounding the movement could be key for the coding of reward-dependent motor reactions. One well-known finding in behavioral work is that animals performing tasks for different reward outcomes display dissimilar general behavioral strategies in acquiring the disparate outcomes.
Behavioral discrimination
Psychologists and other behavioral scientists have long noted that
animals alter their anticipatory behavioral strategies dependent on the
expected reward outcome to be received (Crespi 1942
;
Flaherty 1996
; Tinklepaugh 1911;
Tolman 1932
). This behavioral discrimination has been
proposed to aid animals in the cost-benefit analyses that occur during
normal foraging (Kirshenbaum et al. 2000
). Tolman
(1932)
argued that animals learn expectancies that occur prior
to the outcome that guide the actions. When outcomes are acquired, the
end result either confirms or disconfirms those expectancies. Future
behavior is then guided by the degree of the discrepancy between
expectation and outcome. In this role, expectancies are much more
efficient if they contain information about the anticipated outcome
along with the memories of the past discrepancies following particular
behaviors. Between sets of trials with different outcomes, relative
reward effects can occur. Important properties of the tasks that
influence the behavioral expressions between these trials include
1) the disparity between the outcomes (as rewards become
more disparate, the behavioral reactions become discernibly different
prior to reward reception) (Bloomfield 1967
;
McSweeney 1975
); 2) the similarities between the predictive cues (as they become more or less similar, anticipatory expressions are altered) (Blough 1988
; Mackintosh
1974
); and 3) the time delays between events in the
task (as the delays change between events and rewards, the anticipatory
actions are altered) (Williams 1983
). In the present
study, temporal delays were similar between task events for all trials
and the predictive stimuli were shown to be highly discriminative from
one picture set to the next for signaling the different outcomes.
Disparity between outcomes remains as the potential main determinant of
the behavioral discrimination seen in the present study and reinforces
the idea that the observed neural discrimination potentially modulating general behavioral differences are rich in details concerning the
upcoming outcome.
Neural discrimination
We used a number of control tasks to better understand the relationship between neural activity and the reward outcome difference. These tasks were meant to help distinguish between possible influences of sensory or motor aspects of the task and the difference in magnitude.
MOTOR RESPONSES AND NEURAL DISCRIMINATION.
It could be that the discriminatory activity in the striatum reflects
differences in the movements made by the monkeys in different trials.
Striatal activity has been shown to be related to forearm and eye
movements (Delong et al. 1986
; Hikosaka et al.
1989a
). Despite the reliable movement differences between trials for smaller and larger rewards, the observed changes in striatal
neurons were not primarily due to the particular motor components. We
ranked trials according to reaction time speed and found no consequent
ranking for the activity rate of the individual neurons (Figs. 10 and
11). In many cases, we found that the reaction times were slower and
lick rate less, but the neural response was significantly increased.
Similar findings were observed when monkeys were working for two
different types of juice rewards (Hassani et al. 2001
).
In terms of the present work, we hypothesize that the expectation
generated at the level of the striatum is not directly linked with any
one motor parameter but influences a host of diverse behavioral
reactions that include eye, arm, and whole body movements.
SENSORY INPUT AND NEURAL DISCRIMINATION.
We used a set of different instruction pictures associated with each
reward amount to assess the contribution of image properties to the
reward-discriminating activations. Previous work has shown that there
is visual or object-related activity in the tail of the caudate nucleus
(Brown et al. 1995
). Neural activations related to
visual cues have been found in ventrolateral prefrontal cortex and
inferotemporal cortex (Liu and Richmond 2000
;
Miller et al. 1996
). These areas of cortex send
projections to regions of the striatum (Selemon and
Goldman-Rakic 1985
) and could be important in the production of
the reward influence associated with the visual cues seen in the
present study. Approximately 75% of our neurons tested with multiple
picture sets for the same reward magnitude maintained a similar profile
of responsiveness between trial blocks, i.e., they continued to respond
to the cue that predicted the same outcome (large or small reward)
independent of the changing image (Fig. 7). This supports the idea that
within a subset of striatal cells, the discrimination occurred on the basis of the upcoming reward magnitude rather than the sensory features
of the different images presented. In the orbitofrontal cortex, the
influence of the reward seems greater due to the fact that an even
higher percentage of neurons maintained responding across different
image presentations (Tremblay and Schultz 1999
). The
anterior striatal regions recorded from in the present study receive a
dense projection from the orbitofrontal cortex that could be very
important in the production of the observed responses (Haber et
al. 1995
).
AROUSAL AND NEURAL DISCRIMINATION.
It has been hypothesized that general arousal levels shift when
expectations for different rewards occur (Black 1968
).
These arousal differences may contribute to the changes in neural
activity by individual striatal neurons. This idea is clearly supported whenever there is a positive relationship between firing rate and
reward preference in terms of liquid type or amount, but the present
results show many cases where there is an inverse relationship between
reward magnitude and neuronal firing rate (Figs. 8, 10, 11, and 13). In
the inverse cases, the greatest activity was observed in trials when
the animal expected the smaller magnitude reward. This indicates that
arousal effects on neural activity based on a simple linear
relationship between reward magnitude and arousal could not explain our
results. Similar to our results, previous work in striatum has found
greater neural activity at either end of an established preference
hierarchy, supporting the idea that changes in arousal are not
primarily responsible for the differences in reward processing in this
forebrain structure (Hassani et al. 2001
;
Hollerman et al. 1998
). Others have found that activity of single striatal neurons is consistently related to the
predictability of the stimulus or the reward and not dependent on the
arousal of the animal (Bowman et al. 1996
; Liu
and Richmond 2000
; Shidara et al. 1998
). The
possibility still exists that arousal is not simply a unimodal
phenomenon related to either nonpreferred or preferred stimuli or
outcomes in a linear and interdependent fashion, but could possibly be
related to both types of outcomes in an orthogonal manner, which might
lead to a bimodal distribution of several arousal peaks at different
points along the preference hierarchy. If something like this
relationship exists and interacts with striatal processes, then there
is still a possibility for arousal states to modify the neural activity
observed in the present study.
Striatal afferents and reward expectancies
Where does the information concerning the reward and movement
arrive from that influences these striatal activations?
Movement-related inputs arrive from a host of motor areas in the
frontal cortex, including the supplementary motor area, premotor
cortex, and primary motor cortex (Alexander et al.
1986
). In addition, a large input arrives from the anterior
cingulate cortex in the medial wall of the prefrontal area
(Kunishio and Haber 1994
). These frontal motor areas
have been found to integrate information concerning environmental
events with motor processing (Boussaoud et al. 1995
). In
the premotor cortex, neurons have been shown to code for movement intention separate from sensory attention (Boussaoud
2001
). In the medial wall, cortical areas have been shown that
integrate motor sequence information with particular outcomes; neurons
in the anterior cingulate cortex respond during particular motor sequences used to acquire specific reward outcomes (Shima and Tanji 1998
). Recently, neural signals in this region have been shown to be related to the strength of the reward expectancy as progression toward the outcome was predictably changed between different schedules of reward responding (Shidara and Richmond 2002
). These results indicate that at the cortical level
integration occurs between motor information and other information.
Whether this level of integration occurs prior to or as a consequence of striatal processing is still an open matter. Cross-correlograms of
simultaneous medial cortical and striatal activity during an operant
task revealed ventral striatal neurons with an activity preceding
cortical activity (Chang et al. 2000
). This suggests that the striatum may be the source of certain cortical integrative activity within a well-learned behavioral paradigm.
Neurons in other frontal regions detect rewards and reward-predicting
stimuli. In particular, the orbitofrontal cortex has been shown to
contain neurons that differentiate well between different rewarding
items, possibly on the basis of relative reward value (Critchley
and Rolls 1996
; Gallagher et al. 1999
;
Hikosaka and Watanabe 2000
; Thorpe et al.
1983
; Tremblay and Schultz 1999
). Recent
functional magnetic resonance imaging (fMRI) data show that
distinct regions of human orbitofrontal cortex are activated by
rewarding stimuli with the degree of activation related to the
magnitude of the upcoming, expected reward (O'Doherty et al. 2001
). The striatum could employ this information in the
production of appropriate motor responses based on how the
orbitofrontal represented an outcome relative to other outcomes in the
immediate environment.
The dorsolateral prefrontal cortex has also been shown to be involved
in reward processing and subsets of neurons in this region show
sustained activation between rewards and their predictive stimuli
(Fuster 2000
). These activations can be dependent on the reward type (Watanabe 1996
) or magnitude (Leon
and Shadlen 1999
). Spatial information is known to be a
critical component of dorsolateral prefrontal function in spatial
delayed response tasks (Funahashi et al. 1989
), and it
could be that this region merges spatial and reward information
together to be sent to subcortical structures for further processing.
Other researchers have found this type of integration in the medial
prefrontal cortex (Pratt and Mizomori 2001
). Lesions to
this same region impair adaptive behavioral changes following reward
magnitude shifts in a spatial task (DeCoteau et al.
1997
).
The amygdala sends direct input to the striatum (Russchen et al.
1985
) and has been shown to be an important structure in reward
processing (Everitt et al. 1999). In the primate, it has been shown that single amygdala neurons respond to rewards and the
predictive signals related to those rewards (Nishijo et al. 1995; Rolls 1992
). It is thought to be involved
in the associative learning about reward-stimulus relationships in the
environment (Parkinson et al. 2001
; Shoenbaum et
al. 2000
). Inactivation of the amygdala decreases memories for
reward magnitude shifts (Salinas and McGaugh 1993
,
1996
), and amygdala neural responses to rewards and
reward-related stimuli are influenced by changes in the internal state
(Cromwell et al. 2001
; Rolls 1992
).
Amygdala neurons most likely process attributes of stimuli during
learning, and once learned, the structure may mediate general
influences involved in emotional processing. For the motivational
system of feeding and food seeking, these influences include the
cue-reward associations predicting food availability or the degree of
food deprivation.
Role of striatum in reward expectation
What roles could the striatum be performing in the processing of
rewards through neural expectancies? We hypothesize that the striatum
is critical for linking precise reward information with a diverse set
of movement outputs expressed together as a general behavioral
reaction. The idea that the reward information can be very precise is
critical to understand the ability of striatal processing in modulating
goal-directed action. This essential aspect of striatal functioning
most likely depends on the multi-leveled regions of convergence within
the corticostriatal system (Flaherty and Graybiel 1991
,
1993
; Haber and McFarland 2001
; Haber et
al. 2000
; Kincaid et al. 1998
) that allows these
cells to access and use a variety of information concerning the goal
objects to activate motor strategies that can be individually tuned to
the particular behavioral context. This potential influence can be
greater than mere selection because the output contains an invigoration
of a particular motor response in the face of multi-faceted
environmental choices (Cardinal et al. 2001
;
Everitt et al. 1999). A potential invigorating component
may arise from information regarding internal state changes or from
information about the relative value of an external event. It is this
diversity of inputs that allows these neurons to process fine details
of the stimulus or motor event. This same level of detail in processing
that spans motor and sensory realms has not been found to such a degree
in cortical or other subcortical structures.
A second critical aspect of striatum processing is the ability to link
the general and specific inputs to a wide range of motor reactions. The
striatum does not seem to be intimately involved in the production of
discrete motor acts (Albertin et al. 2000
; Cromwell and Berridge 1996
; Lesuzuk and Flaherty
2000
). Instead, it seems to be important in the integrating
diverse sets of information with general motor plans embedded inside
complex movement sequences (Cromwell and Berridge 1996
;
Lesuzuk and Flaherty 2000
). This function is enabled
through the broad, parallel streams of outputs from the striatum onto
several frontal cortical regions involved in motor processing
(Alexander et al. 1986
).
These features could allow the striatum to utilize information
in a distinct manner compared to cortical subsystems. The putative nature of this distinctiveness can be highlighted by comparing possible
differences in functional significance but not expectancies and working
memories. Sustained activations in the prefrontal cortex and parietal
cortex have been labeled as working memories involved in the decision
making process (Kim and Shadlen 1999
; Platt and
Glimcher 1999
; Wallis et al. 2001
); therefore
they should be critical for the production of the appropriate motor
response within that trial setting. Recent data have shown that
sustained activations within the dorsolateral prefrontal cortex can
code precise information of parametric value when the parametric
information must be memorized to perform the task correctly
(Braver et al. 1997
; Romo et al. 1999
).
In contrast, the influence of an expectancy does not seem to
necessarily include a role in coding the instrumental properties of the
task. The reward information utilized by striatal neurons in this
study, as well as other studies (Hassani et al. 2001
;
Hollerman et al. 1998
), is not crucial information
needed for proficient task performance. In this way, the expectancy
functions not as a key ingredient in decision-making but as a
comparator and amplifier that can invigorate behavioral reactions to
particular objects in the environment. Neural activations found in
anterior cingulate cortex have been proposed recently to function as
expectancies for particular rewards (Shidara and Richmond
2002
), and these activations are related to the actual
progression to the outcome through a predictable series of trials.
These predictive cues, similar to the ones in the present study, were
not instrumental for accurate behavioral output. Instead, they
represented noninstrumental aspects, including delay to reward
reception and the response effort needed to obtain the reward. In the
present study, the cues and the elicited expectancies seem to represent
the amount of absolute reward and possibly the relative magnitude. The
issue of relativity has been a critical aspect in the study of sensory and perceptual systems. It is now time to transfer these ideas more to
the study of the physiology of reward and other motivational systems.
Future studies examining the factors comprising expectations and how
these neural activations influence behavior will be vital to better
defining the "limbic processing" involved in the
motivational-to-motor-interface function of the striatum.
| |
ACKNOWLEDGMENTS |
|---|
We are grateful for the help and expert technical assistance from B. Aebischer, J. Corpataux, A. Gaillard, B. Morandi, and F. Tinguely.
This study was supported by Swiss National Science Foundation Grants 31.43331.95 and NFP38.4038-43997, the Biomed 2 program of the European Community via the Swiss Office of Education and Science (BMH4-CT95-0608 via 95.0313-1), and by an International Research Fellowship Award from the National Science Foundation to H. C. Cromwell (INT-9802538).
Present addresses: H. C. Cromwell, Department of Physiology and Pharmacology, Wake Forest University School of Medicine, Winston-Salem, NC 27157; W. Schultz, Department of Anatomy, University of Cambridge, Downing Street, Cambridge CB2 3DY, UK.
| |
FOOTNOTES |
|---|
Address for reprint requests: W. Schultz (E-mail: ws234{at}cam.ac.uk).
| |
REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
S. B. R. E. Brown and K. R. Ridderinkhof Aging and the neuroeconomics of decision making: A review Cogn Affect Behav Neurosci, December 1, 2009; 9(4): 365 - 379. [Abstract] [PDF] |
||||
![]() |
M. R. Roesch, T. Singh, P. L. Brown, S. E. Mullins, and G. Schoenbaum Ventral Striatal Neurons Encode the Value of the Chosen Action in Rats Deciding between Differently Delayed or Sized Rewards J. Neurosci., October 21, 2009; 29(42): 13365 - 13376. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. M. A. Pennartz, J. D. Berke, A. M. Graybiel, R. Ito, C. S. Lansink, M. van der Meer, A. D. Redish, K. S. Smith, and P. Voorn Corticostriatal Interactions during Learning, Memory Processing, and Decision Making J. Neurosci., October 14, 2009; 29(41): 12831 - 12838. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Lardeux, R. Pernaud, D. Paleressompoulle, and C. Baunez Beyond the Reward Pathway: Coding Reward Magnitude and Error in the Rat Subthalamic Nucleus J Neurophysiol, October 1, 2009; 102(4): 2526 - 2537. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. De Martino, D. Kumaran, B. Holt, and R. J. Dolan The Neurobiology of Reference-Dependent Value Computation J. Neurosci., March 25, 2009; 29(12): 3833 - 3842. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Gregorios-Pippas, P. N. Tobler, and W. Schultz Short-Term Temporal Discounting of Reward Value in Human Ventral Striatum J Neurophysiol, March 1, 2009; 101(3): 1507 - 1523. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Minamimoto, G. La Camera, and B. J. Richmond Measuring and Modeling the Interaction Among Reward Size, Delay to Reward, and Satiation Level on Motivation in Monkeys J Neurophysiol, January 1, 2009; 101(1): 437 - 447. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Schultz, K. Preuschoff, C. Camerer, M. Hsu, C. D Fiorillo, P. N Tobler, and P. Bossaerts Explicit neural signals reflecting reward uncertainty Phil Trans R Soc B, December 12, 2008; 363(1511): 3801 - 3811. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. N. Tobler, A. Kalis, and T. Kalenscher The role of moral utility in decision making: An interdisciplinary framework Cogn Affect Behav Neurosci, December 1, 2008; 8(4): 390 - 401. [Abstract] [PDF] |
||||
![]() |
V. Klucharev, A. Smidts, and G. Fernandez Brain mechanisms of persuasion: how 'expert power' modulates memory and attitudes Soc Cogn Affect Neurosci, December 1, 2008; 3(4): 353 - 366. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. A. Belova, J. J. Paton, and C. D. Salzman Moment-to-Moment Tracking of State Value in the Amygdala J. Neurosci., October 1, 2008; 28(40): 10023 - 10030. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Bellebaum, B. Koch, M. Schwarz, and I. Daum Focal basal ganglia lesions are associated with impairments in reward-based reversal learning Brain, March 1, 2008; 131(3): 829 - 841. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Lau and P. W. Glimcher Action and Outcome Encoding in the Primate Caudate Nucleus J. Neurosci., December 26, 2007; 27(52): 14502 - 14514. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. H. Corbit and P. H. Janak Inactivation of the Lateral But Not Medial Dorsal Striatum Eliminates the Excitatory Impact of Pavlovian Stimuli on Instrumental Responding J. Neurosci., December 19, 2007; 27(51): 13977 - 13981. [Abstract] [Full Text] [PDF] |
||||
![]() |
J.-W. Sohn and D. Lee Order-Dependent Modulation of Directional Signals in the Supplementary and Presupplementary Motor Areas J. Neurosci., December 12, 2007; 27(50): 13655 - 13666. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Yamada, N. Matsumoto, and M. Kimura History- and Current Instruction-Based Coding of Forthcoming Behavioral Outcomes in the Striatum J Neurophysiol, December 1, 2007; 98(6): 3557 - 3567. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Ikeda and O. Hikosaka Positive and Negative Modulation of Motor Response in Primate Superior Colliculus by Reward Expectation J Neurophysiol, December 1, 2007; 98(6): 3163 - 3170. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. B. Kim, N. Huh, H. Lee, E. H. Baeg, D. Lee, and M. W. Jung Encoding of Action History in the Rat Ventral Striatum J Neurophysiol, December 1, 2007; 98(6): 3548 - 3556. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. A. Taha, S. M. Nicola, and H. L. Fields Cue-evoked encoding of movement planning and execution in the rat nucleus accumbens J. Physiol., November 1, 2007; 584(3): 801 - 818. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. G. Sanfey Social Decision-Making: Insights from Game Theory and Neuroscience Science, October 26, 2007; 318(5850): 598 - 602. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. W. Balleine, M. R. Delgado, and O. Hikosaka The Role of the Dorsal Striatum in Reward and Decision-Making J. Neurosci., August 1, 2007; 27(31): 8161 - 8165. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. van Duuren, F. A. N. Escamez, R. N.J.M.A. Joosten, R. Visser, A. B. Mulder, and C. M.A. Pennartz Neural coding of reward magnitude in the orbitofrontal cortex of the rat during a five-odor olfactory discrimination task Learn. Mem., June 11, 2007; 14(6): 446 - 456. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. W. German and H. L. Fields Rat Nucleus Accumbens Neurons Persistently Encode Locations Associated With Morphine Reward J Neurophysiol, March 1, 2007; 97(3): 2094 - 2106. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. A. Teagarden and G. V. Rebec Subthalamic and Striatal Neurons Concurrently Process Motor, Limbic, and Associative Information in Rats Performing an Operant Task J Neurophysiol, March 1, 2007; 97(3): 2042 - 2058. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. R. Roesch, T. A. Stalnaker, and G. Schoenbaum Associative Encoding in Anterior Piriform Cortex versus Orbitofrontal Cortex during Odor Discrimination and Reversal Learning Cereb Cortex, March 1, 2007; 17(3): 643 - 652. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. N. Tobler, J. P. O'Doherty, R. J. Dolan, and W. Schultz Reward Value Coding Distinct From Risk Attitude-Related Uncertainty Coding in Human Reward Systems J Neurophysiol, February 1, 2007; 97(2): 1621 - 1632. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. A. Baxter and J. H. Byrne Feeding behavior of Aplysia: A model system for comparing cellular mechanisms of classical and operant conditioning Learn. Mem., November 1, 2006; 13(6): 669 - 680. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Wan and L. L. Peoples Firing Patterns of Accumbal Neurons During a Pavlovian-Conditioned Approach Task J Neurophysiol, August 1, 2006; 96(2): 652 - 660. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Galvan, T. A. Hare, C. E. Parra, J. Penn, H. Voss, G. Glover, and B. J. Casey Earlier development of the accumbens relative to orbitofrontal cortex might underlie risk-taking behavior in adolescents. J. Neurosci., June 21, 2006; 26(25): 6885 - 6892. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. Hikosaka, K. Nakamura, and H. Nakahara Basal Ganglia Orient Eyes to Reward J Neurophysiol, February 1, 2006; 95(2): 567 - 584. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Haruno and M. Kawato Different Neural Correlates of Reward Expectation and Reward Expectation Error in the Putamen and Caudate Nucleus During Stimulus-Action-Reward Association Learning J Neurophysiol, February 1, 2006; 95(2): 948 - 959. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Samejima, Y. Ueda, K. Doya, and M. Kimura Representation of Action-Specific Reward Values in the Striatum Science, November 25, 2005; 310(5752): 1337 - 1340. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Galvan, T. A. Hare, M. Davidson, J. Spicer, G. Glover, and B. J. Casey The Role of Ventral Frontostriatal Circuitry in Reward-Based Learning in Humans J. Neurosci., September 21, 2005; 25(38): 8650 - 8656. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Campos, B. Breznen, K. Bernheim, and R. A. Andersen Supplementary Motor Area Encodes Reward Expectancy in Eye-Movement Tasks J Neurophysiol, August 1, 2005; 94(2): 1325 - 1335. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. I. G. Wilson and E. M. Bowman Rat Nucleus Accumbens Neurons Predominantly Respond to the Outcome-Related Properties of Conditioned Stimuli Rather Than Their Behavioral-Switching Properties J Neurophysiol, July 1, 2005; 94(1): 49 - 61. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. D. Reyes, R. Mozzachiodi, D. A. Baxter, and J. H. Byrne Reinforcement in an in vitro analog of appetitive classical conditioning of feeding behavior in Aplysia: Blockade by a dopamine antagonist Learn. Mem., May 1, 2005; 12(3): 216 - 220. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. A. Taha and H. L. Fields Encoding of Palatability and Appetitive Behaviors by Distinct Neuronal Populations in the Nucleus Accumbens J. Neurosci., February 2, 2005; 25(5): 1193 - 1202. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Arkadir, G. Morris, E. Vaadia, and H. Bergman Independent Coding of Movement Direction and Reward Prediction by Single Pallidal Neurons J. Neurosci., November 10, 2004; 24(45): 10047 - 10056. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. R. Roesch and C. R. Olson Neuronal Activity Related to Reward Value and Motivation in Primate Frontal Cortex Science, April 9, 2004; 304(5668): 307 - 310. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Yamada, N. Matsumoto, and M. Kimura Tonically Active Neurons in the Primate Caudate Nucleus and Putamen Differentially Encode Instructed Motivational Outcomes of Action J. Neurosci., April 7, 2004; 24(14): 3500 - 3510. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. M. Nicola, I. A. Yun, K. T. Wakabayashi, and H. L. Fields Cue-Evoked Firing of Nucleus Accumbens Neurons Encodes Motivational Significance During a Discriminative Stimulus Task J Neurophysiol, April 1, 2004; 91(4): 1840 - 1865. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. M. Nicola, I. A. Yun, K. T. Wakabayashi, and H. L. Fields Firing of Nucleus Accumbens Neurons During the Consummatory Phase of a Discriminative Stimulus Task Depends on Previous Reward Predictive Cues J Neurophysiol, April 1, 2004; 91(4): 1866 - 1882. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. A. Yun, K. T. Wakabayashi, H. L. Fields, and S. M. Nicola The Ventral Tegmental Area Is Required for the Behavioral and Nucleus Accumbens Neuronal Firing Responses to Incentive Cues J. Neurosci., March 24, 2004; 24(12): 2923 - 2933. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. J. Tindell, K. C. Berridge, and J. W. Aldridge Ventral Pallidal Representation of Pavlovian Cues and Reward: Population and Rate Codes J. Neurosci., February 4, 2004; 24(5): 1058 - 1069. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. D. Volkow, G.-J. Wang, Y. Ma, J. S. Fowler, W. Zhu, L. Maynard, F. Telang, P. Vaska, Y.-S. Ding, C. Wong, et al. Expectation Enhances the Regional Brain Metabolic and the Reinforcing Effects of Stimulants in Cocaine Abusers J. Neurosci., December 10, 2003; 23(36): 11461 - 11468. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Schoenbaum and B. Setlow Lesions of Nucleus Accumbens Disrupt Learning about Aversive Outcomes J. Neurosci., October 29, 2003; 23(30): 9833 - 9841. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Visit Other APS Journals Online |