JN Fuel your research with LabChart
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


J Neurophysiol 94: 2457-2471, 2005. First published June 15, 2005; doi:10.1152/jn.00373.2005
0022-3077/05 $8.00
This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
94/4/2457    most recent
00373.2005v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via ISI Web of Science (26)
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Roesch, M. R.
Right arrow Articles by Olson, C. R.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Roesch, M. R.
Right arrow Articles by Olson, C. R.

Neuronal Activity in Primate Orbitofrontal Cortex Reflects the Value of Time

Matthew R. Roesch1,2 and Carl R. Olson1,2

1Center for the Neural Basis of Cognition, Mellon Institute; and 2Department of Neuroscience, University of Pittsburgh, Pittsburgh, Pennsylvania

Submitted 12 April 2005; accepted in final form 11 June 2005


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
Neurons in monkey orbitofrontal cortex (OF) are known to respond to reward-predicting cues with a strength that depends on the value of the predicted reward as determined 1) by intrinsic attributes including size and quality and 2) by extrinsic factors including the monkey's state of satiation and awareness of what other rewards are currently available. We pose here the question whether another extrinsic factor critical to determining reward value—the delay expected to elapse before delivery—influences neuronal activity in OF. To answer this question, we recorded from OF neurons while monkeys performed a memory-guided saccade task in which a cue presented early in each trial predicted whether the delay before the monkey could respond and receive a reward of fixed size would be short or long. OF neurons tended to fire more strongly in response to a cue predicting a short delay. The tendency to fire more strongly in anticipation of a short delay was correlated across neurons with the tendency to fire more strongly before a large reward. We conclude that neuronal activity in OF represents the time-discounted value of the expected reward.


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
Primate orbitofrontal cortex (OF) is thought to play a critical role in the evaluation of anticipated rewards. This view is based on the fact that lesions of OF interfere with reward evaluation (Baylis and Gaffan 1991Go; Butter and Snyder 1972Go; Butter et al. 1969Go, 1970Go; Dias et al. 1996Go; Gaffan and Murray 1990Go; Iversen and Mishkin 1970Go; Izquierdo and Murray 2004Go; Jones and Mishkin 1972Go; Meunier et al. 1997Go) and is supported as well by the results of microelectrode recording studies demonstrating that neuronal activity is influenced by the value of an expected or experienced reward (Critchley and Rolls 1996Go; Hikosaka and Watanabe 2000Go; Roesch and Olson 2004Go; Rolls et al. 1996Go; Schoenbaum et al. 1998Go, 1999Go; Thorpe et al. 1983Go; Tremblay and Schultz 1999Go, 2000Go; Wallis and Miller 2003Go). Neuronal activity in OF discriminates between appetitive versus aversive outcomes (Schoenbaum et al. 1998Go, 1999Go; Thorpe et al. 1983Go), reward versus no reward (Tremblay and Schultz 2000Go), large versus small reward (Roesch and Olson 2004Go; Wallis and Miller 2003Go), and preferred versus nonpreferred foodstuff (Hikosaka and Watanabe 2000Go; Tremblay and Schultz 1999Go). In addition it is influenced by extrinsic factors that determine the value attached to a given reward, including how satiated the monkey is (Critchley and Rolls 1996Go) and what alternative rewards are available during a given session (Tremblay and Schultz 1999Go). These observations suggest that OF is part of a circuit that evaluates disparate types of future rewards, allowing for comparisons to be made on a common valuation scale (Montague and Berns 2002Go).

In light of these observations, it is reasonable to speculate that the delay expected to elapse before delivery of a reward might also affect neuronal activity in OF. Across a wide range of species, extending from pigeons to humans, value judgments are subject to time discounting. A reward of a given size is perceived as having greater or lesser value according to whether delivery is anticipated after a shorter or longer delay (Cardinal et al. 2001Go; Evenden and Ryan 1996Go; Herrnstein 1961Go; Ho et al. 1999Go; Lowenstein 1992Go; Mobini et al. 2002Go; Montague and Berns 2002Go; Thaler 1981Go). That monkeys engage in time discounting was demonstrated in a recent study from our laboratory (Roesch and Olson 2005Go). The monkeys in this study performed a variable-delay version of the memory-guided saccade task. A cue presented early in each trial indicated whether the delay intervening before the monkey could make a saccade and receive a reward would be long (2,500 ms) or short (500 ms). The essential behavioral finding was that monkeys were more motivated when working for a reward at short delay as indicated by a reduction in the frequency with which they aborted trials by breaking fixation. This indicates that they placed higher value on the reward when it was expected sooner. That monkeys engage in time discounting is also suggested by the fact that their performance improves as they progress through multiple trials in anticipation of receiving a reward at the end of the sequence (Liu et al. 2004Go; Shidara and Richmond 2002Go, 2004Go).

No previous experiment has posed the question whether neuronal activity in OF represents the time-discounted value of an expected reward. However, results obtained in other cortical areas suggest that it might. Neuronal activity in dorsolateral prefrontal cortex, the frontal and supplementary eye fields, premotor cortex, and the supplementary motor area, when monitored in the context of the variable-delay task, was found to depend on the length of the anticipated delay (Roesch and Olson 2005Go). Firing tended to be stronger in anticipation of a short delay just as it tended to be stronger in anticipation of a large reward; moreover, the tendency for a neuron to fire more strongly in anticipation of a short delay was positively correlated with its tendency to fire more strongly in anticipation of a large reward. Delay-dependent activity in these areas may simply have reflected motivational modulation of the monkey's state of motor preparation and need not have represented the time-discounted value of the reward (Roesch and Olson 2003Go, 2004Go). Nevertheless, the fact that neuronal activity increased in anticipation of a short delay, like the observation that monkeys performed better in anticipation of a short delay, encourages the view that neural representations of value somewhere in the brain—including, perhaps, OF—were enhanced by the prediction of a short delay.


    METHODS
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
Subjects

Two adult male rhesus monkeys were used (Macaca mulatta; laboratory designations P and F). Experimental procedures were approved by the Carnegie Mellon University Animal Care and Use Committee and were in compliance with the guidelines set forth in the United States Public Health Service Guide for the Care and Use of Laboratory Animals.

Preparatory surgery

At the outset of the training period, each monkey underwent sterile surgery under general anesthesia maintained with isofluorane inhalation. The top of the skull was exposed, bone screws were inserted around the perimeter of the exposed area, a continuous cap of rapidly hardening acrylic was laid down so as to cover the skull and embed the heads of the screws, a head-restraint bar was embedded in the cap, and scleral search coils were implanted on the eyes, with the leads directed subcutaneously to plugs on the acrylic cap (Robinson 1963Go). After initial training, recording chambers were implanted into the acrylic. For this purpose, a 2-cm-diameter disk of acrylic and skull overlying the left hemisphere was removed. A cylindrical recording chamber was cemented into the hole with its base flush to the exposed dural membrane. The chamber was centered at approximately anterior 23 mm and lateral 23 mm with respect to the Horsley–Clarke reference frame.

Variable-delay task

The monkeys performed a memory-guided saccade task in which a cue presented early in each trial predicted a short (500-ms) or a long (2,500-ms) delay period. Essential features of the task are summarized in Fig. 1A. Each trial began with onset of a central fixation spot. At a point in time 50 ms after attainment of fixation, the spot was transformed to a cue the shape and color of which signified the length of the upcoming delay period. After 400 ms two potential targets appeared at diametrically opposed locations to the right and left of fixation. A directional cue identical to the fixation cue except in size was then presented for 250 ms in superimposition on one of the targets. After a 500-ms (or 2,500-ms) delay period, the fixation spot was extinguished, whereupon the monkey was required to make a saccade directly to the previously cued target and to maintain fixation on it for 300–450 ms after saccade completion, at which time a juice reward was delivered. There were four possible conditions representing all possible combinations of delay length (short or long) and direction (right or left). The targets were always placed at standard locations directly to the right and left of fixation because neurons in OF are not reported to possess well-localized response fields and, indeed, only rarely exhibit selectivity for response direction (Wallis and Miller 2003Go). The conditions were interleaved in pseudorandom order according to the rule that one trial conforming to each condition had to be completed successfully before initiation of the next block of four trials. To prevent confounding activity related to delay length with selectivity for the visual properties of the cues, the cue convention was reversed after each block of 40 successful trials. The collection of data from a given neuron commonly continued until 80 trials had been completed successfully. Further details of the task and stimuli are described in a previous publication (Roesch and Olson 2005Go).



View larger version (17K):
[in this window]
[in a new window]
 
FIG. 1. Sequence of events during the variable-delay task (A) and variable-reward task (B). Each panel represents the screen in front of the monkey during successive epochs of single representative trial. All items on the screen are stimuli visible to the monkey with the exception of the broken circle (which indicates direction of gaze) and the arrow (which indicates direction of saccade). Delay length was predicted by the color and shape of the image that served as fixation target (A2A5) and directional cue (A4). Two images (displayed under A5) predicted a 500- or 2,500-ms delay. Two images (displayed under B9) predicted a small (0.1 ml) or big (0.3 ml) juice reward.

 
Variable-reward task

To determine whether neurons sensitive to variable delay were also sensitive to variable-reward size we had monkeys perform the variable-reward task. Task order was random across sessions, but alternated within a recording session. Essential features of the task are summarized in Fig. 1B. It was similar to the variable-delay task except in that 1) the delay was fixed at 1,500 ms and 2) the cue at the beginning of the trial predicted a big (0.3 ml) or small (0.1 ml) juice reward. Further details of the task and stimuli are described in a previous publication (Roesch and Olson 2003Go).

Single-neuron recording

At the beginning of each day's session, a vertically oriented transdural guide tube was advanced to a depth such that its tip was approximately 1 cm above OF. A varnish-coated tungsten microelectrode with an initial impedance of several megohms at 1 kHz (FHC, Bowdoinham, ME) was then advanced through the guide tube into the brain. The guide tube could be placed reproducibly at anterior–posterior and mediolateral points forming a square grid with 1-mm spacing (Crist et al. 1988Go). The action potentials of a single neuron were isolated from the multineuronal trace by means of an on-line spike-sorting system using a template-matching algorithm (Signal Processing Systems, Prospect, Australia). The spike-sorting system, on detection of an action potential, generated a pulse the time of which was stored with 1-ms resolution.

Experimental control and data collection

All aspects of the behavioral experiment, including presentation of stimuli, monitoring of eye movements, monitoring of neuronal activity, and delivery of reward, were under the control of a Pentium-based computer running Cortex software provided by R. Desimone, Laboratory of Neuropsychology, National Institute of Mental Health. Eye position was monitored by means of a scleral search coil system (Riverbend Instruments, Birmingham, AL). The X- and Y-coordinates of eye position were stored with 4-ms resolution. Stimuli generated by an active matrix LCD projector were rear-projected onto a frontoparallel screen 25 cm from the monkey's eyes.

Analysis of the dependence of behavior on delay length and reward size

We used paired t-tests to compare, across sessions, the session means of the following measures obtained on short- versus long-delay trials (or big- vs. small-reward trials): reaction time, error rate, and fixation-break rate. Reaction time was defined as the delay from offset of the fixation spot to the moment when the eye left the central fixation window. Error rate was defined as the number of trials on which a saccade was directed to the wrong target expressed as a percentage of all trials on which a saccade was directed to either target. Fixation-break rate was defined as the percentage of all trials on which the eye left the central fixation window before offset of the fixation spot. In all tests, the criterion for statistical significance was taken as P ≤ 0.05. In all tests, the distribution of the pairwise differences did not deviate from normality.

Analysis of the dependence of firing rate on task factors

We used two-factor ANOVAs to analyze the impact of delay length (or reward size) and response direction on the firing rate of each neuron. We independently analyzed data from seven trial epochs: 1) from delay cue onset to directional cue onset (700 ms), 2) from onset to offset of the directional cue (250 ms), 3) 250 ms beginning with directional cue offset, 4) 250 ms before fixation spot offset, 5) 200 ms before saccade initiation, 6) from saccade onset to 100 ms after saccade completion, and 7) 100 ms before to 100 ms after initiation of reward delivery. These correspond to the boxes labeled I–VII at the base of Fig. 5. In all tests, the criterion for statistical significance was taken as P ≤ 0.05.



View larger version (30K):
[in this window]
[in a new window]
 
FIG. 5. Impact of delay length on the activity of 154 neurons studied in the context of the variable-delay task. A: mean firing rate as a function of time under 4 conditions defined by crossing delay length (short or long) with response direction (preferred or antipreferred as defined on the basis of each neuron's firing rate during an epoch starting with the onset of the directional cue and ending at saccade initiation). Data on the left are aligned on directional cue onset. Data on the right are aligned on saccade onset. Data from the intermediate 2,000 ms of the long-delay period are not shown. Blue, short-delay conditions; red, long-delay conditions; thick, preferred direction; thin, nonpreferred direction. Data are combined across monkeys. B: difference in population firing rate between short- and long-delay conditions as a function of time during the trial. Positive (blue) values indicate that the firing rate was higher on short-delay trials. C: frequency of cases in which there was a significant main effect of delay length on neuronal firing rate (blue = stronger firing on short-delay trials; red = stronger firing on long-delay trials) during 7 analysis epochs (I–VII) indicated at the bottom of the figure. D: difference in population directional signal between short- and long-delay conditions as a function of time during the trial. Positive (blue) values indicate that the directional signal was stronger on short-delay trials. Directional signal was taken as the firing rate on preferred-direction trials minus the firing rate on antipreferred-direction trials. E: frequency of cases in which firing rate depended significantly on the interaction between delay length and response direction (blue = stronger directional signal on short-delay trials; red = stronger directional signal on long-delay trials) during 7 trial epochs (I–VII) indicated at the bottom of the figure.

 
Localization of recording sites

To characterize the location of the recording sites relative to gross anatomical landmarks, we projected the sites onto structural magnetic resonance (MR) images. The images were collected by use of a Brükker 4.7-T magnet in which the anesthetized monkey was supported by an MR-compatible stereotaxic device. Frontoparallel slices of 2 mm thickness spanning the entire brain were collected. Projection of recording sites onto the MR images was accomplished by reference to the image of an electrode inserted into the brain near the center of the recording zone and at known coordinates relative to the recording grid.


    RESULTS
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
Recording sites

We recorded neuronal activity in OF during the performance both of the variable-delay task (Fig. 1A) and the variable-reward task (Fig. 1B). All recording sites in each monkey were within 2 mm of the point indicated in the corresponding frontoparallel image in Fig. 2. The indicated zone corresponds closely to a region shown in previous studies to contain neurons sensitive to the value of a predicted reward (Roesch and Olson 2004Go; Rolls 2000Go; Thorpe et al. 1983Go; Tremblay et al 1999Go). Neurons were selected for study if, on preliminary testing, they seemed to exhibit any form of phasic activity in conjunction with task performance. In the context of the variable-delay task, we recorded from 154 neurons (83 in monkey F and 71 in monkey P). In the context of the variable-reward task, we recorded from 152 neurons (83 in monkey F and 69 in monkey P). Whenever possible (148 cases) the same neuron was studied in both tasks and is represented in both databases.



View larger version (63K):
[in this window]
[in a new window]
 
FIG. 2. Recording zones shown in frontoparallel magnetic resonance (MR) images of the brain of (A) monkey P and (B) monkey F. A black circle marks the center of the recording zone in each monkey. All recording sites were within 2 mm of the center of the circle. Scale bar: 5 mm. PS, principal sulcus.

 
Impact of delay length and reward size on behavior

LENGTH OF ANTICIPATED DELAY. As an index of the impact of anticipated delay on motivation, we measured the tendency for the monkeys to abort a trial by prematurely breaking fixation. The results indicate 1) that the tendency to break fixation declined over the course of the trial, as if the monkeys became more invested in completing the trial as it proceeded; 2) that the tendency to break fixation was reduced on short- compared with long-delay trials; and 3) that this effect was particularly strong during the first second of the trial, before the variable segment of the delay period, when the only difference between trials lay in the monkey's anticipation of a short or long delay. These points are supported by Fig. 3C, which represents the number of fixation breaks occurring at each time during the trial (in 500-ms bins) and under each condition (short or long delay), as a fraction of all fixation breaks committed by the monkeys. Overall, cases in which a trial was aborted by breaking fixation, even during the first second, were rare. Out of all long-delay trials, 2.6% were terminated by a fixation break during this period (5.6 and 0.03% in monkeys P and F, respectively). Out of all short-delay trials, 0.03% were terminated by a fixation break during this period (0.06 and 0.00% in monkeys P and F, respectively). The difference in these counts was highly significant in the data collapsed across monkeys (t-test, P < 0.001) and in monkey P (t-test, P < 0.0001). The number of fixation breaks in monkey F was so low (this monkey literally never broke fixation on a short-delay trial) that the trend fell short of significance (t-test, P = 0.15). We conclude that monkey P was more motivated to complete the trial if the anticipated delay was short and infer that the same was true of monkey F as well.



View larger version (30K):
[in this window]
[in a new window]
 
FIG. 3. Performance measures sensitive to delay length in the variable-delay task (AC) and reward size in the variable-reward task (DF). Error rate (A, D): number of trials on which a saccade was directed to the wrong target expressed as a fraction of all trials on which a saccade was directed to one target or the other. Reaction time (B, E): average interval between fixation spot offset and saccade initiation on all trials in which the monkey made a saccade to the correct target. Fixation breaks (C, F): percentage of all fixation breaks (summed across short- and long-delay trials in C and summed across big- and small-reward trials in F) committed under a given condition during each 0.5-s epoch. Asterisks: planned comparisons revealing statistically significant differences (t-test, P < 0.05). Error bars indicate SE.

 
LENGTH OF ELAPSED DELAY. Two behavioral measures taken at the end of the delay period revealed better performance on short- compared to long-delay trials. First, the error rate (percentage of trials when the incorrect target was selected relative to all trials when one target or the other was selected), although extremely low overall (amounting to around one wrong choice per recording session), was lower on short- than on long-delay trials (Fig. 3A). This effect was significant in monkey P (t-test, P < 0.05). Second, the behavioral reaction time was lower on short- than on long-delay trials (Fig. 3B). This effect was significant in data collapsed across monkeys (t-test, P < 0.0001) and in each monkey considered individually (t-test, P < 0.05). These results indicate that the monkeys tended to be in a state of heightened response preparation (reflected by a simultaneous improvement of accuracy and speed) after a short delay compared with a long delay. This could be attributable either 1) to the passage of a shorter period of time after onset of the trial or 2) to the lingering effect of a state-change induced by presentation of the short-delay cue.

SIZE OF ANTICIPATED REWARD. In data from the variable-reward task, the same behavioral measures revealed the following results. 1) The error rate was lower on big- than on small-reward trials (Fig. 3D). This effect was significant in data collapsed across monkeys (t-test, P < 0.05) and in a post hoc test of monkey P (P < 0.05). 2) The behavioral reaction time was inconsistently related to anticipated reward size (Fig. 3E). It was significantly lower on big-reward trials in monkey P (t-test, P < 0.0001) and on small-reward trials in monkey F (t-test, P < 0.0001). 3) Fixation breaks summed across the entire 2,500-ms period after presentation of the delay cue were more frequent under the small- than under big-reward condition (Fig. 3F). This effect was significant in the data collapsed across monkeys (t-test, P < 0.05) and in a post hoc test on data from monkey P (t-test, P < 0.01). The error-rate and fixation-break measures suggest that the monkeys were more motivated on big-reward trials. However, for unknown reasons, the observed effects were not as robust as in a previous study (Olson and Roesch 2003Go). In light of this fact, it is important to note that the interpretation of neuronal data does not hinge critically on there being a strong relation between the size of the predicted reward and these particular behavioral measures. It hinges only on the ability of cues predicting large and small rewards to elicit responses of different strength in OF.

Early activity correlated with anticipated delay length

EXAMPLE. In some individual neurons, the firing rate was clearly dependent on the length of the anticipated delay. An example is shown in Fig. 4, A and B. During the period after presentation of the delay-predicting cue, this neuron fired more strongly when the predicted delay was short than when it was long (Fig. 4, A vs. B, leftmost panels). Data in this figure are collapsed across response direction because the firing rate of the neuron was unaffected by direction.



View larger version (35K):
[in this window]
[in a new window]
 
FIG. 4. Data from a neuron sensitive to predicted delay length in the variable-delay task and predicted reward size in the variable-reward task. Histograms have been parcellated so as to bring data from different conditions into alignment with respect to the time when the delay- or reward-predicting cue was presented (left) and the time when the saccade was initiated (right). Middle: activity during the intervening delay period is represented. Histograms represent firing rate as averaged across response directions. Firing rate did not differ significantly as a function of response direction (ANOVA; P > 0.5). In each raster display, the first 20 lines are from trials involving a leftward saccade and the second 20 lines are from trials involving a rightward saccade.

 
POPULATION HISTOGRAMS. To characterize the impact of anticipated delay on the activity of the entire recorded population, we constructed population histograms representing mean firing rate under four conditions determined by crossing delay length (short or long) with response direction (in the neuron's preferred or antipreferred direction as defined on the basis of the firing rate during the delay period after the directional cue). It should be noted that OF neurons are not commonly reported to exhibit spatial selectivity for cues or responses and have never been described as possessing well-circumscribed response fields (Wallis and Miller 2003Go). Nevertheless, for each neuron, it was possible to define the preferred direction as the standard location, located directly to the left or right of fixation, that was associated with stronger firing. The results, shown in Fig. 5A1, demonstrate a clear enhancement of population activity on short-delay (blue curve) as compared with long-delay (red curve) trials. The enhancement appears to have been present both during the period immediately after presentation of the delay cue, when the direction of the response was not yet known, and during the period after presentation of the directional cue, when the firing varied according to the direction of the response—preferred (thick curve) or antipreferred (thin curve)—but, under each directional condition, was higher in anticipation of a short delay (blue curve) than in anticipation of a long delay (red curve).

DIFFERENCE HISTOGRAMS. As a graphic representation of the tendency for the population to fire more strongly on short- than on long-delay trials, we constructed a difference histogram representing, as a function of time during the trial, the index (SP + SA – LP – LA)/2, where SP is the firing rate under the short-delay, preferred-direction condition, LA is the firing rate under the long-delay, antipreferred-direction condition, and so on (Fig. 5B1). This shows that the strongest delay-related activity occurred during the period immediately after the delay-predicting cue. As a graphic representation of the tendency for the population directional signal (firing rate on preferred-direction trials minus firing rate on antipreferred-direction trials) to be stronger on short- than on long-delay trials, we constructed a difference histogram representing, as a function of time during the trial, the index (SP – SA – LP + LA)/2 (Fig. 5D1). This shows that there was a tendency, but only an extremely weak one, for the directional signal to be stronger on short-delay trials.

STATISTICAL ANALYSIS OF DATA FROM INDIVIDUAL NEURONS. Although the population and difference histograms indicate tendencies that were present across the neuronal population as a whole, they do not indicate how consistent or statistically significant these tendencies were. To determine in how many neurons the length of the anticipated delay significantly affected the firing rate, we carried out an ANOVA with firing rate as the dependent variable and with delay length and response direction as independent variables. This analysis focused on three epochs marked at the base of Fig. 5E1: epoch I (700 ms beginning with delay cue onset and ending with directional cue onset, epoch II (250 ms beginning with onset and ending with offset of the directional cue), and epoch III (250 ms beginning with directional cue offset). Direction was included as a factor even in the analysis of data from epoch I, before display of the directional cue, so as to maintain a parallel to the analysis of data from later epochs. Because the directional cue was not displayed until after epoch I, any main effects of direction or interaction effects involving direction could have reflected only type I errors and accordingly were not considered.

AFTER THE DELAY CUE. Out of 154 recorded neurons, 28 (18%) exhibited a significant main effect of delay during epoch I. Out of these 28 neurons, those firing significantly more strongly under the short-delay condition (n = 27; blue symbols in Fig. 5C1) dramatically outnumbered those firing more weakly under the short-delay condition (n = 1; red symbol in Fig. 5C1). This effect was not significantly different between monkeys and was highly significant in the data from the two monkeys combined ({chi}2 test, P < 0.0001).

AFTER THE DIRECTIONAL CUE. During epochs II and III, the effects observed in the two monkeys were inconsistent. In monkey P, a significant majority of neurons fired more strongly in anticipation of a short delay ({chi}2 test, P < 0.001). Furthermore, in monkey P, the strength of the delay-related signal during this period was positively and significantly correlated with its strength during the epoch after the delay cue (r2 = 0.42, P < 0.0001). In contrast, in monkey F, the number of neurons firing more strongly in anticipation of a short delay did not exceed the number expected by chance. Rather, a significant majority of neurons fired more strongly in anticipation of a contralateral response ({chi}2 test, P < 0.001). In neither monkey did neurons in which the directional signal was stronger under the short-delay condition (blue symbols in Fig. 5E1) significantly exceed in number neurons showing the opposite effect (red symbols in Fig. 5E1).

SUMMARY. A single robust and consistent effect emerged from this analysis: the phasic response to the delay cue was stronger when it predicted a short delay than when it predicted a long delay.

Late activity correlated with elapsed delay length

POPULATION AND DIFFERENCE HISTOGRAMS. In the population histograms (Fig. 5A2), it is evident that the mean firing rate began to increase around the time of the imperative cue (offset of the fixation spot) and peaked at the time of saccade initiation around 200 ms later; then it continued to climb and peaked a second time before reward delivery. Throughout the period after saccade initiation, population activity was substantially higher after a long delay (red curves) than after a short delay (blue curves). This tendency is evident in the difference histogram of Fig. 5B2, where the downward-pointing red region indicates the period of time during the trial when firing was stronger after a long delay.

BEFORE THE SACCADE. Results obtained from statistical analysis of data from epoch IV (a 250-ms period immediately preceding offset of the fixation spot) were inconsistent. In monkey F, neurons firing significantly more strongly after a short delay (n = 12) significantly outnumbered those firing significantly more strongly after a long delay (n = 2) ({chi}2 test, P < 0.01). In monkey P, the effect was reversed (n = 2 vs. 13) and was also significant (P < 0.01). During epoch V (a 200-ms period immediately preceding saccade initiation), the difference in number between neurons in the two categories achieved significance neither in the combined data nor in either monkey considered individually. During these epochs, effects involving an interaction between delay and direction did not significantly exceed the rate expected from type I errors.

AFTER THE SACCADE. During epoch VI (extending from saccade onset to 100 ms after saccade completion) and epoch VII (extending from 100 ms before to 100 ms after reward delivery), the number of neurons exhibiting a significant main effect of delay length was significantly higher than expected by chance (22/154 = 14% during epoch VI and 37/154 = 24% during epoch VII). Moreover, neurons firing more strongly after a long delay consistently outnumbered those firing more strongly after a short delay. This effect was not significantly different between monkeys and was significant in the data from the two monkeys combined ({chi}2 test, P < 0.05 and P < 0.0001 during epochs VI and VII, respectively). Effects involving an interaction between delay and direction did not significantly exceed the rate expected from type I errors during either epoch.

SUMMARY. A single robust and consistent effect emerged from this analysis: beginning with onset of the saccade, neuronal activity was markedly stronger after a long than a short delay.

Relation between early and late effects dependent on delay length

To determine whether effects occurring at the end of the delay period (elapsed-delay effects) were correlated with effects occurring at the beginning of the trial (anticipated-delay effects), we computed, for each neuron, indices reflecting the dependency of its firing rate on delay-length during 1) a predelay epoch extending from delay-cue onset to directional-cue onset and 2) a postdelay epoch extending from saccade initiation to reward delivery. The delay index, (S – L)/(S + L), where S and L are the firing rates on short-delay and long-delay trials, respectively, was positive in the case of any neuron firing more strongly when the delay length was short and negative in the case of any neuron firing more strongly when the delay was long. We also characterized each neuron as exhibiting or not exhibiting a significant main effect of delay length during the epoch in question (ANOVA with firing rate as dependent variable and delay length and direction as independent variables; Table 1).


View this table:
[in this window]
[in a new window]
 
TABLE 1. Counts of neurons exhibiting significant effects in an ANOVA

 
The distribution of delay indices for the predelay epoch (Fig. 6A) was shifted significantly above zero (t-test, P < 0.0001). The distribution of delay indices during the postdelay epoch (Fig. 6B) was shifted significantly below zero (t-test, P < 0.0001). The same trends were present and significant when consideration was confined to neurons exhibiting a significant dependency on delay length (shaded bars in Fig. 6, A and B; Table 1). To determine whether the tendency for a neuron to exhibit postdelay modulation was correlated with its tendency to exhibit predelay modulation, we plotted the indices obtained during the postdelay epoch against the delay indices obtained during the postdelay epoch (Fig. 6C). Across the entire population there was a weak but significant negative correlation (Fig. 6C, r2 = 0.085; P = 0.0003). We conclude that the tendency to fire more strongly before a short delay was weakly correlated with the tendency to fire more strongly after a long delay.



View larger version (23K):
[in this window]
[in a new window]
 
FIG. 6. Distribution of delay indices for all 154 neurons studied in the context of the variable-delay task. A: delay indices computed on the basis of firing during the predelay epoch (onset of the delay-cue to onset of the directional cue). B: delay indices computed on the basis of firing during the postdelay epoch (onset of saccade to delivery of reward). Number of observations, the mean of the distribution, and the level of significance at which it differed from zero are given by n, µ, and P, respectively. Dark gray bars represent neurons in which the dependence of firing rate on delay length achieved statistical significance (ANOVA, P < 0.05). C: delay indices from the postdelay period plotted against delay indices from the predelay period. Delay index = (S – L)/(S + L), where S and L represent firing rates on short- and long-delay trials, respectively.

 
To understand the practical impact of the correlation in terms of firing rate measures, we constructed population histograms for separate populations of neurons that did and did not exhibit significantly enhanced activity during the predelay epoch on short-delay trials (Fig. 7, A vs. B). As expected in light of the selection criterion, the population in Fig. 7A showed pronounced enhancement during the predelay period (blue curve markedly higher than red curve), whereas the population in Fig. 7B did not. During the postdelay period, both populations exhibited stronger firing after a long delay (red curve above blue curve), but the magnitude of the effect was slightly higher for the population in Fig. 7A than for that in Fig. 7B, in harmony with the existence of a weak correlation as described above.



View larger version (35K):
[in this window]
[in a new window]
 
FIG. 7. Population histograms representing firing rate as a function of time during the trial for 148 neurons studied in the context of both the variable-delay and the variable-reward task. Neurons were grouped according to whether they (A) did (n = 25) or (B) did not (n = 123) fire significantly more strongly under the short- than under the long-delay condition in the variable-delay task (ANOVA, P < 0.05) during the predelay epoch extending from onset of the delay cue to onset of the directional cue. Blue, short-delay trials; red, long-delay trials. Data are combined across monkeys and response directions. Activity of the same groups of neurons in the context of the variable-reward task is shown in Fig. 11.

 
Early activity correlated with anticipated reward size

RATIONALE. It would be reasonable to speculate that OF neurons responded more strongly to the cue predicting a shorter delay because, with time discounting taken into account, the anticipated reward held greater value. To test this idea required assessing how the same neurons responded to a manipulation of reward value achieved by a means other than varying the delay. For this purpose, we manipulated the size of the predicted reward.

EXAMPLE. Manipulating anticipated reward size had a clear effect on the firing of some OF neurons. An example is shown in Fig. 4, C and D. This neuron fired more strongly in anticipation of a big reward (Fig. 4C, leftmost panel) than in anticipation of a small reward (Fig. 4D, leftmost panel).

POPULATION HISTOGRAMS. To characterize reward-dependent effects at the population level, we constructed population curves representing mean firing rate as a function of time under the four trial conditions (Fig. 8A1). These revealed that, after presentation of the cue predicting reward size, there was a sharp phasic increase in the population activity in OF. The mean firing rate peaked at a higher level after a cue predicting a large reward (blue curves) than after a cue predicting a small reward (red curves).



View larger version (28K):
[in this window]
[in a new window]
 
FIG. 8. Impact of the magnitude of the expected reward on 152 neurons studied in the context of the variable-reward task. A: population curves representing mean population firing rate as a function of time under the 4 task conditions defined by crossing reward size (big = blue; small = red) with direction as defined on the basis of firing rate during an epoch starting with the onset of the directional cue and ending at saccade initiation (preferred = thick; antipreferred = thin). Data to the left (A1) are aligned on the onset of the reward-predicting cue; data to the right (A2) are aligned on saccade initiation. B: difference in population firing rate between big- and small-reward conditions as a function of time during the trial. Positive (blue) values indicate that the firing rate was higher on big-reward trials. C: frequency of cases in which there was a significant main effect of reward magnitude on neuronal firing rate (blue = stronger firing on big-reward trials; red = stronger firing on small-reward trials) during 7 trial epochs (I–VII) indicated at the bottom of the figure. D: difference in population directional signal between big- and small-reward conditions as a function of time during the trial. Positive (blue) values indicate that the directional signal was stronger on big-reward trials. Directional signal was taken as the firing rate on preferred-direction trials minus the firing rate on antipreferred-direction trials. E: frequency of cases in which firing rate depended significantly on the interaction between reward magnitude and response direction (blue = stronger directional signal on big-reward trials; red = stronger directional signal on small-reward trials) during 7 trial epochs (I–VII) indicated at the bottom of the figure.

 
DIFFERENCE HISTOGRAMS. As a graphic representation of the tendency for the population to fire more strongly on big- than on small-reward trials, we constructed a difference histogram representing, as a function of time during the trial, the index (BP + BA – SP – SA)/2, where BP is the firing rate under the big-reward, preferred-direction condition, SA is the firing rate under the small-reward, antipreferred-direction condition, and so on (Fig. 8, B1 and B2). This shows 1) that the strongest reward-related activity occurred during the period immediately after the delay-predicting cue and 2) that firing remained mildly elevated throughout a big-reward trial until just before delivery of reward, when the pattern reversed. As a graphic representation of the tendency for the population directional signal (firing rate on preferred-direction trials minus firing rate on antipreferred-direction trials) to be stronger on big- than on small-reward trials, we constructed a difference histogram representing, as a function of time during the trial, the index (BP – BA – SP + SA)/2 (Fig. 5D1). This shows that there was no tendency for the directional signal to be stronger on big-reward trials.

STATISTICAL ANALYSIS OF DATA FROM INDIVIDUAL NEURONS. To determine whether effects present in the population were also observable at the level of individual neurons, we analyzed data from each neuron during seven trial epochs (I–VII) defined in METHODS and depicted along the time line at the base of Fig. 8E. On data from each epoch, we carried out an ANOVA with firing rate as the dependent variable and with reward size and response direction as factors. Counts of neurons exhibiting significant main effects of reward size on firing rate are shown in Fig. 8C, where blue (or red) symbols represent the percentage of cases in which firing was significantly increased (or decreased) for big compared to small reward. Neurons firing significantly more strongly under the big-reward condition (blue symbols) significantly outnumbered those firing more weakly (red symbols) only in epoch I ({chi}2 test, P < 0.0001). This effect was not significantly different across monkeys. Counts of neurons exhibiting a significant interaction between reward size and direction are shown in Fig. 8E, where blue (or red) symbols represent the percentage of cases in which the directional signal was stronger (or weaker) for big reward. Counts indicated during epoch I must represent type I errors because it was only after this epoch that the directional instruction was delivered. With the counts of epoch I as a basis for comparison, it is clear that interaction effects did not exceed the frequency expected by chance in any epoch.

SUMMARY. A single robust effect emerged from this analysis: the phasic response to the reward cue was stronger when it predicted a big reward than when it predicted a small reward. This effect was commensurate with the one observed previously in OF by use of the same manipulation of reward size (Roesch and Olson 2004Go). However, because the behavioral signs of enhanced task engagement were small in this study (Fig. 3, DF), we must allow for the possibility that the monkeys were not as intensely aware of the cue-reward contingencies as in the previous experiment and that, had they been, neuronal sensitivity to anticipated reward size would have been even greater than observed here.

Late activity correlated with reward size

A very late effect of reward size on the population firing rate was evident in the population histograms of Fig. 8A2. From immediately before delivery of the reward until the end of the trial, neuronal activity was stronger on small-reward trials (red curves) than on big-reward trials (blue curves). This effect lay outside the period of the trial to which planned comparisons were applied. However, because of its possible relation to a similar effect occurring in the variable delay task (see Late activity correlated with elapsed delay length), we decided to analyze it further.

Relation between early and late effects dependent on reward size

To determine whether the effect occurring at the end of the delay period (an enhancement of activity on small-reward trials) was correlated with the effect occurring at the beginning of the trial (an enhancement of activity on big-reward trials), we computed, for each neuron, indices reflecting the dependency of its firing rate on reward size during 1) a predelay epoch extending from delay-cue onset to directional-cue onset and 2) a postreward epoch extending for 500 ms after the initiation of reward delivery. The delay index, (B – S)/(B + S), where B and S are the firing rates on big- and small-reward trials, respectively, was positive in the case of any neuron firing more strongly when the reward was large and negative in the case of any neuron firing more strongly when the reward was small. We also characterized each neuron as exhibiting or not exhibiting a significant main effect of reward size during the epoch in question (ANOVA with firing rate as dependent variable and reward size and direction as independent variables).

The distribution of reward indices for the predelay epoch (Fig. 9A) was shifted significantly above zero (t-test, P < 0.0001). The distribution of reward indices during the postreward epoch (Fig. 9B) was shifted significantly below zero (t-test, P < 0.0001). The same trends were present and significant when consideration was confined to neurons exhibiting a significant dependency on reward size (shaded bars in Fig. 9, A and B). To determine whether the tendency for a neuron to exhibit postreward modulation was correlated with its tendency to exhibit predelay modulation, we plotted the indices obtained during the postreward epoch against the delay indices obtained during the predelay epoch (Fig. 9C). There was no significant correlation. We conclude that the tendency to fire more strongly in anticipation of a big reward was not correlated with the tendency to fire more strongly at the time of delivery of a small reward.



View larger version (23K):
[in this window]
[in a new window]
 
FIG. 9. Distribution of reward indices for all 152 neurons studied in the context of the variable-reward task. A: reward indices computed on the basis of firing during the predelay epoch (onset of the delay-cue to onset of the directional cue). B: reward indices computed on the basis of firing during the postreward epoch (500 ms beginning with onset of reward delivery). Number of observations, the mean of the distribution, and the level of significance at which it differed from zero are given by n, µ, and P, respectively. Dark gray bars represent neurons in which the dependency of firing rate on reward size achieved statistical significance (ANOVA, P < 0.05). C: reward indices from the postreward period plotted against reward indices from the predelay period. Reward index = (B – S)/(B + S), where B and S represent firing rates on big- and small-reward trials, respectively.

 
Relation between reward-related and delay-related activity

EARLY ACTIVITY. To determine whether neurons that fired more strongly (or weakly) in response to the big-reward cue also fired more strongly (or weakly) in response to the short-delay cue, we plotted the delay index computed during the predelay epoch (Fig. 6A) against the reward index computed during the predelay epoch (Fig. 9A) for all 148 neurons studied in the context of both tasks. The results (Fig. 10A) revealed a significant positive correlation (r2 = 0.116, P < 0.0001). We conclude that OF neurons responded similarly to cues predicting a more desirable event either in the form of a short delay or in the form of a large reward.



View larger version (22K):
[in this window]
[in a new window]
 
FIG. 10. Delay-length index plotted against reward-size index for 148 neurons studied in the context of both the variable-delay and the variable-reward task. A: indices based on firing during the predelay period. B: indices based on firing during the postdelay period (in the variable-delay task) and the postreward period (in the variable-reward task). Indices were computed as described in legends to Figs. 6 and 9.

 
To analyze the practical impact of the correlation in terms of firing rate measures, we divided the neurons into two groups: those that fired significantly more strongly during the predelay epoch in anticipation of a short delay (n = 25) and those that did not (n = 123). For each group, we then constructed histograms representing population activity during performance on the variable-reward task (Fig. 11, A and B). Among neurons that fired more strongly under the short-delay condition (Fig. 11A) firing was clearly stronger when a large reward was anticipated (blue curve is above red curve). Among neurons that did not fire more strongly during anticipation of a short delay this effect of reward size was dramatically attenuated (Fig. 11B).



View larger version (32K):
[in this window]
[in a new window]
 
FIG. 11. Population histograms representing activity in the variable-reward task of neurons selected according to their pattern of dependency on delay length in the variable-delay task. Neurons were grouped according to whether they (A) did (n = 25) or (B) did not (n = 123) fire significantly more strongly under the short- than under the long-delay condition in the variable-delay task (ANOVA, P < 0.05) during the predelay epoch extending from onset of the delay cue to onset of the directional cue. Blue, big-reward trials; red, small-reward trials. Data are combined across monkeys and response directions. Activity of the same groups of neurons in the context of the variable-delay task is shown in Fig. 7.

 
LATE ACTIVITY. To determine whether neurons that fired more strongly (or weakly) on delivery of a small reward also fired more strongly (or weakly) on culmination of a long delay, we plotted the delay index computed during the postdelay epoch (Fig. 6B) against the reward index computed during the postreward epoch (Fig. 9B) for all 148 neurons studied in the context of both tasks. The results revealed no significant correlation (Fig. 10B). Although an inversion effect occurred late in the trial in both tasks (with neurons firing more strongly on culmination of a long delay in the variable-delay task and on delivery of a small reward in the variable-reward task), we must conclude that the two effects were not closely related.

Impact of reversing the cue convention

At the end of every 40 successful trials in both the variable-delay task and variable-reward task, the cues previously associated with short-delay or big-reward became associated with long-delay or small-reward and vice versa. Consequently, in each data collection session of 80 trials, there was one block conforming to each cue convention. This manipulation possessed the virtue of allowing us to consider the influence of anticipated delay length and reward size on neuronal activity independently of any selectivity neurons may have possessed for the visual attributes of the stimuli. However, it may have resulted in an attenuation of activity related to the anticipated delay and reward. This would be true if it took monkeys many trials to adjust their expectations after each switch. We addressed these concerns by asking how long it took monkeys and neuronal activity to adjust to the new contingencies after a switch.

To do so, we assessed, as a function of trial number relative to the time of the switch, the effect of reward and delay on behavioral reaction time and neuronal firing rate. It would be of interest to know how each neuron responded to the reversal, although because there was only one reversal per recording session, this was not possible. Instead, to achieve adequate analytic power, we combined reaction time measures and firing rates across all data collection sessions in both monkeys, considering only those trials that the monkey completed successfully. Analysis was performed on blocks of four consecutive correct trials, with blocks demarcated so that the time of the switch fell at a between-block boundary. Data in Fig. 12 are plotted as a function of trial number relative to the point in time at which the cue convention was reversed.



View larger version (20K):
[in this window]
[in a new window]
 
FIG. 12. Impact of the reversal of cue contingencies that occurred during each session between the first and second blocks of 40 successful trials. A: reaction time index in the variable-delay task, computed as the mean reaction time on trials involving the cue initially associated with a short delay minus the mean reaction time on trials involving the cue initially associated with a long delay. It was initially negative because the behavioral responses were swifter on short-delay trials. It became positive after reversal of the cue convention because, again, behavioral responses were swifter on short-delay trials. Shift to positivity, which occurred in 4–8 trials, reflected the monkeys' adjustment to the reversed-cue conventions. Although one might have expected that reaction time should depend only on elapsed delay, independently of what cue had predicted the delay, the fact that it took several trials for behavioral performance to adjust to the reversal of the cue convention indicates the contrary. B: firing rate index in the variable-delay task, computed for all neurons as the mean firing rate during the predelay epoch on trials involving the cue initially associated with a short delay minus the mean firing rate on trials involving the cue initially associated with a long delay. It was initially positive because firing was stronger on short-delay trials. It became negative after reversal of the cue convention because, again, firing was stronger on short-delay trials. C and D: analogous measures for the variable-reward task.

 
The reaction-time index for the variable-delay task was computed as the reaction time on trials involving the cue that initially signaled a short delay minus the reaction time on trials involving the cue that initially signaled a long delay (Fig. 12A). The index was negative before the switch because the monkeys responded faster on short-delay trials and was positive after the switch for the same reason. Inspection of the figure makes clear that the transition from negative to positive values occurred within a few trials after the switch.

The firing-rate index for the variable-delay task was computed as the firing rate on trials involving the cue that initially signaled a short delay minus the firing rate on trials involving the cue that initially signaled a long delay (Fig. 12B). The index was positive before the switch because the neuronal population fired more strongly on short-delay trials and was negative after the switch for the same reason. Inspection of the figure makes clear that the transition from positive to negative values occurred within a few trials after the switch.

The reaction-time and firing-rate indices for the variable-reward task (Fig. 12, C and D) were computed in an exactly analogous way with the following exception. Because monkey F was slower to respond on big-reward trials, we inverted the sign of the reaction-time index in considering data from this monkey, computing it as the reaction time on trials involving the cue that initially signaled a small reward minus the reaction time on trials involving the cue that initially signaled a big reward. Inspection of the figure makes clear that both the transition in behavior and the transition in firing rate occurred within a few trials of the reversal in the cue convention. An intriguing further observation is that adjustment to reversal of the significance of the reward cues occurred slightly earlier (as judged both by behavioral and neural measures) than adjustment to reversal of the significance of the delay cues (Fig. 12, C and D vs. Fig. 12, A and B).

We conclude that reversing the cue convention at the midpoint of the data collection session did not lead to a major attenuation of neuronal activity dependent on delay length or reward size.

Neuronal sensitivity to the visual properties of the cue

Some neurons might conceivably have been selective for the visual properties of the cues. To determine whether this was so, we carried out an ANOVA with the firing rate during the epoch extending from onset of the delay cue (or reward cue) to onset of the directional cue as the dependent variable and with cue identity and delay duration (or reward size) as factors. Among 154 neurons studied in the context of the variable delay task, only ten exhibited a significant (P < 0.05) main effect of cue identity and only six exhibited an interaction effect involving cue identity. These counts were no greater than expected by chance from type I errors ({chi}2 test, P > 0.05). Among 152 neurons studied in the context of the variable reward task, 16 exhibited a significant main effect of cue identity and eight exhibited an interaction effect involving cue identity. The number exhibiting a significant main effect was significantly in excess of the number expected by chance ({chi}2 test, P = 0.0033). We conclude that a few neurons may have been selective for the visual properties of the cues used in the reward task. However, because the number was small and because cue identity was counterbalanced against reward size by reversing the convention at the midpoint of each data collection session, this should not have affected the outcome of the main analysis neuronal activity dependent on reward size.


    DISCUSSION
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
Overview

We monitored single-neuron activity in OF of monkeys performing a variant of the ocular delayed response task in which a cue presented early in each trial predicted whether the ensuing delay would be short or long. We found that a cue predicting a short delay commonly elicited a stronger neuronal response. The strength of the response presumably represented the time-discounted value of the anticipated reward because neurons firing more strongly in response to a short-delay cue also tended to fire more strongly in response to a big-reward cue. We observed an additional incidental effect of uncertain significance at the end of the delay period: neurons fired more strongly if the preceding delay had been longer.

Activity of OF neurons in relation to anticipated time-discounted value

The fact that OF neurons respond more strongly to cues predicting a short delay than to those predicting a long delay might, in principle, arise from the involvement of OF in at least three different sets of processes: 1) affective processes embodying the monkey's emotional response to the display and representing the subjective value of the predicted outcome, 2) motivational processes underlying value-dependent modulation of the monkey's degree of engagement with the demands of the task, and 3) predictive processes associated with the monkey's preparing to respond to the imminent imperative cue on short-delay trials but deferring preparation on long-delay trials. Having considered these possibilities at length in connection with delay-related activity in other frontal areas (Roesch and Olson 2005Go), we will confine ourselves here to noting that the first interpretation, based on affective representations of value, is the one most likely to apply to OF. The predictive interpretation, involving timed preparation, can be ruled out on two grounds. First, it cannot explain the correlation demonstrated here between the tendency for a neuron to fire more strongly in response to a cue predicting a short delay and the tendency to fire more strongly in response to a cue predicting a big reward. In the variable-reward paradigm, activity related to timed preparation should be identical on big- and small-reward trials because the behavioral response occurs at the same delay after the cue on both kinds of trial. The motivational interpretation can be ruled out on the basis of the observation that when the monkey's motivational state and the value of the predicted outcome are dissociated (through manipulating motivation with threatened penalties as well as promised rewards) then neuronal responses to the outcome-predicting display are correlated with value rather than motivation (Roesch and Olson 2003Go). Thus we are left with the conclusion, consonant with a large body of literature summarized in the INTRODUCTION, that neuronal activity in OF represents the value of the predicted outcome.

If we grant that delay-related activity in OF was related to the monkey's affective response to the predictive cue and was related to the value conveyed by the cue, it still does not follow with certainty that the value was determined by time discounting. The fact that the anticipated reward had greater value on short-delay trials might be explained in any of three ways: 1) the time-discounted value of the reward was greater, 2) the probability-discounted value was greater, or 3) the effort-discounted value was greater. A probability-based account must be considered because, on trials with longer delays, there was a greater likelihood of the monkey's aborting the trial by breaking fixation (Fig. 3C) or of making a saccade to the wrong target (Fig. 3A), with the result that the probability of earning a reward was lower. The probability of receiving a reward is known to exert an impact on both behavior and neural activity related to the anticipation of reward (Fiorillo et al. 2003Go; Mobini et al. 2002Go; Platt and Glimcher 1999Go). However, it would seem that a probability-based interpretation is ruled out by the fact that monkey F, although exhibiting a neural effect whereby the response to the cue was stronger on short-delay trials, exhibited no significant difference between short- and long-delay trials with respect to either error rate or fixation break rate (t-test, P > 0.05). We must still consider whether the evaluation of the cue could have been effort based. According to this interpretation, a reward anticipated at short delay had greater value because the monkey anticipated spending less effort (measured in terms of the duration of central fixation) to obtain it. The design of our task does not allow distinguishing between effects arising from time discounting and those arising from effort discounting. To test for time discounting without contamination from effort discounting would require imposing a delay between completion of the trial and delivery of the reward rather than, as in our task, between the beginning and end of the trial. Although it is entirely plausible that the enhancement of OF activity that we observed on short-delay trials reflected the monkey's time-based evaluation of the reward, we cannot altogether rule out an effect arising from an effort-based evaluation.

Activity of OF neurons at the end of the delay period

Although characterizing neuronal activity at the end of the delay period was not a central aim of this experiment, we did make a perplexing incidental observation. Delay-dependent neuronal activity in OF underwent an inversion of sign after initiation of the saccade, with firing becoming stronger after a long than after a short delay (Fig. 5, A2 and B2). We can only speculate about the possible functional significance of this observation.

On one hand, it might be explained in terms of the idea that neuronal activity in OF is stronger under circumstances associated with more positive affect. Termination of a long delay could have been perceived as a positive event for two reasons: first, escape from the long delay must have been rewarding insofar as the delay itself was aversive; second, because of the nature of the algorithm governing the sequencing of trials (as described in METHODS), completion of a trial involving a long delay indicated with a probability of 0.625 that the next trial would involve a short delay. This line of reasoning unfortunately does not provide a clear account of why the inversion should have occurred exactly at the moment of saccade initiation. Nor does it agree particularly well with an observation made in the context of the variable-reward task. In that task, an inversion of activity dependent on the size of the reward occurred at the time of reward delivery, with firing becoming stronger on delivery of a small reward (Fig. 8, A2 and B2). Although delivery of a small reward indicated with a probability of 0.625 that a big reward would be delivered on the next trial, the small reward certainly was not, in itself a more positive event than a big reward.

The late inversion effect might, on the other hand, have been related to the monkeys' state of arousal, attention, or behavioral preparation, which was clearly affected by the duration of the antecedent delay. In this study, as in a previous one (Roesch and Olson 2005Go), speed and accuracy were enhanced if the preceding delay had been short. However, interpretation along these lines is hindered by the fact that, unlike in the previous study, signs of behavioral enhancement were weak and mixed on big-reward as compared with small-reward trials. Thus we remain unable, given data currently in hand, to provide a conclusive unitary account of the late inversion of delay-related and reward-related activity.

Comparison of delay-related activity in OF and other frontal areas

We recently characterized neuronal activity accompanying performance of the variable-delay task in several frontal areas other than OF, including dorsolateral prefrontal cortex (PFC), the frontal and supplementary eye fields (FEF and SEF), and the premotor and supplementary motor areas (PM and SMA) (Roesch and Olson 2005Go). On comparison of the results, it is clear that there are significant differences between delay-related activity in OF and in these areas. In OF, delay-related activity took the form of a marked enhancement of the strength of the phasic response to the delay-predicting cue when this cue signaled a short delay. This result stands in contrast to results obtained in other areas. In PFC, FEF, and SEF, there was little or no enhancement of firing during the period immediately after the delay-predicting cue. In PM and SMA, firing was enhanced not only immediately after the cue but throughout the delay period leading up to the response. These results fit well within a simple explanatory framework. The early phasic response in OF occurred during the period when the monkey was evaluating the significance of the delay-predicting cue; this activity could well be related to the evaluation process. The prolonged activity in PM and SMA spanned the time during which the monkey was in a state of heightened readiness on short-delay trials; this activity could well reflect motivational modulation of neural processes underlying engagement with the demands of the task including response preparation. It might be the case that early phasic delay-related activity in OF (representing time-discounted value) drove prolonged delay-related activity in PM and SMA (reflecting value-dependent motivational modulation). Because the areas are not directly connected, this would necessitate their communicating through intermediaries. Our finding that delay-related signals in PFC, FEF, and SEF are weak suggests that these areas, although interposed topologically and connectionally between OF and premotor cortex, are not likely to relay delay-related signals between them and thus leaves unresolved the identity of the intermediary structures.

The view that neuronal activity in OF represents the time-discounted value of the anticipated reward, whereas neuronal activity in PM and SMA reflects value-based motivational modulation of the monkey's preparatory state, is consonant with much that is currently known about these areas. OF is not directly involved in oculomotor and skeletomotor control. Lesions and inactivation of OF do not result in impairments of motor control but do interfere with the evaluation of rewards (Baylis and Gaffan 1991Go; Butter and Snyder 1972Go; Butter et al. 1969Go, 1970Go; Dias et al. 1996Go; Gaffan and Murray 1990Go; Iversen and Mishkin 1970Go; Izquierdo and Murray 2004Go; Jones and Mishkin 1972Go; Meunier et al. 1997Go) Furthermore, microelectrode recording studies have demonstrated the presence of neurons influenced by the value of an experienced or expected reward in OF but have demonstrated only weak activity related to the locations of stimuli or the directions of responses (Critchley and Rolls 1996Go; Hikosaka and Watanabe 2000Go; Rolls and Baylis 1994Go; Rolls et al. 1996Go; Thorpe et al. 1983