|
|
||||||||
Picower Center for Learning and Memory, RIKENMIT Neuroscience Research Center and Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139
Submitted 30 January 2003; accepted in final form 4 May 2003
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
Details about relative functional specializations or overlaps between the PFC and PMC remain largely unknown. Although neurophysiological studies have reported a greater incidence of sensory-related activity in the PFC and motor-related activity in the PMC (consistent with the PMC being functionally closer to motor output) there appears to be a great deal of overlap in these and other neuronal properties (Boussaoud and Wise 1993a
,b
; di Pellegrino and Wise 1991
, 1993
; Hernandez et al. 2002
; Kalaska and Crammond 1995
; Niki 1974b
; Sakagami and Niki 1994a
,b
; Watanabe 1986a
,b
). For example, during conditional visuomotor tasks, neurons in both areas encode learned conditional visuomotor associations and show an evolution of activity that mirrors the acquisition of the associative rule (Asaad et al. 1998
; Chen and Wise 1995a
,b
, 1996
; Mitz et al. 1991
). In general, however, little is known about similarities or differences between PFC and PMC neural mechanisms; they have rarely been compared under identical behavioral conditions.
Here, we compared and contrasted neural activity in the PFC and PMC by using a task that we had previously used to study PFC activity (Wallis et al. 2001a
). It required monkeys to quickly switch between two rules, a type of behavior impaired after PFC damage in humans (Milner 1963
; Nelson 1976
; Owen et al. 1991
; Stuss et al. 2000
) and monkeys (Dias et al. 1996
; Wallis et al. 2001b
). In our task, an abstract rule (one not linked to a specific sensory input or motor output) intermediated between a stimulus and a response (Wallis et al. 2001a
). The same rule required monkeys to release a lever if two successively presented (sample and test) pictures were identical, whereas the different rule required the lever release if the two pictures were different (Fig. 1). The experimental design thus permitted a full dissociation between the rules and the motor responses because, across trials, each motor response (hold and release) had to be made for each rule (same and different). This allowed us to disentangle their effects on neuronal activity and explore their representations in the PFC and PMC. If the hierarchy described above is correct, then we would expect the PFC to be primarily involved in representing more high-level, abstract, rule-related information and PMC neurons to be more involved in generating motor commands. This would predict that rules would be more strongly encoded, and would tend to appear earlier, in the PFC than in the PMC. Here, we sought to test this hypothesis.
|
| METHODS |
|---|
|
|
|---|
The subjects were two adult rhesus monkeys (Macaca mulatta), one male and one female, weighing 5.06.0 kg. The monkeys were fitted with a head bolt for immobilization and a recording chamber was attached to the monkey's skull. We recorded from the left PFC of monkey A and both the right and left PFC of monkey B. The positions of the recording chambers were determined using a 1.5-T magnetic resonance imaging (MRI) scanner. Monkeys were aligned in the scanner using a stereotactic frame and a positioning light beam. To record from a wide expanse of PFC, the chambers were repositioned several times. For the recordings from PMC we recorded bilaterally in both animals. The chambers were positioned to record from the dorsal PMC (primarily area F2), similar to previous studies that have compared neuronal properties in PFC and PMC (di Pellegrino and Wise 1991
, 1993
). The chambers were positioned with respect to the arcuate sulcus (Figs. 2 and 3), which is considered the defining boundary between PFC and PMC. Cortex anterior to this sulcus possesses a well-developed cell layer IV (Brodmann 1909
; Petrides and Pandya 1994
; Walker 1940
) and receives projections from the mediodorsal nucleus of the thalamus (Giguere and Goldman-Rakic 1988
; Siwek and Pandya 1991
), unlike the cortex posterior to this sulcus. There was a 5-mm separation in the anteriorposterior axis between the most posterior PFC recording and most anterior PMC recordings to ensure no accidental overlap. Indeed, the bulk of the recordings were separated by almost 10 mm.
|
|
Recordings were made using arrays of 8 tungsten dura-puncturing microelectrodes (FHC Instruments, Bowdoin, ME) and a grid (Crist Instruments, Damascus, MD) with 1-mm spacing. The approximate distance to lower the electrodes was determined from the MRI images and the electrodes were advanced using custom-built, manual microdrives until they were located just above the cell layer. The electrodes were then slowly lowered into the cell layer until a neuronal waveform was obtained. We then waited 3 h for the brain to settle and thus ensure stability during the recording session, which typically lasted 2 h. Neurons were randomly sampled; no attempt was made to select neurons on the basis of responsiveness. To ensure that our recordings were of consistent quality, after each session we measured the impedance of the electrodes. Typically values were >1 megaohm (M
). If values dropped below 0.5 M
we would scrape the surface of the dura, or use dura-puncturing guide tubes constructed from 23-gauge hypodermic needles (Samuel Perkins, Quincy, MA) to enable electrodes to enter the brain with less loss of impedance.
Waveforms were digitized and analyzed off-line (Plexon Instruments, Dallas, TX). Approximately 20% of the channels were discarded, either because neuronal waveforms could not be clearly separated or because the waveforms did not remain stable throughout the entire session. Separation of neuronal waveforms was ensured by rejecting channels where more than 0.1% of the waveforms were separated by intervals of <2 ms. The modal number of electrodes (59%) had one discriminable waveform, 32% had two, and 8% had three. On average, slightly more neurons per electrode were obtained in the PMC (mean 1.6) compared with the PFC (mean 1.4, t-test, d.f. = 476, P < 0.005). PFC and PMC recordings were conducted sequentially.
Recording locations were reconstructed by measuring the position of the recording chambers using stereotaxic methods. These were then plotted onto the MRI sections (see above and Fig. 3) using commercial graphics software (CorelDraw, Ottawa, Canada). We confirmed the correspondence between the MRI sections and our recording chambers by mapping the position of the arcuate sulcus using neurophysiological recordings (i.e., a lack of signal indicating that the electrode was positioned in the sulcus). For the PFC recordings, the distance of each recording location along the cortical surface from the principal sulcus was then traced and measured. The positions of the other sulci, relative to the principal sulcus, were also measured in this way, allowing the construction of the unfolded cortical maps shown in Fig. 2.
All procedures were in accord with the National Institute of Health guidelines and the recommendations of the MIT Animal Care and Use Committee.
Behavioral task
Trials began when the monkey grasped a lever and fixated a central fixation point (Fig. 1). Eye position was monitored throughout the session using an infrared monitoring system (ISCAN, Burlington, MA). Monkeys were required to keep their gaze within 1.5 deg of a central fixation point. If the monkey's gaze deviated outside of this window the trial was immediately terminated; breaks of fixation were not included in the overall error rates. Once the monkey had maintained central fixation for 800 ms, a sample picture was presented at the center of gaze for 800 ms, followed by a 1,500-ms delay, and then by a test picture (500 ms). For the same rule, the monkey released the lever if the test picture matched the sample. For the different rule they released the lever if the sample did not match; otherwise, they held the lever through a second delay (300 ms) until the appearance of another picture (500 ms) that always required a response. Thus only the test picture required a decision as to whether to hold or release the lever. The second delay and picture were used so that a behavioral response was required on each trial, ensuring that the monkeys were engaged in the task. The rule was signified with a brief (100-ms) cue coincident with sample onset (for monkey A, a drop of juice or a low tone indicated same, whereas no juice or a high tone indicated different; for monkey B, juice or a blue background indicated same, whereas no juice or a green background indicated different). A set of four pictures was used for each daily recording session. They were chosen at random from the internet, and reduced to 1.8° in size. The use of four pictures meant that the identity of the nonmatching test object could not be predicted and thus the monkeys needed to remember both the current sample picture and rule. Trials were randomized and balanced across all relevant features (cues, samples, rules, and responses). The monkeys completed about 700 correct trials per day at a consistent level of performance. They were very experienced with this task, having learned it over the course of several months and performed tens of thousands of trials after its acquisition.
Data analysis
The PFC neuron population is the same as from our previous report on the PFC alone (Wallis et al. 2001a
). Here, we report on additional data collected from the PMC to compare and contrast neuronal activity between the areas. Only data from correct trials were used; there were insufficient incorrect trials to permit their analysis. Spike density histograms were constructed by averaging activity across the appropriate conditions, and then into 50-ms bins. There is nothing special about the order in which these two steps are performed because averaging into 50-ms bins and then across conditions is mathematically identical.
The time course of neuronal selectivity was examined by performing a sliding receiver operating characteristic (ROC) analysis. An ROC analysis measures the degree of overlap between two response distributions. For each selective neuron the preferred and unpreferred conditions were compared, giving two distributions, P and U respectively, of neuronal activity. For example, for a rule-selective neuron these distributions might be the neuron's firing rates when the same rule was in effect in comparison to the different rule. An ROC curve was then generated by taking each observed firing rate of the neuron and plotting the proportion of P that exceeded the value of that observation against the proportion of U that exceeded the value of that observation. The area under the ROC curve was then calculated. A value of 0.5 would indicate that the two distributions completely overlap (because for each value of the neuron's firing rate the proportion of P and U exceeding that value is equal), and thus the neuron is not selective. A value of 1.0, on the other hand, would indicate that the two distributions are completely separate (i.e., every value drawn from U is exceeded by the entirety of P, whereas none of the values of P is exceeded by any of the values in U) and so the neuron is very selective. This method of analysis has the advantage that it is independent of the neuron's firing rate, and so can be used to compare neurons with different baseline firing rates and dynamic ranges. It is also nonparametric and so does not require the distributions to be Gaussian. Furthermore, the ROC value can be thought of as the probability that an independent observer could identify the condition that had been presented using the neuron's firing rate.
Starting from the baseline period of the trial, the ROC value was calculated for a 200-ms epoch. This was repeated in 10-ms increments until the entire trial had been analyzed. We used a relatively large time window (200 ms) that was incremented by small time steps (10 ms). This allowed good temporal resolution, but it caused neighboring time bins to be highly correlated with one another. However, our critical comparisons were always between different conditions (i.e., when activity related to the conditions began to diverge), rather than adjacent time bins, and so these correlations did not affect the analysis. From this analysis we were able to determine the latency at which selectivity appeared. It should be noted that smoothing the data using a sliding window means that the latency values should not be interpreted as exact values; we were interested purely in terms of comparing the relative latency of effects between the PFC and PMC. The criterion for selectivity was defined as the point at which the sliding ROC curve exceeded 0.6, for three consecutive 10-ms bins. This arbitrary criterion was chosen as one that, overall, yielded the best correspondence with latencies determined by examining spike histograms (e.g., compare Figs. 5 and 6). However, a wide range of criteria yielded essentially the same results (see following text). Latency values were computed for each neuron individually. Latency values across the population were typically not normally distributed, and so the median values were computed and comparisons between the populations used nonparametric statistics. A baseline period was defined as the 500 ms preceding the presentation of the sample picture (the last 500 ms of the fixation period) for neurons that showed selectivity during the sample and delay epoch, and the 500 ms preceding the presentation of the test picture (the last 500 ms of the delay epoch) for neurons that showed selectivity during the test epoch.
|
|
The differences we observed between the PFC and PMC were robust with respect to the precise criterion that we used to define the latency. We used a wide range of criteria: different ROC values (ranging from 0.55 to 1), a value of 3SDs greater than each neuron's ROC values during the baseline period, and a value that exceeded 99% of the neuron's baseline ROC values. Although these different criteria yielded different absolute latency values, the latency differences between the PFC and PMC that we report here were similar in all cases. We also ensured that the analysis was robust with respect to the precise time window that we used. Thus we examined all time windows from 75 to 500 ms in 25-ms increments (with windows smaller than 75 ms, we could not accumulate enough spikes to compute ROC values). All windows yielded the same pattern of significant differences between the PFC and the PMC.
We performed a power analysis to determine whether sufficient neurons had reached criterion to meaningfully compare the latency of selectivity between the two areas, particularly when concluding that there were no significant differences between populations. We estimated our minimum effect size as medium, corresponding to a "distance measure" of 0.5 (Cohen 1988
). This is a measure of the difference between the mean of the sampling distribution if the null hypothesis was true and the mean of the sampling distribution if the null hypothesis was false, expressed in terms of the SD of the parent population. For comparison, the size of this "distance measure" for the difference in the latency of the rule selectivity between the PFC and PMC was 0.64. A power of 0.95 was desired, giving a required minimum sample size of 70 neurons. In comparison, for all statistical tests used in this study we evaluated Type I errors at 0.01, making a Type II error five times as likely as a Type I error, a reflection of their estimated relative importance.
To further quantify the strength of encoding of the different aspects of the task, the trial was divided into three epochs, the 800 ms of the presentation of the sample picture, the 1,500 ms of the delay, and the 500 ms of the first test picture presentation. For each trial the total number of spikes that the neuron fired in each epoch was determined, and an ROC measure for the entire epoch was calculated. Because these values were not normally distributed, to compare the ROC values between the two areas we used a Wilcoxon's rank-sum test assessed at P < 0.01.
The ROC values were calculated for the entire population of neurons (which were not prescreened for selectivity or responsiveness; see METHODS). As a result, the ROC values were often close to 0.5. To ensure that the values were significantly different from chance we performed a bootstrap analysis. For each neuron, the trials were randomly assigned to the different behavioral conditions, and the ROC value calculated. This process was repeated 1,000 times for each neuron, and the mean ROC value was determined. A Wilcoxon's rank-sum test then compared these mean ROC values derived from the shuffled data to the actual ROC values.
We also compared the incidence of neurons encoding the different aspects of the task using an ANOVA. We defined rule-selective neurons as those that showed a significant difference in their firing rates between the two different rules, regardless of either the cue that was used to instruct the monkey, or the picture that was used as the sample stimulus. To consider the effect of all 3 factors on the neuron's firing rate it was necessary to use a 3-way ANOVA to analyze activity during the sample and delay epochs. The mean firing rate of the neuron in each epoch was compared, using the factors of the modality in which the cue was presented (monkey A: taste/auditory cue; monkey B: taste/visual cue), the rule that the cue signified (same or different), and which of the 4 pictures was presented as the sample stimulus. Thus a rule-selective neuron showed a main effect of rule, and no interaction with the other 2 factors. We also used this analysis to define picture-selective neurons (those that had a main effect of picture, and no interaction with the other 2 factors). Significance was assessed at P < 0.01. Differences in the prevalence of neurons between 2 areas (either PFC vs. PMC, or anterior PMC vs. posterior PMC) were assessed using a chi-square test at an alpha level of P < 0.01, whereas differences among 3 areas (a comparison between the dorsolateral PFC, ventrolateral PFC, and orbitofrontal cortex) were assessed using multiple chi-square tests with a Bonferroni-corrected alpha level of 0.0033.
During the test epoch 3 different processes were examined. First, neurons that encoded the test picture were identified using a KruskalWallis one-way ANOVA on the firing rate of the neurons when the 4 different test pictures were presented. Second, neurons that encoded whether the test picture matched the sample picture were identified by performing a Wilcoxon's rank-sum test on the neuron's firing rate under both conditions. Finally, neurons that encoded the behavioral response were determined by performing a Wilcoxon's rank-sum test on the neuron's firing rate when the monkey had either to hold or to release the lever. In all cases significance was assessed at P < 0.01, and differences in the proportions of the neurons were determined using chi-square tests.
| RESULTS |
|---|
|
|
|---|
Both monkeys were proficient at the task (monkey A 85% correct overall, and monkey B 93%; Table 1). Performance was significantly better during those sessions in which neurons were recorded from the PMC compared with the PFC (PFC: 93%, n = 55, PMC: 87%, n = 17; Wilcoxon's rank-sum test = 911, P < 0.0005). Performance on same and different trials was identical (same: 88%, different: 89%; Wilcoxon's matched-pairs signed-ranks test = 977, n = 72, P > 0.1). The monkeys had a bias toward holding rather than releasing the lever (hold: 90%, release: 87%; Wilcoxon's matched-pairs signed-ranks test = 277, n = 72, P < 5 x 108), and performed better for the juice/no juice cues than for the visual or auditory cues (juice/no juice: 93%, visual/auditory: 85%; Wilcoxon's matched-pairs signed-ranks test = 1, n = 72, P < 1 x 1012).
|
The most consistent difference in reaction times was between the hold and release trials (Table 2). The monkeys released an average of 20 ms later to the first test picture (on release trials) compared with the second picture (on hold trials). This presumably reflects the fact that the first test picture requires a decision as to the correct behavioral response, unlike the second picture, which always requires a release. The reaction times on release trials, across the same and different rules, were virtually identical (median 374 and 373 ms, respectively), and there was also no difference in reaction times on release trials between the juice/no juice cues and the visual or auditory cues (median 373 and 374 ms, respectively). Monkey A tended to be faster on release trials during the sessions in which the PFC data were recorded (median 372 ms) compared with the PMC data (median 389 ms), whereas the opposite pattern was observed in monkey B (PFC: median 389 ms; PMC: median 357 ms).
|
For recording sessions, we used 4 novel pictures. This raised the possibility that the monkeys solved the task, not by using abstract, general rules, but by rapidly learning stimulusresponse or stimulusstimulus associations. For example, the monkey could quickly learn an association between a picture of a man on a blue background, and the requirement to release to that same picture when it appears as a test picture, rather than learning that a blue background means "choose same" for all pictures. To test this possibility, we conducted separate behavioral tests after the recording sessions in the PMC, in which new pictures were used on each and every trial. Each monkey was run on a single probe session consisting of 364 trials. During these sessions, the monkeys' performance was virtually identical to that during the recording sessions (monkey A: 89% correct during the probe session vs. 85% correct during the recording sessions; monkey B: 96 versus 93%), suggesting that the monkeys had indeed learned general, abstract rules rather than specific pictureresponse associations.
We also examined performance on trials early in the recording session, those in which monkeys saw a given picture for the very first time (i.e., before trial-and-error learning could occur). The monkeys performed significantly above chance (50%) on these trials (67% correct, or 194 correct responses out of a total of 288 pictures [4 pictures x 72 sessions], P < 1 x 107, binomial test). Examining performance across the session, in 100 trial blocks, revealed that the monkeys did not perform as accurately over the first 100 trials as the remainder of the session (F = 8.2, d.f. = 6, 486, P < 5 x 108, one-way ANOVA and post hoc analysis using Bonferroni-corrected t-test; see Fig. 4). This might suggest that they learned additional information over the course of the session (e.g., perhaps they became familiar with the stimuli and were better able to discriminate them). However, across a wide range of cognitive tasks we have found that monkeys typically take a number of trials to perform accurately on any given day. This, coupled with the fact that the monkeys could perform the task just as accurately with trial-unique stimuli, suggests that the most parsimonious explanation is that the monkeys were not using stimulusresponse or stimulusstimulus associations to solve the task.
|
Neuronal properties
A total of 750 neurons were studied. Some (31) of the neurons were excluded from the analysis because they lay on the boundary of the PMC and PFC (within the arcuate sulcus), and thus could not be accurately ascribed to either area. This left 461 neurons from the PFC (161 from monkey A and 300 from monkey B) and 258 neurons from the PMC (114 from monkey A and 144 from monkey B; see Figs. 2 and 3). Our previous report focused on the activity of the PFC neurons during the sample and the delay epochs (Wallis et al. 2001a
). Here, we compare that activity to that of PMC neurons. We also add a detailed examination of activity in the PFC and PMC during the test epoch, which is when the match/nonmatch judgment is made and the behavioral response selected.
To obtain sufficient statistical power, the neurons were pooled across monkeys. For the PFC and PMC comparisons we made certain that there were no qualitative differences in the data obtained from each monkey individually by ensuring that all main effects were in the same direction in both monkeys and approached significance (assessed as P < 0.1). For the comparisons between different PFC regions, or different PMC regions, the smaller sample of neurons precluded such an assessment (given that in each monkey alone the effects may have been nonsignificant, but were significant when pooled) and so we ensured that the effects were at least in the same direction for both monkeys.
Rule selectivity
Figure 5 shows an example of a "rule-selective" PMC neuron: it showed differential activity depending on which rule was currently behaviorally relevant. It had a higher firing rate throughout the trial when the different rule was in effect than when the same rule was in effect, and this activity was unaffected by which particular cue signaled the rule and which sample picture the monkey was holding in memory.
To examine the time course of the rule effect, we performed a sliding ROC analysis using a 200-ms time window that was incremented across the trial in 10-ms steps (see METHODS). The result of this analysis for the single rule-selective neuron shown in Fig. 5 is illustrated in Fig. 6. It further demonstrates that this neuron began showing an effect of rule shortly after onset of the rule-signifying cue and then maintained this information throughout the trial until it waned at the end of the test epoch.
We applied this analysis to each and every recorded neuron. The results are shown in Fig. 7A. Plotted are every neuron's ROC values for every time step. ROC values are derived from the absolute difference in firing rate between same rule and different rule trials, so every value ranges from 0.5 (no rule information) to nearly 1.0 (perfect discrimination between rules). We sorted the values according to the mean ROC value across the sample, delay, and test epochs. These plots suggest that the effect of rule was stronger, more sustained, and appeared sooner in the PMC than in the PFC. To quantify the strength of the rule effect we also performed an ROC analysis using the mean firing rates of the neurons across the sample epoch and across the delay epoch. This confirmed that rule selectivity was significantly stronger in the PMC than the PFC (Table 3).
|
|
We next calculated the latency for the rule effect to appear in neuronal activity after the onset of the rule-signifying cue. To determine latency, we used the sliding ROC analysis, and determined the point at which the ROC values exceeded a threshold of 0.6 for 3 consecutive time bins. The latency was then defined as the center of the first time bin. For the single neuron shown in Fig. 6, the circle at 140 ms indicates the point at which it reached this criterion. During the sample epoch, 114 PFC and 98 PMC neurons reached this criterion. The distributions of the latencies are shown in Fig. 8. It shows that, across this neuron population, rule selectivity appeared significantly earlier in the PMC (median = 280 ms) than in the PFC (median = 415 ms; Wilcoxon's rank-sum test = 8175, P < 5 x 107). This difference was not dependent on the threshold we used for defining the latency. We examined a range of thresholds, from 0.55 to 1.0. For all thresholds
0.70, the difference in latencies was significant (at P < 0.01), with rule selectivity in the PMC always preceding the PFC (above 0.7, too few neurons reached the criterion to permit a meaningful statistical comparison of the two areas). Despite the fact that the monkeys' performance during the first 100 trials of the recording session was significantly worse than the remainder of the session (Fig. 4), the strength of rule-selectivity remained constant across the session. We calculated the ROC value in each epoch for 100 trial blocks and found no difference in the ROC values across these blocks in any epoch, or either area (P > 0.1 for all comparisons).
|
Next, we identified neurons that showed significant effects of task variables by applying a 3-way ANOVA to each neuron's mean firing rate across the sample epoch and across the delay epoch independently (see METHODS). The factors were the current rule, sample picture, and modality of the instructional cue (evaluated at P < 0.01). Rule-selective neurons were defined as those that showed a main effect of the rule factor, excluding those neurons that showed a significant interaction with one of the other two factors (see METHODS and below). The results are summarized in Table 4, and show that rule encoding was somewhat more prevalent in the PMC relative to the PFC, although the difference was significant for monkey A only when the data from both monkeys were not pooled.
|
The rule effect could result from higher activity on same rule trials or higher activity on different rule trials; we found both types of effects. In the PFC, there was a similar number of neurons that had higher activity to the different rule as the same rule for both the average sample epoch activity (53/99 or 54% of rule-selective neurons preferred the different rule, binomial test, P > 0.1) and average delay epoch activity (45/107 or 42% preferred the different rule, binomial test, P > 0.05). This was also the case for the PMC for the sample epoch (36/67 or 54% of the rule-selective neurons preferred the different rule, binomial test, P > 0.1), but for average delay epoch activity, there was a significant bias toward PMC neurons showing higher activity to the different rule (63/92 or 68%, binomial test, P < 0.0005).
Finally, some neurons were most (or least) active after a specific cue, leading to a significant Cue x Rule interaction in the 3-way ANOVA (see Table 4). This activity could reflect the physical properties of the cue, although, in principle, it could also carry some rule informationfor example, by encoding rule information but only when signaled from a single modality.
Picture selectivity
The requirement to identify and remember the sample picture was also reflected in neuronal activity. An example of a picture-selective PFC neuron is shown in Fig. 9. This neuron did not discriminate between the different rules or cues, but had a higher phasic burst of firing shortly after sample onset for one of the 4 pictures ("sample picture 2"). We used the sliding ROC analysis to examine the time course of picture selectivity for each and every recorded neuron. Overall, picture selectivity was weaker than rule selectivity (Fig. 7B).
|
We calculated the strength of the picture selectivity by computing an ROC value using the mean firing rate of each neuron across the sample and delay epochs. We compared activity to each neuron's "preferred" picture (the picture that elicited the highest mean firing rate in the neuron) and "nonpreferred" picture (the picture that elicited the lowest mean firing rate). There was a weak tendency for picture selectivity to be stronger in the PFC than in the PMC during the sample epoch (Table 3). We also compared the incidence of neurons whose mean sample and/or delay epoch activity was picture selective according to the 3-way ANOVA (described above, assessed at P < 0.01). This revealed a weak tendency toward more PFC neurons showing significant effects, but only in the sample epoch, and the difference was significant in monkey A only when the data from both monkeys were not pooled (Table 4).
We calculated the latency for picture selectivity to appear in neuronal activity after the sample onset using the sliding ROC values and the same criterion as we did for rule selectivity (3 consecutive 10-ms time steps where the ROC value exceeded 0.6; see above and METHODS). During the sample epoch, 76 PFC neurons and 11 PMC neurons reached this criterion. The low number of PMC neurons that reached criterion precluded a meaningful statistical comparison of the latencies, although the median latency for picture selectivity to appear in the PFC (median 205 ms) was earlier than in the PMC (median 390 ms).
We also examined picture selectivity for the test stimulus that appeared at the end of the trial. The sliding ROC analysis for all recorded neurons (see above and METHODS) is pictured in Fig. 10A. It suggests stronger test picture selectivity in the PFC than in the PMC, just as we found for the sample picture. This was confirmed by performing an ROC analysis comparing the neuron's mean activity to its "preferred" and "nonpreferred" test pictures (see Table 3). We also identified neurons selective for the test picture through a KruskalWallis one-way ANOVA on the average test epoch activity using the 4 different test pictures as a factor (assessed at P < 0.01). This revealed that more neurons in the PFC (106/461 or 23%) than in the PMC (10/258 or 4%) were test picture selective (chi-square = 43.3, d.f. = 1, P < 5 x 1012).
|
Selecting the behavioral response: activity during the test epoch
When the test picture was presented, monkeys needed to determine whether it matched the sample picture and then choose the appropriate behavioral response (hold or release) depending on the match/nonmatch status of the test picture and the current rule. So, we looked for their neuronal correlates during the test epoch.
Some neurons showed an effect of the match/nonmatch status of the test picture. An example of a single PFC neuron is shown in Fig. 11A. It showed higher activity to matches than to nonmatches, largely irrespective of which behavioral response the match or nonmatch required. Figure 10B shows the sliding ROC analysis of the match/nonmatch effect (see above and METHODS) for all recorded neurons. Although the match/nonmatch effect appeared to be somewhat stronger in some of the PFC neurons than in the PMC neurons, an ROC analysis on the average activity across the test epoch did not reveal a significant difference between the strength of this effect between the entire PFC and PMC neuronal populations (Table 3). The latency for the match/nonmatch effects (defined as the point at which the sliding ROC analysis exceeded 0.6 for 3 consecutive time bins) also did not differ between the 2 areas (PFC median latency = 290 ms, n = 19, PMC median latency = 325 ms, n = 8, Wilcoxon's rank-sum test = 137, P > 0.1), although the test was underpowered because of the small number of neurons reaching the criterion.
|
We also identified neurons with match/nonmatch effects using a Wilcoxon's rank-sum test (assessed at P < 0.01) that compared average test epoch activity on match versus nonmatch trials. This revealed that a similar proportion of neurons in the PFC (63/461 or 14%) and the PMC (27/258 or 10%; chi-square = 1.3, d.f. = 1, P > 0.1) showed a significant effect. In the PFC, the majority of neurons with a match/nonmatch effect showed higher activity to nonmatches than to matches (45/63, or 71%, binomial test, P < 0.0005). There was no such bias in the PMC: 13/27 or 48% of the neurons with a match/nonmatch effect had a higher firing rate to nonmatching test pictures, a proportion that did not significantly differ from 50% (binomial test, P > 0.1).
Some of the neurons that showed a significant match/nonmatch effect also encoded the current rule during the delay epoch that preceded the test picture (according to the 3-way ANOVA described above). An example of a PFC neuron with this property is shown in Fig. 11B. It reflected the rule during the delay epoch (it had higher activity when the different rule was in effect) but shortly after the onset of the test picture, its selectivity changed to reflect the match/nonmatch status of the test picture (it showed higher activity when the test picture was a nonmatch) and no longer reflected the current rule. The proportion of neurons encoding the match/nonmatch status of the test picture that also encoded the rule was not significantly different between the areas (PFC: 14/63 or 22%; PMC: 11/27, or 41%; chi-square = 2.4, d.f. = 1, P > 0.1).
Next, we examined neurons whose activity reflected the behavioral response. An example of a PMC neuron showing this effect is illustrated in Fig. 12A. This neuron had a higher test epoch firing rate on trials when the monkey released the lever as opposed to those trials when the monkey held the lever, irrespective of which combination of rule and test picture match/nonmatch status cued that response. The sliding ROC analysis (Fig. 10C; see METHODS and above) on all recorded neurons revealed that the behavioral response was apparent in both PFC and PMC neuronal activity. However, it seemed to appear sooner and was stronger in the PMC. An ROC analysis comparing average test epoch activity for each neuron on trials when the monkey held versus released the lever revealed that the effect of the behavioral response was indeed stronger in the PMC (Table 3).
|
In the sliding ROC analysis pictured in Fig. 10C, behavioralresponse selectivity appeared to arise earlier in the PMC than in the PFC. To confirm this, we computed the latency for selectivity for each neuron (defined as the point at which the sliding ROC analysis exceeded 0.6 for 3 consecutive time bins, as above; also see METHODS). During the test epoch, 128 PFC and 134 PMC neurons reached this criterion. Selectivity for the behavioral response appeared significantly earlier in the PMC population (median latency = 270 ms) than the PFC population (median = 330 ms, Wilcoxon's rank-sum test = 20,255, P < 5 x 108; see Fig. 13). This difference in latencies was not dependent on the threshold that we used. All thresholds from 0.55 to 0.80 produced a significant difference between the two areas in the distribution of latencies, and in all cases the PMC encoded the behavioral response before the PFC.
|
We also identified neurons that showed an effect of behavioral response using Wilcoxon's rank-sum test (assessed at P < 0.01) on average test epoch activity. Such behavioral response-selective neurons were significantly more prevalent in the PMC (170/258 or 66%) than in the PFC (159/461 or 35%; chi-square = 64.5, d.f. = 1, P < 1 x 1016). In the PFC, there was an even split between the selective neurons that had a higher firing rate on release trials (87/159 or 55%) and those that had a higher firing rate on hold trials (72/159 or 45%, binomial test, P > 0.1). In the PMC, however, more behavioral response-selective neurons had higher activity on release trials (102/170 or 60%, binomial test, P < 0.01) than on hold trials. Many of these neurons also reflected the rule during the preceding delay epoch. For example, the neuron in Fig. 12B has a higher firing rate on same rule trials as opposed to different rule trials during the delay. However, about halfway through the test epoch, its activity switched; it then reflected the behavioral response (with more activity on release trials) and no longer reflected the rule. Similar numbers of behavioral response-selective PFC (46/159 or 29%) and PMC (62/170 or 36%) neurons also encoded the rule during the delay epoch (chi-square = 1.8, d.f. = 1, P > 0.1).
Both monkeys held and released the lever with their left hand. However, there was no difference in the incidence of neurons encoding the behavioral response between the two cerebral hemispheres. In the PFC, 120/326 (37%) of the neurons recorded in the right hemisphere reflected the behavioral response compared with 39/135 (29%) of the neurons recorded in the left hemisphere (chi-square = 0.28, d.f. = 1, P > 0.1). In the PMC, 123/180 (68%) of neurons in the right hemisphere reflected the response compared with 47/78 (60%) in the left hemisphere (chi-square = 0.05, d.f. = 1, P > 0.1).
Comparison to behavioral performance
The main difference we observed between the two areas was a stronger and earlier encoding of both the rule and the behavioral response in the PMC relative to the PFC. However, the monkeys' performance was also better when we recorded from the PMC versus the PFC. To ensure that this did not account for the differences in neuronal activity, we ranked the recording sessions in terms of performance for each monkey, and compared the more poorly performed half of the PMC sessions to the better-performed half of the PFC sessions. There was no difference in the monkeys' performance between these two groups of sessions (PFC: 90%, n = 29, PMC: 91%, n = 9, Wilcoxon's rank-sum test = 202, P > 0.1).
Our analysis of this reduced data set revealed that the PMC still encoded the rule before the PFC (median PFC latency = 420 ms, n = 73, median PMC latency = 270 ms, n = 51, Wilcoxon's rank-sum test = 2,283, P < 5 x 106), as well as the behavioral response (median PFC latency = 355 ms, n = 66, median PMC latency = 380 ms, n = 76, Wilcoxon's rank-sum latency = 4,530, P < 1 x 106). The rule selectivity was also stronger in the PMC than in the PFC during the delay epoch (median ROC value in the PFC = 0.541, n = 264, median ROC value in the PMC = 0.578, n = 143, Wilcoxon's rank-sum test = 35229, P < 1 x 107), although with the reduced data set the difference between the areas did not reach significance during the sample epoch (median ROC value in the PFC = 0.536, n = 264, median ROC value in the PMC = 0.548, n = 143, Wilcoxon's rank-sum test = 31,607, P > 0.01). Finally, selectivity for the behavioral response was also still stronger in the PMC than in the PFC (median ROC value in the PFC = 0.535, n = 264, median ROC value in the PMC = 0.571, n = 143, Wilcoxon's rank-sum test = 36,233, P < 5 x 1010).
Comparison of "switch" and "stay" trials
The impairments that patients with PFC damage exhibit on rule-following tasks are most profound when the patient is required to switch from one rule to another. This phenomenon is termed "perseveration" (Milner 1963
; Nelson 1976
; Owen et al. 1991
; Stuss et al. 2000
). In our task, the rules varied randomly from trial to trial. Thus there was no discrete point at which monkeys had to switch from a currently familiar rule to a new one. Nonetheless, we compared the monkeys' behavior and rule-related activity from trials when the rule had switched from the previous trial ("switch" trials) to that from trials when the rule was the same as the previous trial ("stay" trials).
There was no difference between the monkeys' level of performance on switch trials and stay trials (switch trials: 89%; stay trials: 88%), and no difference in their median behavioral reaction times (373 ms for both types of trials). Accordingly, we also found that neural activity was largely similar between these trials.
To examine rule selectivity between stay and switch trials, we computed ROC values using each neuron's average activity across the sample epoch and across the delay epoch (see METHODS). During the sample epoch, neither PFC nor PMC neurons showed a difference in magnitude of rule selectivity between stay and switch trials (PFC neurons: median ROC value on "same" trials = 0.543, median ROC value on "switch" trials = 0.542, Wilcoxon's matched-pairs signed-ranks test = 52,054, n = 461, P > 0.1; PMC neurons: median ROC value on "same" trials = 0.554, median ROC value on "switch" trials = 0.555, Wilcoxon's matched-pairs signed-ranks test = 50,708, n = 258, P > 0.1). During the delay epoch, there was no difference in magnitude of rule effect between switch and stay trials for the PFC neurons (median ROC value on "same" trials = 0.548, median ROC value on "switch" trials = 0.544, Wilcoxon's matched-pairs signed-ranks test = 14,824, n = 461, P > 0.1). However, in the PMC there was a slightly, but significantly, higher value on "switch" trials (median ROC value on "same" trials = 0.570, median ROC value on "switch" trials = 0.578, Wilcoxon's matched-pairs signed-ranks test = 12,503, n = 258, P < 0.001). This may be attributed to the fact that significantly more PMC neurons (154/258 or 60%) showed stronger selectivity on "switch" trials than on "same" trials (binomial test, P < 0.00005). To examine the time course of rule selectivity on "switch" versus "stay" trials, we performed a sliding ROC analysis (as above and see METHODS) but there were no differences in the latency of selectivity on these two types of trial in either area (P > 0.1 in all cases).
Prefrontal regional specialization: neuronal properties of three prefrontal subregions
We recorded neural activity over a wide expanse of the PFC, which afforded an opportunity to examine whether there was any regional specificity of neuronal properties (Figs. 2 and 3). We compared 3 major PFC subregions: the dorsolateral prefrontal cortex (DLPFC, defined as the cortex dorsal to the ventral lip of the principal sulcus), the ventrolateral PFC (VLPFC, defined as the cortex from the ventral lip of the principal sulcus to the lateral lip of the lateral orbital sulcus), and the orbitofrontal cortex (OFC, defined as the cortex medial to the lateral lip of the lateral orbital sulcus). We used the same analyses that we had used to compare the PFC and the PMC to examine the incidence, strength of encoding and latency of the different neuronal properties.
We previously reported (Wallis et al. 2001a
) that there were more neurons in the VLPFC than in the DLPFC and the OFC whose activity varied significantly with sample picture and that rule-selective neurons were equally distributed throughout all 3 areas. An analysis of test epoch activity also revealed regional differences. In the VLPFC, 65/175 (38%) of neurons were selective for the identity of test picture (as determined by a KruskalWallis ANOVA), compared with 17/182 (12%) of neurons in the DLPFC and 24/104 (23%) of neurons in the OFC (chi-square = 37.4, d.f. = 2, P < 1 x 109). A post hoc analysis, using multiple chi-square tests and a Bonferroni-corrected alpha level of 0.0033, showed that the incidence of picture-selective neurons in the DLPFC was significantly lower than the incidence in either the VLPFC (P < 1 x 1010) or the OFC (P < 0.0033). The incidence of picture selectivity in the VLPFC and OFC did not significantly differ (P > 0.01). There was also a difference between the areas in the incidence of neurons encoding the match/nonmatch status of the test pictures, as revealed by Wilcoxon's rank-sum tests on each neuron's average activity across the test epoch (VLPFC 39/175 or 22%, DLPFC 11/182 or 6%, OFC 13/104 or 13%; chi-square = 18.7, d.f. = 2, P < 0.0001). Post hoc analysis revealed that the incidence of such neurons in the VLPFC was higher than that in the DLPFC (P < 0.00005), whereas the incidence in the OFC did not differ from either the VLPFC (P > 0.05) or the DLPFC (P > 0.05). Finally, there was not a significant difference in the incidence of neurons that reflected the behavioral response, as determined by a Wilcoxon's rank-sum test on average activity across the test epoch (DLPFC 49/182 or 26%, VLPFC 64/175 or 38%, OFC 46/104 or 47%, chi-square = 8.48, d.f. = 2, P > 0.01).
With regard to the strength of the encoding (as determined by an ROC analysis on each neuron's average test epoch activity) there was a bias for VLPFC neurons, relative to DLPFC and OFC neurons, to show a stronger effect of the identity of the sample picture (during the sample epoch) and the test picture (during the test epoch) and of the match/nonmatch status of the test picture (Table 5). During the sample epoch, there was also a tendency for the DLPFC to encode the rule more strongly than in the OFC, and during the delay epoch the VLPFC encoded the identity of the sample picture more strongly than in the DLPFC, but not the OFC. The only difference in latencies among the 3 areas was a slight tendency for the OFC to encode the rule earlier than the DLPFC (median DLPFC latency = 585 ms, n = 40, VLPFC = 480 ms, n = 51, OFC = 370 ms, n = 19, KruskalWallis one-way ANOVA, chi-square = 14.9, d.f. = 2,107, P < 0.001).
|
PMC regional specializations
In comparison to our recordings from the PFC, the PMC recordings were restricted to a smaller cortical area. The area we recorded from was bounded by the superior arcuate sulcus and the arcuate sulcus spur at its ventral extent, the end of the arcuate sulcus spur at the posterior extent, and the superior precentral dimple at the dorsal extent (Figs. 2 and 3). The recordings extended no further than 5 mm anterior to the genu of the arcuate sulcus, dorsal and superior to the superior limb of the arcuate sulcus. The recording locations were intended to match those of previous studies that have compared neuronal properties in the PFC and PMC (di Pellegrino and Wise 1991
, 1993
), and as such the recordings were largely restricted to area F2 (Matelli et al. 1985
).
Within this area neuronal properties were largely homogeneous, although there was a slight clustering of neurons that were rule selective during the delay in the more posterior extent of the recording area. To quantify this effect we compared the incidence of selective neurons anterior to the genu of the arcuate sulcus (n = 128) to those posterior to the genu (n = 130).
There were no differences in the distribution of neurons that showed a significant rule effect in their average activity across the sample epoch according to the 3-way ANOVA (posterior PMC: 41/130 or 32%, anterior PMC: 25/128 or 20%; chi-square = 4.27, d.f. = 1, P > 0.01). For the delay epoch, however, the posterior PMC had a greater proportion of neurons that were rule selective (57/130 or 44%) than in the anterior PMC (34/128 or 27%, chi-square = 7.70, d.f. = 1, P < 0.01). Furthermore, the rule was more strongly encoded in the posterior PMC during both the sample epoch (median ROC value in posterior PMC = 0.568, median ROC value in anterior PMC = 0.538, Wilcoxon's rank-sum test = 14083, P < 0.00005) and the delay epoch (median ROC value in posterior PMC = 0.612, median ROC value in anterior PMC = 0.554, Wilcoxon's rank-sum test = 13498, P < 5 x 107).
During the test epoch, there was no difference in the incidence of neurons encoding the identity of the test picture (posterior PMC: 7/130 or 5%, anterior PMC: 3/128 or 2%, chi-square = 0.88, d.f. = 1, P > 0.1), its match/nonmatch status (posterior PMC: 17/130 or 13%, anterior PMC: 10/128 or 8%, chi-square = 1.38, d.f. = 1, P > 0.1), or the behavioral response (posterior: 90/130 or 69%, anterior: 80/128 or 63%, chi-square = 1.02, d.f. = 1, P > 0.1). However, the behavioral response was encoded more strongly in average test epoch activity in the posterior PMC (median ROC value = 0.599) compared with that in the anterior PMC (median ROC value = 0.563, Wilcoxon's rank-sum test = 14,481, P < 0.0005).
Finally, there were no significant differences between the two regions in the latency to encode any of the above information (P > 0.1 in all cases).
| DISCUSSION |
|---|
|
|
|---|
At the same time, however, there was evidence for some degree of specialization: there were differences in the incidence, strength, and latency of effects. Encoding of both the rule and the behavioral response occurred earlier and was stronger and more abundant in the PMC than PFC. In contrast, the PFC contained some neurons that encoded picture identity, but they were rare in the PMC.
Below, we discuss these results in more detail.
Abstract rules
The abstraction of higher-order rules permits a shortcut in an organism's learning. It is a form of generalization that allows the organism to maximize flexibility and the amount of reward. For example, the monkey could have potentially solved our task quickly learning, on each day, a set of 16 associations (4 pictures x 4 cues). The monkey might learn that picture A on a blue background is associated with picture A (same), whereas picture A on a green background is associated with anything but picture A (different). This is inefficient because learning these associations tells nothing about the correct response to picture B on a blue background. Unless the monkey extracts a general rule that a blue background means "same," it would have to learn 16 new associations by trial and error each day, wasting many reward opportunities through error. Our monkeys apparently did make this abstraction; they performed well above chance when viewing pictures for very first time and could even perform the task with a new picture on every single trial.
The economy that this abstraction permits was reflected in the neural activity. It would have been possible to solve the task without abstracting the rule on the single neuron level. For example, there could have been two populations of neurons, one for encoding the rules when instructed by one modality (e.g., vision) and another populat