|
|
||||||||
1Department of Physiology, College of Medicine, The University of Arizona, Tucson, Arizona; 2Graduate School of Neurosciences Amsterdam, Faculty of Science, Swammerdam Institute for Life Sciences, Center for Neuroscience, University of Amsterdam, Amsterdam, The Netherlands; 3Helen Wills Neuroscience Institute, University of CaliforniaBerkeley, Berkeley; and 4Department of Psychiatry and Behavioral Sciences, The California National Primate Research Center and The Music Intelligence Neural Development Institute, University of CaliforniaDavis, Sacramento, California
Submitted 12 July 2006; accepted in final form 2 November 2006
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
Further support for the role of the amygdala in face processing is provided by neuroimaging studies that demonstrate reliable activation of the amygdala in response to passive viewing of faces in both humans (Breiter et al. 1996
; Morris et al. 1996
, 1998
; Sato et al. 2004
; Vuilleumier et al. 2002; Whalen et al. 1998
; Wright and Liu 2006
) and in monkeys (Logothetis et al. 1999
; KL Hoffman, unpublished data). A larger activation of the human amygdala to fearful and angry faces than to neutral or happy faces (for a review see Zald 2003
) together with the observation that negative facial affect is the least discernible expression for patients with amygdala damage (Adolphs et al. 1994
; Sprengelmeyer et al. 1999
; but see Rapcsak 2003
) led to the proposal that the amygdala is preferentially involved in processing negative facial expressions and fear-related stimuli in general.
Face selectivity and the proposed processing bias in the amygdala were previously assessed at a neuronal level in both humans (Fried et al. 1997
, 2002
) and monkeys (Leonard et al. 1985
; Nakamura et al. 1992
; Sanghera et al. 1979
; Wilson and Rolls 1993
). Thus far, single-neuron recordings have not confirmed a bias in favor of threatening faces or in favor of aversive stimuli in general (Fuster and Uyeda 1971
; Nakamura et al. 1992
; Nishijo et al. 1988a
,b
; Ono et al. 1983
; Sanghera et al. 1979
). Furthermore, it is unclear whether the face-selective neurons in the monkey amygdala respond to face identity, facial expression, or some other dimension of faces, such as direction of gaze, age, sex, or perceived status of the monkey in the stimulus image.
Neural signals that may contribute to face selectivity in the amygdala originate in multiple areas of the temporal cortex where face-responsive neurons have been found (Bruce et al. 1981
; Desimone 1991
; Desimone et al. 1984
; Gross et al. 1972
; Hasselmo et al. 1989
; Perrett et al. 1982
, 1984
; Tsao et al. 2003
, 2005
; Wang et al. 1996
). Neurons in the superior temporal sulcus (STS) and in the polysensory area dorsal to area TE respond mainly to facial expressions and to direction of gaze (Hasselmo et al. 1989
; Perrett et al. 1985
), whereas neurons in the inferior temporal gyrus are more likely to respond to face identity (Eifuku et al. 2004
; Rolls and Tovee 1995a
,b
; Sugase et al. 1999
; Young and Yamane 1992
). The anatomical convergence of inputs from the STS and TE in the amygdala leads to the hypothesis that these inputs may be further processed in the amygdala, leading to the selection of social responses based on a combination of identity and facial expression.
The present study had two major objectives: 1) to determine the specificity of neural responses to faces in monkey amygdala, i.e., to investigate whether faces and other complex objects are processed by distinct groups of neurons with stimulus-specific or category-specific responses; and 2) to determine whether face identity and facial expression are encoded independently or jointly in the firing properties of the face-responsive neurons. The first objective was addressed by recording neural responses to large image sets that contained monkey faces, human faces, and objects. For the second objective, which required independent assessment of the contribution of face identity and facial expression; appeasing, neutral, and aggressive facial expressions from each stimulus monkey were shown to the experimental animals (Gothard et al. 2004
). These stimuli allowed a distinction between neurons in the monkey amygdala that respond to 1) facial expressions regardless of identity, 2) face identity regardless of expression, or 3) specific combinations of expression and face identity.
| METHODS |
|---|
|
|
|---|
All surgical procedures were carried out in compliance with National Institutes of Health guidelines and were approved by the IACUC at the University of Arizona. Two adult male rhesus monkeys (Macaca mulatta) (monkeys S and H) were surgically prepared for multielectrode recordings from the amygdala using a two-step surgical procedure. For both surgical procedures the monkeys were preanesthetized with ketamine (1015 mg/kg), administered intramuscularly, and brought to surgical levels of anesthesia with isoflurane (11.5%) supplemented with fentanyl (710 µg · kg1 · h1). During the first procedure, monkeys were implanted with three titanium tripod plates, custom-manufactured by Thomas Recording (Giessen, Germany). Each tripod plate had an elevated center with a threaded screw hole and three flat radial support arms emerging at 120° intervals. The three support arms were 10 mm long, 2 mm wide, and were perforated at the end to accommodate a 2-mm-diameter bone screw. The arms were bent to match the curvature of the skull and affixed to the bone with screws. Two plates were affixed posteriorly, 15 mm lateral from midline and approximately 20 mm posterior from the interaural line. The third plate was affixed straddling the midline 2 cm anterior to the interaural line. The temporalis muscle and the scalp were closed over the plates and the bone was allowed to heal and grow around the plates (about 412 mo). Once substantial bone growth had occurred around the screws and the arms of the plates (verified by CT scan), the second procedure was undertaken in which the scalp above the plates was perforated and small threaded head posts (3-mm-diameter shaft, 15-mm length, 5-mm-diameter spherical head) were screwed into the center of each of the tripod plates. The posts served as anchors for a removable stainless steel ring that was secured in a rigid frame during experiments. Such an arrangement distributed the torque generated by head immobilization across the three head posts.
During the second surgical procedure, a recording chamber was secured with bone screws and bone cement to the skull above the amygdala. The chamber was placed at stereotaxic coordinates calculated from a structural MRI carried out before surgery. Magnetic search coils were also implanted in the left eye of one of monkey (S) following standard procedures (Judge et al. 1980
; Robinson 1963
). After a 10-day recovery period, monkeys were trained to tolerate head immobilization and to fixate on objects presented on a computer monitor. When behavioral training was complete, a 6- to 8-mm-diameter craniotomy was performed within the chamber.
Behavioral training
The monkeys were trained to fixate on a white square that subtended 0.5 degree of visual angle (dva). The eye movements of monkey S were tracked with a resolution of 0.25 dva and were digitized at 500 Hz using an analog eye tracker (DNI, Newark, DE). Eye movements of monkey H were monitored with the infrared eye tracker (Iscan, Burlington, MA) with a resolution of 0.5 dva. Both eye trackers were connected to the CORTEX experimental control system (NIMH-supported freeware from the website: http://www.cortex.salk.edu). When the eyes were fixated for
150 ms at the white square, the fixation icon was removed from the monitor and a stimulus image, subtending 12 dva, was displayed. The monkeys were allowed to freely scan the image, with the requirement to maintain gaze within the boundaries of the image. If this requirement was met for the entire duration of the display (500 ms or 1 s) the monkeys received a 0.5- to 1-ml reward droplet. The reward consisted of a paste of mashed granola, rice cereal, and fruit juice or simple fruit juice mixed with applesauce. To disambiguate the effect of the reward from the effect of the stimuli, monkeys received a reward on only 50% of the trials, so that each image was followed an equal number of times by reward or no reward. The duration of the intertrial interval (ITI) was 1 s. When the monkey's eyes moved outside the boundary of the image, the trial was terminated by removing the image from the monitor. Error trials were not rewarded and were followed by a 2-s time-out period. Error trials were excluded from the analysis. To prevent habituation and blunting of the emotional response to face stimuli, the sets of images used for training or used during the period when we searched for neurons in the amygdala did not contain monkey faces or images intended for presentation during neurophysiological recordings.
Stimulus sets
The stimuli were digitized images that subtended 12 x 12° of visual angle and fell into the following categories: 1) monkey faces, 2) human faces, and 3) objects/abstract images. Images of 25 unfamiliar monkeys were selected from an extended library of digitized monkey faces (Gothard et al. 2004
). Each monkey was depicted with three facial expressions: an affiliative expression (lipsmack), a neutral expression, and an aggressive expression (open-mouth threat) (
Fig. 2). A lipsmack is an appeasing expression used by monkeys during a friendly approach, preceding grooming, or as part of a complex of gestures used to beg for food or grooming from other monkeys or from caretakers (Hinde and Rowell 1962
; van Hoof 1962). The open-mouth threat is a typical aggressive facial expression. High-ranking monkeys use this expression to threaten lower-ranking individuals, which, in response, display submission or retreat from the threatening monkey (Chevalier-Skolnikoff 1973
; Hinde and Rowel 1962
; Redican 1975
).
|
|
The images of objects were of two general types: pictures of objects familiar to the monkeys (laboratory equipment, fruits, food items, toys, and objects used for behavioral enrichment) and pictures of objects unfamiliar to the monkeys (abstract images, pictures of animals, nature scenes, and man-made objects (Fig. 2).
These three classes of images were assembled in different proportions into eight stimulus sets. Six of the eight stimulus sets contained 15 images of monkey faces, five images of human faces, five images of familiar objects, and five images of abstract or unfamiliar objects, totaling 30 images (Fig. 2). The 15 monkey face images depicted five monkeys with a lipsmack, a neutral face, and a threatening face each. In the other two stimulus sets, the ratio of monkey faces, human faces, and objects remained the same (3:1:2, respectively) but the number of images was either eight monkeys or ten monkeys with three expressions each. Each stimulus set contained different monkeys, humans, and objects, with the exception of three humans that appeared in more than one set but in a different photograph (e.g., with or without masks and safety goggles).
Electrodes and electrode delivery system
A custom-built, seven-channel Eckhorn drive, manufactured by Thomas Recording, was used to deliver seven electrodes (80- to 100-µm diameter, tungsten/platinum core, quartz glass coated) to a depth 2535 mm below the surface of the brain. The drive contains seven precision motors that tense or relax a rubber tube attached to the back of the electrode, advancing or withdrawing the electrode in 1- to 3-µm increments (Eckhorn and Thomas 1993
; Mountcastle et al. 1991
). The electrodes were delivered into the brain by 30-gauge stainless steel sharp cannulae that penetrate the dura and were advanced into the brain 5 mm. The target coordinates of the amygdaloid nuclei were calculated for each monkey using an MRI-based method developed by Saunders et al. (1990)
, Rebert et al. (1991)
, and Zola-Morgan et al. (1991)
and adapted to the amygdala by Amaral and colleagues (1992)
. The electrodes were connected to a headstage amplifier (gain = 20), built into the drive, and from there the signals were directed to a Lynx-8 (Neuralynx, Tucson, AZ) amplifier (gain of 2,000, band-pass 0.6 to 6 kHz). Neural data were digitized at 30 kHz, recorded continuously using a Power 1401 data-acquisition system [Cambridge Electronics Design (CED), Cambridge, UK], and stored on a disk for off-line spike sorting.
Neural recordings
At the beginning of a recording session, the electrode drive was placed in the recording chamber and aimed primarily at the lateral and basal nuclei of the amygdala. For the vast majority of recordings the electrode array was positioned to reach the dorsal and lateral aspects of the amygdala, the target of projections from and to cortical visual areas (Amaral and Price 1984
; Amaral et al. 1992
), particularly the rostral half of the amygdala where the lateral and basal nuclei are bordered superiorly by the anterior amygdaloid area (AAA) and the substantia innominata (SI); only a few recordings were made from the caudal half of the amygdala where the lateral and basal nuclei lie ventral to the central nucleus. The sharpened cannulae containing the electrodes were advanced through the dura 5 mm into the brain. The electrodes were then advanced at a speed of 30100 µm/s to a depth associated with the stereotaxic coordinates of the dorsal border of the amygdala. When the electrodes reached the dorsal border of the amygdala, background activity was modulated by the onset and offset of images. The electrodes continued to be advanced in small increments until neural signals with a high signal/noise were observed. Some neurons appeared to be responsive to the fixation icon or images displayed during electrode advancement, although this was not a selection criterion. When the majority of the electrodes registered well-isolated and stable spikes, one of the stimulus sets was displayed and the test stimuli were delivered in repeated blocks that contained the entire sets of images. The recording session continued until the monkey looked 10 to 20 times at each image in a stimulus set.
We used a template-matching algorithm for off-line spike sorting (Spike2, CED). Only well-isolated neurons that could be monitored for at least five presentations of each image in the stimulus set were included in the analysis.
Histological analyses were performed for monkey S. After euthanasia, the calvarium was opened and the head was submerged in 4% phosphate-buffered (pH 7.2) formaldehyde. The block containing the amygdala was sectioned in the coronal plane at 40-µm thickness and two systematic series through the amygdala were mounted on microscopic slides and stained either with the Nissl method to define nuclear boundaries or with the Pearl technique to identify the location of the electrode track.
Monkey H is currently involved in other studies, thus precluding histological confirmation of the electrode tracks. The reconstruction of intraamygdala recording sites based on histological and MRI analysis is shown in Fig. 1.
Data analysis
Analyses were carried out with custom-designed programs in MATLAB (The MathWorks, Natick, MA). Only trials in which the monkey maintained its gaze within the boundary of the stimulus images for the required time were included in the analysis. For each image presentation, two time intervals were considered: a baseline interval, 1 s immediately before the onset of the fixation icon, and a response interval, beginning 100 ms after stimulus-image onset and ending when the image was removed from the display. Images from the six stimulus sets that contained 30 stimuli were displayed for 500 ms, whereas images from the other two stimulus sets were displayed for 1,000 ms. The onset of the response interval was chosen to exclude nonimage-specific responses to the fixation spot. The numbers of spikes generated by the cell during the response interval, normalized by interval duration, were computed to characterize the response of each cell for each image presentation.
Category selectivity was assessed with one-way ANOVAs with three levels: "Monkey," "Human," and "Object." Significant results are reported based on P < 0.05 level, unless stated otherwise. BonferroniDunn post hoc tests (also at P < 0.05) were used to determine which category of stimuli for which each cell was selective.
Identity and expression selectivity were assessed using two-way ANOVAs where the two factors were "Identity" (with five, eight, or ten levels depending on the number of monkeys in the stimulus set) and "Expression" (with three levels: Threat, Neutral, and Lipsmack). Selectivity is reported at the P < 0.05 level. The same BonferroniDunn post hoc test was used to determine which of the three expressions for which the neuron was selective.
To determine whether the same three categories of stimuli (monkey faces, human faces, and objects) would emerge from the responses of small groups of neurons, a multidimensional scaling (MDS) procedure was applied to the neural data. For each image in a stimulus set, a vector was compiled, containing the normalized average firing rate of all the recorded neurons in response to an image. To avoid excessively weighting cells with high firing rates, the average firing rates were normalized. Cells that did not differentiate among stimuli or had an extremely low discharge rate (i.e., cells for which the SD of the average firing rates across stimuli was <0.3 Hz) were not included in the analysis. Nonmetric MDS (using the Kruskal stress criterion) was applied to the matrix of the Euclidean distances between the resulting population vectors for each stimulus. The first two components of the resulting scaling were retained, to create a two-dimensional representation of the stimulus space (Fig. 3 B and Fig. 8D).
|
|
| RESULTS |
|---|
|
|
|---|
|
Category-selective responses
Neural firing rates during stimulus presentation (excluding the first 100 ms after stimulus onset) for three categories of images ("Monkey," "Human," "Object") were compared for each neuron using a one-way ANOVA. Of the 196 neurons, 102 (52%) showed a significant effect of category (P < 0.05) (chance level = 9.8 neurons, binomial sign test, z = 30.2, P < 0.0001).
A typical stimulus set and examples of category-selective neurons are shown in Fig. 2. The stimulus set (Fig. 2A) contained 15 monkey faces (three facial expressions for each of the five monkeys used in this image set), five human faces, and ten objects. Mean (±SE) firing rate responses to each of the stimuli in this set are shown for three category-selective neurons in Fig. 2, BD. The neuron depicted in Fig. 2B discharged with significantly higher rates during the presentation of monkey faces, compared with the presentation of human faces or objects (P < 0.0001). This neuron showed particularly robust responses to threatening monkey faces. In contrast, the neuron in Fig. 2C responded with significantly higher rates for human faces than for monkeys faces or objects (P < 0.0001), whereas the neuron in Fig. 2D was selective for objects (P < 0.0001), particularly familiar objects (last five bars, Fig. 2D). These examples illustrate the remarkable selectivity of some neurons in the amygdala for broad categories of stimuli. A small subset of object-selective neurons (including the example neuron in Fig. 2D) appeared to respond differentially to familiar versus unfamiliar stimuli. The number of such neurons, however, was too small and their behavior too inconsistent to draw reliable statistical conclusions.
To determine which category was preferred by each neuron, a post hoc test was performed in which the average firing rate of a neuron for each category was compared with the firing rate for the other two categories combined (BonferroniDunn post hoc test). In this test, a significant differential response to a category can be manifested as either an increase or decrease in firing rate relative to the combined firing rate for the other two categories. This test thus identified the neurons that differentiated among categories by responding either with higher or lower firing rates to images from a particular category. Monkey faces elicited differential responses from 83 neurons, human faces from 48 neurons, and objects from 76 neurons (Fig. 3A). Many of the 102 neurons differentiated between more than one category of image by increases or decreases of firing rate that were specific for each category. Thus a category-selective neuron could be listed as showing multiple differential responses (e.g., a monkey selective neuron might show increased firing rates for monkey faces compared with human faces and objects and decreased firing rate for objects compared with monkey and human faces). Of the 83 neurons selective for images in the "Monkey" category, 32 showed a higher firing rate and 51 showed a lower firing rate for monkey faces than for images from the other two categories. Human faces elicited category-selective responses from 48 neurons (31 with higher and 17 with lower firing rates for humans faces than for the other image categories). Finally, 76 neurons were selective for objects, of which 47 neurons had higher and 29 neurons had lower firing rates for objects than for the other categories of images. When all stimuli are taken together, there was no significant bias in the overall population of amygdala neurons for responsiveness to any particular category [one-way repeated-measures ANOVA, F(2,194) = 0.9, P > 0.05].
To determine whether the same three image categories would emerge from the analysis of population responses, an MDS analysis was carried out on all the neurons recorded with the same stimulus set. Figure 3B shows the relative distances between 30 images computed from the similarity of the discharge pattern of 22 neurons for each image. In this two-dimensional stimulus space, images of monkey faces are clustered and distributed separately from images of objects and from images of human faces, indicating that neural population activity in the amygdala is different for different categories but comparatively more similar for images within the same category. The larger distribution of distances in stimulus space between categories than within categories of images (KolmogorovSmirnov, P < 0.0001) is shown in Fig. 3C.
An example of a category-selective cell is shown in Fig. 4. This neuron responded weakly to images of monkey faces but showed a robust (6070 Hz) response to all the objects in the stimulus set. The monkeys shown in Fig. 4 display an appeasing facial expression to illustrate that monkeys and object with positive valence (the banana and the reward spout) do not elicit similar firing rates. The firing rate variation across objects with positive or negative affective significance was smaller than the variation between the categories of faces versus objects.
|
In addition to the tests for category selectivity, each recorded neuron was further analyzed to determine response selectivity for monkey facial expression or monkey identity. Two-way ANOVAs were performed where the two factors were facial expression (levels: "Threat," "Neutral," "Lipsmack") and monkey identity (five, eight, or ten levels depending on the number of individual monkeys depicted in a stimulus set). About half of the neurons (95 of 196, or 48%) responded selectively (P < 0.05) to facial expression and or face identity. This number slightly exceeds the number of cells that showed category selectivity for monkey faces because it includes neurons that responded to only a small fraction of the monkey faces and therefore did not reach significance when analyzed for category selectivity. For example, a neuron that responded with increased firing rate only to lipsmacks may not have been identified as category selective as a result of the modest firing rate for the other two expressions.
Eighty-one neurons (41% of the total) responded selectively to face identity. Only 39 of these responded to face identity regardless of facial expression; the remaining 42 neurons showed a significant interaction between face identity and facial expression. An example of an identity-selective neuron is shown in Fig. 5. This cell responded to all three facial expressions displayed by the first two monkeys (although with considerable difference in firing rate and duration of response) and did not respond to the faces of the other two monkeys. This identity-selective response was not modulated by facial expression [F(7,562) = 2,487, P < 0.0001].
|
|
|
Overall, the number of responses that showed significant interactions exceeded the number of responses that showed a significant response only for identity or expression. These results are summarized in Fig. 8A. Furthermore, 34 neurons showed a significant main effect for both identity and expression (e.g., a neuron responded with higher firing rates to all three facial expression of monkeys 1 and 2 compared with monkeys 3, 4, and 5, but also showed expression selectivity by responding with higher firing rates to the threatening expressions of all five monkeys). Taken together, 64% of the recorded neurons responded to both expression and identity (half of which also showed significant interaction between these factors) indicating that face-selective neurons in the amygdala are more likely to respond to both identity and expression than to only one of these factors.
An additional post hoc analysis (two-tailed BonferroniDunn test) was performed to determine which expression was the most likely to elicit significant changes in firing rates. Differential responses to facial expressions included both increases and decreases of firing rate for a given expression compared with the average firing rate for the other two expressions combined. Of the 48 expression-selective neurons, 21 responded to threats, 11 responded to neutral faces, and 16 responded to appeasing faces. The majority of neurons that responded selectively to threats increased their firing rate (19 of 21) (Fig. 8C). Neurons that responded to neutral faces were equally likely to decrease or increase their firing rate. Finally, neurons that responded to lipsmacks were more likely to "select" for lipsmacks by virtue of a lower firing rate compared with the responses to threats and neutral faces combined.
Population analyses
When the entire population of amygdala neurons was considered, the differences in firing rates for threatening (3.3 ± 5.7 Hz), neutral (3.2 ± 5.7 Hz), and lipsmacking (3.2 ± 5.6 Hz) expressions were not significant (P > 0.05). When only the expression-selective neurons were considered, however, a small but significant difference was observed (threat: 4.2 ± 4.7; neutral: 3.8 ± 4.8; lipsmack: 3.6 ± 4.5 Hz; repeated-measures ANOVA, P < 0.01). This tendency of the expression-selective neurons to respond with higher firing rates to threatening faces is shown in the peristimulus time histogram (PSTH) constructed from all expression-selective neurons in Fig. 8B. The firing rates elicited by threatening faces exceeds the firing rates elicited by neutral or appeasing facial expressions for only a limited period of time ranging between 200 and 300 ms after stimulus presentation. This effect arises primarily from neurons recorded from one of the two monkeys (monkey S). Proportions of neurons with different response properties in the two subject monkeys are shown in Table 2.
|
| DISCUSSION |
|---|
|
|
|---|
We examined the response properties of amygdala neurons to a large array of images that belong to three main categories: monkey faces, human faces, and objects. The majority of neurons responded differentially to images from these three categories. Neurons that responded to monkey faces were further examined to determine the contribution of face identity and of facial expression to the observed changes in firing rates. We found that the majority of face-selective neurons responded to one or more combinations of identity and expression and only a small fraction of the neurons responded to identity irrespective of expression or expression irrespective of identity. The fraction of threat-specific neurons was not larger than the fraction of neurons that responded to neutral or appeasing faces. However, the global (summed across all expression-selective neurons) neural activity was higher for threatening faces than for neutral or appeasing faces. This effect was observed for only a brief period of time between 120 and 250 ms after stimulus display.
Category-selective neurons
The majority of amygdala neurons responded with significantly different firing rates to images of monkey faces, human faces, or objects. However, for the ANOVA, the image categories were predetermined, leaving open the possibility that the effect is dictated by the structure of the stimulus set and the method of analysis (e.g., a neuron that responds to half of the 15 monkey faces, but only to a few human faces and objects might appear category selective for monkey faces). The MDS analysis distinctly classified the stimuli in the same three categories. Comparable clustering patterns were observed for all image sets; i.e., the relative distances between categories were larger than distances within a category (Fig. 3C).
In general, no processing bias was observed in favor of conspecific face stimuli. On the contrary, some of the most selective responses were elicited by images with no obvious significance for the monkey (e.g., fractals, junk objects). Similar observations by Nishijo et al. (1998a
,b) support the hypothesis that the amygdala participates in the representation and evaluation of all the stimuli encountered by an organism.
According to a functional scheme proposed by Paré and colleagues (Collins and Paré 1999
; Paré and Smith 1993
; Royer et al. 1999
; reviewed by Davis and Whalen 2001
), the lateral, basal, and accessory basal nuclei of the amygdala, evaluate the emotional valence of stimuli, whereas the central nucleus is the effector for an appropriate behavioral and autonomic response. Neurons in the central nucleus are involved in the initiation of somatic, autonomic, and endocrine responses to emotional stimuli (Kaada 1967
; Kapp et al. 1979
; Moga and Gray 1985
). The central nucleus would thus be activated when the output of the evaluation process that takes place in the basolateral complex signals that the stimuli are emotionally important or require further exploration (Davis and Whalen 2001
). The information transfer from the lateral and basal nuclei to the central nuclei is gated by the intercalated neurons that are under prefrontal control (Likhtik et al. 2005
; McDonald 1998
; Quirk et al. 2003
). The intercalated neurons are assumed to block the activation of the central nucleus in response to neutral stimuli or stimuli that do not require immediate action (Davis and Whalen 2001
). In this framework, the basal and lateral nuclei, where the evaluation is taking place, are expected to process all stimuli, whether emotional or neutral. Our data support this functional scheme. The majority of neurons (167 of 196) reported here were recorded from the lateral, basal, and accessory basal nuclei. It is thus not surprising that images with little or no species-specific significance elicited category-specific or image-specific responses.
Previous single-unit studies in the monkey amygdala also showed neural responses selective for monkey faces (Leonard et al. 1985
; Rolls 1984
; Sanghera et al. 1979
), human faces (Nakamura et al. 1992
; Rolls 1992
), images of meaningful objects, such as food items (Nishijo et al. 1989a
,b
; Ono et al. 1989
; Wilson and Rolls 2005
), and even to abstract images (Fuster and Uyeda 1971
; Paton et al. 1006). These studies emphasized the selectivity of amygdala neurons for individual stimuli rather than for categories of images, although these conclusions are based on responses to relatively few stimuli. We used eight image sets, each containing 3060 images, that allowed more refined evaluation of response selectivity but also increased the difficulty to control for all the stimulus variables.
The images used in the present study fall short of an ideal stimulus set that should contain negative, neutral, and positive variants of all categories of images. This kind of symmetry in the stimulus set might result in a different type of category and stimulus selectivity than the observations reported here. However, in designing this study we believed that an entirely balanced set of stimuli was not achievable. In particular, it is difficult to match the affective and social meaning of human and monkey facial expressions. In our stimulus set, the images within each category certainly appear more similar in terms of shape, color, and features than the stimuli between categories. Yet the category-selective neurons showed large variations of firing rates across the images from the same category (see Fig. 2). Consequently, we cannot dismiss the possibility that the category selectivity reported here is accounted for by the shared visual components of stimuli (Fujita et al. 1992
; Kobatake and Tanaka 1994
; Wang et al. 1996
) and not by cognitive categorization resulting from experience with the stimuli (Erickson et al. 2000
) or task demands (Erickson and Desimone 1999
; Logothetis et al. 1995
; Miller et al. 1996
; Sakai and Miyashita 1991
).
The observed neural responses in the amygdala do not seem dramatically different from the responses reported in IT. This is to be expected given that the amygdala receives the processed output of these areas and returns massive feedback projections to these same areas. It was unexpected, however, that images with known emotional valence and objects with no obvious emotional significance are equally likely to elicit strong, stimulus-selective neural responses. One possible cause for this effect is that monkeys performed a passive viewing task where the stimuli were not associated with reinforcers. Stimulusreinforcer associations were previously shown to broadly facilitate neural responses in the amygdala (Paton et al. 2006
; Quirk et al. 1977
; Sugase-Miyamoto and Richmond 2005
).
Face-selective neurons
The majority (55%) of the neurons in the amygdala responded to monkey face stimuli. This percentage is comparable to previous studies that reported face cells in the amygdala (e.g., Nakamura et al. 1992
). Compared with previous studies, however, we independently manipulated face identity and facial expression expecting that facial expressions, especially threatening faces, will be the primary determinant of neural activity. The largest fraction of face-selective neurons (64%) responded to both expression and identity, half of which also showed significant interaction between these factors, suggesting that expression and identity are processed jointly rather than separately in the amygdala. Moreover, the neurons that showed a significant main effect for identity were more numerous than the neurons that showed a significant main effect for expression. Within these classes, the neurons that showed a significant main effect for identity only (no interaction with expression) were also more numerous than the neurons that showed a significant main effect for expressions only, suggesting that the monkey amygdala is equally or more concerned about identity than about facial expression. Indeed, recent fMRI studies in humans reached similar conclusions (Wright and Liu 2006
). Overall, the largest population of neurons showed significant interaction between identity and expression. Taken together, these observations suggest that the amygdala can specify unique combinations of individuals and facial expression. The ethological relevance of this observation rests with the fluid dominance hierarchy of macaque troupes where facial expressions gain or lose emotional significance as the displaying monkey ascends or descends in social rank. In this scenario, the identity of the displaying individual might carry as much emotional significance as the expression itself, such as the threatening face of a high-ranking adult is more dangerous than the threatening face of a juvenile.
Previously many important properties of face cells were determined, both in the cortex (in STS: Bruce et al. 1981
; Desimone 1991
; Desimone et al. 1984
; Hasselmo et al. 1989
; Perrett et al. 1982
; Wang et al. 1996
; in TE: Eifuku et al. 2004
; Rolls 2000
; Rolls and Tovee 1995a
; Sugase et al. 1999
; Tsao et al. 2003
; Young and Yamane 1992
) and in the amygdala (Leonard et al. 1985
; Nakamura et al. 1992
; Sanghera et al. 1979
), yet little evidence has been available for either clearly separate or joint processing of facial identity and facial expressions (Calder and Young 2005
). Using an image set that contained three monkeys with three expressions each, Hasselmo and colleagues (1989)
recorded neural activity from STS and from area TE of the temporal cortex. They found that the majority of neurons in the upper bank of STS responded to facial expression regardless of identity, whereas the neurons in area TE responded to identity regardless of facial expression. Only 6.7% of their sample showed a combined effect of these two factors. In the present study, 64% of the neurons responded to such combinations. Joint processing of expression and identity is further supported by our MDS analysis where the three facial expressions displayed by five to eight monkeys fall in the same cluster (see Fig. 8D).
A potential confound for the image-selective responses in the amygdala is that in monkeys, eye movements appear to be different for each facial expression (Gothard et al. 2004
). Although the scanpaths show regularities in the amount of time spent exploring each facial feature, there is no evidence for a stereotypical sequence of fixations and saccades. For example, the ratio of time spent looking at the eyes relative to the mouth is approximately 80/20 for lipsmacks and 55/45 for threats, although the targets and the durations of fixations are highly variable. Eye movements start at 200250 ms after stimulus display and, by visual inspection, scanpaths do not exhibit any fixed temporal structure. The peak in the differential neural response precedes the onset of the first saccade by 80120 ms (see Fig. 8B). Therefore a simple oculomotor explanation for expression-selective responses can be reasonably excluded. A more detailed analysis of the relationship between eye movements and neural responses in the amygdala might uncover mechanisms by which the amygdala directs attention toward face areas that carry information about emotion (Adolphs et al. 2005
).
Differential processing of aggressive and appeasing/neutral facial expressions
In support of the prediction that negative facial expressions elicit stronger responses in the amygdala, we found a small but significant increase in firing rate for threatening faces compared with neutral or appeasing faces. This effect was accounted for by the population of neurons sampled from monkey S. Moreover, the higher population firing rate in response to threatening faces is short lived (restricted to an interval of 120250 ms after stimulus presentation) compared with the overall response profile to face images (see population histogram in Fig. 8D). This small difference could account for some of the neuroimaging results obtained by subtraction analyses. An equivalent subtraction method applied to the data presented here would allow us to propose a higher global output of the amygdala for threatening faces compared with other facial expressions, yet this conclusion would ignore the large fraction of neurons that responded selectively to neutral and appeasing faces (Fitzgerald et al. 2006
; Wright and Liu 2006
). It thus appears that in the context of a passive viewing task of conspecific facial expressions, threatening faces have a small but distinct advantage of engaging the monkey amygdala. This observation is consistent with conclusions from the majority of primate neurophysiology studies that the primate amygdala evaluates stimuli with both negative and positive valence but draws a sharp difference between them (Fuster and Uyeda 1971
; Nakamura et al. 1992
; Nishijo et al. 1988; Ono et al. 1983
; Paton et al. 2006
; Sanghera et al. 1979
; Wilson and Rolls 2005
).
It thus appears that the monkey amygdala contains neurons that transmit information about a large array of complex visual stimuli and their category membership regardless of emotional or species-specific significance. Concomitantly, the face-selective neurons respond to multiple, socially relevant dimensions of faces with both positive and negative valences.
| GRANTS |
|---|
|
|
|---|
| ACKNOWLEDGMENTS |
|---|
|
|
|---|
| FOOTNOTES |
|---|
Address for reprint requests and other correspondence: K. M. Gothard, The University of Arizona, College of Medicine, Department of Physiology, ARL-NSMA 327 LSN building, Tucson, AZ 85724 (E-mail: kgothard{at}email.arizona.edu)
| REFERENCES |
|---|
|
|
|---|
Adolphs R, Gosselin F, Buchanan TW, Tranel D, Schyns P, Damasio AR. A mechanism for impaired fear recognition after amygdala damage. Nature 433: 6872, 2005.[CrossRef][Medline]
Adolphs R, Tranel D, Damasio AR. The human amygdala in social judgment. Nature 393: 470474, 1995.
Adolphs R, Tranel D, Damasio H. Impaired recognition of emotion in facial expressions following bilateral damage to the human amygdala. Nature 372: 669672, 1994.[CrossRef][Medline]
Aggleton JP. The contribution of the amygdala to normal and abnormal emotional states. Trends Neurosci 16: 328333, 1993.[CrossRef][ISI][Medline]
Amaral DG, Price JL. Amygdalo-cortical projections in the monkey (Macaca fascicularis). J Comp Neurol 230: 465496, 1984.[CrossRef][ISI][Medline]
Amaral DG, Price JL, Pitkänen A, Carmichael ST. Anatomical organization of the primate amygdaloid complex. In: The Amygdala: Neurobiological Aspects of Emotion, Memory, and Mental Dysfunction, edited by Aggleton JP. New York: WileyLiss, 1992, p. 166.
Breiter HC, Etcoff NL, Whalen PJ, Kennedy WA, Rauch SL, Buckner RL, Strauss MM, Hyman SE, Rosen BR. Response and habituation of the human amygdala during visual processing of facial expression. Neuron 17: 875887, 1996.[CrossRef][ISI][Medline]
Brown S, Schäfer EA. An investigation into the functions of the occipital and temporal lobes of the monkey's brain. Philos Trans R Soc Lond B Biol Sci 179: 303327, 1888.
Bruce C, Desimone R, Gross CG. Visual properties of neurons in a polysensory area in the superior temporal sulcus of the macaque. J Neurophysiol 46: 369384, 1981.
Calder AJ, Young AW. Understanding the recognition of facial identity and facial expression. Nat Rev Neurosci 6: 641651, 2005.[CrossRef][ISI][Medline]
Chevalier-Skolnikoff S. Facial expression of emotion in nonhuman primates. In: Darwin and Facial Expression, edited by Ekman P. New York: Academic Press, 1973, p. 1190.
Collins DR, Paré D. Spontaneous and evoked activity of intercalated amygdala neurons. Eur J Neurosci 11: 34413448, 1999.[CrossRef][ISI][Medline]
Davis M, Whalen PJ. The amygdala: vigilance and emotion. Mol Psychiatry 6: 1334, 2001.[CrossRef][ISI][Medline]
Desimone R. Face-selective cells in the temporal cortex of monkeys. J Cogn Neurosci 3: 18, 1991.[Medline]
Desimone R, Albright TD, Gross CG, Bruce C. Stimulus-selective properties of inferior temporal neurons in the macaque. J Neurosci 4: 20512062, 1984.[Abstract]
Dicks D, Myers RE, Kling A. Uncus and amygdala lesions: effects on social behavior in the free-ranging rhesus monkey. Science 165: 6971, 1968.[CrossRef][ISI][Medline]
Eckhorn R, Thomas U. A new method for the insertion of multiple microprobes into neural and muscular tissue, including fiber electrodes, fine wires, needles and microsensors. J Neurosci Methods 49: 175179, 1993.[CrossRef][ISI][Medline]
Eifuku S, De Souza WC, Tamura R, Nishijo H, Ono T. Neuronal correlates of face identification in the monkey anterior temporal cortical areas. J Neurophysiol 91: 358371, 2004.