|
|
||||||||
Department of Physiology, Faculty of Medicine, Toyama Medical and Pharmaceutical University, Toyama, Japan
Submitted 13 September 2004; accepted in final form 27 March 2005
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
Neurons that are selectively responsive to the sight of faces, i.e., "face neurons," were originally identified in the anterior STS (Bruce et al. 1981
; Perrett et al. 1982
). To date, these neurons have been recorded in various regions of the monkey brain (Baylis et al. 1985
; Desimone et al. 1984
; Hasselmo et al. 1989
; Nakamura et al. 1992
; O'Scalaidhe et al. 1997
; Perrett et al. 1985
; Rolls 1984
; Yamane et al. 1988
). The responses of face neurons in the anterior STS of macaque monkeys are tuned to facial views and/or to gaze direction in the faces of other individuals (Perrett et al. 1985
, 1992
). The so-called "face space" composed by the population of face neurons in the anterior STS represented facial views (Eifuku et al. 2004
). It has been reported that when recognizing familiar objects (or faces), humans usually use "view-specific representations" rather than "structural descriptions" (Bulthoff and Edelman 1992
; Edelman and Bulthoff 1992
; Logothetis et al. 1995
; Tarr 1995
). The selectivity for facial views among the face neurons in the anterior STS, already discussed in previous studies, is consistent with the notion that view-specific representations are preferentially employed for the recognition of objects.
Previous studies have shown that anatomical architecture and connectivity differ between the rostral and caudal regions of the anterior STS. The rostral region of the lower bank of the anterior STS receives input primarily from the anterior inferior temporal gyrus (ITG), which is known to be related to short-term and long-term visual memory (Naya et al. 2001
). The caudal region of the anterior STS receives input from the posterior ITG, as well as from the intraparietal sulcus (IPS), posterior parahippocampal areas (areas TF and TH), and prestriate areas, including area V4 (Barnes and Pandya 1992
; Saleem et al. 2000
; Seltzer and Pandya 1978
, 1984
, 1994
). The IPS is involved in the processing of three-dimensional structures (Sakata et al. 1997
; Taira et al. 2000
; Tsutsui et al. 2002
).
In contrast to our current understanding of this anatomical heterogeneity, the functional differences between the rostral and caudal regions of the anterior STS are not well understood. The purpose of this study was to investigate the various characteristics of face neuron responses in the rostral and caudal regions of the anterior STS; a related aim of this study was to elucidate the functional heterogeneity of these responses within the anterior STS. Thus we analyzed the view-tuning curves of face neuron responses; here, we focused on optimal views for each neuron, as well as on right-left symmetry or asymmetry, and certain differences were observed between the rostral and caudal regions. In addition, we studied the modulation of face neuron responses according to the direction of gaze in the observed face, i.e., the modulation of neuronal responses to simulated eye contact directed toward the subject, or to gazes averted away from the subject. This approach was used because the direction of a gaze is an important determinant of the biological significance of an observed face. The results, taken together, might help achieve an understanding of the type of information processing specific to the anterior STS.
| METHODS |
|---|
|
|
|---|
Three adult female macaque monkeys (Macaca fuscata) weighing 4.57.5 kg were used in this experiment. The neuronal activity detected in four hemispheres was recorded. All experimental protocols were approved by the Animal Care and Use Committee of Toyama Medical Pharmaceutical University, and all protocols conformed to the National Institutes of Health guidelines for the human care and use of laboratory animals.
Behavioral task
The monkeys were trained to perform a version of a sequential delayed matching-to-sample task that requires the identification of a face (I-DMS task; Fig. 1 A); this behavioral task was the same as that described in our preceding paper (Eifuku et al. 2004
). In the I-DMS task, a sample (480 ms) stimulus was presented to the animal after fixation on a small point, and test (match or nonmatch, 480 ms) stimuli were subsequently presented to the subject after a period of interstimulus delay (992 ms). Eye position was monitored using the scleral search coil technique, and the size of the eye control window was ±2.0° (Judge et al. 1980
). The sample face was always presented in the frontal view (0°). The match stimuli consisted of images of the faces of people with whom the monkeys were already familiar; these people were laboratory staff involved in the daily care of the subjects (Fig. 1B). Two types of facial stimuli (regular set) were used as the match stimuli in the initial experiments. In the first type (Fig. 1, Ba and Bc), the stimulus was one of seven faces viewed from one of seven different angles (from left to right profile: 90, 45, 22.5, 0, +22.5, +45, and +90°). Furthermore, the direction of the gaze shown on the viewed face matched that in which the face pointed (Face = Gaze). The second type of facial stimuli used for the regular set was employed for the investigation of the effects of the direction of the depicted gaze (Fig. 1, Bb and Bd). In this second type of stimulus, one of five views was used (from left to the right: 45, 22.5, 0, +22.5, and +45°), and the gaze was always projected toward the subject; thus the point of view presented to the subject gave the impression of eye contact with the subject (simulated eye contact). The frontal faces with simulated eye contact (stimuli 4 and 15) were common to the two types of facial stimuli. Each monkey was required to identify the same person who had been shown in the sample; if the test stimulus was a match, the monkey was to push a lever. A number of intervening (nonmatch) stimuli was presented until a match finally appeared (03 intervening stimuli); all correct trials were rewarded by the administration of juice (0.2 ml). Various geometric patterns served as the control stimuli (Fig. 1C). All visual stimuli were presented within the center of the receptive field of each recorded neuron, each of which had been mapped in advance of the experiment. In addition, the stimuli were typically centered on the fixation point, and the size of the images was 1015 x 1015°.
|
Electrophysiological procedures
The procedures used for electrophysiological recording and data analysis have been described in detail in our preceding paper (Eifuku et al. 2004
). In brief, these procedures were carried out as follows.
During the experiment, a grid was placed within the recording cylinders (Crist et al. 1988
) to facilitate the insertion of stainless steel guide tubes through the dura to a depth of
10 mm above the STS. At the beginning of each recording session, the guide tube stylet was removed and an epoxy-coated tungsten microelectrode (FHC, 1.01.5 M
at 1 kHz) was inserted. The electrode was advanced using a stepping microdrive, while neuronal activity was monitored to establish the relative depth of the landmarks, including the layers of gray and white matter, and to determine the properties of the neuronal responses. For all monkeys, we used 3D-MRI rendering to place an electrode into the anterior STS (Asahi et al. 2003
). The positions of the anterior STS and of the recording sites were checked by MRI during the experiment, and these MR images included a marker (tungsten, 500 µm diam); we verified the calculated recording sites with reference to the coordinate of the marker. It was confirmed by 3D-MRI rendering that all of the tracks of the inserted microelectrodes were almost vertical to the orbito-meatal (OM) plane for each monkey.
While conducting the I-DMS task, we recorded neuronal activity from the anterior STS. In this study, we analyzed the neuronal activity of single neurons during a period of 64496 ms after the onset of a match (the lag time of 64 ms was based on the minimum response latency of the neurons). Control firing was measured in the 208-ms period before when the sample was presented to each monkey. Because an overtrained monkey could perform the I-DMS task correctly when test stimuli were presented for <240 ms, it appeared that the entire period of 480 ms might not have been required for solving the task. However, to reduce variation in neuronal activity across different trials, we presented the monkeys with test stimuli for a duration of 480 ms and analyzed the neuronal activity that occurred during the 64- to 496-ms period after the onset of a match.
Histological procedures
After the final recording session, several small marking lesions were created in the anterior STS by passing a 20- to 30-µA anodal current for 40 s through a tungsten microelectrode; this procedure was monitored by MR imaging. Each animal was deeply anesthetized with an overdose of pentobarbital sodium (50 mg/kg, im) and perfused transcardially with heparinized 0.9% saline followed by 10% buffered formalin. The brains were removed and cut into 50-µm coronal sections through the target areas with a freezing microtome. Sections were stained with cresyl violet, and all sites marked by an electrical lesion were carefully verified microscopically. The location of each recording site was calculated by comparing the stereotaxic coordinates of the recording sites with those of the lesions. MR images obtained during the experiment were compared with those showing the marking electrodes to verify the calculated recording sites. The reconstruction of the recording sites based on histological investigation and MRI confirmed that all of the responses of the face neurons used for this analysis were recorded from the anterior STS in the range of 1024 mm anterior to the interaural line; most of these face neurons were located in the lower bank and fundus of the anterior STS.
| RESULTS |
|---|
|
|
|---|
The behavioral performance of the subjects on the I-DMS task was better than 95% correct during the neuronal recording experiments; the analysis of neuronal activity included only the data obtained from correct trials. In our preceding paper (Eifuku et al. 2004
), we behaviorally characterized this I-DMS task. Consistent with the results of that study, the behavioral reaction times during the correct trials in this study ranged between 350 and 680 ms. In addition, behavioral reaction times tended to be proportional to the difference between the sample and match stimuli with respect to the angle of the facial view, as was also observed in our previous report.
A total of 255 neurons exhibited significantly increased activity in response to the presentation of match stimuli during the I-DMS task (visually responsive neurons; paired t-test, P < 0.05); the match stimuli were either faces or geometric patterns, as those shown in Fig. 1, B and C. Neurons that exhibited both significant responses during the match period, as well as significantly larger responses to faces than to geometric patterns, were defined as face neurons in this study; neuronal responses to an optimal face were statistically compared with those to an optimal geometric pattern (t-test, P < 0.05). According to this definition, 152 (59.6%) of 255 visually responsive neurons were classified as face neurons. The selectivity of neuronal responses to faces was further analyzed by two-way ANOVA (factors: facial view and facial identity, i.e., the facial view included 7 views in the 1st set of regular faces, whereas the facial identity included 2 different identities; significance level: P < 0.05). Of these 152 face neurons, 131 (86.2%) exhibited responses to facial view (as opposed to facial identity). Thirty-two (21.1%) and 49 (32.2%) of the 152 face neurons showed significant responses to facial identity (as opposed to facial view), as well as showing significant interactions between facial views and facial identity, respectively. The 131 face neurons showing a significant effect in response to facial views were further analyzed.
In this paper, we focused on neuronal responses to match stimuli. However, the majority of responses of face neurons to the presentation of nonmatch faces did not significantly differ from those to match faces [126 (82.9%), t-test, P > 0.05] when identical faces were used as nonmatch or match stimuli. Only 15 (9.9%) and 11 (7.2%) neurons showed an enhancement (nonmatch > match) or a suppression (nonmatch < match) of response, respectively (t-test, P < 0.05). These results were in contrast to our earlier findings regarding the anterior ITG face neurons (Eifuku et al. 2004
), and they were also in contrast to the results of other previous studies (Miller et al. 1991
, 1993
, 1996
; Riches et al. 1991
) of the anterior ITG.
Optimal facial views for face neuron responses
Figure 2 provides an example of the responses detected in a single face neuron, the activity of which was recorded from a relatively caudal portion of the anterior STS; the rostro-caudal coordinate of this face neuron was 12 mm anterior to the interaural line. Neuronal responses to match faces, when the facial view and the direction of the gaze in the presented face were equivalent, are shown in Fig. 2A. Figure 2B summarizes the facial-view tuning of responses of this particular face neuron. Moreover, in the case of this particular face neuron, the effect of facial views was significant [2-way ANOVA factors: facial view and facial identity; F(6,144) = 30.252, P < 0.0001], whereas neither facial identity [F(1,144) = 1.027, P = 0.3126] nor the interaction between facial view and facial identity [F(6,144) = 1.203, P = 0.3079] was found to be significant. This neuron responded best to profiles within a range of 90 to +90° (Newman-Keuls test, P < 0.05), and the responses showed right-left symmetry across the midline (0°), as if the facial-view tuning curve were the mirror image. There was no difference between neuronal responses to profiles (90 or 90° view) of different facial identities (Fig. 2C; t-test, P > 0.05). It should be noted that the activity of the neuron described in Fig. 2 offers a representative example of a neuron located in the relatively caudal portion of the anterior STS.
|
|
16. According to this grouping, 71 face neurons were included in the rostral group, and 60 neurons were included in the caudal group. It should be noted that the posterior disappearance of the AMTS was used as a marker to distinguish between areas TEp and TEa or between areas CIT and AIT in the inferior temporal gyrus of monkeys (Saleem et al. 2000
We then retrospectively compared the percentages of the optimal facial views for particular face neuron responses in the rostral group and the caudal group. The optimal facial view was the view that elicited the largest magnitude of responses to different facial views. It should be noted that in this analysis, the data obtained for two different identities were averaged. In the pie charts shown in Fig. 3B, it can be seen that the proportion of optimal facial views differed significantly between the rostral and the caudal groups (
2 test, P < 0.01). In the caudal group, the face neurons showed selectivity to all views, including profiles, and there were no significant differences in the proportion of cells tuned to different views (
2 test, P > 0.05). However, in the rostral group, a significant difference was observed in the proportion of cells tuned to different views (
2 test, P < 0.05). The percentage of neurons tuned to oblique views of the face was largest compared with the percentages obtained in response to other views.
All of the face neurons that showed significant effects in response to facial views were used for the analysis summarized in Fig. 3B. Thus face neurons that were broadly tuned to facial views were also included in the sample. We analyzed the relationship between the optimal facial views and selectivity in terms of facial-view tuning. To quantify the selectivity in terms of tuning, we used a selectivity index (SI) defined by the following equation: SI = (Rmax Rmin)/(Rmax + Rmin), where Rmax and Rmin are the maximum and minimum responses in the facial-view tuning curve, respectively. For both values, the control firing value was subtracted from the neuronal firing value obtained in response to a match face. The SI assumes values in the range of 01, where 1 represents the highest selectivity to facial views and 0 the lowest. The scatter plot in Fig. 3C shows the relationship between the optimal facial views and the SIs of all face neurons tested, indicating that the distribution of SIs was independent of the optimal facial view. There were no significant differences in the mean SI between the rostral and caudal populations (t-test; P > 0.05). The mean of the SIs for the rostal population (n = 70) was 0.533 ± 0.029 (SE), whereas that for the caudal population (n = 60) was 0.499 ± 0.031. In Fig. 3D, we provide another comparison of the percentages of the optimal facial view between the rostral group and the caudal group of the anterior STS; only face neurons for which the SI exceeded 0.33 (i.e., Rmax > 2Rmin) are included in this figure. Thus this figure exclusively depicts the behavior of face neurons with a relatively higher level of selectivity in terms of facial views. Again, in the caudal group, the face neurons showed selectivity for all views, including profiles, and no differences were observed with respect to neuronal responses to different views of faces (
2 test, P > 0.05). However, in the rostral group, a significant difference was observed in neuronal responses to different views of faces (
2 test, P < 0.05). The proportion of optimal facial views differed significantly between the rostral and the caudal groups (
2 test, P < 0.01). In addition, as regards optimal facial views, the difference between the rostral and caudal groups of face neurons in the anterior STS was consistently observed in all of the subjects.
Right-left symmetry/asymmetry of face neuron responses
We frequently recorded responses of face neurons, the facial-view tuning of which was right-left symmetrical across the midline (0°; see the example shown in Fig. 2). In Fig. 4, the frequency of right-left symmetry is indicated, together with the frequency of asymmetry among face neuron responses along the rostro-caudal axis in the anterior STS. In this analysis, face neurons that appeared to preferably respond to oblique views or profiles were included (n = 53, rostral; n = 43, caudal), and the magnitude of responses to the first or second type of facial stimuli in the original set were compared. For the quantification of right-left symmetry among face neuron responses, we used a tuning symmetry index (TSI), which was defined as follows: TSI = (Rmax Ropp)/(Rmax + Ropp), where Rmax is the maximum response in the facial-view tuning curve, i.e., the response to the optimal facial view, and Ropp is the response to the facial view shown from a perspective opposing that of the optimal view. For both values, the control firing value was subtracted from the neuronal firing value obtained in response to the match face. The TSI accepts values in the range of 01; when the TSI approached 1, a greater degree of asymmetrical shaping in the facial-view tuning curves was observed, and when the TSI approached 0, a greater degree of symmetrical shaping of the facial-view tuning curves was observed. In the scatter plot shown in Fig. 4A, the relationship between the rostro-caudal coordinates and the TSIs of face neurons in the sample is given; each dot represents an individual neuron in this plot. The TSI and the rostro-caudal coordinate of the face neurons revealed a significant positive correlation (r = 0.319, P < 0.005), indicating that the facial-view tuning curves of the caudal population of face neurons tended to show right-left symmetry, whereas those of the rostral population tended to show right-left asymmetry.
|
0.2. A significant dominance of right-left symmetry among the facial-view tuning curves was shown in the caudal group (binominal test, P < 0.005). However, right-left asymmetry was found to dominate significantly in the rostral group (binominal test, P < 0.001); the smaller number of face neurons in the rostral region showed such symmetrical tuning curves. The proportion of right-left symmetrical responses differed significantly between the rostral and the caudal groups (
2 test, P < 0.001). This trend for right-left asymmetry/symmetry in face neuron responses in the anterior STS was consistently present in all subjects. Modulation of face neuron response according to gaze direction
Figure 5, AC, provides an example of face neurons belonging to the rostral group; the rostro-caudal coordinate of this face neuron was 22 mm anterior to the interaural line. Neuronal responses to all of the 22 match faces in the original set are displayed in Fig. 5A. For this particular face neuron, the effect of facial views was significant [2-way ANOVA, factors: facial view and facial identity, F(6,109) = 8.111, P < 0.0001], but neither the effect of facial identity [F(1,109) = 0.366, P = 0.5465] nor the interaction between facial view and facial identity [F(6,109) = 0.362, P = 0.9013] was significant. Figure 5B summarizes the facial-view tuning of the face neuron depicted in Fig. 5A. The face neuron shown here responded best to a 45° view of the face (Newman-Keuls test, P < 0.05). Two-way ANOVA (factors: facial view and eye contact, the facial view included ±45 and ±22.5° views, whereas the eye contact included two subordinate conditions, i.e., Face = Gaze and simulated eye contact; significance level: P < 0.05) revealed that the effect of facial view [F(3,119) = 11.701, P < 0.0001] and the effect of eye contact [F(1,119) = 8.003, P < 0.01], were significant, but the interaction between facial view and eye contact [F(3,119) = 1.611, P = 0.1903] was not significant for the particular neuron shown in Fig. 5, AC. It should be noted in this context that the face neuron response to the optimal facial view (45°) was significantly facilitated by simulated eye contact (t-test, P < 0.01); we therefore referred to the type of neuron responding to such facilitation as an excitation type of neuron. The best response of this particular face neuron was observed in association with the left oblique view simulating eye contact. There were no significant differences between neuronal responses to the 45° views of two different identities (Fig. 5C; t-test, P > 0.05). The activity of the neuron described in Fig. 5, AC offers a representative example of that recorded in the relatively rostral part of the anterior STS. Other examples of face neurons in the rostral group that were influenced by simulated eye contact are shown in Fig. 5, D and E. The activity of the neuron in Fig. 5D responded best to a 45° view of the face. For this particular neuron, two-way ANOVA (factors: facial view and eye contact) revealed that the effect of facial view [F(3,125) = 3.315, P < 0.05], the effect of eye contact [F(1,125) = 7.868, P < 0.01], and the interaction between facial view and eye contact [F(3,125) = 5.426, P < 0.05] were all significant. Responses to the optimal facial view were significantly suppressed by simulated eye contact (t-test, P < 0.005); we referred to this type of neuron responding to such suppression as an inhibition type of neuron. The neuron in Fig. 5E showed an asymmetrical modulation of responses under the simulated eye-contact condition; significant facilitation of responses occurred in association with right views of the face (t-test, P < 0.05), whereas significant suppression of responses occurred in association with left views (t-test, P < 0.05). We thus referred to this type of neuron showing such facilitation and suppression as an asymmetrical-effect neuron. Moreover, in the case of this particular face neuron, the effect of facial views [2-way ANOVA factors: facial view and eye contact; F(3,121) = 27.819, P < 0.0001] and the interaction between facial view and eye contact [F(3,121) = 5.426, P < 0.005] were significant, but the effect of eye contact was not significant [F(1,121) = 0.126, P = 0.7228].
|
16) groups. It was found that the proportion of the three types of effect in association with eye contact differed significantly between the rostral and the caudal groups (
2 test, P < 0.01); moreover, the effects were more evident in the rostral group than in the caudal group. The difference between the rostral group and the caudal group of face neurons in the anterior STS was consistently observed in all subjects.
|
Influence of averted gaze
In the examinations carried out up to this point, gaze modulation was studied only in cases involving oblique faces such as those depicted in Figs. 5 and 6. However, the presence of gaze modulation still needed to be examined in terms of faces presented from views other than the oblique view. To this end, we carried out additional experiments using an additional set of facial stimuli and additional samples of face neurons in the anterior STS.
In this series of experiments, the faces of two persons (identities) were used as match test stimuli; the facial stimuli consisted of images of 18 faces that differed with respect to facial view or gaze direction, such as those depicted in Fig. 7, AF (hyper set). As shown in Fig. 7, A and D, the stimuli set of gazes was pointed in the same direction as that in which the face was pointed (Face-Gaze); the faces were viewed from five different angles (from left to right profile: 90, 45, 0, +45, and +90°). This stimuli set was similar to that depicted in Fig. 1, Ba and Bc. As shown in Fig. 7, B and E, the stimuli set of faces depicted the gaze as consistently projected toward the subject (simulated eye contact); these faces were viewed from three different angles (from left to right: 45, 0, and +45°). This stimuli set was similar to that depicted in Fig. 1, Bb and Bd. In Fig. 7, C and F, the stimuli set of faces was always pointed toward the subject; under this condition, the direction of the gaze was pointed in three different angles (from left to right: 45, 0, and +45°). We referred to this latter facial stimulus set as the averted gaze condition. The frontal faces with simulated eye contact (stimuli 3 and 12) were common to all three types of facial stimuli.
|
Figure 8Aa summarizes the facial view/gaze direction-tuning of a face neuron in the caudal group (14 mm anterior to the interaural line). The data for two different identities were averaged, because there were no differences between the neuronal responses to these two identities. The effect of simulated eye contact as well as the effect of averted gaze on face neuron responses was further analyzed by two-way ANOVA (factors: view condition and face/gaze condition; the view condition included ±45° views, whereas the face/gaze condition included 3 subordinate conditions, i.e., Face = Gaze, simulated eye contact, and averted gaze; significance level: P < 0.05). In the case of the particular neuron depicted in Fig. 8A, the effect of view [F(1,92) = 58.256, P < 0.0001], the effect of face/gaze [F(2,92) = 155.666, P < 0.0001], and the interaction between the view condition and the face/gaze condition [F(2,92) = 6.168, P < 0.005] were all found to be significant. This neuron responded to the frontally presented faces, regardless of whether they simulated eye contact or showed an averted gaze; moreover, the presentation of averted gazes did not diminish the intensity of responses of this neuron. Responses to the frontal views showing simulated eye contact were compared with those showing an averted gaze in the columnar graph provided in Fig. 8Ab, and the results did not indicate any significant effects associated with gaze (t-test: P > 0.05). However, oblique views of faces with simulated eye contact significantly diminished the magnitude of response (t-test: P < 0.005).
|
In the caudal region, only 5 of 12 neurons tested (41.7%) significantly changed their responses to the averted gaze series of facial stimuli, whereas in the rostral region, 9 of 11 neurons tested (81.8%) changed their responses to this series. The percentage of gaze modulation was larger in the rostral region than in the caudal region (
2 test, P < 0.05).
Figure 8C summarizes the facial view/gaze direction-tuning of yet another face neuron in the rostral group (22 mm anterior to the interaural line). Two-way ANOVA (factors: view and face/gaze) revealed that the effect of the face/gaze condition [F(2,90) = 59.876, P < 0.0001] and the interaction between the view condition and the face/gaze condition [F(2,90) = 179.643, P < 0.0001] were significant, but the effect of view [F(1,90) = 3.035, P = 0.0849] alone was not significant for this particular neuron. This neuron responded to two combinations of facial views and gaze directions; the first was a combination of the frontal view (0°) and the +45° averted gaze, and the other was a combination of the 45° facial view and simulated eye contact. It should be noted that the relative spatial configuration between the facial views and the gaze directions were the same between the two face-gaze configurations. The results suggested that the relative spatial configuration was encoded in the responses of this face neuron. Four of 14 neurons tested (1 of 5 in the caudal region and 3 of 9 in the rostral region) responded to different facial view/gaze direction combinations, the relative spatial configurations of which were the same with regard to the respective facial views and the gaze directions.
| DISCUSSION |
|---|
|
|
|---|
Neuronal responses in the rostral region of the anterior STS
Oblique facial-view neurons (Fig. 5) were typically found among the rostral population of neurons located relatively anterior to the neurons discussed in previous reports (Hasselmo et al. 1989
; Perrett et al. 1982
, 1985
). A major question concerning the processing of facial representations focuses on determining which view, if any, is used as the "canonical view" for faces (Bruce 1988
; Perrett et al. 1991
; Valentin et al. 1997
). The psychological data suggest that the ±45° (3/4) view is advantageous (Bruce et al. 1987
; Fagan 1979
; Krouse 1981
; Logie et al. 1987
), whereas the physiological data thus far suggest that higher numbers of face neurons are tuned to frontal or profile views than to the 3/4 view (Perrett et al. 1985
, 1991
, 1992
). A previous report suggested that the advantage of the 3/4 view, according to physiological results, derived from the partial activation of face neurons tuned to various views. To confirm this explanation, a scheme using a distributed representation was proposed, i.e., the simultaneous partial activation of frontal and profile neurons might mediate the advantage of the 3/4 view (Valentin et al. 1997
). Some reports have suggested the presence of neurons selectively responsive to a particular view across the entire 360° range of views (Perrett et al. 1992
). However, our results indicated simply the dominance of the 3/4 view as the optimal view for face neurons in the rostral region of the anterior STS; in this region, the responses clearly showed the best direction for the identification of a face, i.e., these neurons preferred the 3/4 view. Because the 3/4 view reveals a relatively large number of features, it may prove to be more useful for recognition tasks than either frontal or profile views. Furthermore, provided that the 3/4 view maximizes the number of encoded features, it is more likely to promote an enhanced recognition of faces presented from this perspective. The neuronal preference for 3/4 facial views and the lack of right-left symmetry in the view-tuning curves among the face neuron responses in the rostral region both imply that this region represents either the +3/4 face (right oblique view) or the 3/4 face (left oblique view), and the information stored might be used at later stages for face recognition or identification. Because the proportion of optimal facial views did not differ significantly between the +3/4 (right) and the 3/4 (left) views, it is likely that both facial views were equally represented in the population of face neurons in the rostral regions.
Gaze direction and orientation of the face toward or away from the viewer are important social and communicative signals in both humans and monkeys (Argyle and Cook 1976
). In contrast to the corresponding responses in the caudal region, in the rostral region, the neuronal responses to faces appeared to be frequently modulated according to the direction of the gaze shown in the viewed face. Moreover, simulated eye contact with the subject easily changed the tuning of the face neurons. These results were observed not only in cases involving faces presented in an oblique manner (Fig. 5), but similar results were also obtained when frontal views of the face were used (Fig. 8). Direction of gaze is a determinant of the biological significance of faces as communicative signals. In other words, the biological significance of a face either making eye contact with a subject or avoiding such contact appears to be differentially reflected by the effective modulation of the response of face neurons in the rostral, but not caudal, region. These findings suggest a plausible functional hierarchy in information processing based on the biological significance of faces.
Gaze modulation in some face neurons, whether it is a case of facilitation, as in the examples in Fig. 5, AC, or a case of inhibition, as in the example in Fig. 5D, implies that facial view signals and gaze direction signals are not independent of each other in the rostral region. This line of reasoning suggests that face neuron responses are selective for a particular combination of a particular facial view and a particular direction of the gaze of another individual. However, in the case of other neurons, the curves for facial-view tuning differed from those for gaze direction tuning, as seen in the example in Fig. 5E (asymmetrical effects); facial view signals and gaze direction signals therefore appear to be independent of each other in the case of these neurons. Previous studies of evoked potential using humans have suggested an early separation of face and gaze signals in the processing streams used for face perception (Allison et al. 1999
; McCarthy et al. 1999
; Puce et al. 1999
; Shibata et al. 2002
).
Neuronal responses to a particular facial view/gaze direction combination
Some face neurons, as those depicted in Fig. 5, AC, showed responses to a particular combination of a particular facial view and a particular gaze direction shown the faces presented to the subjects. The responses of the majority of these face neurons did not reflect the relative spatial configuration between the facial views and the gaze directions (Fig. 8B), because the observed neuronal responses differed significantly between the two face-gaze configurations that were in the same relative spatial configuration with respect to the facial views and the gaze directions. However, it was also found that a small number of face neurons (Fig. 8C) were able to encode the relative spatial configuration between the facial views and the gaze directions shown in these faces, because the neuronal responses to the two face-gaze configurations that were in the same relative spatial configuration with respect to face and gaze were similar to each other. Despite the finding that face neurons with sensitivity to the relative spatial configuration were few in number, the results suggested that an "object-centered reference frame" was used in the case of some face neurons in the anterior STS. Similar neuronal responses to three-dimensional objects in the object-centered reference frame have been reported in the IPS (Sakata et al. 1997
), although the afferent connectivity from the IPS to the rostral region of the anterior STS is relatively weak. It is of note that the rostral region of the lower bank and fundus of the anterior STS have a strong reciprocal connection with the area TEav (Barnes and Pandya 1992
; Saleem et al. 2000
; Seltzer and Pandya 1978
, 1984
, 1994
); face neurons in the area TEav have been shown to represent facial identity, which is closely related to the eventual decision regarding the determination of facial identity (Eifuku et al. 2004
). Thus these results imply that the observed combinatorial neuronal responses to facial views and gaze directions might frequently serve as major bottom-up signals in the process of face identification. However, the involvement of the facial area in the anterior STS in determining the identity of observed faces remains under debate (Heywood and Cowey 1992
; see also Eifuku et al. 2004
).
Neuronal responses in the caudal region of the anterior STS
Although previous studies have suggested that many, but not all, face neurons in the anterior STS prefer frontal views and profiles, other reports have indicated that an optimal view might be identified across the entire 360° range of views (Perrett et al. 1985
, 1991
, 1992
). These results were quite similar to those obtained with our sample of caudal neurons (Fig. 2). In addition, in these regions, the neuronal responses of many face neurons were found to be right-left symmetrical. Given that right and left views of faces share many common facial components, and given that the viewing of the components depends on the angle of the view to the midline, these results imply that these responses tended to be related to the spatial layout of facial components that cannot simply be ascribed to the mere visibility or invisibility of particular components (Eifuku et al. 2003
).
In our previous report (Eifuku et al. 2003
), we investigated brain representations of familiar and unfamiliar faces using reaction time (RT) measurements in humans; in that study, we used a pair association paradigm using facial stimuli [pair association task based on an identification (I-PA) task]. In the I-PA task, subjects learned four paired associates in advance of performing the test (prelearning): the pair consisted of a neutral geometric pattern and the face of a person viewed from 45 or +45°. In the test, a stimuli pair was presented to the subject, and the subject was required to judge whether or not the pair was correct. The correct pair was the pattern and the face of the person who was associated with the pattern during prelearning, which could have been one of six images of the person viewed from one of six different angles. While conducting that series, we found that RTs were influenced by prelearning in the case of both unfamiliar and familiar faces, but the shaping of the RT curves differed markedly between cases involving familiar and unfamiliar faces; the RT curve for unfamiliar faces had two volleys, which were right-left symmetrical to the midline (0°), whereas the RT curve for familiar faces had only a single volley, i.e., those showing right-left asymmetry. The results revealed a significant difference between the mental representations of familiar and unfamiliar faces. The results of our previous report are in agreement with those obtained in this study in terms of right-left symmetry and/or asymmetry. The right-left symmetry reflected in the RT curves for unfamiliar faces might have been associated with the behavior of face neuron responses frequently observed in the caudal region of the anterior STS, where the majority of face neuron responses had two peaks that were right-left symmetrical to the midline (0°).
Functional heterogeneity and hierarchical organization within the anterior STS
Previous anatomical studies have indicated that the rostral region of the lower bank and the fundus of the anterior STS have a strong reciprocal connection with area TEav (Barnes and Pandya 1992
; Saleem et al. 2000
; Seltzer and Pandya 1978
, 1984
, 1994
), which also has a strong reciprocal connection with medial limbic structures such as the perirhinal cortex. It is already known that there are only weak connections between the parietal and occipital areas and the rostral region of the anterior STS. On the other hand, the caudal region of the lower bank and fundus of the anterior STS has abundant afferent connections stemming from the intraparietal (area POa), preoccipital (area V4), and posterior parahippocampal areas (areas TH and TF) (Barnes and Pandya 1992
; Saleem et al. 2000
; Seltzer and Pandya 1978
, 1984
, 1994
). Moreover, it was recently reported that the caudal region of the anterior STS has a strong reciprocal connection with the dorsal portion of the anterior inferior temporal gyrus (area TEad), which also has a strong reciprocal connection with the posterior parahippocampal regions (areas TF, TH) (Saleem et al. 2000
). It is thus likely that the anatomical connectivity differs in the rostral and caudal regions of the anterior STS.
We separated two groups of face neurons in the anterior STS based on their rostro-caudal coordinates: the rostral group consisted of neurons that were located in the rostral region of the anterior STS, and the caudal group consisted of neurons that were located in the relatively caudal region of the anterior STS. Three types of functional difference between the rostral group and the caudal group of face neurons were identified in this study. In the caudal group, optimal facial views for the face neuron responses were distributed among all types of views as is shown in Fig. 3. The majority of face neurons in the caudal group responded symmetrically to right and left views, as those seen in mirror images (Fig. 4). In contrast, face neurons in the rostral group responded best to a single oblique view; moreover, right-left symmetry among the responses was less evident in the rostral group, as summarized in Figs. 3 and 4. These results imply the possibility that face neurons in the rostral region represent certain canonical faces, whereas no such specialization was apparent in the caudal region. The modulation of face neuron response by gaze direction was more evident in the rostral group than in the caudal group (Fig. 6). These findings suggest that biological significance is a more important factor for determining the face neuron responses in the rostral region than in the caudal region.
It is well known that in the domain of visual information processing related to "object vision," the inferior temporal cortical areas of primates are hierarchically organized along the rostro-caudal (or antero-posterior) axis. In particular, single neuronal recording studies using monkeys have thus far shown the presence of a functional hierarchy in the inferior temporal gyrus along the rostro-caudal (or antero-posterior) axis. Moreover, recent reports on neural organization in the anterior superior temporal gyrus of rhesus monkeys have indicated a hierarchical organization in the domain of auditory information processing; the rostral region of the left anterior superior temporal gyrus was shown to be selectively activated by species-specific vocalizations (Poremba et al. 2003
, 2004
).
Our results revealed differences between the rostral and caudal regions of the anterior STS in terms of the response characteristics of face neurons, and these results are consistent with previous findings of anatomical connectivity. In addition, these findings were suggestive of a plausible functional hierarchy in the anterior STS, a notion that is also in agreement with the findings of previous reports regarding the organization of the anterior superior/inferior temporal gyri.
| GRANTS |
|---|
|
|
|---|
| ACKNOWLEDGMENTS |
|---|
|
|
|---|