|
|
||||||||
The Journal of Neurophysiology Vol. 80 No. 1 July 1998, pp. 324-330
Copyright ©1998 by the American Physiological Society
1 RIKEN Brain Science Institute, Wako-shi, Saitama 351-01; 2 Electrotechnical Laboratory, Tsukuba-shi, Ibaraki 305; 3 Department of Physiology II, Faculty of Medicine, Kagoshima University, Kagoshima-shi, Kagoshima 890; and 4 Core Research for Evolutional Science and Technology, Japan Science and Technology Corporation, Wako-shi, Saitama 351-01, Japan
| |
ABSTRACT |
|---|
|
|
|---|
Kobatake, Eucaly, Gang Wang, and Keiji Tanaka. Effects of shape-discrimination training on the selectivity of inferotemporal cells in adult monkeys. J. Neurophysiol. 80: 324-330, 1998. Through extensive training, humans can become "visual experts," able to visually distinguish subtle differences among similar objects with greater ease than those who are untrained. To understand the neural mechanisms behind this acquired discrimination ability, adult monkeys were fully trained to discriminate 28 moderately complex shapes. The training effects on the stimulus selectivity of cells in area TE of the inferotemporal cortex were then examined in anesthetized preparations. Area TE represents a later stage of the ventral visual cortical pathway that is known to mediate visual object discrimination and recognition. The recordings from the trained monkeys and untrained controls showed that the proportion of TE cells responsive to some member of the 28 stimuli was significantly greater in the trained monkeys than that in the control monkeys. Cell responses recorded from the trained monkeys were not sharply tuned to single training stimuli, but rather broadly covered several training stimuli. The distances among the training stimuli in the response space spanned by responses of the recorded TE cells were significantly greater in the trained monkeys than those in the control monkeys. The subset of training stimuli to which individual cells responded differed from cell to cell with only partial overlaps, suggesting that the cells responded to features common to several stimuli. These results are consistent with a model in which visual expertise is acquired through the development of differential responses by inferotemporal cells to the images of relevant objects.
The human ability to discriminate between similar object images appears to depend on visual circumstances. It is said that Inuits can discriminate many different kinds of snow, and mounted people can discriminate many different kinds of horses. This capacity also depends on profession. Shepherds can distinguish individual sheep, and neuroscientists can distinguish individual experimental animals. What is the neural basis of this phenomenon?
Training
Four adult macaque monkeys (Macaca fuscata) served as subjects. Two of these received training on a visual recognition task with the 28 shape stimuli shown in Fig. 1. One trained monkey (female) weighed 5.2 kg, and the other (male) weighed 5.5 kg at the beginning of the shape training. At that time their ages were estimated to be between 4 and 5 yr. Each trial of the task began with the presentation of a sample chosen from among the 28 stimuli on a computer display equipped with a touch screen. As soon as the monkey touched the stimulus, it disappeared from the screen. After a delay period, the sample stimulus reappeared with four distracters chosen from the same set. The monkey obtained a drop of juice as reward for touching the sample on the screen. The delay was initially set to 1 s and gradually increased to 16 s. Both the sample and distracters were randomly selected, and the position of the sample among the four distracters was randomized. Training was automated, with the apparatus placed in front of the monkey's home cage for 8 h per day, 6 days a week. The monkey had free access to the apparatus and could perform the task ad libitum. In the final stage of the shape training, the monkeys performed 500 successful trials per day with a success rate of over 75%. We began recordings of cell activity in area TE 3 or 5 mo after the task had been mastered at the longest delay (16 s). We imposed the interval due to the possibility that some cortical reorganization might continue even after the task had been mastered.
Recording
The monkeys were prepared for repeated recordings with initial aseptic surgeries. Under anesthesia with pentobarbital sodium (35 mg/kg ip, supplemented when necessary by 10 mg/kg), a brass block for head fixation was attached to the top of the skull, two stainless steel screws for electroencephalogram recording were implanted into the skull, the zygoma was removed, and the lateral surface of the skull was exposed and covered with resin for later recording of cell activity. Before the first recording session, eye optics were measured to select appropriate contact lenses, and photographs of the fundus were taken to determine the position of the fovea.
Visual stimulation and procedure on individual cells
To evaluate the magnitudes of the responses to the training stimuli, we used a reference set of 75 object stimuli consisting of animal and plant imitations and laboratory junk objects [see Kobatake and Tanaka (1994)
The data set used in this paper consisted of 131 cells recorded from the 2 trained monkeys and 130 cells recorded from the 3 untrained control monkeys. These cells were selected according to the criteria that 1) at least one stimulus (either a training stimulus or a reference object stimulus) evoke statistically significant responses (P < 0.05); 2) the cells be located in TE; and 3) they be removed by >200 µm from the last studied cell along the penetration.
We trained adult monkeys to discriminate among a class of shape stimuli and found that the proportion of inferotemporal cells responsive to some of the training stimuli in the trained monkeys was greater than that in the control untrained monkeys. Because consistent results were obtained in two trained monkeys and three control monkeys, we take the results as evidence that the proportion of such cells increased in the inferotemporal cortex through the training. Individual cells responded to different subsets of the training stimuli. This change in responsiveness fulfilled the requirements of the task: the development of diverging responsiveness increased the distances among the training stimuli representations in the feature space spanned by responses of inferotemporal cell population, which in turn made the discrimination easier. Single cells responded to multiple members of the training stimuli, which suggests that the discrimination was based on the activity of cell population. The increase in distances would contribute to either the passive or active mechanism, which has previously been proposed to solve the delayed matching to sample task (Miller and Desimone 1994
![]()
INTRODUCTION
Abstract
Introduction
Methods
Results
Discussion
References
; Logothetis and Sheinberg 1996
; Miyashita 1993
; Tanaka 1996)
.
; Desimone et al. 1984
; Perrett et al. 1982)
. It is often suggested that these cells have developed so that the monkey can distinguish individual faces and facial expressions. Indeed, it has been reported that the cells differentially respond to different faces, although their selectivity is broad (Baylis et al. 1985
; Yamane et al. 1988
; Young and Yamane 1992)
. These face-selective cells may have developed over generations or through early development. The present study was designed to determine what kind of changes occur in TE of adult monkeys that are extensively trained to discriminate among members of a shape class.
, 1994)
and Logothetis et al. (1995)
trained adult monkeys to discriminate among fractal patterns or wire-frame objects and found that many inferotemporal cells responded to the learned stimuli after training. We performed the identical recording procedures on a population of cells in both trained and untrained control monkeys, under conditions of anesthesia and separate from training, and found that the proportion of cells responsive to some of the training stimuli in the trained monkeys was greater than that in the control untrained monkeys. Portions of the present results have been previously reported in abstract form (Kobatake et al. 1992
, 1993
).
![]()
METHODS
Abstract
Introduction
Methods
Results
Discussion
References

View larger version (23K):
[in a new window]
FIG. 1.
Shown are the 28 shape stimuli that the monkeys were trained to discriminate and the associated responses of 1 TE cell in a trained monkey. These stimuli are referred to as the "training stimuli." The most effective reference-object-stimulus with its evoked response is shown at the top. Responses were averaged over 10 repetitions of the stimulus presentation. Statistically significant responses (P < 0.05) are labeled with their relative response magnitudes. Underlines indicate the duration of stimulus presentation.
but are not explained in this paper because we have only one monkey trained for the discrimination of the color stimuli. The responses to the shape stimuli in the first set of recordings were taken as a part of control data (control 2) for the training with the shape stimuli. The other monkey was trained only with the shape stimuli.
1·h
1 im), and the anesthesia was maintained by artificial ventilation with a mixture of N2O and O2 (70:30). The depth of anesthesia was assessed by monitoring the electrocardiogram and electroencephalogram, with isoflurane added to the gas mixture when necessary. Atropine sulfate (0.5 mg) was subcutaneously administered every 3 h to reduce salivation.
at 1 kHz). The electrodes were advanced from the lateral side through a pinhole made in the dura mater with a needle. The exposed dura mater was covered with paraffin to prevent it from drying and to reduce movements of the brain caused by pulsation and respiration. The position of penetration was determined with reference to a point marked on the resin-coated skull. The hole in the skull was filled with resin after the recording was completed. All recording procedures were conducted under aseptic conditions. Within a few hours after the last injection of muscle relaxant, spontaneous respiration resumed and became normal. The monkey was returned to its home cage after the injection of an antibiotic (Pentcillin, 40 mg/kg im; Sankyo, Tokyo). Recordings were also made from area TEO, and the border between TE and TEO was determined based on the size of the receptive fields (Kobatake and Tanaka 1994)
. Data from TEO cells were excluded from this paper. The experimental protocol had been approved by the Experimental Animal Committee of the RIKEN Institute. Monkeys were regularly monitored by a veterinarian and cared for in accordance with the Guiding Principles for the Care and Use of Animals in the Field of Physiological Science of the Japanese Physiological Society.

View larger version (18K):
[in a new window]
FIG. 2.
Extent of recording sites in the inferotemporal cortex is indicated by the shading on the lateral view of the brain (left) and the ventral half of a frontal section (right). sts, superior temporal sulcus; amts, anterior middle temporal sulcus; rs, rhinal sulcus.
for the list of objects]. Once activity in a cell was isolated, all of the object stimuli in the set were successively hand-presented to the monkey, and the two to four most effective object stimuli were determined by listening to the evoked activity on an audiomonitor. Images of these object stimuli were then taken with a video camera and stored on a computer. The background of the stimuli was filled with a homogenous gray. Unlike our previous studies (reviewed in Tanaka 1996)
, in the interest of time, we did not determine which features of the stimulus images were critical for activation. Finally, the images of the object stimuli were presented on the television display in combination with the training stimuli to evaluate the relative magnitude of the responses to the training stimuli.
; Fujita et al. 1992
; Ito et al. 1994
, 1995
; Kobatake and Tanaka 1994
; Sheinberg and Logothetis 1997
; Tanaka et al. 1991)
. A fixed set of stimuli of medium size (e.g., 20) might fail to hit the effective stimulus range of many recorded cells. The reference stimuli will work properly only if most cells are activated by at least some of them. The whole set of object images contains a larger set of partial features; therefore the chances are higher that some view of an object will contain the features effective for the activation of a recorded cell. We also considered the possibility that the experimenters inadvertently searched for effective reference stimuli more extensively in cells recorded in the control monkeys. This possibility was dismissed by inspecting the distribution of the magnitude of responses to the selected reference object stimuli between the two groups of cells recorded from the trained and control monkeys (Fig. 4).

View larger version (34K):
[in a new window]
FIG. 4.
Distribution of the absolute magnitudes of the responses to the most effective reference-object-stimuli for the 131 cells recorded from the 2 trained monkeys (top) and for the 130 cells recorded from the 3 control monkeys (bottom).
![]()
RESULTS
Abstract
Introduction
Methods
Results
Discussion
References

View larger version (48K):
[in a new window]
FIG. 3.
Distribution of the normalized magnitude of the individual cells' strongest responses to the training stimuli. The overall distributions for 131 cells recorded from the 2 trained monkeys and for 130 cells recorded from the 3 control monkeys are shown at left, whereas the distributions in individual monkeys are shown at right. The magnitude of the response, after subtracting the spontaneous firing rate, was normalized with respect to the maximal response of the cell (the larger of the strongest response of the cell to the reference object stimuli and the strongest response of the cell to the training stimuli).

View larger version (23K):
[in a new window]
FIG. 5.
Distributions of the absolute magnitudes of the strongest responses to the training stimuli for the 131 cells recorded from the 2 trained monkeys (top) and for the 130 cells recorded from the 3 control monkeys (middle), and the difference between them (bottom). Note that the ordinate scale on the bottom is twice that in the top and middle.

View larger version (47K):
[in a new window]
FIG. 6.
Comparison of the distribution of response magnitudes between responses of the same rank-order in individual cells recorded from the trained (filled bars) and control (open bars) monkeys.
; Young and Yamane 1992)
. We let responses of one cell represent one dimension of the space. The number of dimension was thus equal to the number of cells, and one training stimulus was represented by one point in this space. The distance between two stimuli in this space was calculated by 1) taking a difference between responses of one particular cell to the two stimuli, 2) multiplying the difference by itself, 3) summing the square value across cells, and 4) taking a square root of the sum. To compare the distances between the trained and control monkeys, from which different numbers of cells were recorded, the distances were normalized by the square root of the cell numbers (1311/2 in the trained monkeys and 1301/2 in the control monkeys). The distances in the trained monkeys were significantly larger than those in the control monkeys in both the space spanned by the relative responses normalized by the maximal responses of individual cells (Fig. 7, left; P < 0.001 with K-S test) and that spanned by the absolute magnitudes of the responses (Fig. 7, right; P < 0.001 with K-S test). These larger distances could make the discrimination among the training stimuli easier, and thus likely underlay the learned discrimination in the trained monkeys.

View larger version (45K):
[in a new window]
FIG. 7.
Distributions of the distances between 2 training stimuli in the space spanned by responses of recorded TE cells. Those calculated for the 131 cells recorded from the trained monkeys (top) and those for the 130 cells recorded from the control monkeys (bottom). The distances were calculated from the responses normalized by the maximum responses of individual cells in the left, whereas from the absolute magnitudes of the responses in the right. The distance between 2 stimuli was represented by a root of [a sum of (a difference between responses of 1 cell to the 2 stimuli) across cells] divided by a root of the number of cells.

View larger version (32K):
[in a new window]
FIG. 8.
Independence of responsivity to different training stimuli among cells recorded in the trained monkeys. A: 2 examples of the scatter diagrams showing the correlation between the responses of 2 cells to the 28 training stimuli. The x-value of an individual dot represents the magnitude of the response elicited by 1 training stimulus in 1 cell of the pair, and the y-value of the dot represents the magnitude of the response elicited by the same stimulus in the other cell of the pair. There are 28 dots corresponding to the 28 training stimuli. The values of r represent Pearson's correlation coefficient for the distribution. B: distribution of Pearson's correlation coefficients among 309 cell pairs in which at least 1 training stimulus evoked responses exceeding 50% of either cells' maximal response.
![]()
DISCUSSION
Abstract
Introduction
Methods
Results
Discussion
References
; Miller et al. 1991)
.
, 1994)
trained adult monkeys to discriminate among many fractal patterns and found that many inferotemporal cells responded to the learned patterns. Logothetis et al. (1995)
trained adult monkeys to discriminate many wire-frame objects from each other and found many inferotemporal cells responding to the images of the learned objects. The present results are consistent with these previous findings; quantitatively, the finding that 25% of inferotemporal cells responded maximally to the learned stimuli agrees well with the result of Logothetis et al. (1995)
that 28.5% of inferotemporal cells responded to some of the learned object stimuli more strongly than the control stimuli. A unique contribution of the present study is the demonstration that training increases the proportion of inferotemporal cells that respond to particular stimuli as measured against untrained controls. Because the present results were obtained in anesthetized preparations, and cells in the perirhinal cortex, which is one step downstream from TE, scarcely respond to visual stimuli in anesthetized preparation (H. Tamura and K. Tanaka, unpublished observation), the changes in selectivity were likely due to changes in the neuronal network up to TE. Vogels and Orban (1994)
trained adult monkeys to discriminate between gratings of a limited range of orientations but did not find an increase in inferotemporal cells responsive to the range of orientations used in training despite an improvement in the monkeys' discrimination performance for the trained range of orientations. As Vogels and Orban (1994)
argued, it is likely that the change of responsiveness in the inferotemporal cortex occurs only in the domain of features more complex than the orientation of gratings.
. The task in which the monkeys were trained used a fixed set of 28 shapes and thus did not require generalization. Nevertheless, the ability to generalize must certainly hold selective advantage in nature and may therefore constitute a cortical operating principle that is always in effect. The present results are more consistent with the hypothesis that inferotemporal cells develop responses to the learned stimuli by coding partial features, or aspects, common to multiple exemplars. (Note that in referring to "partial features" we do not exclude holistic features.)
; Miller et al. 1991
; Rolls et al. 1989)
.
found that TEO and V4 contain cells that selectively respond to complex features, although the proportion of such cells is smaller in these earlier areas than in TE. It is possible that the increase in cells responsive to the training stimuli might be observable in these earlier areas. The change might first occur in TE and later pervade the earlier areas. It is also possible that changes first occurred in areas downstream from TE, e.g., the perirhinal cortex, and were then transferred to TE.
, 1979
). Although specific experiments might rule out this possibility, it is not very likely that passive exposure could effect the changes we observed in the selectivity of inferotemporal cells.
| |
ACKNOWLEDGEMENTS |
|---|
This work was supported by the Frontier Research Program of the Institute of Physical and Chemical Research.
| |
FOOTNOTES |
|---|
Address for reprint requests: K. Tanaka, Laboratory for Cognitive Brain Mapping, RIKEN Brain Science Institute, 2-1 Hirosawa, Wako-shi, Saitama 351-0198, Japan.
Received 25 August 1997; accepted in final form 6 April 1998.
| |
REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
B. Anderson, R. E.B. Mruczek, K. Kawasaki, and D. Sheinberg Effects of Familiarity on Neural Activity in Monkey Inferior Temporal Lobe Cereb Cortex, November 1, 2008; 18(11): 2540 - 2552. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. D. Cox and J. J. DiCarlo Does Learned Shape Selectivity in Inferior Temporal Cortex Automatically Generalize Across Retinal Position? J. Neurosci., October 1, 2008; 28(40): 10045 - 10055. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Li and J. J. DiCarlo Unsupervised Natural Experience Rapidly Alters Invariant Object Representation in Visual Cortex Science, September 12, 2008; 321(5895): 1502 - 1507. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. De Baene, B. Ons, J. Wagemans, and R. Vogels Effects of category learning on the stimulus selectivity of macaque inferior temporal neurons Learn. Mem., August 26, 2008; 15(9): 717 - 727. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Kawasaki and D. L. Sheinberg Learning to Recognize Visual Objects With Microstimulation in Inferior Temporal Cortex J Neurophysiol, July 1, 2008; 100(1): 197 - 211. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. P. Op de Beeck, J. A. Deutsch, W. Vanduffel, N. G. Kanwisher, and J. J. DiCarlo A Stable Topography of Selectivity for Unfamiliar Shape Classes in Monkey Inferior Temporal Cortex Cereb Cortex, July 1, 2008; 18(7): 1676 - 1694. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. S. Marshall, J. J. Ferrera, A. Barnes, Xian Zhang, K. A. O'Brien, M. Chmayssani, J. Hirsch, and R. M. Lazar Brain Activity Associated With Stimulation Therapy of the Visual Borderzone in Hemianopic Stroke Patients Neurorehabil Neural Repair, April 1, 2008; 22(2): 136 - 144. [Abstract] [PDF] |
||||
![]() |
G. A. Orban Higher Order Visual Processing in Macaque Extrastriate Cortex Physiol Rev, January 1, 2008; 88(1): 59 - 89. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Li, D. Ostwald, M. Giese, and Z. Kourtzi Flexible Coding for Categorical Decisions in the Human Brain J. Neurosci., November 7, 2007; 27(45): 12321 - 12330. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. E. B. Mruczek and D. L. Sheinberg Context Familiarity Enhances Target Processing by Inferior Temporal Cortex Neurons J. Neurosci., August 8, 2007; 27(32): 8533 - 8545. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Kiani, H. Esteky, K. Mirpour, and K. Tanaka Object Category Structure in Response Patterns of Neuronal Population in Monkey Inferior Temporal Cortex J Neurophysiol, June 1, 2007; 97(6): 4296 - 4309. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. J. Peissig, J. Singer, K. Kawasaki, and D. L. Sheinberg Effects of Long-Term Object Familiarity on Event-Related Potentials in the Monkey Cereb Cortex, June 1, 2007; 17(6): 1323 - 1334. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. I. Baker, J. Liu, L. L. Wald, K. K. Kwong, T. Benner, and N. Kanwisher Visual word processing and experiential origins of functional selectivity in human extrastriate cortex PNAS, May 22, 2007; 104(21): 9087 - 9092. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. De Baene, E. Premereur, and R. Vogels Properties of Shape Tuning of Macaque Inferior Temporal Neurons Examined Using Rapid Serial Visual Presentation J Neurophysiol, April 1, 2007; 97(4): 2900 - 2916. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Turk-Browne, D-J Yi, A. Leber, and M. Chun Visual Quality Determines the Direction of Neural Repetition Effects Cereb Cortex, February 1, 2007; 17(2): 425 - 433. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. P. Op de Beeck, C. I. Baker, J. J. DiCarlo, and N. G. Kanwisher Discrimination Training Alters Object Representations in Human Extrastriate Cortex J. Neurosci., December 13, 2006; 26(50): 13025 - 13036. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. J. Freedman, M. Riesenhuber, T. Poggio, and E. K. Miller Experience-Dependent Sharpening of Visual Shape Selectivity in Inferior Temporal Cortex Cereb Cortex, November 1, 2006; 16(11): 1631 - 1644. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Suzuki, K. Matsumoto, and K. Tanaka Neuronal Responses to Object Images in the Macaque Inferotemporal Cortex at Different Stimulus Discrimination Levels J. Neurosci., October 11, 2006; 26(41): 10524 - 10535. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Mogami and K. Tanaka Reward association affects neuronal responses to visual stimuli in macaque te and perirhinal cortices. J. Neurosci., June 21, 2006; 26(25): 6761 - 6770. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. B. Sereno and S. C. Amador Attention and Memory-Related Responses of Neurons in the Lateral Intraparietal Area During Spatial and Shape-Delayed Match-to-Sample Tasks J Neurophysiol, February 1, 2006; 95(2): 1078 - 1098. [Abstract] [Full Text] [PDF] |