|
|
||||||||
The Journal of Neurophysiology Vol. 85 No. 5 May 2001, pp. 2245-2266
Copyright ©2001 by the American Physiological Society
1Neuroscience Section, Electrotechnical Laboratory, Ibaraki 305, Japan; 2Laboratory of Sensorimotor Research, National Eye Institute, National Institutes of Health, Bethesda, Maryland 20892; and 3Dipartimento di Elettronica, Elettrotecnica ed Informatica, Universitá degli Studi di Trieste, 34100 Trieste, Italy
| |
ABSTRACT |
|---|
|
|
|---|
Takemura, A., Y. Inoue, K. Kawano, C. Quaia, and F. A. Miles. Single-Unit Activity in Cortical Area MST Associated With Disparity-Vergence Eye Movements: Evidence for Population Coding. J. Neurophysiol. 85: 2245-2266, 2001. Single-unit discharges were recorded in the medial superior temporal area (MST) of five behaving monkeys. Brief (230-ms) horizontal disparity steps were applied to large correlated or anticorrelated random-dot patterns (in which the dots had the same or opposite contrast, respectively, at the two eyes), eliciting vergence eye movements at short latencies [65.8 ± 4.5 (SD) ms]. Disparity tuning curves, describing the dependence of the initial vergence responses (measured over the period 50-110 ms after the step) on the magnitude of the steps, resembled the derivative of a Gaussian, the curves obtained with correlated and anticorrelated patterns having opposite sign. Cells with disparity-related activity were isolated using correlated stimuli, and disparity tuning curves describing the dependence of these initial neuronal responses (measured over the period of 40-100 ms) on the magnitude of the disparity step were constructed (n = 102 cells). Using objective criteria and the fuzzy c-means clustering algorithm, disparity tuning curves were sorted into four groups based on their shapes. A post hoc comparison indicated that these four groups had features in common with four of the classes of disparity-selective neurons in striate cortex, but three of the four groups appeared to be part of a continuum. Most of the data were obtained from two monkeys, and when the disparity tuning curves of all the individual neurons recorded from either monkey were summed together, they fitted the disparity tuning curve for that same animal's vergence responses remarkably well (r2: 0.93, 0.98). Fifty-six of the neurons recorded from these two monkeys were also tested with anticorrelated patterns, and all showed significant modulation of their activity (P < 0.005, 1-way ANOVA). Further, when all of the disparity tuning curves obtained with these patterns from either monkey were summed together, they too fitted the disparity tuning curve for that same animal's vergence responses very well (r2: 0.95, 0.96). Indeed, the summed activity even reproduced idiosyncratic differences in the vergence responses of the two monkeys. Based on these and other observations on the temporal coding of events, we hypothesize that the magnitude, direction, and time course of the initial vergence velocity responses associated with disparity steps applied to large patterns are all encoded in the summed activity of the disparity-sensitive cells in MST. Latency data suggest that this activity in MST occurs early enough to play an active role in the generation of vergence eye movements at short latencies.
| |
INTRODUCTION |
|---|
|
|
|---|
Vergence eye movements function to align both
eyes on the same object and facilitate the binocular fusion of visual
images. An important cue in this process is the slight difference in
the locations of the two retinal images that arises from the slight difference in the viewpoints of the two eyes: binocular disparity. Most
studies of disparity-induced vergence have examined the transfer of
fixation between small targets presented at different distances and
have reported latencies ranging from 150 to 200 ms in humans (Jones 1980
; Mitchell 1970
;
Rashbass and Westheimer 1961
; Westheimer and
Mitchell 1956
) and from 135 to 177 ms in monkeys
(Cumming and Judge 1986
). However, it has been shown
that step changes in the horizontal disparity of a large pattern result
in machine-like vergence responses with very short latencies: <60 ms
in monkeys and <0.80 ms in humans (Busettini et al.
1996
; Masson et al. 1997
). It has been suggested
that these disparity vergence responses are important for the rapid
automatic correction of residual (i.e., small) vergence errors
(Busettini et al. 1996
). In line with this suggestion,
these short-latency vergence responses have disparity tuning curves
that resemble the derivative of a Gaussian, are well fit by odd (sine
phase) Gabor functions and have a roughly linear servo region that
extends only a degree or so on either side of zero disparity
(Masson et al. 1997
). It has recently been reported
(Masson et al. 1997
) that horizontal disparity steps applied to dense anticorrelated random-dot patterns (in which the
patterns seen by the two eyes have opposite contrast so that each black
dot in one eye is matched to a white dot in the other eye) also elicit
short-latency vergence responses that are very similar to those
observed with normal correlated patterns except that they are in the
opposite direction. In line with the earlier observations of
Cogan et al. (1993)
, Masson et al. also showed that all
of their human subjects were able to discriminate between 1.2°
crossed and uncrossed disparity steps applied to dense correlated patterns, but none was able to make these discriminations with dense
anticorrelated patterns, which evoked strong binocular rivalry. Cumming and Parker (1997)
have reported that monkeys too
can make such discriminations with correlated patterns but not with
anticorrelated patterns. These anticorrelated patterns therefore
provide an interesting disparity stimulus in that they can support
vergence eye movements but not depth perception.
It has often been suggested that disparity-induced vergence utilizes
disparity-selective neurons to sense vergence errors, and such neurons
have been recorded in various regions of the monkey cortex, including
striate and extrastriate visual areas (Burkhalter and Van Essen
1986
; Cumming and Parker 1999
, 2000
; Felleman and Van Essen 1987
; Hubel and
Livingstone 1987
; Hubel and Wiesel 1970
;
Poggio and Fischer 1977
; Poggio and Talbot
1981
; Poggio et al. 1988
; Prince et al.
2000
; Smith et al. 1997
; Trotter et al.
1996
), as well as the middle temporal area (MT) (Bradley and Andersen 1998
; Bradley et al. 1995
;
DeAngelis and Newsome 1999
; DeAngelis et al.
1998
; Maunsell and Van Essen 1983a
), MST (Eifuku and Wurtz 1999
; Roy et al. 1992
),
the posterior parietal area (Sakata et al. 1983
), the
lateral bank of the intraparietal sulcus (LIP) (Gnadt and Mays
1995
), and the frontal eye fields (FEF) (Ferraina et al.
2000
; Gamlin et al. 1996
). Most of the earlier
studies grouped cells according to the shapes of their disparity tuning
curves using the classification scheme of Poggio and Fischer
(1977)
, which recognized two general groupings of disparity-selective neurons, later termed "tuned" and
"reciprocal" by Poggio et al. (1988)
. Tuned neurons
had narrow tuning curves with either a peak ("tuned-excitatory"
neurons) or a trough ("tuned-inhibitory" neurons) centered on the
plane of fixation. Reciprocal neurons had asymmetric tuning curves and
responded to a broad range of disparities in front ("near" neurons)
or behind ("far" neurons) the plane of fixation. Subsequently,
Poggio et al. (1988)
recognized two additional tuned
groups with peaks in front ("tuned near" neurons) or behind
("tuned far" neurons) the plane of fixation and this led to the
renaming of the "tuned excitatory neurons" as "tuned zero
neurons." Cumming and Parker (1997)
recently showed that most disparity-selective neurons in cortical area V1 of monkeys also respond to the disparity of anticorrelated random-dot patterns, often with the opposite sign, e.g., neurons that had "tuned
excitatory" disparity tuning curves with correlated patterns had
"tuned inhibitory" tuning curves with anticorrelated patterns. Such
responses are in line with the suggestion that these neurons act as
purely local filters (Cumming and Parker 1997
, 2000
;
Nomura et al. 1990
; Ohzawa 1998
;
Ohzawa et al. 1990
).
Our purpose in the present study was to see if there are neurons in
area MST of the macaque monkey that have the necessary properties to
initiate the short-latency disparity-vergence responses described by
Busettini et al. (1996)
and Masson et al.
(1997)
. A strong motivating factor came from preliminary
evidence that bilateral lesions in MST cause a significant reduction in
these vergence responses (Takemura et al. 1999
, 2000
).
Our study was restricted to the initial (open-loop) neuronal responses
that occur before the associated vergence responses have had time to modify the central neuronal responses via the disparity feedback loop.
We here report that MST neurons can be activated by horizontal disparity steps applied to either correlated or anticorrelated random-dot patterns at latencies that are probably short enough for
these neurons to have a causal role in producing even the earliest
vergence responses. However, comparatively few of these neurons had
disparity tuning curves that resembled the tuning curves for vergence
and, when sorted using a fuzzy clustering algorithm, the curves fell
into four major groups, corresponding roughly to classes of
disparity-selective neurons described in the visual cortex
(Poggio et al. 1988
). Thus qualitatively at least, most
individual tuning curves resembled those at earlier, overtly sensory,
stages. Interestingly, when these curves were simply summed together,
they fitted the tuning curves for the vergence responses elicited by
both correlated and anticorrelated stimuli, indicating that the
associated motor responses are encoded at the population level.
Nonetheless, using a genetic algorithm, it was possible to identify
subsets of neurons whose tuning curves, when summed together, gave an
even better fit to the vergence tuning curves, though these subsets
invariably included cells from all four groups. Additional analyses of
the spike trains elicited by disparity steps revealed considerable
variation across cells in the latency, amplitude, and time course of
the changes in discharge rate. When all of the spike trains elicited by
a given disparity step were summed together to give an average
discharge profile for the whole population of cells, many were rather
noisy, but others that were less so matched the temporal profile of the motor response, vergence velocity, quite well. Based on these findings,
we hypothesize that the disparity-sensitive cells in MST each encode
only some aspect(s) of the sensory input and/or motor output, but that
the population of cells as a whole encodes the complete motor output
(vergence velocity).
Preliminary results have been presented elsewhere (Takemura et
al. 1997
, 1998
, 1999
).
| |
METHODS |
|---|
|
|
|---|
We recorded single unit activity in the MST area of five
adolescent Japanese monkeys (Macaca fuscata, 6-8 kg) in
response to disparity steps applied to large random-dot patterns. Prior to any surgery, animals were trained to fixate small target spots on a
tangent screen for a liquid reward using the dimming task of
Wurtz (1969)
. After the completion of training, animals
were anesthetized with pentobarbital sodium for the surgical
implantation (under aseptic conditions) of 1) a
pedestal, secured to the skull to permit the head to be fixed in the
standard stereotaxic position during the experiments, 2) a
cylinder, secured to the skull over the superior temporal sulcus for
the chronic recording of single neuron activity, and 3)
scleral search coils, around both eyes for recording eye movements
(Judge et al. 1980
). All experimental procedures were
approved by the Electrotechnical Laboratory Animal Care and Use
Committee and have been fully described elsewhere (Kawano et al.
1994
).
Disparity stimuli
During recording sessions, which were each several hours long, the animal sat in a primate chair with the head secured and faced a translucent tangent screen onto which two identical random-dot patterns were back-projected. The screen was 50 cm in front of the eyes and subtended 90° along the vertical and horizontal meridia. Orthogonal polarizing filters in the two projection paths and in front of each eye ensured that each pattern was visible to only one eye. Mirror galvanometers in the two light paths were used to control the horizontal positions of the two images (binocular disparity). Using fluid reinforcement, animals were rewarded for fixating stationary red target spots, which were projected onto the patterns on the screen. Binocular disparity stimuli were applied to random-dot patterns that were either exactly matched at the two eyes (standard correlated patterns) or of opposite contrast (anticorrelated patterns).
Standard paradigm using correlated patterns.
At the beginning of each trial, the patterns seen by the two eyes were
identical, overlapped exactly (zero binocular disparity), and filled
the screen. The patterns consisted of white dots on a black background
(dots had a luminance of 0.8 cd/m2, a diameter of
1.5° and covered 50% of the image space). Horizontal disparity steps
(crossed and uncrossed, ranging in amplitude from 0.5 to 6.0°) were
applied by displacing the two images equally in opposite directions.
Because previous studies had shown that the vergence responses were
subject to transient postsaccadic enhancement (Busettini et al.
1996
), these steps were applied 50 ms after 10° leftward
centering saccades guided by target spots projected onto the screen.
The experimental situation was the same as that used by
Busettini et al. (1996)
in all essentials. Because we
were interested only in the initial vergence responses, the disparity
steps lasted only 230 ms, and, if there were no saccades during this
time, then the data were stored on a hard disk and the animal was given
a drop of water; otherwise, the trial was aborted and fluid was
withheld. At this point, both images were blanked for 500 ms by
mechanical shutters and then reappeared once more for the start of the
next trial. Note that all experiments included control trials in which
no steps were applied (saccade-only trials). By applying the disparity
steps in the immediate wake of centering saccades, we ensured that the animal was alert during the steps, the stimulus pattern was always centered on the retina at the onset of the steps, and the vergence responses were subject to postsaccadic enhancement.
Paradigm using anticorrelated patterns.
For this paradigm, the right eye saw the same white dots on a black
background as when the binocular patterns were correlated, and the left
eye saw a matching negative image (black dots on a white background).
However, trials started with the screen a featureless gray (with the
same space-averaged luminance as for the patterns), until 50 ms after a
10° leftward centering saccade (again, guided by projected target
spots), at which time the anticorrelated random-dot patterns suddenly
appeared with a fixed horizontal disparity. Here too the disparity
stimuli were presented only briefly (230 ms) before the screen was
blanked, ending the trial. This procedure for applying disparate
anticorrelated stimuli was the same as that used by Masson et
al. (1997)
and was necessitated because disparity steps applied
directly to anticorrelated patterns at best elicit only weak vergence
eye movements (unpublished observations).
Data collection
The techniques for recording unit activity in MST were the same
as previously described (Kawano et al. 1992
,
1994
) and will only be given in brief here. A hydraulic
microdrive (Narishige Mo-9) was mounted on the recording cylinder, and
glass-coated tungsten microelectrodes were used for the initial
identification and mapping of the MT/MST region. Subsequently, a fixed
grid system (Crist et al. 1988
) was used to position a
guide tube through which a flexible tungsten microelectrode was
introduced into the MST area for single-cell recordings. The tip of the
guide tube was positioned 3-5 mm above the MST. Neuronal activity was
recorded using standard extracellular techniques. Spikes were detected with a time-amplitude window discriminator with a resolution of 1 ms.
We selectively isolated neurons whose discharge was modulated by
disparity steps applied to correlated patterns, and only after obtaining
40 samples of the responses to the complete set of disparity steps did we record responses to similar "steps" applied to anticorrelated patterns. Eye velocity signals were sampled at 500 Hz
and all other analog signals at 250 Hz. All data were transferred to a
work station (SunSparc) for quantitative analysis.
Data analysis
Horizontal vergence was computed by subtracting the horizontal
position signal for the right eye from that of the left eye. We used
the convention that rightward positions and velocities are positive,
hence, crossed disparities and convergence were also positive. The
latency of the vergence eye movements (and associated neuronal
responses) was taken to be the time when the mean vergence acceleration
(and the mean discharge rate) first exceeded the baseline noise by 2 SD. Vergence responses were quantified by measuring the change in
vergence position over the 60-ms time period beginning 50 ms after
stimulus onset, and disparity tuning curves were constructed by
plotting these measures against the amplitude of the disparity step. To
quantify the neuronal responses, we first constructed peristimulus time
histograms (binwidth, 1 ms) from the responses to multiple
presentations of each disparity step, computing a spike density
function by convolving each spike with a Gaussian pulse whose standard
sigma was 3 ms (MacPherson and Aldridge 1979
;
Richmond et al. 1987
). These histograms were then used
to compute the (average) instantaneous discharge frequency over time.
The mean discharge frequency over the 60-ms period starting 40 ms after
stimulus onset was computed for the responses to each disparity step,
and these values were then plotted against the amplitude of the step:
these plots will be referred to as disparity tuning curves. All data
shown in the figures, including both eye movements and neuronal
discharges, have had the responses on saccade-only trials (i.e., no
disparity step applied) subtracted to eliminate any postsaccadic
vergence drift and postsaccadic neuronal activity. This has the effect
of forcing all the disparity tuning curves through the origin.
Fuzzy clustering analysis.
It has been usual to group disparity-selective neurons according to the
shape of their disparity tuning curve using a classification scheme
described by Poggio and coworkers (Poggio and Fischer
1977
; Poggio et al. 1988
). This scheme has been
very successful but relies on a subjective assessment. In an effort to
classify the cells according to the shape of their tuning curve in an
objective manner, we turned to clustering methods. These methods
attempt to achieve an optimal partitioning of the data into groups.
When visual inspection of the data reveals the presence of relatively tight and separate clusters, classic clustering algorithms successfully assign each datum to one of several discrete groups. But when a human
observer is uncertain about the separation between groups, as was the
case with our data set, these algorithms may not find any structure in
the data. In this case, one can try fuzzy clustering algorithms, which
retain much more information about the distribution of the data being
clustered. We implemented the fuzzy c-means clustering algorithm of
Bezdek (1981)
, and the technical details are given in
APPENDIX A. Here, we provide only a brief outline of the
general operations performed.
Genetic algorithm. In examining the correlation between the single-unit activity and the vergence eye movements elicited by disparity steps, we attempted to identify the subset of cells whose disparity tuning curves when summed together best matched the disparity tuning curve for vergence. However, if the number of units recorded in a given animal is n, then 2n subsets are possible. In the present study, for example, we obtained disparity tuning curves for 49 units from one animal, and we estimate that our current computers would require hundreds of years to examine all 249 subsets. After reviewing the options, we decided to use a genetic algorithm (GA). The rationale behind this choice is that for problems in which there is a large number of binary variables (as here), GAs are known to outperform all other methods (computation time being equal): unlike other optimization algorithms, GAs sample several regions of the solution space in parallel. The technical details are given in APPENDIX B and only a brief outline of the general operations performed is provided here.
The data from each animal were treated separately because of small differences in the shapes of their disparity tuning curves for vergence. In our implementation of the GA algorithm, each "chromosome" in effect represents one of the 2n subsets of tuning curves. We started with 5,000 chromosomes, each having the same complement of n "genes," one for each of the n units recorded in the animal under scrutiny. All genes were randomly assigned a value of either 1, indicating that they made a contribution ("were expressed"), or 0, indicating that they made no contribution ("were not expressed"). For each chromosome, the disparity tuning curves of the units/genes that had been assigned a value of 1 were summed together and then fitted to the disparity tuning curve for the vergence response of that monkey. The task of the GA was to then "evolve" a chromosome (or chromosomes) whose subset of "expressed" tuning curves when summed together best fit the vergence data. The mean squared error (MSE) was used to assess the goodness of fit, and a histogram showing the distribution of MSEs among the first generation of 5,000 chromosomes invariably indicated a wide range of values. New generations of chromosomes (each having 5,000 chromosomes) with progressively smaller MSEs were created using a set of standard evolutionary rules. The algorithm ran for 50 generations, during which the minimum MSE for the population gradually diminished, usually stabilizing after ~30 generations. When the last (50th) generation was reached, the vast majority of the "chromosomes" had the same string of "expressed genes," and further evolution was virtually impossible. This surviving string of expressed genes represents the algorithm's estimate of the subset of units whose summed disparity tuning curves best correlate with the disparity tuning curve for vergence.Histology
At the conclusion of recordings in a given monkey, that animal
was deeply anesthetized with pentobarbital and perfused through the
heart with saline followed by 10% Formalin. The animal's brain was
removed, and frozen sections were cut at 50 µm in the sagittal plane,
mounted on microscope slides, and stained with cresyl violet for cell
bodies and with a modified silver stain (Gallyas 1979
) for myelinated fibers. Electrolytic lesions facilitated histological reconstruction of the electrode tracks, and recording sites were verified using both Nissl and myelination. Sample electrode tracks were
verified to pass through the MST using X-rays, and neurons were assumed
to be in the MST based on their physiological characteristics (such as
preferred speed and receptive field size).
| |
RESULTS |
|---|
|
|
|---|
Initial vergence responses to disparity steps (correlated patterns)
Horizontal disparity steps applied to large-field, correlated,
random-dot patterns elicited vergence eye movements at short latency
that were, in all essentials, like those described by Busettini
et al. (1996)
and Masson et al. (1997)
. Latency
estimates for the five monkeys, based on the responses to the optimal
disparity steps, were 73 ms (monkey H, +2°), 68 ms
(monkey L, +2°), 65 ms (monkey M,
2°), 58 ms (monkey N, +1°), and 59 ms (monkey Q,
+1°).1 Although
no attempt was made to obtain formal estimates of latencies to
nonoptimal stimuli, the vergence velocity temporal profiles suggested
that for a given animal, latency was largely independent of the
stimulus amplitude: see, for example, the sample profiles from
monkey H in Fig. 1, those in
A showing responses to crossed disparity steps, and those in
B responses to uncrossed disparity steps, the stimuli
ranging in amplitude from 0.5 to 6°. The profiles in Fig.
1A reach a peak and then decline, often before the closing of the disparity feedback loop (twice the response latency), whereas the profiles in Fig. 1B have a more varied time course and
some fail to reach a peak within the time window shown (150 ms after the disparity step). The initial responses to small (<2°) steps were
always compensatory in that they operated to reduce the seen disparity:
small crossed steps elicited increases in the vergence angle and small
uncrossed steps the converse. Responses were generally maximal with
steps of 1-3° and declined with larger steps, sometimes showing
reversal (Fig. 1B). These trends are evident from the disparity tuning curves in Fig. 1C, in which the change in
vergence position (measured over the time period of 50-110 ms) is
plotted against the amplitude of the disparity step. These plots
indicate that the system showed appropriate servo-like behavior only
for small disparity steps. That is, increases in the input resulted in
roughly proportional increases in the output (in the compensatory direction) only for steps of less than a degree or so. (Note that the
amplitude and direction of the responses of any given monkey to the
largest steps were generally independent of whether the steps were
crossed or uncrossed and showed considerable variation from one animal
to another.) The disparity tuning curves resembled the derivative of a
Gaussian and were well fit by a Gabor function: the parameters for the
least squares best fits are listed for all five monkeys in Table
1. To evaluate the goodness of the fit,
we computed the fraction of the disparity-induced variation in the data
accounted for by the fit, r2,
as previously done by others (e.g., Cumming and Parker
1999
). We found that r2
ranged from 0.88 to 0.99, indicating that 88-99% of the variation in
the data were captured by the fit. (Note that the Gabor functions for
monkeys N and Q are plotted as the continuous
lines in Fig. 6, A and B.)
|
|
Initial neuronal responses to disparity steps (correlated patterns)
We recorded the activity of 586 MST neurons in seven hemispheres
of five monkeys while horizontal disparity steps were applied to
correlated large-field random-dot patterns. About 20% of the neurons
(122) responded to disparity steps at short latency, and the changes in
discharge rate ranged from very transient
some lasting little more
than 20 ms
to tonic. This is evident from the sample responses in Fig.
2, which shows the discharge rate temporal profiles of 20 units recorded from monkey N in
response to 2° crossed-disparity steps. If a neuron has a response
latency of Ln ms, and the associated
vergence has a latency of Lv ms, the
earliest time at which the neuronal response could be affected by the
decrease in retinal disparity secondary to the vergence response would
be Ln + Lv ms (after the disparity step). The
estimated latency of the vergence response (see METHODS) in
Fig. 2 was 59 ms (indicated by the dashed vertical line), which
means that only neurons with a latency of <51 ms could have been
affected by the closing of the disparity-feedback loop within the time
window shown (110 ms after the disparity step). Seven of the neurons in
Fig. 2 had a short enough latency to have been so affected (see *), but
even in those cases, the loop wouldn't have closed until long after
the neurons' initial burst of activity had ended. This is apparent
from the estimated time of closure of the disparity feedback loop for
those seven cells (indicated by
on the relevant traces in Fig. 2).
A similar scrutiny of the entire population of cells indicated that the
discharges of ~60% had a phasic component, which in every case was
independent of the closure of the disparity feedback loop. The
histograms in Fig. 3 show the
distributions of the latency estimates for the neurons that met the
response-onset criterion (2 SD above the mean control level: see
METHODS). The estimates in Fig. 3A are given
with respect to the onset of the disparity steps (n = 71; median, 55 ms; range 43-86 ms), whereas those in Fig.
3B are given with respect to the (measured) onset of the
vergence responses. The onset of the neuronal responses preceded the
onset of the vergence eye movements in 43/50 units (86%) by
24 ms
(median lead, 9 ms). Note that the measures in Fig. 3 were all obtained
from the response profiles elicited by the stimulus that was optimal
for the cell (see METHODS).
|
|
DISPARITY TUNING CURVES OF SINGLE CELLS.
Neuronal responses were quantified by measuring the mean discharge
frequency over the 60-ms period starting 40 ms after the disparity
step. Of the 122 MST neurons that showed sensitivity to horizontal
disparity steps at short latency, 102 yielded sufficient data to allow
complete disparity tuning curves to be plotted. These curves showed a
variety of forms and, after normalization, we used the fuzzy c-means
algorithm to organize them into groups based on their shapes (see
METHODS). This algorithm sorted the curves into four
clusters, and we then assigned each curve to one of four groups based
on the cluster in which each curve had its largest membership. We chose
to partition the curves into four groups because this was the largest
number to yield a consistent grouping when the algorithm was run
multiple times (
50).
1° disparity and skewed to the left (toward uncrossed
disparities), c.f., the "tuned far" cells of Poggio et al.
(1988)
|
near
tuned inhibitory
tuned far,
some gradual trends are evident: the responses to crossed steps start
with a peak at small disparities, then the peak flattens and shifts to
higher disparities before largely disappearing, while the responses to
uncrossed disparities show a similar trend but in the reverse order. If
we are dealing with a "smooth" continuum then these trends should
also be apparent within the groups.
Figure 5 (top) plots the
memberships in each of the four clusters for all 102 cells. The cells
are subdivided into the four groups and, within each group, are ranked
according to their memberships in the defining cluster of an adjacent
group.2 Figure 5
(bottom) shows all of the disparity tuning curves in a
color-coded form arranged along the abscissa in the order in which they
are plotted in Fig. 5, top, with the disparity step represented along the ordinate (crossed steps upward); increases in
activity are shown in red, zero activity in white, and decreases in
activity in blue. The general impression is that the tuned far
group 1 is distinct, with a sharp discontinuity at the
boundary with group 2 (see particularly the responses to
uncrossed disparities ranging from 0 to
2°) whereas the other three
groups lie along a continuum.
|
DISPARITY TUNING CURVES FOR (SUMMED) POPULATIONS OF CELLS.
A characteristic feature of the disparity tuning curves for vergence in
Fig. 1C is that they are roughly odd functions, that is,
f(x) = 
f(
x),
with a linear segment passing smoothly through zero disparity defining
the critical servo range over which changes in the input (binocular
disparity) elicit roughly proportional changes in the output (vergence
eye movement). It is significant that very few cells had disparity
tuning curves with this exact same feature: The tuned inhibitory
group 2 cells have rather flat curves in the vicinity of
zero, conveying little information about disparity in this range, and
the tuning curves for most other cells undergo a substantial change in
slope near zero disparity. Exceptions are small numbers of cells in the
tuned near group 4 and tuned far group 1, with
positive and negative slopes, respectively, extending on either side of
zero disparity. (Given appropriate excitatory or inhibitory
connections, neurons with either positive or negative slopes could make
a positive contribution to the vergence responses. That the tuning
curves for vergence had a positive slope around zero disparity was
determined solely by our sign convention.) To obtain an estimate of how
well the vergence responses were encoded in the discharges elicited in
individual MST cells, we fitted the disparity tuning curves of the
individual cells to the tuning curves for vergence, the only free
parameters being gain and y offset. We assumed that all
neurons made a positive contribution to the vergence responses elicited
by disparity steps in the important servo range, ±1° and so reversed
the sign of those curves that had negative slopes over that range: this
involved 29/102 cells, including all 28 in the tuned-far group
1 and 1 in the tuned inhibitory group 2. The
least-squares best fits ranged widely
(r2: mean, 0.51; range,
0.009-0.92), but the very best of the fits were good enough to raise
the possibility that some cells might be more properly regarded as
"vergence-related" rather than "disparity-related." We will
revisit this issue later when we describe the responses of these cells
to anticorrelated stimuli.
(summed activity) and * (vergence) plotted in Fig.
6A (monkey N,
r2 = 0.98) and Fig. 6B
(monkey Q, r2 = 0.93). Inverting the
sign of the curves with negative slopes around zero disparity was
critical for achieving such good fits. Thus failure to invert those
curves (before fitting) decreased the
r2 to 0.12 for monkey
N and to 0.21 for monkey Q, and the best fits now
showed little resemblance to the tuning curves for vergence: see Fig.
6, C and D. None of the single units had a
tuning curve that matched the vergence as well as these population
responses did: even with sign inversion, the highest
r2 for any individual unit was
0.92 for monkey N (mean
r2: 0.49) and 0.86 for
monkey Q (mean r2:
0.48). It is of interest that the disparity tuning curves for the
summed neuronal activity, like those for the vergence eye movements,
were well fit by Gabor functions,
r2 being 0.95 and 0.94 for
monkeys N and Q, respectively: see - - -
plotted in Fig. 6. The parameters for these fits are included in Table
1.
|
|
18 ms
for monkey N and
23 ms for monkey Q. In
both cases, the peak in the relationship between
r2 and the time delay was
sharply convex (Fig. 8, C and D), indicating that
these time delays provide a reliable estimate of the time interval by
which the neuronal population response preceded the vergence response.
|
15 to
22 ms (mean ± SD, 17.6 ± 2.3 ms): see Table 2 (values listed
under "all cells"). The fits were even worse when we summed only
the activity of the cells that had been selected with the genetic
algorithm: see Table 2 (values listed under "GA"). Visual
inspection revealed that the summed discharge profiles giving poor fits
were generally noisy, often before the onset of the disparity-driven
response, indicating the existence of appreciable noise unrelated to
the disparity stimulus. To our surprise, the summed discharge profiles
that gave good fits always included individual profiles that varied
widely. For example, the summed discharge profile that included the
data in Fig. 2 accounted for >93% of the stimulus-induced variation
in the associated vergence velocity profile
(r2 = 0.933), despite the
obvious variability among the individual cells in the latency and time
course of the (averaged) discharge profiles. Thus we suggest that the
paucity of good fits was due in large part to the inadequacy of our
data samples: on the one hand, the relatively small number of responses
averaged for each stimulus condition and, on the other, the relatively
small numbers of cells recorded from any given monkey. Another
important factor here was that not all units were active for all
stimuli, further limiting the number of discharges contributing to a
given temporal profile. In summary, although noise problems restricted
the amount of useful data, the temporal profile of the summed activity
associated with some disparity steps showed a reasonably good fit to
the vergence velocity profile.
|
Initial vergence responses to disparity steps (anticorrelated patterns)
The vergence responses elicited by disparity steps applied to
anticorrelated patterns were recorded from two animals (monkeys N and Q). As previously reported by Masson et
al. (1997)
, these vergence responses were comparable in latency
with those produced by the same steps applied to correlated patterns
but were often in the opposite direction. This is apparent from the
sample data from monkey N shown in Fig.
9, which has a layout identical to Fig.
1. Thus the disparity tuning curves showed a negative slope in the
immediate vicinity of zero disparity, though it is clear that the curve
for monkey Q is shifted appreciably to the left of the curve
for monkey N. The peak-to-peak amplitudes of the (interpolated) disparity tuning curves for the anticorrelated data were
smaller than those for the correlated data: by 33% for monkey N and by 44% for monkey Q. The tuning
curve data were well fit by a Gabor function
(r2 was 0.99 and 0.98 for
monkeys N and Q, respectively), and the parameters of the least-squares best fits are listed in Table 1: see
also the continuous lines plotted in Fig. 12, A and
B. Of particular interest among the parameters of the Gabor
functions is the difference in the phase of the cosine terms for the
correlated and anticorrelated data: 177° for monkey N and
169° for monkey Q. This reinforces the impression that, to
a first approximation, the disparity tuning curves obtained with
anticorrelated stimuli were inverted versions of those obtained with
correlated stimuli. The above-mentioned shifts, however, are evident in
the x shifts of the best-fit Gabor functions, which differed
for the correlated and anticorrelated data, especially for monkey
Q: this parameter was always zero for the correlated data and
negative for the anticorrelated data (Table 1). In fact, the tuning
curves obtained with anticorrelated stimuli were quite well fitted by
the tuning curves obtained with correlated stimuli when the latter were
inverted provided that the x shift was a free parameter (in
addition to gain and y offset): r2 values for the least-squares
best fits for monkeys N and Q were 0.86 and 0.97, respectively, and the corresponding x shifts were 0.05 and
0.61°.
|
Initial neuronal responses to disparity steps (anticorrelated patterns)
Neurons that were still well isolated after we had finished recording their responses to disparity steps applied to correlated patterns were then recorded while the same steps were applied to anticorrelated patterns. The activity of 56 MST neurons in three hemispheres of two monkeys was so recorded (25/49 units from monkey N and 31/31 units from monkey Q), and all gave significant responses to the anticorrelated stimuli (P < 0.005, 1-way ANOVA). Neuronal response latencies to anticorrelated stimuli were roughly comparable to those obtained with correlated stimuli.
DISPARITY TUNING CURVES OF SINGLE CELLS. Figure 10 shows the normalized disparity tuning curves for all 56 units whose responses to anticorrelated stimuli were recorded, and is organized like Fig. 4, with cells placed in the same four groups. That is, cells that were in group 1 in Fig. 4 were also placed in group 1 in Fig. 10, and so forth. Again, we fitted the disparity tuning curves of the individual cells to the tuning curves for vergence (gain and y offset, free parameters) and assumed that the contribution of any given cell to the vergence response would always have the same sign regardless of the stimulus used to drive it. Accordingly, cells whose contributions had been inverted for the earlier analysis of the correlated data were again inverted here. (For monkey N, 7/25 curves were inverted and for monkey Q, 7/31 curves were inverted, all in group 1.) As for the correlated data, the least-squares best fits for the anticorrelated data ranged widely (r2: mean, 0.39; range, 0-0.97) and some were clearly good enough to be considered vergence-related rather than disparity-related. However, no single cell had responses that fitted the vergence responses obtained with both correlated and anticorrelated stimuli with an r2 value >0.67: Fig. 11 shows a plot of the individual r2 values obtained with correlated stimuli against those obtained with anticorrelated stimuli, and there is a relative paucity of cells in the upper right quadrant, which is where pure vergence-encoding cells would be expected.
|
|
DISPARITY TUNING CURVES FOR (SUMMED) POPULATIONS OF CELLS.
We saw earlier that, when correlated patterns were used, the disparity
tuning curves for the summed activity of the population of cells
matched the tuning curves for the associated vergence responses quite
well. We now sought to determine if the same was true for the data
obtained with anticorrelated stimuli. We again summed all of the
raw
that is, nonnormalized
tuning curves for each of the two monkeys
separately and then determined how well these population responses
fitted their respective tuning curves for vergence when gain and
y offset were the only free parameters. We assumed that the
contribution of any given cell to the vergence response would always
have the same sign regardless of the stimulus used, and cells whose
contributions had been inverted for the earlier analysis of the
correlated data were again inverted here. Figure
12, A and B,
shows that the disparity tuning data for the summed activity (
)
again matched those for the associated vergence responses (*) quite
well (r2: 0.96 and 0.95 for
monkeys N and Q, respectively). Once again, failure to invert the contributions of the relevant cells before fitting led to significantly worse fits
(r2: 0.77 and 0.57 for
monkeys N and Q, respectively), though the effects on the fits were not as dramatic as reported above for the data
obtained with correlated patterns. In the case of monkey N,
this is perhaps in part because a somewhat smaller proportion of the
curves obtained with anticorrelated stimuli were inverted: 28% (7/25),
compared with 35% (17/49) of those obtained with correlated stimuli.
There was one cell from monkey N whose tuning curve matched the vergence as well as the curve for the summed activity did (r2 = 0.97) but the mean
r2 for all cells from this
monkey was only 0.47. None of the single units from monkey
Q had tuning curves matching its vergence responses as well as its
summed activity did and the highest
r2 for any individual unit from
this monkey was 0.88 (mean r2:
0.32).
|
| |
DISCUSSION |
|---|
|
|
|---|
About 20% of the cells that we recorded in MST responded to
disparity steps, their discharges often commencing at short latency and
preceding the associated vergence eye movements. The depth tuning of
many cells resembled that previously reported for cells in various
regions of the visual cortex: only a few cells in our sample shared the
depth tuning of the associated vergence responses and then only for the
data obtained with one of the two kinds of disparity
stimuli
correlated or anticorrelated patterns
and never for both.
Thus we were surprised to find that, for both stimuli, merely summing
the disparity tuning curves of all the cells sampled in a given animal
resulted in a curve that closely approximated the disparity tuning
curve for that animal's vergence responses. Further, for some
disparity stimuli, the time course of the initial changes in the summed
activity closely mirrored the time course of the associated changes in
vergence velocity: there was little hint of this in the discharge
profiles of the individual cells. These data suggest that the
discharges of the individual cells each encode some limited aspect(s)
of the disparity stimulus and/or vergence motor response, whereas the
summed activity of the population (faithfully?) encodes the entire
vergence velocity response. We will discuss the information carried by
single cells and by the population separately.
Information coding at the single-cell level
DISCRETE GROUPS OR A CONTINUUM?
Most of the previous studies of the disparity-selective neurons in the
monkey cortex used the classification schemes of Poggio and
Fischer (1977)
or Poggio et al. (1988)
, whose
recordings were all made in the visual cortex (V1, V2, and V3). The
later study divided disparity tuning curves into two major groups,
tuned and reciprocal, each with subgroups based on their specific
preferences for images located in front of (tuned near, near), within
(tuned zero, tuned inhibitory), or behind (tuned far, far) the plane of
fixation. These groupings have proven very useful for describing the
disparity-selectivity of cortical neurons especially in the various
regions of the visual cortex for which they were originally devised,
although their applicability to higher levels of cortex is less clear.
For example, in the study of Maunsell and Van Essen (1983a)
on MT, which projects directly to MST (Maunsell
and Van Essen 1983b
; Ungerleider and Desimone
1986
), most of the disparity-sensitive cells fell into one of
the four categories, near, far, tuned zero, and tuned inhibitory, but
almost 12% "were on a border between two classes." Roy et
al. (1992)
reported that most of the disparity-sensitive cells
in the dorsomedial region of MST (referred to as, MSTd) were in the
near or far categories, a few were tuned, and 17% had hybrid curves
with features of both tuned and reciprocal curves. Eifuku and
Wurtz (1999)
reported that disparity-sensitive cells in the
lateral region of MST (referred to as, MSTl) were broadly tuned and did
not fit the classification scheme of Poggio et al. These workers also
suggested that there was "a continuum of disparity tuning
functions" in MSTl, some curves having clear peaks and others not.
Poggio et al. (1988)
had earlier suggested that four of
the classes in visual cortex (near, tuned near, tuned far, far) might
form a continuum, and DeAngelis and Newsome (1999)
reported that the disparity preferences of cells in MT ranged from near
to far without any major discontinuities, mapping smoothly across the
surface of the cortex. Further, Gnadt and Mays (1995)
, in a study of the depth dependence of saccade-related neurons in the
lateral bank of the intraparietal sulcus (LIP), which has reciprocal
connections with MST (Andersen et al. 1990
; Blatt
et al. 1990
; Ungerleider and Desimone 1986
),
also found that not all of the tuning curves matched one of the groups
of Poggio et al. and concluded that there was "little basis for
categorical parcellation into separate functional classes."
those in
groups 1 and 4
were likened to the cells with
the narrowest (nonzero) peaks in the grouping of Poggio et al.
the
tuned far and tuned near categories
the fact is that the narrowest
peaks in our study were often so broad and skewed that these cells
would probably have been placed in the far and near categories had they been recorded in striate cortex. It is also important to remember that
we decided on four clusters only after finding that this was the
largest number that gave consistent groupings when the algorithm was
repeated. This should not be taken to imply that there are four
discrete groups of cells: The membership values indicated that, with
the possible exception of group 1, our groups were fuzzy and
that groups 2-4 might well form a continuum. Indeed, using
the membership values in the nondefining clusters to rank order the
tuning curves within each group indicated that there was little
evidence of discontinuities at the boundaries between the three groups
(Fig. 4). In addition, the hierarchical fuzzy algorithm, which attempts
to find the maximum number of discrete clusters that can be justified
by the data and continues to merge clusters until a firm criterion for
separation between clusters is met, merged our groups 2-4
and hence recognized only two clusters as discrete. However, it is
still possible that, with more data, the hierarchical fuzzy algorithm
would recognize more clusters, and so the question of whether
groups 2-4 are discrete entities or parts of a continuum
must remain open for the present. Regardless, we do feel that it is
useful to recognize four groups for descriptive purposes. Thus the
tuned inhibitory cells of group 2, for example, are clearly
very different from the tuned near cells of group 4.
INITIAL OPEN-LOOP RESPONSES.
A somewhat unusual aspect of our study is that it dealt exclusively
with the so-called open-loop neural responses that occur at short
latency (40-100 ms after stimulus onset), before closure of the
disparity-feedback loop. In a recent study of neurons in the frontal
eye fields (FEF), which receive direct projections from MST
(Schall et al. 1995
), disparity stimuli were presented suddenly against a previously blank background, and disparity tuning
curves based on the responses during an early period (50-200 ms after
stimulus onset) could have a very different shape from those based on
the responses during a later period (300-1,000 ms after stimulus
onset) (Ferraina et al. 2000
).
SENSORY AND/OR MOTOR?
In previous studies, the disparity-selective activity in MT and MST was
assumed to be purely visual in origin, and, insofar as it was linked to
vergence eye movements at all, it was solely as a possible source of
visual feedback guiding binocular alignment (Maunsell and
Van Essen 1983a
; Roy et al. 1992
). In our
experiments, unfortunately, the sensory and motor events were so
time-locked that we were unable to determine whether the responses were
more closely synchronized to one or to the other. Thus in our
experiments, the discharges of individual neurons might encode some
aspect of the stimulus (disparity) and/or the associated motor response (vergence).
0.08). Thus the type of disparity stimulus used
to elicit the vergence responses also influenced the shape of the
individual tuning curves independent of any effect on the vergence
itself. It is entirely possible that some (or all) cells carry mixed
sensory and motor signals, an arrangement for which there is a
precedent in MST: some neurons that discharge in relation to optic flow
and pursuit tracking show both retinal and extraretinal influences, the
latter encoding some aspect of the tracking eye movements
(Bradley et al. 1996Information coding at the population level
The summed activity of the disparity-sensitive cells in MST carries considerable information about the initial vergence eye movements associated with a sudden change in disparity. We examined this population coding of vergence from a spatial viewpoint by plotting disparity-tuning curves and from a temporal viewpoint by plotting the time course of the initial (open-loop) events. In both cases, the vergence information conveyed by the summed activity became apparent only after inverting the contributions of those cells whose tuning curves had negative slopes around zero disparity, which is roughly at the center of the servo range of the disparity-vergence system (Figs. 6 and 12). Given that the tuning curves for vergence had a positive slope around zero disparity, this inversion meant that all neurons would make a positive contribution to the population sum over the important servo range. Of course, for the brain to achieve a similar result would require appropriate excitatory and inhibitory connections. (Note that the sign of the disparity stimuli and the vergence responses was solely a matter of convention.)
SPATIAL ASPECTS. It is apparent from above that the single-cell data gave little hint that summing the disparity tuning curves for all of the units recorded from a given monkey would yield a curve strongly resembling the tuning curve for that animal's vergence response (r2 > 0.93). This close match between summed activity and vergence was equally true for the data obtained with correlated and anticorrelated stimuli in spite of the fact that the associated vergence responses were very different. Further, the summed activity even reproduced the slight differences in the depth tuning of the vergence responses of the two monkeys from which we recorded most of our data (see Fig. 12, A and B). In fact, these idiosyncrasies were significant enough that the summed activity from one monkey fitted the vergence data for the other monkey only very poorly (r2: 0.29, 0.35).
TEMPORAL ASPECTS.
Our attempts to demonstrate the existence of temporal coding of
vergence at the population level, by summing together all of the
discharge profiles elicited by a given disparity step, were less
successful. Visual inspection of the summed discharge profiles suggests
that low signal to noise was a major factor here, emanating from
irregularities in the spike trains coupled with inadequate sample sizes
(Gomi et al. 1998
). (Such noise factors were less
evident in the analysis of spatial coding, presumably because it
involved response measures that were averaged over time before
summing.) Nonetheless, for 40% of the data sets obtained with
correlated stimuli, the summed temporal profiles accounted for more
than 90% of the stimulus-induced variation in the temporal profile of
the associated vergence velocity. This is remarkable in that many of
the unit response profiles had appreciable phasic components and showed
little resemblance to the vergence response profiles: see, for example,
the unit response profiles in Fig. 2, which formed part of a data set
that when summed together fitted the associated vergence velocity
profile with an r2 value of
0.933. Clearly, the initial phasic components seen at the single-cell
level largely disappeared at the population level, presumably because
of the latency differences among the individual units (temporal summation).
COMPARISON WITH OTHER STUDIES.
Our data suggest that the magnitude, direction, and time course of the
initial vergence velocity responses associated with disparity steps are
all encoded in the summed activity of the disparity-sensitive cells in
MST. This brings to mind the vector addition model of motor cortex
proposed by Georgopoulos and colleagues in which the direction of arm
movement is reconstructed by summing the preferred direction vectors of
the individual cells weighted by their firing rates
(Georgopoulos et al. 1983
, 1986
, 1988
; Kettner et
al. 1988
).
Neural mediation of short-latency disparity-vergence eye movements
It is known that the initial vergence eye movements elicited by a
step of disparity, which we have shown to be encoded in the summed
activity of the disparity-sensitive cells in MST, are significantly
reduced in amplitude by bilateral lesions in MST (Takemura et
al. 1999
, 2000
). We suggest that these two observations are
causally linked and that the vergence responses result at least in part
from the population activity in MST. The genetic algorithm indicated
that subsets of MST cells could reproduce the vergence responses better
than our entire recorded population of cells, raising the possibility
that only a subpopulation of MST cells contributes to the vergence
responses. That the subsets of cells selected by the genetic algorithm
always included cells from all four groups suggests that if only a
subset of MST cells contributes to vergence, this subset is randomly
selected from the entire population. There is strong evidence, based on
lesions and single-unit recordings, that MST is critically involved in the generation of another type of short-latency eye movement, ocular
following (OFR), elicited by large-field motion (see Kawano et
al. 2000
for review) and the associated activity in MST
precedes the motor responses by a time interval that is closely
comparable with that found in the present study. Thus the estimated
latency of the MST discharges preceded the estimated latency of the
associated motor responses on average by 8.6 ms for OFR (Kawano
et al. 1994
) and by 8.9 ms for the vergence responses in the
present study.
There are a number of putative pathways by which MST might produce
vergence eye movements, including direct, subcortical projections. One
of the latter is a projection to the dorsolateral pontine nuclei
(Boussaoud et al. 1992
; Glickstein et al. 1980
,
1985
), which have been shown to contain cells that discharge in
relation to vergence (Zhang and Gamlin 1997
) and project
in turn to regions of the cerebellum (paraflocculus and vermis) that
are known to be concerned with eye movements: for review, see
Leigh and Zee (1999)
. There have been some preliminary
reports that MST projects to the superior colliculus (Colby and
Olson 1985
; Lock et al. 1990
), a structure that
has recently been implicated in the production of vergence: see
Chaturvedi and Van Gisbergen (2000)
for recent review.
Another direct subcortical projection from MST, to the nucleus of the
optic tract, is thought to mediate OFR and optokinetic responses but,
to date, is not known to have any involvement with vergence eye
movements: see Inoue, Takemura, Kawano, and Mustari (2000)
for recent review. MST is also known to project to two cortical areas that are interconnected and have been shown to contain
neurons that discharge in relation to disparity stimuli and/or vergence
eye movements: LIP (Gnadt and Mays 1995
) and FEF (Ferraina et al. 2000
; Gamlin et al.
1996
). Neurons in LIP that carry depth-related information have
been shown to project directly to the superior colliculus (Gnadt
and Beyer 1998
), and FEF neurons project directly to the medial
part of the nucleus reticularis tegmenti pontis (Huerta et al.
1986
; Leichnetz et al. 1984
; Stanton et
al. 1988
), which shows vergence-related activity (Gamlin
and Clarke 1995
) and projects in turn to the premotor neurons
for vergence (in the supraoculomotor and adjacent reticular formation) via the posterior interposed and fastigial nuclei of the cerebellum: see Gamlin, Yoon, and Zhang (1996)
and Gamlin
(1999)
for review.
To the extent that the summed discharges of the disparity-selective
neurons in MST carry a complete description of the associated vergence
eye movements, they would have the potential to generate the entire
vergence response, assuming appropriate dynamical processing in the
projection pathways (Mays et al. 1986
; Patel et
al. 1997
). The vergence deficits that follow lesions of MST are
only partial (Takemura et al. 1999
, 2000
), leaving the
possibility that other regions make a significant contribution (though
it also seems very likely that the lesions were only partial). Of
course, we do not know if the vergence system actually utilizes the
population coding in MST, but were it to do so, then any attempt to
correlate single-unit activity in MST with motor behavior would be a
largely meaningless exercise.
Other data have implicated MST in perception, especially of self motion
(Bradley et al. 1996
; Britten and Wezel
1998
; Duffy 1998
; Duffy and Wurtz
1991a
,b
, 1995
, 1997a
,b
; Graziano et al. 1994
;
Orban et al. 1995
; Page and Duffy 1999
;
Roy et al. 1992
; Saito et al. 1986
;
Tanaka et al. 1986
, 1989
). In our study, all of the MST
neurons that responded to disparity steps applied to correlated
patterns also responded when the same steps were applied to
anticorrelated patterns even though the latter do not support depth
perception and are seen as rivalrous (Cogan et al. 1993
; Cumming and Parker 1997
; Cumming et al.
1998
; Masson et al. 1997
). In fact, gradual
changes in the disparity of large correlated patterns (similar to the
ones we have used) have also been shown to produce little sensation of
motion in depth (Erkelens and Collewijn 1985
;
Regan et al. 1986
). This is in accord with the finding
that stereopsis is much better for relative than for absolute disparity (Westheimer 1979
), where absolute disparity is given by
the difference in the locations of the two retinal images of a given
object and relative disparity refers to the differences in the absolute
disparity of different objects. Our stimuli contained only absolute
disparity cues, and changes in the vergence angle between the two eyes
affect only the absolute disparity of the seen images. The implication is that the activity in MST that we have studied is linked to motor,
rather than perceptual, processes.
| |
APPENDIX A |
|---|
|
|
|---|
Fuzzy clustering
The major advantage that fuzzy clustering algorithms have over
classic (often referred to as hard or crisp) clustering algorithms is
that they provide much more information about the data being clustered.
More precisely, while crisp algorithms simply assign each data point to
one out of a set of clusters, fuzzy algorithms compute, for each data
point, a measure of how close that point is to the "center" of each
cluster. This measure is termed the "fuzzy membership" of the point
in a particular cluster: the higher the membership value, the closer
the point to the center of that cluster. The choice of fuzzy clustering
algorithms is very large (Hoppner et al. 1999
), but we
have decided to use the simplest, i.e., the fuzzy c-means clustering
algorithm developed by Bezdek (1981)
, which assumes that
the clusters are hyper-spheres having approximately the same diameter.
The concept behind the algorithm is simple. It minimizes the following
functional
|
(A1) |
The most important parameter to be defined is then the number of clusters, c. Unfortunately, there is no golden rule for choosing it even though many so-called cluster validation measures have been proposed to solve this problem. These measures try to establish, in an objective way, which value of c is best for a given data set, but unfortunately they work well, and are in agreement with each other, only when the groups are fairly well separated, in which case a fuzzy algorithm is not necessary. Thus we decided to use the stability of the clustering solution (i.e., the convergence to the same solution regardless of the initial conditions) as an indication of the validity of the number of clusters selected, even though this is not a standard procedure. The detailed rationale behind our particular choice is indicated in RESULTS.
| |
APPENDIX B |
|---|
|
|
|---|
Genetic algorithm
In examining the correlation between single-unit activity in MST
and disparity vergence, we attempted to identify the subset of cells
whose aggregate behavior came closest to representing the vergence
behavior. This was done by searching for the subset of cells whose
disparity tuning curves, when summed together, best matched the
disparity tuning curve for vergence. A mathematical formulation of the
problem helps in deciding how to approach it. If
VGi is the vergence at different
disparities (i = 1 ... 11) and UNij (j = 1 ... n) is the discharge rate of unit j for disparity i, for each subset of cells, T, we can compute
their summed tuning curve as
|
(B1) |
|
(B2) |
Rules 1 and 2 keep some diversity in the population, whereas rules 3 and 4 exert evolutionary pressure. The algorithm ran for 50 generations, during which the scatter, the mean and the minimum MSE for the population gradually diminished, usually stabilizing after ~30 generations. After 50 generations, the vast majority of the chromosomes were identical. This surviving chromosome represents the algorithm's best estimate of the optimal solution, i.e., the subset of units whose summed disparity tuning curves best correlates with the disparity tuning curve for vergence. Following standard practice, we ran the algorithm numerous (100) times for each data set, both to validate our finding and to give the algorithm the opportunity to sample as many regions of the solution space as possible. Of course this does not allow us to conclude that we have found the optimal solution (as that would require testing all the possible solutions), but it is the best estimate given the technologies available at this time.
| |
ACKNOWLEDGMENTS |
|---|
We thank Drs. L. M. Optican, Y. Kodaka, M. Shidara, and S. Yamane for valuable advice; M. Okui, A. Kameyama, and T. Takasu for technical assistance; and Y. Yaguchi and S. Inoue for secretarial help.
This research was supported by grants from the Japanese Agency of Industrial Science and Technology, Core Research for Evolutional Science and Technology (CREST) of Japan Science and Technology Corporation, the Human Frontier Science Program, Japan Society for the Promotion of Science Research Fellowships for Young Scientists, and the National Eye Institute.
| |
FOOTNOTES |
|---|
Address for reprint requests: A. Takemura, Neuroscience Section, Electrotechnical Laboratory, 1-1-4 Umezono, Tsukubashi, Ibaraki 305, Japan (E-mail: atakemur{at}etl.go.jp).
1 A lower luminance level (0.5 vs. 0.8 cd/m2) was used for the first three monkeys (H, L, M) and might account for the longer latencies in these animals.
2 The membership values are unsigned scalar entities, indicating only the similarity/proximity of the curves to each of the four center curves (see METHODS), and hence the cells within a group cannot be ranked by their memberships in their own defining cluster. However, they can be ranked by their memberships in any other, nondefining cluster because the curves within a given group must all lie on the same side of the 12-dimensional plane that passes through the center of the nondefining cluster and that is orthogonal to the line that goes through the center of the two clusters.
3 The medians provided a fairly good representation of the correlated data: mean r2 values between individual tuning curves and the median for groups 1-4 were 0.74, 0.79, 0.82, and 0.78, and r2 exceeded 0.4 for 100/102 cells.
Received 3 August 2000; accepted in final form 3 January 2001.
| |
REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
K. Miura, Y. Sugita, K. Matsuura, N. Inaba, K. Kawano, and F. A. Miles The Initial Disparity Vergence Elicited With Single and Dual Grating Stimuli in Monkeys: Evidence for Disparity Energy Sensing and Nonlinear Interactions J Neurophysiol, November 1, 2008; 100(5): 2907 - 2918. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. J. Preston, S. Li, Z. Kourtzi, and A. E. Welchman Multivoxel Pattern Selectivity for Perceptually Relevant Binocular Disparities in the Human Brain J. Neurosci., October 29, 2008; 28(44): 11315 - 11327. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. W. Roe, A. J. Parker, R. T. Born, and G. C. DeAngelis Disparity Channels in Early Vision J. Neurosci., October 31, 2007; 27(44): 11820 - 11831. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Umeda, S. Tanabe, and I. Fujita Representation of Stereoscopic Depth Based on Relative Disparity in Macaque Area V4 J Neurophysiol, July 1, 2007; 98(1): 241 - 252. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Takemura, Y. Murata, K. Kawano, and F. A. Miles Deficits in Short-Latency Tracking Eye Movements after Chemical Lesions in Monkey Cortical Areas MT and MST J. Neurosci., January 17, 2007; 27(3): 529 - 541. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. V. Barthelemy, I. Vanzetta, and G. S. Masson Behavioral Receptive Field for Ocular Following in Humans: Dynamics of Spatial Summation and Center-Surround Interactions J Neurophysiol, June 1, 2006; 95(6): 3712 - 3726. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Uka, S. Tanabe, M. Watanabe, and I. Fujita Neural Correlates of Fine Depth Discrimination in Monkey Inferior Temporal Cortex J. Neurosci., November 16, 2005; 25(46): 10796 - 10802. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Tanabe, T. Doi, K. Umeda, and I. Fujita Disparity-Tuning Characteristics of Neuronal Responses to Dynamic Random-Dot Stereograms in Macaque Visual Area V4 J Neurophysiol, October 1, 2005; 94(4): 2683 - 2699. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Akao, M. J. Mustari, J. Fukushima, S. Kurkin, and K. Fukushima Discharge Characteristics of Pursuit Neurons in MST During Vergence Eye Movements J Neurophysiol, May 1, 2005; 93(5): 2415 - 2434. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Neri A Stereoscopic Look at Visual Cortex J Neurophysiol, April 1, 2005; 93(4): 1823 - 1826. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Tanabe, K. Umeda, and I. Fujita Rejection of False Matches for Binocular Correspondence in Macaque Visual Cortical Area V4 J. Neurosci., September 15, 2004; 24(37): 8170 - 8180. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Krug, B. G. Cumming, and A. J. Parker Comparing Perceptual Signals of Single V5/MT Neurons in Two Binocular Depth Tasks J Neurophysiol, September 1, 2004; 92(3): 1586 - 1596. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. E. Angelaki Eyes on Target: What Neurons Must do for the Vestibuloocular Reflex During Linear Motion J Neurophysiol, July 1, 2004; 92(1): 20 - 35. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Uka and G. C. DeAngelis Contribution of Middle Temporal Area to Coarse Depth Discrimination: Comparison of Neuronal and Psychophysical Sensitivity J. Neurosci., April 15, 2003; 23(8): 3515 - 3530. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. C. DeAngelis and T. Uka Coding of Horizontal Disparity and Velocity by MT Neurons in the Alert Macaque J Neurophysiol, February 1, 2003; 89(2): 1094 - 1111. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. P. Bichot and J. D. Schall Priming in Macaque Frontal Cortex during Popout Visual Search: Feature-Based Facilitation and Location-Based Inhibition of Return J. Neurosci., June 1, 2002; 22(11): 4675 - 4685. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. M. Churchland and S. G. Lisberger Shifts in the Population Response in the Middle Temporal Visual Area Parallel Perceptual and Motor Illusions Produced by Apparent Motion J. Neurosci., December 1, 2001; 21(23): 9387 - 9402. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Takemura, Y. Inoue, H. Gomi, M. Kawato, and K. Kawano Change in Neuronal Firing Patterns in the Process of Motor Command Generation for the Ocular Following Response J Neurophysiol, October 1, 2001; 86(4): 1750 - 1763. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Visit Other APS Journals Online |