|
|
||||||||
J Neurophysiol (February 1, 2003). 10.1152/jn.00717.2002
Submitted on Submitted 22 August 2002; accepted in final form 30 September 2002
Department of Anatomy and Neurobiology, Washington University School of Medicine, Saint Louis, Missouri 63110
| |
ABSTRACT |
|---|
|
|
|---|
DeAngelis, Gregory C. and Takanori Uka. Coding of Horizontal Disparity and Velocity by MT Neurons in the Alert Macaque. J. Neurophysiol. 89: 1094-1111, 2003. We performed the first large-scale (n = 501), quantitative study of horizontal disparity tuning in the middle temporal (MT) visual area of alert, fixating macaque monkeys. Using random-dot stereograms, we quantified the direction tuning, speed tuning, horizontal disparity tuning, and size tuning of each neuron. The vast majority (93%) of MT neurons were significantly tuned for horizontal disparity. Although disparity tuning was generally quite robust, the average disparity sensitivity of MT neurons was significantly weaker than their direction or speed sensitivity as quantified using both an index of response modulation and an index of signal-to-noise ratio. Disparity tuning was not correlated with direction or size tuning but tended to be broader and weaker for neurons that preferred faster speeds of motion. By comparison with recent studies, we find that disparity selectivity in MT is substantially stronger than that seen in either primary visual cortex (V1) or area V4. In addition, MT neurons are more broadly tuned for disparity than V1 neurons at comparable eccentricities. Disparity tuning curves are very well described by Gabor functions for >80% of MT neurons. The distribution of Gabor phases shows clear bimodality, indicating that MT neurons tend to have odd-symmetric disparity tuning (unlike neurons in V1). The preferred disparities were more strongly correlated with the phase parameter of the Gabor function than with the positional offset parameter. In fact, for neurons with preferred disparities close to zero, the positional offset tended to oppose the phase shift in specifying the disparity preference. We suggest that this result reflects a strategy used to finely distribute the disparity preferences of MT neurons, given the predominance of odd-symmetry and broad tuning.
| |
INTRODUCTION |
|---|
|
|
|---|
The middle temporal (MT) visual
area (also known as V5) is a central processing stage along the
"dorsal" visual stream from primary visual cortex (V1) to the
parietal lobe (for review, see Albright 1993
;
Andersen 1997
; Van Essen and Gallant
1994
). Many studies show that MT is intimately involved in the
processing of visual motion information, both for motion perception
(Britten et al. 1992
, 1996
; Newsome and Pare
1988
; Orban et al. 1995
; Pasternak and
Merigan 1994
; Salzman et al. 1992
) and for
guiding eye movements (Born et al. 2000
; Groh et
al. 1997
; Komatsu and Wurtz 1989
;
Lisberger and Movshon 1999
; Newsome et al.
1985
). Accordingly, the selectivity of MT neurons for stimulus
velocity (direction and speed) has been extensively studied in both
anesthetized and alert animals (Albright 1984
;
Cheng et al. 1994
; Felleman and Kaas
1984
; Lagae et al. 1993
; Maunsell and Van
Essen 1983b
; Mikami et al. 1986
; Rodman
and Albright 1987
; Snowden et al. 1992
;
Zeki 1974b
).
It has recently become clear that MT also plays a role in processing
binocular disparity signals for depth perception as evidenced by the
finding that electrical microstimulation of MT can bias depth judgments
(DeAngelis et al. 1998
) and the finding that responses of MT neurons are correlated with perceptual reports of
three-dimensional (3D) structure in ambiguous stimuli (Bradley
et al. 1998
; Dodd et al. 2001
). Previous
physiological studies of disparity processing had focused mainly on the
contributions of V1, which is the first stage in the visual pathway
where neurons integrate signals from the two eyes to encode binocular
disparity. However, recent work shows that activity in V1 alone cannot
account for stereoscopic depth perception (Cumming and Parker
1997
, 1999
, 2000
), suggesting that MT (and other extrastriate
visual areas) could be a major source of disparity signals for
stereopsis. It is therefore important to have a detailed, quantitative
account of disparity tuning in MT and to examine how the representation
of disparity information in MT differs from that in V1.
The disparity selectivity of V1 neurons has been extensively studied in
both alert and anesthetized animals (see Cumming and DeAngelis
2001
; DeAngelis 2000
; Gonzalez and Perez
1998
for review). Much of the early work on disparity
selectivity in V1 was performed using anesthetized cats (e.g.,
Barlow et al. 1967
; Nikara et al. 1968
),
whereas more recent work has been focused on alert primates for which
vergence posture can be precisely controlled and absolute retinal
disparities can be accurately specified (e.g., Poggio et al.
1988
; Prince et al. 2002b
). By comparison to V1,
relatively little is known about disparity selectivity in MT.
Zeki (1974a
,b
; 1979
) reported the existence of a small
fraction of MT neurons that responded to continuous changes in
binocular disparity and image size of the sort that would normally
accompany motion of an object toward or away from the observer. He did
not measure the response of MT neurons to stimuli at different fixed
disparities, however. This was later achieved by Maunsell and
Van Essen (1983c)
, who performed the first detailed study of
disparity tuning in MT using bar stimuli. Although their study
accurately revealed many features of disparity selectivity in MT, it
had a couple of limitations. Because anesthetized animals were studied,
the absolute retinal disparity of the stimuli was known to an accuracy of "about ±1°" (p. 1149). Also, the sample consisted of 52 disparity-selective MT neurons, and there is not much population data
in the paper.
Four subsequent studies have measured disparity tuning of MT neurons in
alert, fixating monkeys using random-dot stereograms as stimuli (to
eliminate monocular cues). Characterization of disparity-tuning
properties was not the main purpose of three of these studies
(Bradley et al. 1995
, 1998
; Dodd et al.
2001
), and the fourth study examined the organization of
disparity selectivity in MT based primarily on responses of multi-unit
clusters (DeAngelis and Newsome 1999
). Thus the
literature lacks a large-scale quantitative study of disparity
selectivity in MT, and this study aims to fill that void. We focus on
the responses of single MT neurons to systematic variations in
horizontal disparity, the metric that is most directly related to depth
perception (responses of MT neurons to vertical disparities will not be
addressed here). More specifically, we quantify the strength, range,
and shape of disparity tuning curves for a large population of MT
neurons, we compare our population data to similar measurements
published recently for V1 (Prince et al. 2002a
,b
) and V4
(Watanabe et al. 2002
), and we analyze the relationships
between disparity tuning and other response properties of MT neurons
(e.g., direction and speed tuning). We discuss how the coding of
disparity in MT differs from that in V1 and whether the disparity
tuning of MT neurons could be accounted for by inputs from V1. This
work provides a detailed database with which to compare past and future
studies of disparity selectivity in other visual areas.
| |
METHODS |
|---|
|
|
|---|
Experiments were performed on three male rhesus monkeys (Macaca mulatta) weighing 5-7 kg. All experimental procedures were approved by the Institutional Animal Care and Use Committee at Washington University and conformed to National Institutes of Health guidelines.
Surgical preparation
Standard surgical procedures were used to prepare animals for
daily training and recording sessions (see Britten et al.
1992
; DeAngelis and Newsome 1999
for details).
Briefly, a post for head restraint and a recording chamber were affixed
to the skull using a combination of titanium screws and cranio-plastic
cement (Plastics One). The recording chamber was beveled at an angle of
32 or 36° and centered over occipital cortex at a location roughly 17 mm lateral and 14 mm dorsal to the occipital ridge. An eye coil was implanted under the conjunctiva in each eye, allowing us to monitor both conjugate eye position and vergence angle. To reduce coil slippage
and deformation in the eye, each eye coil was sutured to the sclera at
three to four locations using either a permanent (8-0 Nylon) or
long-lasting dissolvable (7-0 Dexon) suture.
Visual stimuli and task
Random-dot stereograms (RDSs) were generated by an OpenGL accelerator board (3Dlabs Oxygen GVX1) and presented on a flat-faced 22-in color display (Sony GDM-F500) subtending 40 × 30° at a viewing distance of 57 cm. Dot density was 64 dots per square degree per second, each dot subtended ~0.1°, and the starting position of each dot within the receptive field was newly randomized for each trial. Precise disparities and smooth motion were achieved by plotting dots with sub-pixel resolution using the hardware anti-aliasing capabilities of the OpenGL board.
Half-images for the left and right eyes were presented alternately at a refresh rate of 100 Hz, and stereoscopic presentation was achieved using ferro-electric liquid crystal shutters (DisplayTech) that were synchronized to the video refresh. The RDS consisted of red dots presented on a black background (Fig. 1) to minimize stereo crosstalk between the two eyes (crosstalk was <3%). Note that the 100-Hz refresh rate (50-Hz refresh for each eye) limited the speeds of motion that we could present. A maximal speed of 32°/s was chosen because motion was still quite smooth at this speed. Importantly, the position of each dot in a moving stimulus was updated every video frame with an appropriate compensation for the fact that left and right half-images were presented on alternate frames. Thus we assured that the binocular disparity of the stimulus was accurate for each direction and speed of motion.
|
Monkeys viewed the random-dot stimuli while maintaining fixation on a small yellow spot (0.15°). The fixation window was typically 1.5° wide. Stimuli were presented for a period of 1.5 s, and monkeys received a liquid reward for maintaining fixation throughout this period (see Fig. 1). If the monkey's conjugate eye position left the fixation window during the trial, the visual stimulus was terminated, data were discarded, and the monkey was not rewarded. Monkeys were also trained to maintain their vergence angle to within ±0.25° of the plane of fixation. A background of stationary dots presented at zero disparity helped to anchor vergence (Fig. 1). After the initial training, we found that all three monkeys accurately maintained their vergence posture as the disparity of the RDS was varied over the neuron's receptive field. In most recording sessions, a vergence criterion was not enforced, but vergence data were always collected and factored into the analysis of disparity tuning curves (see following text).
Recording procedures and data acquisition
Tungsten microelectrodes (FHC) were introduced into the cortex through a transdural guide tube and typically passed through extrastriate visual areas in the anterior bank of the lunate sulcus prior to entering area MT. MT was recognized based on extensive experience interpreting the patterns of gray and white matter transitions along electrode penetrations, the response properties of single units and multi-unit clusters (direction, speed, and disparity tuning), the relationship between receptive field size and eccentricity, topography within MT, and the subsequent entry into gray matter with response properties typical of area MST. All data included in this study were derived from recordings taken in stretches of gray matter that were confidently assigned to area MT.
Behavioral control and data acquisition were accomplished using a commercially available software package (TEMPO, Reflective Computing). Raw neural signals were amplified, band-pass filtered (500 to 5,000 Hz), and discriminated using conventional electronic equipment (Bak Electronics). Only well-isolated action potentials were counted as single-unit responses. Times of occurrence of spikes, along with behavioral event markers, were stored to disk with 1-ms resolution. Horizontal and vertical eye position signals from each eye were sampled at 1 kHz and stored to disk at a rate of 250 Hz. In most experiments, the raw analog signal from the electrode was digitized at 25 kHz and streamed to hard disk using Spike2 software (Cambridge Electronic Design).
Experimental protocol
After the action potential of an MT neuron was isolated using a dual voltage-time window discriminator (Bak Electronics), we explored the receptive field and tuning properties of the neuron using a receptive-field mapping program. A small patch (typically 1-3°) of moving or flickering dots was moved around the screen using a mouse, and instantaneous firing rate was plotted on a graphical user interface. This allowed us to carefully center the stimulus over the receptive field and adjust the size of the stimulus to be optimal. Next, we estimated the neuron's preferred velocity of motion by searching through a polar representation of direction and speed. Finally, we adjusted the horizontal disparity of an optimized patch of moving dots to achieve the largest response from the neuron.
After these qualitative tests, four quantitative measurements were
taken for each neuron in our sample: a direction tuning curve, a speed
tuning curve, a size tuning (area summation) curve, and a disparity
tuning curve. Each of these measurements was performed in a separate
block of randomly interleaved trials, with each unique stimulus
presented three to seven times (typically 5). In all cases, a circular
aperture of moving dots was centered on the MT receptive field, and the
remainder of the screen was filled with stationary dots having zero
disparity to help anchor vergence in the plane of fixation (Fig. 1).
Direction tuning was measured by presenting random-dot patterns that
drifted in eight different directions, 45° apart. For the occasional
narrowly tuned cell, the spacing between directions was reduced
appropriately and the test was repeated. Speed tuning was measured (at
the preferred direction) by presenting dot patterns that drifted at 0, 0.5, 1, 2, 4, 8, 16, and 32°/s. Next, a size-tuning (area summation) curve was obtained by presenting dots within circular apertures of the
following diameters: 0, 1, 2, 4, 8, 16, and 32°. Note that large
stimulus patches would often overlap the fixation point, which could
elicit tracking eye movements or changes in vergence posture. To avoid
this, a small (2° diam) patch of stationary zero-disparity dots
always surrounded the fixation point in the size tuning run. Finally,
we measured a disparity-tuning curve at the optimal stimulus direction,
speed, and size. In most cases, horizontal disparities were tested from
1.6 to 1.6° in steps of 0.4°; however, these parameters were
adjusted as necessary based on our initial qualitative assessment of
the breadth of disparity tuning.
Data analyses
The response of the neuron for each trial was computed as the mean firing rate over the 1.5-s stimulus duration. Tuning curves were constructed by plotting the mean response (across repetitions) to each stimulus along with the SE of the mean (see Fig. 2). The time-averaged position of each eye was also computed for each trial, and the horizontal vergence angle was computed as the left eye position minus the right eye position. Averaged across all experiments, the inter-trial variations in (time-averaged) vergence angle had a SD of 0.10°. By comparison, the average within-trial SD of vergence angle was 0.06°. This greater variance of vergence angle across trials versus within trials presumably reflects a small amount of eye-coil slippage or distortion that occurs from trial to trial.
|
Each tuning curve was fit with a function that was chosen because it
describes the data well with a relatively small number of parameters.
The best fit of each function was achieved by minimizing the
sum-squared error between the responses of the neuron and the values of
the function, using the constrained minimization tool, "fmincon,"
in Matlab (Mathworks). We fit all of the single-trial responses for
each stimulus condition not just the mean response. Although this
generally yields the same result as fitting the mean response to each
different stimulus, fitting all of the single-trial responses allows us
to test the goodness-of-fit by comparing residuals around the mean
response with residuals around the value of the fitted curve
(
2 goodness-of-fit test, discussed further in
the following text). To homogenize the variance of the neural responses
across different stimulus values, we minimized the difference between
the square root of the neural responses and the square root of the
function (see Prince et al. 2002b
). Table
1 gives the equations for the functions
fit to each tuning curve along with a description of the free
parameters. Because neuronal responses must be positive, all fitted
curves were half-wave rectified, though this was seldom necessary. The
amplitude parameter, A, of each fit was constrained to be no
larger than 1.5 times the difference between the maximum and minimum
responses.
|
Direction tuning curves were fit with a Gaussian, and speed tuning
curves were fit with a Gamma distribution. The choice of a Gamma
distribution was simply motivated by the variety of shapes of speed
tuning curves that we observed empirically; there was no theoretical
basis for this choice. The denominator term in the Gamma formulation
normalizes the curve to have an amplitude specified by A.
Size-tuning curves were fit with two functions: a single error function
(erf, the integral of a Gaussian) and a difference of error (DoE)
functions. The single error function provided good fits to size tuning
curves for neurons that lacked surround inhibition, whereas the DoE
function was necessary for neurons with surround inhibition (e.g., Fig.
2C). The errors of these two fits were compared for each
neuron using a sequential F test (Draper and Smith
1966
). If the DoE function provided a significantly better fit
than the single error function (P < 0.05), the neuron
was considered to have significant surround inhibition and the optimal
stimulus size was taken as the peak of the DoE fit. Otherwise, the
optimal size was taken from the single error function as 1.163
,
which defines the size at which the curve reaches 90% of its maximal value.
Disparity tuning curves were fit with a Gabor function, as done
previously for V1 neurons (e.g., Ohzawa et al. 1997
;
Prince et al. 2002b
). Because the disparity frequency,
f, is often poorly constrained by the data at the
low-frequency end of the spectrum, this parameter was only allowed to
vary within ±10% of the peak frequency determined from a Fourier
transform of the raw tuning curve (after subtracting the DC response).
We found that this constraint considerably improved the convergence of
the optimization (see also Prince et al. 2002b
) with
minimal increase in the overall error of the fits. We also constrained
the center of the Gaussian component,
d0, to lie within the range of
disparities that was tested. To test whether the six-parameter Gabor
function was necessary to fit our disparity tuning curves, we also fit
each curve separately with both a Gaussian and a sinusoid. Results of
these comparisons will be discussed later.
The functions described in Table 1 generally provided excellent fits to
the tuning curves of MT neurons as evidenced by the median
R2 values given for each function. We
also tested the quality of the fits by performing a
2 goodness-of-fit test on each data set with
significant tuning (ANOVA, P < 0.05). This is a rather
stringent test that assesses whether the residuals around the values of
the fitted curve have a significantly larger variance than the
residuals around the mean responses. As seen in Table 1, 64-81% of MT
neurons pass this test for the different tuning functions, indicating
that most of the fits were very good. Moreover, most of the neurons that failed the
2 test still had
R2 values in excess of 0.9. None of
our population analyses differed when these neurons were removed from
the sample, thus we did not exclude neurons from study based on the
2 test alone.
From each direction, speed, and disparity tuning curve, we extracted
two different measures of tuning strength. A modulation index
characterized the amount of response modulation due to stimulus variations, relative to the maximal response
|
(1) |
|
(2) |
For each size-tuning curve, we computed the percentage of surround
inhibition as
|
(3) |
Population statistical analyses
Our data set consisted of single-unit recordings from three different monkeys. Because differences in the distribution of parameters among monkeys could produce misleading results in correlation analyses, all such analyses were done using a within-cells regression in the context of an analysis of covariance (ANCOVA), with monkey identity as an independent factor. Thus all correlation coefficients and P values reported here are corrected for differences between subjects.
| |
RESULTS |
|---|
|
|
|---|
Neuronal database
The sample for this study consists of 501 MT neurons that were
recorded from three monkeys (230 units from monkey B, 166 from monkey J, and 105 from monkey R). For each
of these neurons, we obtained a complete set of data consisting of a
direction tuning curve, a speed tuning curve, a size tuning (area
summation) curve, and a horizontal disparity tuning curve. An example
data set for one of the MT units is shown in Fig. 2. The smooth curves
(
) fitted to the data in Fig. 2 are described by the equations given in Table 1. This example neuron had broad direction tuning, preferred far dots moving at slow speeds and exhibited substantial surround inhibition. This example was chosen for illustration because the R2 values of the fits were average or
slightly below average (compare with median values in Table 1). Thus
this neuron is truly representative of the quality of the curve fits
that we obtained for most MT neurons.
Figure 3A shows a polar
representation of the preferred directions and speeds for 481/501 MT
neurons that had significant tuning for both direction and speed of
motion. It is clear that the population represents all directions of
motion across a broad range of speeds. Figure 3B presents a
summary of the size-tuning properties in which the percentage of
surround inhibition is plotted against the optimal stimulus size.
Surround inhibition was statistically significant for 42% of MT
neurons (P < 0.05, sequential F test, see
METHODS), with surround inhibition in excess of 20%
generally reaching statistical significance (
). These data are
roughly compatible with previous results from MT (e.g., Born
2000
; Raiguel et al. 1995
). Together, the data
of Fig. 3 show that we sampled MT neurons with a broad range of
stimulus preferences.
|
Most of the cells in our sample were significantly tuned (ANOVA,
P < 0.05) for direction of motion (96%), speed of
motion (99%), and horizontal disparity (93%). As our percentage of
disparity-selective neurons was higher than the 68% reported by
Maunsell and Van Essen (1983c)
in the anesthetized
monkey, we worried that eye movements could be a factor. We thus
performed an analysis of covariance on the responses of each neuron
with disparity as the independent factor and vergence angle as a
covariate. Only 59/501 (11%) MT neurons exhibited a significant
correlation between response and vergence angle (ANCOVA within-cells
regression, P < 0.05), and the significance of the
main effect of disparity was unchanged for all but 15 neurons when
vergence angle was included as a covariate in the analysis. Thus
vergence posture was under reasonably tight control, and small
variations in vergence angle from trial to trial did not have a
significant impact on our disparity tuning curves. The lower percentage
of disparity-selective neurons reported by Maunsell and Van
Essen (1983c)
may reflect a different criterion for
significance, which is not stated in their study.
We assessed the strength of tuning for direction, speed, and disparity
using both a modulation index (Eq. 1) and a discrimination index (Eq. 2). Figure
4A shows the distribution of
the modulation index for direction, speed, and disparity tuning
measurements across the population of MT neurons. As numerous studies
have shown previously (e.g., Albright 1984
;
Britten et al. 1992
; Maunsell and Van Essen
1983b
; Mikami et al. 1986
; Snowden et al.
1992
), the distribution of modulation indices for direction
tuning (
in Fig. 4A) is centered at ~1.0 (mean = 0.97), indicating that the difference in response between the preferred
and null directions is typically equal to the difference between the
preferred response and the spontaneous activity level. The distribution
of modulation indices for speed tuning (
) is similarly distributed
around ~1.0, and the mean modulation index for speed (0.96) is not
significantly different from that for direction (paired
t-test, P = 0.21). In contrast to direction
and speed, the modulation indices for horizontal disparity tuning are
not distributed around 1.0; rather the mean modulation index (0.73) is
significantly lower than that for direction or speed (paired
t-test, P
0.0001 for both comparisons). Thus although 93% of MT neurons are significantly tuned for disparity, the
response modulations elicited by varying disparity are somewhat smaller
than those elicited by varying direction or speed.
|
A similar pattern of results is observed for the discrimination index
(Fig. 4B), which takes into account the variability of
responses across repetitions of a given stimulus. The median discrimination indices for direction (0.819) and speed (0.818) are
indistinguishable (sign test, P > 0.5), whereas the
median disparity discrimination index (0.74) is significantly lower
(sign test, P
0.0001 for both comparisons). Thus
individual MT neurons tend to have slightly lower discriminative
capacity for horizontal disparity than for direction or speed of motion.
Quantitative description of disparity tuning curves of MT neurons
We now consider the variety of disparity tuning in MT and describe
methods for parametric characterization of disparity tuning curves. MT
neurons have preferred disparities that cover a large range of crossed
and uncrossed disparities. Figure 5 shows
disparity tuning curves (along with Gabor fits) for 13 MT neurons that
typify the variety of responses that we see across the population. Some of these neurons could be described as near (units 1 and
2) or far (units 11 and 12) cells in
the terminology used by Poggio and colleagues (1988)
.
Units 6 and 7 could be described as tuned-zero cells, whereas the remaining neurons would be classified as tuned-near (units 3-5), tuned-far (units 8-10), or tuned
inhibitory (unit 13). Alternatively, the 13 units in Fig. 5
could well be described as having a continuous range of disparity
preferences and shapes rather than forming discrete classes. As
Prince et al. (2002a)
reported for V1, we see no
evidence that disparity-tuning curves in MT form discrete classes.
|
The smooth curves (
) in Fig. 5 are the best-fitting Gabor functions
for each neuron. Table 2 gives the
parameters of the fits, as well as other relevant information, for each
of these example units. To facilitate interpretation of Gabor fits here and throughout the paper, Fig. 6 shows a
graphic depiction of each of the six parameters of the Gabor function.
Note, in particular, that the phase of the sinusoidal component is
specified relative to the center of the Gaussian envelope (Fig.
6B), such that neurons with phases of 0 and 90° have even-
and odd-symmetric curves, respectively.
|
|
As discussed in the preceding text, Gabor functions provide an
excellent description of the data for ~80% of MT neurons
(
2 test, Table 1). However, the question
arises as to whether a six-parameter Gabor function is
necessary to describe these data. To address this question,
we also fit each disparity-tuning curve with a Gaussian and a sinusoid
(the 2 components of a Gabor function), and we compared the quality of
the fits statistically using a sequential F test
(Draper and Smith 1966
). Figure
7A shows the R2 value of the Gabor fit plotted
against the R2 value for the
sinusoidal fit; one data point is shown for each of 471 MT neurons that
had significant disparity tuning (ANOVA, P < 0.05). In
most cases, the Gabor function accounted for substantially more of the
variance in the data than the sinusoid, and the Gabor function provided
a significantly better fit for 313/471 neurons (
; sequential
F test, P < 0.05). Figure 7B
shows the analogous comparison between Gabor fits and Gaussian fits;
202/471 tuning curves were significantly better fit by a Gabor
function. These comparisons show that many MT tuning curves can be well
fit by either the Gaussian or sinusoidal component of the Gabor
function and do not require both. Thus in a later section when we
analyze the position and phase components of the disparity tuning
curve, we shall focus on a subset of 185 neurons for which the Gabor fit was significantly better than both the Gaussian and
sinusoidal fits.
|
It is important to note that there are a number of factors that affect the relative quality of the Gabor, Gaussian, and sinusoidal fits. For example, units 1 and 3 of Fig. 5 may have been well fit by a sinusoid because we did not test a wide enough range of disparities to observe the response plateau at large disparities. In fact, it seems likely that many of the 158 neurons that were well fit by a sinusoid would have been better fit with a Gabor function over a larger disparity range. This explanation does not account for most of the cells that were adequately fit with a Gaussian, however. Many of these cells simply lacked clear side lobes in their disparity tuning curves (e.g., units 6 and 13 of Fig. 5) or had shallow side lobes that were clipped off because the response to uncorrelated dots (and consequently the baseline level of the Gabor function) was rather low (e.g., unit 10 of Fig. 5). Gabor fits for these cells might have been superior if the responses were averaged across more repetitions of each stimulus. Thus our estimates of the proportion of MT cells that require a Gabor fit is probably quite conservative.
Population analyses and comparisons to other visual areas
When comparing our results to disparity-tuning data from other
areas, it is important to consider the range of receptive field eccentricities studied. For example, most studies of disparity tuning
in primate V1 have focused on the central 5° of the visual field
(e.g., Poggio et al. 1988
; Prince et al.
2002b
), whereas previous studies in MT have generally focused
on larger eccentricities (Maunsell and Van Essen 1983c
;
DeAngelis and Newsome 1999
). Figure 8 shows how a few key parameters of
disparity tuning in MT vary with receptive field eccentricity. Figure
8A shows the preferred disparity of each neuron with
significant disparity tuning (ANOVA, P < 0.05) as a
function of eccentricity. As others have reported (Bradley and
Andersen 1998
; Maunsell and Van Essen 1983c
),
there are more MT neurons tuned to near than far disparities, and this tendency is maintained across the range of eccentricities tested. The
mean preferred disparity across the sample is
0.22, which is
significantly less than zero (t-test, P
0.0001). There is a weak positive correlation between the magnitude of
the preferred disparity and eccentricity (R = 0.09, P = 0.048).
|
Figure 8B shows how the disparity frequency, which is a
measure of the coarseness of disparity tuning, varies with
eccentricity. There is a highly significant negative correlation in
this plot (R =
0.20, P
0.0001) as
expected if the spatial scale of disparity selectivity increases along
with receptive field size at large eccentricities. Finally, Fig.
8C shows the phase of the Gabor function plotted against
eccentricity. Positive values of the Gabor phase denote phase shifts
that move the peak of the Gabor function toward Near disparities (e.g.,
units 3-5 of Fig. 5). The distribution of Gabor phases is
clearly bimodal, such that there are many MT neurons with tuning curves
close to odd-symmetric (phases near ±
/2), whereas relatively few MT
neurons have even-symmetric tuning curves (phases near 0 and ±
).
This pattern contrasts with that seen in V1, where there is a
preponderance of even symmetry (Cumming and DeAngelis
2001
; Prince et al. 2002a
). This difference between areas is not due to different ranges of eccentricities because
the bimodal pattern of spatial phases is seen for MT neurons at all
eccentricities tested.
It is notable, and perhaps somewhat confusing, that the distribution of preferred disparities in Fig. 8A is unimodal, whereas the distribution of Gabor phases (Fig. 8C) is bimodal. This indicates that some other parameter of the Gabor function must counterbalance the phase to distribute the preferred disparities unimodally. We shall see later (Fig. 15) that the position parameter (d0) of the Gabor function accounts for this apparent inconsistency.
We now compare our results from MT with some recently published data
from areas V1 and V4. Figure
9A shows distributions of the
disparity discrimination index (DDI) for MT, V1 (data from Prince et al. 2002b
), and V4 (data from Watanabe
et al. 2002
). Although all three distributions overlap
considerably, it is quite clear that the average MT neuron is more
sensitive to disparity than the average V1 or V4 neuron. The median
values of DDI are 0.74 for MT, 0.54 for V1, and 0.50 for V4. Note that
the most sensitive neurons in V1 and (to a lesser extent) V4 are
comparable to the most sensitive MT neurons. However, relatively few V1
and V4 neurons have DDI values >0.7, whereas the majority of MT
neurons exceed this criterion. It should be noted that there are
considerable stimulus differences between the V4 study and ours.
Responses of MT neurons were generally well sustained throughout the
entire 1.5-s stimulus period. In contrast, Watanabe et al.
(2002)
measured responses to stationary bar stimuli presented
for 1s, and their responses were generally much more transient than
ours. To gauge how much these differences affect the comparison of Fig.
9A, we recomputed DDI values for our MT neurons based on the
first 300 ms of responses following stimulus onset. The median DDI for
MT declined by 14% from 0.74 to 0.64, but is still substantially larger than that seen for V4. Thus it seems clear that MT neurons are
generally more sensitive to horizontal disparity than neurons in V1 and
V4.
|
This difference in disparity sensitivity between areas is unlikely to be the result of differences in the range of eccentricities tested because we found no significant dependence of DDI on eccentricity for our sample of MT neurons (R = 0.05, P = 0.28). Another possible explanation for the difference between V1 and MT in Fig. 9A is that the V1 neurons were tested with dynamic random-dot stereograms in which dots were randomly replotted every few video frames, whereas our MT neurons were tested with stereograms containing dots moving coherently in the preferred direction. We have tested a handful of MT neurons with moving and dynamic stereograms and have found DDI values to be very similar for the two types of stimuli, but our sample is not large enough for a rigorous statistical analysis. Nevertheless, this factor seems unlikely to account for the large difference in DDI values between MT and V1.
Although responses of MT neurons are more strongly modulated by
disparity than responses of V1 neurons, Fig. 9B shows that MT neurons are more broadly tuned. Disparity frequency is plotted against eccentricity for populations of V1 neurons recorded by Prince et al. (2002b)
and B. G. Cumming
(unpublished results), along with our MT data. Over the range of
eccentricities from 2 to 8°, where the data overlap, the median
disparity frequency for V1 (0.89) is significantly higher than the
median value (0.30) for MT (Mann-Whitney U test,
P
0.0001). This difference is confirmed by an analysis
of covariance on the entire data set, which reveals a highly
significant difference between areas (P
0.0001) as well as a significant dependence on (log) eccentricity (partial R =
0.30, P
0.0001). MT neurons are
clearly more broadly tuned for disparity than V1 neurons at comparable eccentricities.
There is a somewhat abrupt falloff in the number of MT neurons with disparity frequencies below ~0.2 cycles/° (see histogram in Fig. 8B). Because the disparity frequency of the Gabor fit was constrained using the Fourier transform of the disparity tuning data (see METHODS), the range of disparities tested imposes a lower bound on the disparity frequency estimate. Thus some broadly tuned neurons would have a lower disparity frequency if a wider range of disparities were tested. Note, however, that any errors introduced by the limited disparity range are in the wrong direction to account for the difference in disparity frequency between V1 and MT.
Relationships between disparity tuning and velocity tuning in MT
To understand population coding of disparity and velocity in MT, it is of considerable interest to know how disparity tuning correlates with the other response properties of MT neurons, namely direction, speed, and size tuning. We thus performed a series of multiple regression analyses with the following parameters as independent variables: direction discrimination index, preferred direction, direction tuning width, speed discrimination index, preferred speed, optimal size, percentage of surround inhibition, eccentricity, and monkey identity. Because direction is a circular variable, we wrapped preferred directions into the range from 0 to 90°, so that we could test for differences between neurons coding horizontal and vertical directions. Variables were log transformed before analysis whenever this improved the normality of the distributions, and all significant effects reported in the following text were verified using nonparametric statistics separately. Because the data set is large and very small effects can reach marginal significance, we adopted a significance criterion of P < 0.01.
We first examined how the DDI depends on other tuning parameters. Only
two variables were significant predictors of DDI: the speed
discrimination index and the preferred speed. Figure
10A shows the positive
correlation between DDI and speed discrimination index (partial
R = 0.21, P = 0.00014), indicating that
neurons with strong speed tuning also tend to have strong disparity
tuning. This effect is not simply due to the fact that the two metrics share a common variability term (see Eq. 2), as a
significant correlation is also found between modulation indices for
speed and disparity (R = 0.18, P < 0.0001). Figure 10B shows that DDI also depends
significantly on the preferred speed of MT neurons (partial
R =
0.19, P = 0.00085), such that
neurons preferring fast speeds tend to have weaker disparity
selectivity. This confirms the result previously reported by
DeAngelis and Newsome (1999)
based on multi-unit
recordings in MT. The lack of any significant correlation between DDI
and direction tuning parameters is also noteworthy and is consistent
with findings of the previous multiunit study as well.
|
We also examined the dependence of disparity frequency on other tuning
parameters. Figure 11A shows
that disparity frequency depends on the preferred speed of MT neurons,
such that neurons preferring fast speeds tend to have lower disparity
frequencies (partial R =
0.25, P < 0.0001). This relationship is sensible if the disparity frequency of MT
neurons is correlated with their monocular spatial frequency
preferences as is the case in V1 (Ohzawa et al. 1997
;
Prince et al. 2002b
). Neurons that prefer low spatial frequencies would be expected to have faster speed preferences, given
that velocity is inversely proportional to spatial frequency in the
Fourier domain. Disparity frequency is not significantly correlated
with direction or size tuning parameters. Figure 11B shows
that disparity frequency is negatively correlated with the absolute
value of the preferred disparity (partial R =
0.53, P
0.0001), indicating that neurons with large disparity
preferences tend to have broad tuning. This is expected if nonzero
disparity preferences are mainly the result of phase shifts in the
disparity-tuning curve, rather than position shifts. The relative
contributions of phase and position components to the disparity
preference will be analyzed further in the following text.
|
For a fixed physical speed of an object through the environment, the
speed of retinal image motion will be much larger when the object is
close to the observer. Thus one might expect to observe a correlation
between the preferred speed and preferred disparity of MT neurons.
Specifically, the expectation would be that cells preferring near
(crossed) disparities might be tuned to faster speeds than cells
preferring far (uncrossed) disparities. Moreover, it is possible that
this behavior would be specific to neurons with surround inhibition, as
these have been proposed to code object motion rather than self-motion
(Born and Tootell 1992
; Born et al.
2000
). To examine this possibility, we performed a two-way
factorial ANOVA across the population of neurons with speed preference
as the dependent variable. Sign of preferred disparity and significance
of surround inhibition were factors. We find no significant tendency
for preferred speeds to depend on either the disparity preference
(crossed vs. uncrossed) or the presence of surround inhibition
(P > 0.15 for both). There was also no significant
correlation between disparity preference and speed preference in a
regression analysis. Thus we see no evidence to support the idea that
the MT population incorporates the ecological relationship between
retinal image speed and object distance. Of course, we might miss such
a relationship because we have measured disparity selectivity not
distance selectivity. If MT neurons only code retinal disparity (a
hypothesis that we are currently testing), then they will be activated
by objects at many different distances, depending on the viewing
distance. Thus ecological considerations might predict little or no
relationship between speed and disparity preferences over the range of
speeds that we have tested. It should also be noted that our findings do not rule out the possibility that individual MT neurons show a
disparity preference that varies with the speed of the stimulus. Because we did not systematically explore the interactions between disparity and speed for single neurons, we cannot address this possibility rigorously. However, in exploring the stimulus space qualitatively during receptive field mapping, we didn't noticed any
clear interaction between disparity and speed in determining the
response of MT neurons.
Relationships between disparity tuning and responses to monocular and uncorrelated stimuli
Previous studies have reported links between disparity tuning and
the monocular response properties of neurons in V1 and V2 (Ferster 1981
; LeVay and Voigt 1988
;
Poggio and Fischer 1977
). We therefore examined how the
sensitivity and shape of disparity tuning depends on monocular
responses of MT neurons. We computed an ocular dominance index (ODI) as
|
(4) |
) and ipsilateral (
) monocular responses against the response obtained with binocularly uncorrelated dots. In general, the monocular responses are about equal to the uncorrelated response. Figure 12B also shows the baseline level of the fitted Gabor
function (R0,
) plotted against the
response to binocularly uncorrelated dots. As can be seen by the tight
clustering around the unity-slope diagonal, these two values are nearly
identical for most neurons. Thus the uncorrelated response corresponds
to the baseline level around which responses are modulated by crossed
and uncrossed disparities.
|
Other studies have reported a correlation between the ocular dominance
of cortical neurons and the shape of disparity tuning. Specifically,
near and far cells were found to be more monocularly driven than
tuned-excitatory cells (Ferster 1981
; LeVay and
Voigt 1988
; Poggio and Fischer 1977
; but see
Prince et al. 2002a
). To examine this relationship for
MT, we computed a monocularity index as
|
(5) |
/4 can be considered as
tuned-excitatory, neurons with phases between ±
/4 and ±3
/4 as
near or far, and neurons with phases between ±3
/4 and ±
as tuned-inhibitory. It is clear from this plot that there is no tendency
for near and far cells to be more monocular than tuned-excitatory neurons. In fact, there is a statistically significant negative correlation between monocularity and Gabor phase (R =
0.13, P = 0.004), such that tuned-excitatory neurons
tend to be slightly more monocular than the other types.
Figure 13A shows that the
DDI is also significantly correlated with Gabor phase, such that
tuned-excitatory neurons tend to have larger values of DDI than the
other types (R =
0.18, P < 0.0001).
This is not due to differences in response variability among tuning
types, as the variability term of the DDI (Eq. 2) is not
correlated with Gabor phase (R =
0.03,
P = 0.55). Rather, the mean responses of
tuned-excitatory neurons are more strongly modulated by horizontal
disparity, and this pattern can be understood in terms of response
modulations around the level of response to binocularly uncorrelated
dots. We assessed the positive and negative response variations around
this level by computing a facilitation index
|
(6) |
0.56, P
0.0001). Neurons with
phases near zero (tuned-excitatory) have large values, indicating that
response modulation due to disparity variation consists mainly of
increased responses above the level of
Runcorr, with relatively little
suppression below the uncorrelated response. In contrast, for
tuned-inhibitory neurons with phases near ±
, the facilitation index
is generally close to zero. For these units, response modulations due
to disparity consist almost entirely of suppression below the level of
the uncorrelated response (e.g., unit 13 in Fig. 5). Because
the amount by which responses can be suppressed depends on the level of
Runcorr, this limits the range of
response modulations that can be exhibited by tuned-inhibitory neurons
and accounts for the phase dependence of the DDI (Fig. 13A).
A similar result was found by Prince et al. (2002a)
|
Phase and position components of disparity tuning in MT
Work done over the past decade or so in V1 has revealed that
disparity selectivity depends on interocular differences in both the
position and phase of monocular receptive fields (Anzai et al.
1997
, 1999
; DeAngelis et al. 1991
, 1995
;
Ohzawa et al. 1996
; Prince et al. 2002a
).
For V1 simple cells, the phase difference between receptive fields for
the two eyes determines the shape (or phase) of the disparity-tuning
curve, and the position disparity between the receptive fields shifts
the disparity-tuning curve horizontally along the disparity axis
(see DeAngelis et al. 1995
; Prince et al.
2002a
). The parameters of the Gabor fit to a disparity-tuning curve are thus directly related to the position and phase disparities of V1 receptive fields (see Prince et al. 2002b
).
Clearly, we should not expect any simple relationship between the
disparity-tuning curve and the monocular receptive fields of MT neurons
because MT receptive fields are much larger than those in V1 and must be the result of a considerable convergence of inputs. Thus the position and phase components of Gabor fits to MT data cannot be simply
interpreted in terms of the underlying receptive field structure, as is
the case for V1 simple cells. Nevertheless, we shall see that it is
useful to analyze the disparity tuning curves of MT neurons in terms of
their position and phase components to better understand how the
disparity preference of MT neurons is determined, and to gain some
insight into the population coding of disparity in MT (see
DISCUSSION).
Figure 14 plots the phase of the Gabor
function (in radians) against the position of the Gaussian envelope for
our population of MT neurons. Neurons for which the disparity tuning
curve is significantly better described by a Gabor function than either a Gaussian or sinusoid are denoted (
, sequential F-test,
P < 0.05 for both). For these 185 units, both the
position and phase parameters of the Gabor fit were well constrained by
the data. We therefore analyzed this subset of neurons to examine the
position and phase components of disparity tuning. Regarding Fig. 14,
it is worth noting that the envelope locations (i.e., position
disparities) are distributed evenly around zero disparity, whereas the
Gabor phases are bimodally distributed away from zero. This tendency toward odd-symmetry is especially clear for the subset of neurons that
require Gabor fits (
).
|