|
|
||||||||
The Journal of Neurophysiology Vol. 87 No. 1 January 2002, pp. 191-208
Copyright ©2002 by the American Physiological Society
University Laboratory of Physiology, Oxford OX1 3PT, United Kingdom
| |
ABSTRACT |
|---|
|
|
|---|
Prince, S.J.D., A. D. Pointon, B. G. Cumming, and A. J. Parker. Quantitative Analysis of the Responses of V1 Neurons to Horizontal Disparity in Dynamic Random-Dot Stereograms. J. Neurophysiol. 87: 191-208, 2002. Horizontal disparity tuning for dynamic random-dot stereograms was investigated for a large population of neurons (n = 787) in V1 of the awake macaque. Disparity sensitivity was quantified using a measure of the discriminability of the maximum and minimum points on the disparity tuning curve. This measure and others revealed a continuum of selectivity rather than separate populations of disparity- and nondisparity-sensitive neurons. Although disparity sensitivity was correlated with the degree of direction tuning, it was not correlated with other significant neuronal properties, including preferred orientation and ocular dominance. In accordance with the Gabor energy model, tuning curves for horizontal disparity were adequately described by Gabor functions when the neuron's orientation preference was near vertical. For neurons with orientation preferences near to horizontal, a Gaussian function was more frequently sufficient. The spatial frequency of the Gabor function that described the disparity tuning was weakly correlated with measurements of the spatial frequency and orientation preference of the neuron for drifting sinusoidal gratings. Energy models make several predictions about the relationship between the response rates to monocular and binocular dot patterns. Few of the predictions were fulfilled exactly, although the observations can be reconciled with the energy model by simple modifications. These same modifications also provide an account of the observed continuum in strength of disparity selectivity. A weak correlation between the disparity sensitivity of simultaneously recorded single- and multiunit data were revealed as well as a weak tendency to show similar disparity preferences. This is compatible with a degree of local clustering for disparity sensitivity in V1, although this is much weaker than that reported in area MT.
| |
INTRODUCTION |
|---|
|
|
|---|
Selectivity for binocular
disparity was initially demonstrated using elongated bar stimuli in cat
area 17 (Barlow et al. 1967
; Pettigrew et al.
1968
) and V1 of the awake monkey (Poggio and Fischer
1977
). Subsequently, Poggio and colleagues (Poggio
1995
; Poggio et al. 1985
, 1988
) examined the
sensitivity to horizontal disparity in random-dot stereograms (RDS) in
macaque V1. None of the studies using RDS has attempted to describe the
disparity tuning quantitatively. Consequently, there has been no
quantitative analysis of the relationship between disparity selectivity
to RDS and other fundamental properties of V1 neurons, such as
orientation tuning and ocular dominance.
There are several reasons why it is important to study these issues
with RDS. First, a change in the disparity of a bar stimulus also
generates changes in the monocular images, which by themselves may
influence neuronal firing. By contrast the monocular images of
random-dot stimuli are spatially homogeneous. There is nothing that can
be discovered about the disparity of RDS by inspecting one eye's image
alone. Second, random-dot patterns contain a complete spectrum of
orientations, which permits horizontal disparities to be explored
regardless of the neuron's orientation preference. With bars or
gratings, only the component of disparity orthogonal to the
stimulus orientation can influence the neuron regardless of
the receptive field properties. Remarkably, there are no published data
that compare selectivity for the horizontal disparity of orientation
broadband stimuli against orientation preference in area V1. Third,
ever since the initiative of Julesz (1964
,
1971
), many psychophysical studies of stereopsis have
used RDS to isolate binocular from monocular processes. To understand
the physiological substrate of such behavior, it is important to use
equivalent stimuli in a species whose psychophysical performance
approaches that of human observers (Harwerth and Boltz
1979
; Harwerth et al. 1995
; Prince et al.
2000
; Siderov and Harwerth 1995
) and in a brain
area where the neuronal performance can potentially account for the
precision of psychophysical performance (Prince et al. 2000
).
We therefore undertook a quantitative survey of the responses to RDS in
a large population of V1 neurons (n = 787) recorded from awake behaving monkeys. This provides a detailed description of
the prevalence, type, and range of disparity tuning in macaque V1 as
well as the relationship between disparity selectivity and other RF
properties. We also tested quantitatively models of the underlying
mechanisms of disparity tuning. Specifically, we sought to determine
whether the observed data are compatible with the "energy" model of
disparity selective neurons (Ohzawa et al. 1990
), which
was developed to describe data from area 17 of the cat. In this model,
binocular simple cells are modeled as linear filters followed by a
static output nonlinearity
a half-squaring operation (Albrecht
and Geisler 1991
; Heeger 1992
; Jagadeesh
et al. 1993
; Movshon et al. 1978
;
Tolhurst and Dean 1987
, 1990
). Disparity selectivity
arises because the response of the linear filter for one eye is summed
with that of a similar filter for the other eye before the
half-squaring operation. Hence the expansive nonlinearity provides an
increase in firing rate when both left- and right-eye filters match the
image. Ohzawa et al. (1990)
suggest that a complex cell
may be constructed by combining four simple cells that have different
monocular phase profiles but are all tuned to the same disparity.
According to the energy model, the shape of the disparity tuning curve
is determined by the shapes of the monocular receptive fields. The
strongest test of the model is therefore to perform a quantitative
comparison of the shape of the monocular subunits and the disparity
selectivity. Such an analysis has recently been performed for simple
cells in anesthetized cats (Anzai et al. 1999b
). The
analysis has not been performed for complex cells because the spatial
nonlinearity of these cells makes it difficult to determine the
receptive field (RF) profile of the subunits. In awake animals, the
analysis is difficult even for simple cells, owing to the complications
created by fixational eye movements (Livingstone and Tsao
1999
).
Fortunately, there are many other tests of the energy model that can be
applied in the absence of direct measurements of monocular RF
structure. These are based on assuming a specific functional form for
the underlying monocular subunits. A suitable function is the Gabor
function, which has been extensively evaluated as a suitable function
for describing both monocular RF profiles of simple cells
(Daugman 1985
; Jones and Palmer 1987
;
Marcelja 1980
) in cortical area V1. Under the assumption
that the Gabor is a correct description for the monocular profiles, the
shape of the disparity tuning curve should be well described by a Gabor function, whose parameters should be related to the spatial properties of the neuron (for example, its orientation and spatial frequency tuning). Some deviations from the Gabor model for V1 cortical neurons
have been previously observed (Hawken and Parker
1987
). Although similar deviations occur within the
present data set, they are slight.
The energy model is commonly combined with the assumption that its
subunits are described by Gabor functions. This has been used widely
(e.g., Fleet et al. 1996a
,b
; Prince and Eagle
2000
; Qian 1994
; Qian and Zhu
1997
) since its inception and will be referred to here as
"Gabor energy model." All experimental data used to assess its
validity (Anzai et al. 1999a
,b
, 1997
; DeAngelis et al. 1995
; Ohzawa and Freeman 1986a
,b
;
Ohzawa et al. 1990
, 1996
, 1997
) have been gathered from
V1 neurons in the anesthetized cat using one-dimensional stimuli (bars
or gratings). The present paper gives a quantitative summary of the
responses of cortical neurons in V1 of the awake monkey to RDS patterns
and examines how well the energy model describes these responses.
The accompanying paper (Prince et al. 2002
)
concentrates on those neurons that show strong disparity selectivity.
The parameters of the curves fitted to the disparity tuning functions
are used to address four questions concerning the mechanism of
disparity selectivity. 1) Is there any evidence for a
distinct grouping into different types of disparity tuning curve?
2) Are interocular differences in RF position or phase used
to generate selectivity for nonzero disparities? 3) What
range of disparities is signaled by V1 neurons? 4) Is the
disparity encoding limited by the periodicity of the tuning curve
(size-disparity correlation)? Together, these two papers provide a
comprehensive, quantitative account of the properties of
disparity-selective neurons in primate V1.
| |
METHODS |
|---|
|
|
|---|
General methods
The methods employed in this experiment for recording from V1 of
the awake behaving monkey have been described in full in Cumming
and Parker (1999)
. All of the procedures carried out complied with the United Kingdom Home Office regulations on animal
experimentation. In brief, extracellular recordings were made from the
striate cortex of two adult monkeys (Macaca mulatta), which
had been trained to perform attentive fixation while viewing visual
stimuli in a Wheatstone stereoscope for fluid rewards. Single-unit
sensitivity to dynamic random-dot stereograms was measured as a
function of horizontal disparity. Sinusoidal and bar stimuli were used
to characterize a variety of other parameters.
Apparatus and single-unit recording
Binocular stimuli were presented on two monochrome monitors
(Textronix GMA 201) driven by a split-color signal from a Silicon Graphics Indigo computer and viewed using a Wheatstone stereoscope. Mean luminance was 188 cd.m
2, the maximum
contrast was 99%, and the frame rate was 72 Hz. The screens were at a
distance of 89 cm from the eyes, such that each pixel subtended 0.98 arc min. For a small number of the later experiments, EIZO FlexScan F78
monitors were used with a mean luminance of 42 cd.m
2. The positions of both of the animals'
eyes were monitored using a magnetic scleral search coil system (C-N-C
Engineering). To initialize a stimulus presentation, the animals were
required to fixate to within either 0.4° (monkey
Rb) or 0.6° (monkey Hg) of a binocularly
presented spot. If the animal failed to maintain fixation within this
window for the trial duration of 2 s, the trial was abandoned and
a brief time-out period ensued. For the majority of trials, oculomotor
control was much tighter than these limits.
Tungsten-in-glass microelectrodes (Merrill and Ainsworth
1972
) were passed transdurally into the opercular cortex.
Extracellular measurements of electrical activity in cortical area V1
were made from the left hemisphere for monkey Hg, and both
hemispheres for monkey Rb. On isolation of a single unit,
the classical minimum response field was determined, and its
orientation preference was measured with a sweeping bar stimulus (see
following text). Ninety-five percent of the RF centers were at
eccentricities between 0.99 and 4.93°.
Measurement of disparity tuning functions
The disparity sensitivity of single units was assessed using
dynamic random-dot stereogram patterns. Each stereo-half consisted of
equal numbers of black and white dots (usually 0.08 × 0.08°) presented against a midgray background with an
overall density of 25%. A new pattern of random dots was used on each
video frame to construct the stereograms. Thus in a 2-s presentation,
there were 144 frames. As an absolute minimum, every measurement of a
disparity-tuning curve was based on at least two trials for each
disparity, which means that
288 different random-dot patterns were
presented to each neuron for each disparity tested. For practical purposes, the sum of this stimulation is spatially homogeneous in each
monocular image. In practice, many more trials were acquired for data
that were subjected to detailed quantitative analysis (see following text).
The stereogram stimuli consisted of a central circular region that
varied in disparity and a surround region that was held at a constant
disparity. The central region was sufficiently large to cover the
monocular receptive fields of the cell even at the largest disparity
tested. The surround region was present to mask the monocular shifts in
stimulus position that accompany the introduction of disparity, and to
provide a reference for simultaneous psychophysical judgments (see
Prince et al. 2000
for details). The disparity of the
surround has been demonstrated not to influence the mean firing rate of
units in V1 (see Cumming and Parker 1999
). An initial test of disparity selectivity was carried out using five stimuli with
disparities varying from
0.4 to 0.4°. If the mean firing rate was <10 spikes/s for all disparities, then the data were discarded. If the cell did not modulate its firing rate with disparity, then measurements of spatial properties were made and another unit was
sought. For a subset of a cells, a wider range of disparities was
sampled at the outset to ensure that cells tuned only to large disparities were not missed. In all cases where disparity tuning was
found, there was some modulation in the range ± 0.4°, even if the maximum response was for a greater disparity.
If the cell modulated its firing rate with disparity, further measurements of disparity sensitivity were made, and the stimulus disparities were adjusted to cover the range over which modulation occurred. In general, the disparity tuning curves analyzed here were used for a variety of other studies, which influenced the choice of sampling. Thus the quantity and range of data gathered varied widely from cell to cell. The number of disparity levels sampled varied from 5 to 34, and the total number of trials varied from 10 to 958. For some cells, responses to binocularly uncorrelated random-dot stereograms were also measured. Disparity tuning functions were recorded from a total of 787 V1 cortical cells, of which 489 were from monkey Rb and 298 were from monkey Hg.
Sensitivity to disparity was first characterized with a binocular
interaction index or BII (Ohzawa and Freeman 1986b
;
Smith et al. 1997b
), which measures the degree to which
the firing modulates with disparity. A BII of near 1 indicates that
disparity variations can modulate the firing rate from zero to the
maximum rate on the disparity tuning curve. A BII of near 0 indicates
that disparity hardly changes the firing rate at all. The binocular
interaction index is defined as
|
(1) |
The Gabor energy model predicts that disparity tuning curves will have
the form of either a one dimensional Gabor function or a Gaussian curve
(see APPENDIX B). Well tuned cells were fit with both of
these models using a nonlinear least squares algorithm (Numerical
Algorithms Group, Oxford). A one-dimensional Gabor function may be
described by the equation
|
(2) |
is the width of the function.
The frequency and phase of the Gabor function are controlled by the
parameters f and
, respectively. Note that phase is
defined relative to the disparity offset. Hence the phase parameter
describes the symmetry of the tuning profile relative to the mean
position of the Gaussian envelope. The operation Pos denotes half-wave
rectification. For the energy model, the binocular baseline firing rate
about which modulations occur (Rmean) is the response of the neuron to dots that are binocularly uncorrelated.
Three constraints were placed on the Gabor fitting. First, the amplitude parameter A was restricted to be less than the observed range of firing rates. This ensured that the fitted curve was limited to a plausible range of firing rates. Note that for the Gabor function, the largest possible range of firing rates is Rmean ± A, so the restriction on A allows for the fitted modulation to be up to twice the experimentally observed range. Second, an upper limit on the frequency of the sinusoidal component (f) was set so that it did not exceed the limit constrained by the data sampling. Third, the disparity offset (d0) was constrained to be within the range of the data samples. For each curve, a Gaussian function was also fit. This is defined identically to the Gabor with the cosine term omitted. A sequential F test was carried out to test whether the Gabor function explained significantly more of the variation in the curves than the Gaussian function. Because these regressions are nonlinear in their parameters, a separate numerical simulation was carried out to confirm that the F test had an appropriate rejection rate.
Many statistical procedures, including regression analysis, rely on an
assumption of homogeneity of variance. This poses a problem for
neuronal firing data, in which the variance is known to be
approximately proportional to mean firing rate (e.g., Dean 1981
; Tolhurst et al. 1981
). Taking the square
root of measured firing provides a variance stabilizing transformation.
This should remove the mean:variance dependency and de-skew the data
(see Armitage and Berry 1994
; Snedecor and
Cochran 1989
), and APPENDIX A shows that the
transform achieves this for our dataset. All statistical analysis and
curve fitting in this paper was hence performed on

Measurement of spatial properties
Orientation preference was assessed using binocular sweeping bar stimuli. Mean firing rate was measured at a number of orientations and a Gaussian curve was fitted to the resulting tuning profile. The position of the peak of this curve was taken to be the preferred orientation. For many units, orientation preference was also tested with binocular, drifting sinusoidal gratings of the optimal spatial, and temporal frequency. A Gaussian curve was also fitted to this orientation response curve, and the peak position was taken as a measure of the preferred orientation. Where both bar and grating stimuli were used to measure the cell's orientation preference, results were generally in close agreement. In these cases, the sinusoidal grating data were used. For both stimuli, the half-width at half height of the fitted Gaussian curve was taken as a measure of the orientation bandwidth.
Preferred spatial frequency was assessed by presenting a drifting grating patch at the preferred orientation. The neuronal response was measured at a number of spatial frequencies (usually 5), and a Gaussian in log frequency was fit to the resulting data. The peak of this Gaussian was taken as a measure of the spatial frequency preference of the cell. When the peak position of the fitted Gaussian was above or below the data range, the data were deemed not to permit the designation of a preferred spatial frequency. It should be noted that spatial frequency sampling was usually sparse (typically at 1, 2, 4, 8, and 16 cpd), and hence our estimates of the preferred spatial frequency for luminance gratings are less precise than estimates of other parameters. Examples of disparity, orientation and spatial frequency tuning curves are shown in Fig. 1. The spatial frequency preference was usually measured after characterizing disparity selectivity. In many cases, the unit isolation was lost before this stage was reached so there are many units for which no spatial frequency tuning is available.
|
Ocular dominance was determined by presenting monocular drifting
gratings of the preferred spatial frequency, orientation, and direction
to each eye alone. For some cells, monocular random-dot stimuli were
interleaved in the main disparity tuning measurement, and a further
estimate of ocular dominance was produced from these. For both stimuli,
the eye that was not being tested viewed a blank, dark screen. The
ocular dominance index (ODI) was defined by LeVay and Voigt
(1988)
as the response of the ipsilateral eye to a monocular stimulus divided by the sum of the ipsi- and contralateral responses (see Eq. 3). Hence, cells that have an ocular
dominance index near 1 have a large ipsilateral response, cells with an
ocular dominance index near 0 have a large contra-lateral response, and cells with an ODI of near 0.5 are well balanced. This can be
re-expressed as a monocularity index MI, where 0 is totally binocular
and 1 is totally monocular (see Eq. 4). It should be noted
that, unlike LeVay and Voigt (1988)
, spontaneous rates
were not measured explicitly in the present work and have not been
subtracted from these measures. However, examination of the mean firing
rate in the prestimulus period suggests that spontaneous rates were
almost always small compared with the responses to random-dot patterns
|
(3) |
|
(4) |
|
(5) |
| |
RESULTS |
|---|
|
|
|---|
Prevalence of disparity tuning
In this section we address two questions. First, we consider whether the degree of tuning for horizontal disparity is distributed continuously or whether there is evidence for a distinct population of disparity selective cells. Second, we examine whether disparity selectivity is correlated with other neuronal properties. To answer these questions, disparity selectivity must be adequately characterized.
The most common way to assess disparity selectivity has been to employ
a "relative modulation" index. For example, Ohzawa and
Freeman (1986a)
and Smith et al. (1997b)
measured disparity tuning using drifting sinusoidal grating stimuli.
They fitted sinusoidal functions to the tuning curves and defined the
BII to be the ratio of the amplitude to the mean firing level. In this
paper, we use a related measure, also referred to as the BII, which is
suitable for use with data from random-dot stereograms (see Eq. 1 in METHODS).
There are several potential difficulties with the BII because it takes
no account of the variability in firing, nor the dependence of this
variability on the firing rate. For these reasons we developed a
different, statistical index that estimates the discriminability of the
maximum and minimum points on the disparity tuning profile. We call
this the disparity discrimination index or DDI
|
(6) |

The DDI essentially compares the difference in firing to the preferred
and least-preferred disparities to the within-stimulus variation in
neuronal firing. If the disparity tuning curve modulates a great deal
and the response to each particular disparity on the curve is
statistically reliable, then this index will be near one. If the firing
rate is not modulated by disparity, then the fluctuations in the
disparity tuning curve will be due to noise and this index will be
small. Better estimates of the term RMSerror are
of course achieved by increasing the number of stimulus presentations. However, increasing the duration over which rates are measured systematically reduces RMSerror, leading to
large values of the DDI. It is therefore important that comparisons of
this type of measure are made between datasets with the same the
interval of time over which firing rates are measured, since changes in
RMSerror alter the value of the DDI, even if
Rmax and
Rmin do not change. The DDI index is
related to the ability of an ideal observer to perform a disparity
discrimination task, given only the measured firing rates of the neuron
at the preferred and least-preferred disparities. See Prince et
al. (2000)
for a further discussion of neuronal discrimination
of horizontal disparities.
Figure 2 compares these two measures of disparity selectivity in a way that reveals three advantages of the DDI. First, the BII is negatively correlated with the mean firing rate, which the DDI is not. The reason is that neurons with low firing rates can spuriously acquire a high value of BII simply due to random fluctuations of an otherwise weak response. Second, and conveniently, the DDI is more or less normally distributed close to a Gaussian in its frequency distribution. Third, and most importantly, the DDI is a better indicator of whether disparity-induced modulation is statistically reliable. For all these reasons, it is more appropriate to use the DDI when examining the correlation of disparity selectivity with other neuronal properties. Figure 3 presents examples of disparity tuning curves with low (A), medium (B), and high (C) disparity tuning indices. Note that error bars on these plots represent the SDs of the firing rate. Across the entire population, the order of DDI values accorded well with judgments by eye of the strength of disparity tuning.
|
|
Figure 2 shows the distributions of both measures of disparity
selectivity (BII and DDI) for the whole population (787 neurons). There
is no evidence of two distinct populations despite the large sample.
Rather there is a continuum in the strength of disparity tuning. A
similar result using gratings has been reported both in the cat
(Ohzawa and Freeman 1986a
,b
) and the monkey
(Smith et al. 1997b
). We also quantified disparity
tuning in a number of other ways, including the maximum rate of change
of firing with disparity and the F ratio from a one-way
ANOVA. None of these showed a separation into two populations.
Because the distribution of disparity selectivity is unimodal, the proportion of neurons that are deemed to be disparity selective will depend on the criterion used. For example, 378/787 (48%) neurons showed significant modulation at the 5% level on a Kruskal-Wallis test. On the other hand, if a one-way ANOVA is used, 431/787 (55%) of our V1 neurons were significant at the 5% level. Figure 2 shows that the DDI is quite closely related to the statistical classification: values of DDI smaller than 0.4 are very rarely the result of significant modulation, and values of DDI >0.6 are almost invariably the result of significant modulation. Even so, some cases where the DDI is >0.6 and statistically significant actually represent very weak tuning (see Fig. 3B). With a more stringent criterion that accepts values of DDI >0.8, most tuning curves were strongly modulated by disparity and could be reliably quantified at later stages.
Comparing selectivity for disparity with other neuronal properties
The preceding section establishes the DDI as a valid measure of the strength of disparity tuning. Figure 4 shows the relationship between the DDI and other neuronal properties.
|
Figure 4A shows a weak, but significant, positive
correlation (rs = 0.23, P
0.002) between the DDI and direction selectivity. Inspection of the plot indicates that it is uncommon for neurons to
show a combination of strong direction selectivity and weak disparity
sensitivity. Figure 4B plots the disparity discrimination index as a function of the degree of monocularity. There is no tendency
for neurons that respond equally to monocular stimulation in each eye
to exhibit a greater sensitivity to disparity, again in agreement with
earlier quantitative data from anesthetized animals. Although
Smith et al. (1997b)
found that simple cells with ocular
imbalance tended to have lower disparity sensitivity than those that
were balanced, they found no relationship for complex cells. In area 17 of the cat, Ohzawa and Freeman (1986b)
reported that the
degree of binocular interaction in simple cells did not depend on their
ocular dominance.
Figure 4D shows no correlation between disparity selectivity
and preferred orientation
cells that are highly sensitive to disparity
are found at all orientations. In practice, only a modest relationship
between orientation preference and disparity sensitivity in random-dot
stereograms is predicted by the binocular energy model (see
APPENDIX A and Fig. 12). Even this is not observed in our
dataset. The correlation between orientation bandwidth and DDI in Fig.
4C is weak compared with that found by Smith et al.
(1997b)
and not statistically significant
(rs=
0.09, P < 0.07). This difference may reflect the fact that Smith et al. (1997b)
used the BII, a measure that depends on mean firing
rate. Indeed in our data, a stronger, statistically significant
correlation was found between the BII and orientation bandwidth
(rs =
0.13, P < 0.012). However, we also found that orientation bandwidth was
negatively correlated with the mean firing rate.
Figure 4E indicates that there is no tendency for disparity selectivity to vary with the preferred stimulus spatial frequency (rs = 0.07, n.s.). Figure 4F shows that there is no tendency for the strength of disparity selectivity to change as a function of eccentricity (rs = 0.03, n.s.). Indeed, the general lack of structure in these data sets is a little surprising at first glance. As a precaution, we re-examined all of these relationships after setting a tighter criterion on the average firing rate achieved at the most preferred disparity on the tuning curve. Previously, this had been 10 impulses/s (see METHODS). Raising this value to 40 impulses/s did not substantially alter the conclusions.
We also examined the relationship between disparity sensitivity and
classification as simple or complex. Cells were classified by the
method of Cumming et al. (1999)
(see
METHODS). Of 226 neurons to which this analysis could be
applied, 57 were classified as simple. It has previously been claimed
(e.g., Poggio et al. 1985
) that simple cells do not
respond to random-dot stereograms and also rarely show disparity
selectivity to RDS. It is important to consider these two issues
separately. There is a significant negative correlation between mean
firing rate to RDS patterns and F1:F0 ratio
(rs =
0.377, P
0.0001), confirming that simple cells tend to have lower firing rates
on average than complex cells in response to RDS. This is unsurprising
because simple cells can only respond to those dot patterns that happen
to match their monocular phase preferences. Complex cells may be
stimulated by all dot patterns. As a consequence of the relationship
between the F1:F0 ratio and the response rates to RDS, it is important to use a measure of disparity selectivity that is not influenced by
mean firing rate when comparing simple and complex cells. The DDI has
this property, and we found no relationship between the F1:F0 ratio and
DDI (rs = 0.026, NS). Fig.
5C shows an example tuning
curve from one simple cell that is strongly selective for disparity in
these random-dot stereograms. We conclude that both simple and complex
cells respond to dynamic random-dot stereograms and vary their response
as a function of the disparity of such patterns.
|
For many cells, disparity tuning was also measured for drifting
sinusoidal gratings as a function of the interocular phase difference.
The spatial frequency, temporal frequency, orientation and drift
direction of these gratings were matched to the preferred values for
each unit. The disparity discrimination index for sinusoidal gratings
was significantly correlated with the disparity discrimination index
for random-dot stereograms (rs = 0.28;
P
0.00015, n = 176). One reason why
this correlation might be less than perfect is that if the spatial
properties of the grating or the RDS pattern are not optimal for the
neuron, this may limit the DDI. A second reason is that mis-sampling in
either tuning curve inevitably affects the value of DDI that is
measured. Nonetheless, neurons that exhibit disparity tuning to dynamic
random-dot patterns also generally exhibit tuning to sinusoidal grating stimuli.
Description of disparity tuning curves
To summarize the population of disparity tuning curves, it is
useful to fit analytic functions to the disparity tuning data and
examine the resulting parameters. The energy model predicts that the
disparity tuning profile for dynamic random-dot stereograms will take
the form of the horizontal cross-correlation between the left- and
right-eye receptive field shapes (see APPENDIX B). We did
not measure directly the shape of the monocular receptive fields.
Previous measures of this property have concluded that a reasonable
description of the monocular receptive fields can be delivered by Gabor
functions, which consist of a sinusoid multiplied by a Gaussian
envelope (Daugman 1985
; Marcelja 1980
;
but see also Hawken and Parker 1987
). Under
these circumstances, the energy model predicts that the disparity
tuning functions should be well described by a Gabor function, which is
the function we therefore fit to the data (as described in
METHODS).
The requirements for this stage of the analysis were that neurons
should be strongly modulated by disparity and their modulation should
have been reliably characterized. The DDI provides a good estimate of
the extent to which neuronal activity is modulated by disparity, but it
provides no statistical assurance that there was reliable
stimulus-related variance within the tuning profile as a whole. We used
another metric to select neurons for further analysis, the
Findex, given by
|
(7) |

Six disparity tuning profiles and their associated Gabor fits are
illustrated in Fig. 5. The Gabor function described the disparity
tuning data extremely well: for 163/253 neurons the fit accounted for
90% of the variance attributable to disparity and for 233/253
neurons the fit accounted for
75% of the variance. Only a few
disparity tuning functions were not well fit by Gabors. For most of
these cases, the disparity tuning curves appeared to lack any coherent
form. The only systematic deviation from the Gabor model that was noted
is illustrated in Fig. 5, E and F. For these
curves, the side lobes of the disparity tuning function are noticeably
wider than the central peak. This is the same form of deviation from
the Gabor model noted by Hawken and Parker (1987)
. However, its effect in this dataset of disparity tuning curves is
slight. Within this data set, the examples in Fig. 5, E and F, are relatively poor fits: 68% of all fits accounted for
a larger fraction of the variance than that accounted for by
the fit in Fig. 5F. Nonetheless the best-fitting Gabor
functions in Fig. 5, E and F, still capture the
essential features of these tuning curves. In particular, the fitted
phase provides an accurate reflection of the degree of symmetry in the
tuning curve, and the fitted position of the Gaussian envelope
describes well the center of the disparity range over which modulation occurs.
One parameter that needs to be interpreted with care is the fitted
frequency. Figure 6 shows an example with
two different fits to the same disparity tuning data. The sinusoidal
and Gaussian components of the fitted Gabors are shown separately in
the bottom two panels. Although the fitted frequencies are
very different, these combine with equally different Gaussians to
produce very similar looking Gabor functions. To provide a fitted
measure that fairly reflected the spatial scale of the modulation in
disparity, we adopted a procedure in which the frequency of the Gabor
function was derived directly from the data. We examined the
Fourier spectra of the disparity tuning curves after the DC component
had been removed (Fig. 6). The frequency of the sinusoidal component of the Gabor was set to be the equal to Fourier component with the greatest energy, which we term the "disparity frequency" of the tuning curve (following Ohzawa et al. 1997
). The
remaining five parameters of the Gabor curve were then refit. In all
cases the shape of the new fit was extremely similar to the previous
fit. However, the "disparity frequency" corresponded much better
with the scale of the tuning curves as assessed "by eye."
|
We also investigated whether a more economic model than a Gabor
function would suffice. Many curves could be well described by a
four-parameter Gaussian model. We found that for 142/253 cells, the
addition of the frequency (f) and phase
(
) components for the Gabor model did not provide a
significant improvement in the fit (sequential F test,
P
0.05
see METHODS for details). Fig.
7, A and B,
demonstrates two curves that are well described by Gaussian functions.
There is no suggestion of a sinusoidal component in these disparity
tuning curves. Figure 7C presents an example of a borderline
case, in which the Gabor model is significantly better than a Gaussian
model but only at the 5% level. The disparity tuning curves in Figs.
1A and 5A provide examples where Gaussian models
are insufficient. Note that a failure to demonstrate statistically that
a Gabor fit is necessary does not guarantee that the underlying tuning
is truly Gaussian, only that we cannot reject that possibility. If the
underlying Gabor shape had relatively shallow side lobes (i.e., the
frequency is low relative to the SD), then our sampling may not have
been fine enough to detect these reliably.
|
For neurons that are adequately described by Gaussian tuning profiles, the frequency term in Eq. 4 is poorly constrained: provided the period of the cosine term is large relative to the SD of the Gaussian, it has little influence. This does not mean that the measure of "disparity frequency" is uninterpretable. Somewhat paradoxically, Gaussian tuning curves give rise to a peak "disparity frequency" in the continuous Fourier transform (see Fig. 6D) of the disparity tuning data, because the DC component is removed prior to the transform. A similar issue arises with neurons whose disparity tuning is odd symmetric and broadly tuned for disparity, for which a low-frequency, odd-symmetric Gabor is the most appropriate functional form. The disparity frequency gives a consistent measure of the spatial scale of the disparity modulation that can be applied to both Gabor-shaped and Gaussian tuning curves. For this reason, all subsequent analysis is performed on Gabor fits in which the frequency term was not a free parameter, but was set to the disparity frequency.
For those cells where the Gaussian model is sufficient, the Gabor still
yields an equally good description even though some of the parameters
of the Gabor are poorly constrained. Two important parameters are still
well constrained, even in these cases. The first is the horizontal
position term. The curve in Fig. 7A is both Gaussian in
shape and requires a nonzero position term. The second is the phase
term, which is inevitably near zero or
, because the
Gaussian shape is symmetrical. When applying population analyses to the
shape of tuning curves, the shape of the fitted Gabor is used even for
cells where a Gaussian would have been adequate.
Relating disparity tuning to spatial properties
The preceding section demonstrates that Gabor functions provide highly accurate descriptions of the shape of disparity tuning functions in primate V1. Given that many underlying monocular tuning curves are often well described by Gabor functions, this represents a successful prediction of the energy model. It would be difficult to reconcile the energy model with disparity tuning curves that looked very different from Gabor functions. Of course, this does not demonstrate that the shape of each disparity tuning function is explained by the monocular RF structure of that particular cell. Because we did not measure monocular line-weighting functions, we rely on other measures to test the link between the spatial properties of the RF and the disparity tuning function.
Before attempting such an analysis, it is important to restrict the sample to neurons that are well tuned and well described by the Gabor fits. As described in the preceding text, we initially fit Gabor functions only to the 253 neurons with Findex > 0.8 that had been sampled at seven or more disparities. Next, we rejected 13 cells for which the Gabor fit accounted for <75% of the between-disparities variance. Also, we excluded 48 cells for which our measurements of disparity tuning covered <2 SDs of the fitted Gabor (which is to say the sampled disparities did not adequately constrain the fit). All of these cases also had a minimum of three samples per period of the disparity frequency (i.e., the sampling was always above the Nyquist limit). Finally we removed a further 12 cells by hand for which the Gabor did not provide a realistic description of the variation in the curve. After these refinements, 180 tuning curves remained, for which the fit of the Gabor function was both extremely good and adequately constrained. Note that this group includes tuning curves that could be described by Gaussians, as the Gabor function still provides a good fit to the data with a Gaussian form.
The Gabor energy model makes clear predictions about how the form of the disparity tuning curve should depend on preferred orientation and spatial frequency (see APPENDIX B for a detailed discussion). First, the sinusoidal component of the Gabor curve should increase in scale as the preferred orientation of the neuron moves from vertical to horizontal (when preferred spatial frequency is held constant). When the orientation approaches horizontal, the period of the sinusoid becomes very broad, and the tuning profile either becomes Gaussian or flat, depending on the original symmetry of the curve (see APPENDIX B, Fig. 12).
This prediction was tested by taking the ratio of the wavelength of the
sinusoid,
= 1/f, to the SD parameter,
.
This provides an estimate of the number of cycles in the tuning curve.
As this ratio decreases, the curve becomes more nearly Gaussian. Figure 8 shows the ratio plotted as a function
of the preferred orientation. There is a marked absence of points in
the upper left quadrant, indicating that cells with orientations near
horizontal tend to have more Gaussian tuning curves as predicted by the
energy model. The relationship is statistically significant: for cells
with orientations within 45° of horizontal, the spread of
values for this parameter
/
was significantly smaller than for
those with orientations within 45° of vertical
(F test, P < 0.005). Nonetheless, some
cells with vertically oriented receptive fields also exhibit disparity
tuning curves that are well described by a Gaussian. This is what might
be expected if the spatial frequency tuning of these cells was very
broad.
|
In fact, it is notable that the mean number of cycles per SD of the
Gabor envelope is only 0.25 for the whole population. Hence, the
Fourier amplitude spectra of the majority of the tuning curves are
effectively low-pass. This is surprising because the Gabor energy model
predicts that the bandwidth of the tuning curves should strictly be
narrower than that of the underlying RFs measured monocularly with
sinusoidal gratings (APPENDIX B). For sinusoidal luminance
gratings, primate V1 neurons recorded under anesthesia typically have
spatial frequency bandwidths (full width at half height) of ~1.5
octaves (DeValois et al. 1982
). This corresponds to 0.38 cycles/SD, which is substantially larger than the majority of values in
Fig. 8, even for vertically oriented cells. These data are highly
suggestive of a discrepancy between the bandwidth of the disparity
frequency and the tuning for sinusoidal luminance gratings. However,
directly comparable data of sufficient quality were not available on
sufficient neurons to evaluate this hypothesis fully.
The second prediction of the model is that the frequency at which the
disparity tuning function modulates (disparity frequency) should equal
the spatial frequency of a horizontal section through the RF (or RF
subunits, for complex cells). This is closely related to the spatial
frequency of a horizontal cross section through the preferred grating
stimulus (the "horizontal frequency"). Figure 9 compares the disparity frequency and
the horizontal frequency. The data are clustered around the identity
line, indicating that the spatial scale of the disparity tuning curves
is on average similar to that predicted by spatial properties. The
correlation between disparity frequency and the horizontal frequency is
also significant, but it is weak (Spearman's rank correlation
co-efficient, rs = 0.36, P
0.01, n = 52). Note that the
predicted relationship does not necessarily follow the identity line:
the estimated disparity frequency never reaches very low values because
the Gaussian envelope of the tuning curve places a lower limit on this
parameter. This explains why some of the points on the left of the
graph lie above the identity line. Nonetheless this phenomenon cannot
entirely explain the weakness of the correlation. Even when the
analysis was restricted to tuning curves that were not Gaussian in
shape, the correlation was similar. Ohzawa et al. (1997)
compared the disparity frequency perpendicular to the RF orientation
with the preferred spatial frequency of complex cells in cat area 17. They also found only a weak relationship, and the linear regression had
a slope that was considerably less than unity.
|
Relation between monocular responses and disparity tuning
The energy model makes several predictions about the relationship
between monocular and binocular responses. First, purely monocular
cells, that lack excitatory input from one eye, should not be disparity
selective. In fact, the extent of disparity selectivity was found to be
unrelated to ocular dominance (Fig. 4F). Second, the
response to binocular uncorrelated RDS should equal the sum of the
monocular responses to RDS (APPENDIX B). We took the
baseline firing of the fitted Gabor as a measure of the response to
uncorrelated RDS. This baseline firing reflects the binocular response
to stimuli of large disparity: because V1 RFs are small, large
disparities mean that the stimulus within the RF is binocularly uncorrelated. In 18 cases, we measured the responses to binocularly uncorrelated dot patterns. These independent measures of activity were
closely correlated with the fitted baseline of the Gabor function
(rs = 0.917, P
0.0001, n = 18). Thus V1 neurons calculate binocular
correlation over a finite area. Moreover, the range of disparity
samples was broad enough in our experiments to determine the response
to uncorrelated binocular stimuli from the flanks of the disparity
tuning curves. Taking the baseline firing of the fitted Gabor as a
measure of the binocular response, this response was well correlated
with strength of the monocular responses, but was usually closer to
their mean rather than their sum (see Fig.
10A).
|
The third relationship predicted by the energy model is that the ability of changes of disparity to cause changes in the firing rate of the neuron should be determined by the strength of monocular responses (see APPENDIX B). Once again, although there is a significant correlation (Fig. 10B), the observed amplitude of modulation tends to be smaller than predicted. Both this observation and the previous one show that the monocular responsiveness to random-dot patterns does play an important role in determining the binocular response to RDS. This is only in partial agreement with the energy model because some additional factor (perhaps response normalization) leads to lower than predicted activity in response to binocular stimuli.
Architecture of disparity tuning
In this section, we address the question of whether there is a
"functional architecture" for disparity tuning in macaque V1. Other
properties of cells in the striate cortex are known to be organized in
a systematic fashion, such as the columnar organization of orientation
preference (Hubel and Wiesel 1977
). For cat visual cortex, Blakemore (1970)
proposed the existence of
"constant depth" columns, in which the preferred disparity was the
same for all units. However, LeVay and Voigt (1988)
found that neurons with similar disparity preferences were only weakly
clustered together.
A recent method used to examine the functional architecture of
disparity in MT has been to compare single- and multiunit data recorded
at the same site (DeAngelis and Newsome 1999
). During many experiments, we recorded both a clearly isolated spike from the
neuron under investigation and a mixture of unisolated spikes from
nearby neurons (multiunit activity). We used this approach to compare
disparity tuning curves for the multiunit data with that for the
isolated single spikes. This was performed for 195 sites where we had
recorded substantial multiunit activity, the distinction between the
isolated unit and the multiunit spikes was extremely clear, and there
was no slow drift in the number of multiunit events over time.
Figure 11, left, shows a
plot of the disparity discrimination index for single-unit data as a
function of the same parameter for the multiunit data. These are
significantly correlated (r = 0.36, P
0.0001, n = 195), suggesting that disparity-selective neurons are to some extent clustered together as if there is a columnar
organization. This is not merely a consequence of the functional
architecture for ocular dominance: the disparity discrimination index
was found to be independent of the ocular dominance for both the
single-unit data (see Fig. 4F) and the multiunit data (not
shown). The plot also shows data from a study of disparity tuning in
visual area MT (DeAngelis and Newsome 1999
), which has been re-analyzed using the same method. Cells in MT generally exhibit
much stronger disparity tuning. Moreover, the correlation between the
degree of disparity tuning is considerably stronger in MT
(r = 0.61, P
0.0001) than in V1.
|
Given that disparity tuning tends to be present in the multiunit
data when the single unit is itself disparity tuned, it is appropriate
to consider whether the disparity tuning profile of nearby neurons
tends to be similar. To maintain compatibility with the work of
DeAngelis and Newsome (1999)
, we selected cells for
which both the single- and multiunit responses were significantly modulated by disparity (ANOVA, P < 0.05). We then fit
cubic splines to each response profile and took the position of the
maximum as a measure of the preferred disparity. Figure 11,
right, shows the preferred disparity for the single-unit
data plotted as a function of the preferred disparity of the multi-unit
data. There is a weak correlation between the preferred disparities,
demonstrating that the organization of disparity sensitivity in V1 is
not random. However the correlation is much weaker than the one found
in MT (DeAngelis and Newsome 1999
).
As a check on the validity of our procedures, we carried out a similar
analysis that compared the orientation preferences of single units in
V1 against multiunit data from the same site. As expected, the
relationship here was much stronger than that for disparity. For 106 recording sites tested with sinusoidal gratings, the correlation was
0.972, and for 102 recording sites tested with sweeping bar stimuli,
the correlation was 0.870 (both values highly significant). We conclude
that if there is a correlation between the preferred disparity of
nearby neurons, it is considerably weaker than the correlation in area
MT (DeAngelis and Newsome 1999
) and also much weaker
than the organization for orientation in V1.
| |
DISCUSSION |
|---|
|
|
|---|