|
|
||||||||
Laboratory of Sensorimotor Research, National Eye Institute, National Institutes of Health, Bethesda, Maryland 20892-4435
Submitted 11 December 2002; accepted in final form 10 July 2003
|
|
ABSTRACT |
|---|
|
|
|
INTRODUCTION |
|---|
|
The current best description of the operation of these cells is provided by the energy model (Adelson and Bergen 1985
; Fleet et al. 1996
; Ohzawa 1998
; Ohzawa et al. 1990
; Qian 1994
), sketched in Fig. 1A and described more fully below. This elegant model has been extremely successful in explaining qualitatively the properties of disparity-tuned neurons in V1, for example, the shape of the binocular receptive field obtained with disparate bar stimuli and the shape of the disparity tuning curve obtained with random-dot stereograms (Anzai et al. 1999a
; Cumming and Parker 1997
; Ohzawa et al. 1996
, 1997
; Prince et al. 2002b
).
|
Although the energy model was originally intended as a qualitative description, its success to date suggests that it may be possible to elaborate the model so as to provide a good quantitative description of neuronal behavior. Extending the energy model requires identifying those quantitative discrepancies that have not so far been reconciled with the original structure of the model. First, the response to anticorrelated random-dot stimuli (when contrast polarity is inverted in one eye) must be accounted for. In real cells, anticorrelation inverts the disparity tuning curve and also reduces the amplitude, whereas the energy model predicts inversion only (Cumming and Parker 1997
; Ohzawa et al. 1997
). The amplitude reduction can be explained if we modify the energy model by incorporating threshold nonlinearities before binocular combination (Read et al. 2002
). This modified version of the energy model is shown in Fig. 1B.
Second, the energy model predicts that monocular stimulation in either eye should always have an excitatory effect. In this study, by making a quantitative comparison of monocular and binocular responses, we confirm that there are many cells in which input from one eye always seems to suppress the cell's response, as previously reported by others (Ohzawa and Freeman 1986a
; Prince et al. 2002a
). Such behavior is inconsistent with linear binocular combination, but is predicted by our modified version (Read et al. 2002
).
Third, the energy model predicts that the response to binocularly uncorrelated random-dot patterns should equal the sum of the responses to monocular random-dot patterns. In fact, it is generally much closer to their mean. It has been suggested (Prince et al. 2002b
) that this is a consequence of a contrast-normalizing mechanism that tends to boost the response to monocular. We show here that the relative size of the monocular and binocular response can be explained by our modification to the energy model, without the need to invoke a normalization process.
Fourth, a key prediction of the energy model is that the shape of monocular receptive fields determines the shape of the disparity-tuned response. Although this has been verified for simple cells in the cat (Anzai et al. 1999b
), the situation for complex cells is less clear because it is difficult to estimate the receptive fields of the subunits. Fortunately, the model can also be tested in the frequency domain: the Fourier power spectrum of the disparity-tuning curve should match the shape of the monocular spatial frequency-tuning curves. Preliminary testing of this prediction has indicated a conflict with the energy model prediction (Ohzawa et al. 1997
; Prince et al. 2002b
); however, for a number of reasons, the seriousness of this conflict is hard to assess. Prince et al. (2002b
) found that their disparity tuning curves often had spatial frequency bandwidths substantially larger than those estimated from luminance gratings in other studies (de Valois et al. 1982
). However, Prince et al. measured tuning for horizontal disparity, so these data are not directly comparable with selectivity for the spatial frequency (SF) of luminance gratings at the preferred orientation. Ohzawa et al. (1997
) found that the frequency of the disparity-tuning curve tended to be lower than the preferred spatial frequency revealed with monocular luminance gratings in the dominant eye, apparently contradicting the energy model. However, their definition of disparity frequency could potentially obscure an underlying agreement with the energy model (see below); also, confidence intervals were not presented. Most important, neither of these studies reported measures of spatial frequency tuning in both eyes. The original energy model assumes that spatial frequency tuning is identical in the two eyes, so it is possible that the discrepancies could be attributable to binocular differences in spatial frequency tuning. If this were the case, it would be easy to extend the energy model to take account of this difference, by allowing different spatial frequency tuning between subunits, either within an eye or between eyes. The data should then agree with this generalized version of the energy model. Thus, a more complete comparison of the spatial frequency tuning and the power spectrum of the disparity tuning is necessary to test the model.
To resolve this important question, we recorded the monocular spatial frequency and orientation tuning in both eyes. This is compared with the selectivity for disparity applied to random-dot patterns along an axis orthogonal to the preferred orientation. Both comparisons systematically violate the predictions of the energy model, even after it has been generalized to allow for differences between subunits. The disparity-tuning curves show more power at lower frequencies than is possible within these models, even allowing for the presence of several subunits that may differ in position and/or spatial frequency tuning. However, once again, the results may be explained by our modified version of the energy model incorporating a threshold nonlinearity before binocular combination.
In summary, therefore, we have compared two families of models of disparity selectivity: 1) the energy model and a set of generalizations of it, all postulating linear binocular summation; and 2) our modified version incorporating threshold nonlinearities before binocular combination. For a wide range of observations, the data are quantitatively at odds with the linear model and can be accounted for by the threshold model. We conclude that adding thresholds to the energy model, before inputs from the two eyes are combined, represents a substantial step forward in our understanding of disparity selectivity in V1.
|
|
METHODS |
|---|
|
Stimuli were generated on a Silicon Graphics Octane workstation and presented on two Eizo Flexscan F980 monitors (mean luminance 41.1 cd/m2, contrast 99%, frame rate 72 Hz) viewed through a Wheatstone stereoscope, in which the monitors are viewed through mirrors positioned in front of the animal's eyes. At the viewing distance used (89 cm) each pixel in the 1,280 x 1,024 display subtended 1.1 min arc. Antialiasing was used to render with subpixel accuracy [pixels are colored intermediate shades of gray to represent edges that only partially cover the pixel (Foley et al. 1990
)]. Glasscoated platinumiridium electrodes (FHC) were placed transdurally each day. Electrode position was controlled with a custom-made microdrive that used an ultralight stepper motor mounted directly onto the recording chamber.
The monkeys initiated a stimulus presentation by maintaining fixation on a binocularly presented spot to within ±1°. They were required to maintain fixation at this accuracy for 2.1 s to earn a fluid reward. During each such trial, 4 stimuli were presented, each lasting 420 ms, separated by 100 ms.
Stimuli
Sinusoidal luminance gratings were used to determine the minimum response field, spatial frequency, and orientation tuning of the cell. After an initial determination of the preferred spatial frequency and orientation, the monocular orientation-tuning curve in each eye was obtained using a circular patch of grating with spatial frequency reasonably close to optimal. Quantitative orientation-tuning curves usually spanned a range of 180° centered around the preferred orientation (or direction, for direction-selective cells). The spatial frequency tuning curve (SFTC) was then obtained using a large rectangular grating patch at the preferred orientation. The frequencies generally spanned 0.0625 to 16 cycles per degree (cpd) in steps of 1 octave. A pseudo-random sequence interleaving frequencies and eye of presentation was used in both cases. During monocular trials, the nonstimulated eye viewed a uniform screen of the same mean luminance.
Dynamic random-dot stereograms were composed of black and white dots, scattered at random on a gray background. The dots were usually 5 x 5 pixels (0.1° x 0.1°); for some cells, a different size was used if this enhanced the response rate. A new random stereogram was generated every frame (72 Hz). The dot density was sufficient to cover 50% of the gray background but, because the dots were allowed to overlap one another (dot location was randomly assigned with subpixel precision using antialiasing), the total coverage was slightly less. On average, 20% of pixels were black, 20% white, and 60% gray. Figure 2 shows an example stereogram, together with a circle indicating the size of typical V1 minimum response fields for comparison.
|
The energy model assumes that all receptive fields feeding into a cell have the same orientation. Its predictions are therefore most easily framed in terms of disparities parallel and orthogonal to this orientation, rather than horizontal and vertical disparities. Accordingly, to facilitate testing of energy model predictions, experimental disparities were applied along the axis orthogonal to each neuron's preferred orientation. These covered the range from 1.2° to +1.2° in the initial test for disparity selectivity, with the range 0.6° to +0.6° covered in steps of 0.1°, and steps of 0.2° outside this range. A larger range of disparities was used if necessary to ensure that there was no modulation at the extremes of the tuning curve (i.e., that the full response range had been explored). In neurons with preferred SFs >4 cpd, the central region of the curve was sampled more finely, to ensure that sampling exceeded the Nyquist limit predicted from the monocular SF tuning.
Data analysis
Data analysis such as curve fitting is greatly simplified if we can make the assumption that variance is constant across the data set. This assumption is invalid for neuronal firing rates, whose variance tends to increase in proportion with the mean (Dean 1981
). However, the square root of firing rates has variance that is roughly constant, independent of the mean (Cumming and Parker 2000
; Prince et al. 2002b
). This variance-stabilizing transformation greatly simplifies the analysis of neuronal data. For this reason, we performed all our analysis on the square root of the recorded firing rates.
To quantify the strength of disparity tuning, we used the disparity discrimination index (DDI) introduced by Prince et al. (2002b
)
![]() | (1) |
, respectively, and RMSerror is the square root of the residual variance around the mean
recorded across the whole tuning curve, including the response to uncorrelated stimuli (effectively, infinite disparity). Like the more familiar binocular interaction index, (Rmax Rmin)/(Rmax + Rmin), this is a contrast measure, except that here the difference in response between the preferred and null disparity is contrasted not with the mean response, but with the variability of the firing rate RMSerror. This means that cells in which the range in firing rates is largely the result of random fluctuations are not wrongly classified as being highly sensitive to disparity; equally, cells in which the change in firing as a function of disparity is relatively small but highly reliable are correctly described as strongly disparity-tuned. The term (Rmax Rmin) in the denominator of Eq. 1 ensures that the index is bounded at 1 when the variability is small. For most cells, monocular random-dot stimuli were also presented, in trials interleaved with binocular stimuli. Blank stimuli were also usually interleaved, where both eyes viewed a blank screen of the same mean luminance as the random-dot patterns. These were used to obtain an estimate of the spontaneous firing rate.
To allow a cell into the study, we required that binocular random dots at the optimal disparity elicit a response of at least 10 spikes/s. To proceed to quantitative analysis of the response's shape, we further required i) ANOVA indicates a significant (P < 0.05) main effect of disparity; ii) the disparity discrimination index exceeds 0.375. The second condition removes neurons with weak but significant disparity tuning because these tend to produce noisy estimates in the quantitative analysis that follows. Including these weakly tuned neurons did not change any of the substantial results; it only increased the scatter. To subject monocular spatial frequency data to quantitative analysis, we required that i) the optimal drifting grating in that eye elicits at least 10 spikes/s; ii) ANOVA indicates a significant (P < 0.05) main effect of spatial frequency. We do not require tuning to be band-pass, and our sample included a few neurons that showed a low-pass spatial frequency tuning.
Fitting tuning curves
We summarized our tuning curves by fitting them with analytical functions. If we fitted the function directly to the mean firing rates, we would have to reduce the weight given to residuals at higher firing rates, to take account of the higher variance there. As explained above, we avoided this complication by, instead, fitting the square root of our chosen fit function to the mean of the square-root firing rates. Given that
has approximately constant variance, we could then just minimize the sum of the squared residuals, without needing to weight them differently.
SFTCs were fitted with Gaussians, in either log or linear frequency space, whichever minimized the residuals. These had 4 free parameters: frequency of the peak f0, standard deviation
, baseline and amplitude above the baseline. The baseline was assumed to represent the spontaneous firing rate; thus, it was not allowed to be negative. The peak frequency was constrained to lie within the range of stimulus frequencies. The amplitude was not allowed to exceed twice the range of the response.
These fitted curves were used to extract a peak frequency, a low-frequency cutoff, and a high-frequency cutoff, defined as the positions where the tuning curve falls to half its maximum. Where the SFTC was fitted with a Gaussian in linear frequency, with peak at f0 and standard deviation
, the high and low cuts are
. Where the tuning curve was fitted with a Gaussian in log frequency, with standard deviation
in log space, the high and low cuts are
.
Disparity-tuning curves were fitted with half-wave rectified one-dimensional Gabor functions (the product of a sinusoid with an exponential; cf. APPENDIX B). The original energy model predicts a Gabor disparity-tuning curve, provided that the monocular receptive fields are narrow-band Gabor functions differing only in their position and phase. However, our main motivation for using Gabor functions is that they provide a succinct description of most experimental tuning curves (Cumming and Parker 1997
; Ohzawa et al. 1990
; Prince et al. 2002a
). In the RESULTS section, we verify that our conclusions do not depend on the use of a fitted Gabor. One-dimensional Gabors have 6 free parameters: the spatial frequency f and phase
of the carrier cosine, the standard deviation
, amplitude A and center
0 of the Gaussian envelope, and the baseline firing rate B about which the sinusoid oscillates. Uncorrelated responses, if available, were included in the fitting; the expected response to uncorrelated stimuli is just B.
0 was constrained to lie within the range of stimulus disparities and the amplitude A was not allowed to exceed twice the difference between the maximum and the minimum response. The spatial frequency of the fit was not allowed to exceed half the Nyquist limit (i.e., one-quarter of the maximum spatial sampling rate of the data). Although these curves generally gave good descriptions of the tuning curves, the parameters of the fitted Gabor must be interpreted with care (see Prince et al. 2002b
). When using these fits to summarize some property of the tuning curve, we therefore used appropriate measures applied to the fitted curve (illustrated by our measurement of disparity peak frequency in the next section), rather than using the parameters of the fit.
Disparity Fourier spectrum
The original energy model predicts that the disparity peak frequency, the frequency at which the disparity modulation has most power, should be the same as the preferred spatial frequency observed with monocular gratings. In making this comparison, the disparity must be applied at right angles to the cell's preferred orientation. Most previous work using random-dot stimuli in awake animals has employed only horizontal disparities. To enable a test of the energy model prediction, all disparities in the present study were applied orthogonal to the cell's preferred orientation (Cumming 2002
). It is also important that the disparity tuning is measured with a broadband stimulus such as random dots, to ensure that the disparity tuning curve shape is not trivially determined by the stimulus. If disparity tuning were measured with a grating, for instance, the periodicity of the stimulus would guarantee a periodic response (Cumming and Parker 2000
).
The disparity peak frequency is slightly different from the "disparity frequency"a term used by two previous authors in related but distinct senses. Ohzawa et al. (1997
) used a bar as the broadband stimulus to test this prediction in the anesthetized cat. They used the term "disparity frequency" to mean the carrier frequency of the Gabor fitted to the disparity tuning curve, which they then compared with the monocular spatial frequency tuning. Note, however, that this carrier frequency does not necessarily equal the disparity peak frequency. Thus their finding that the carrier frequency of fitted Gabors was systematically lower than the preferred spatial frequency in the dominant eye is not necessarily at odds with the energy model. For sufficiently narrow-band Gabor functions, the carrier frequency f and disparity peak frequency coincide (APPENDIX B), but many of the disparity tuning curves presented by Ohzawa et al. appear to be fairly broadband (e.g., their Fig. 15). In this case, the disparity peak frequency and the carrier frequency diverge. Which is higher depends on the phase of the disparity-tuning curve (APPENDIX B). The disparitytuning curves presented by Ohzawa et al. display phases across the spectrum: thus, both situations occur. Furthermore, for Gabors that are not narrow-band, the fitted carrier frequency is often poorly constrained by data (Prince et al. 2002a
; Fig. 6). For these reasons, it is not clear that the data presented by Ohzawa et al. (1997
) necessarily violate the energy model.
|
|
We compared two different ways of extracting the disparity peak frequency. The first was completely model-independent; here we used the response to uncorrelated stimuli as an estimate of the baseline firing rate, subtracted that from the disparity tuning curve, and took the continuous Fourier transform of the result (by trapezoidal integration). We also estimated the disparity spectrum from the Gabor function fitted to the tuning curve. When performing the fit, the Gabor function was half-wave rectified; that is, negative values of the fit function were replaced with zeros for the purpose of evaluating the residual (given that firing rates could not be lower than zero). When obtaining the disparity spectrum, we used the unrectified Gabor, and solved numerically for the peak and half-maximal points of the Fourier spectrum.
Bootstrap resampling
To interpret scientific results, it is important to have an estimate of significance, to be sure that features we observe in our data are not merely the result of the vagaries of finite sampling. Throughout this study we have used bootstrap resampling (Efron 1979
) to estimate significances. Given a data set consisting of n samples of the random variable, one generates a "new data set" by randomly selecting a member of this data set n times (with replacement). This provides a convenient, nonparametric way to estimate the distribution of some function of a random variable, avoiding the normality assumptions buried in many standard statistical tests. For resampling to be reliable, n must be large. This was one motivation for presenting stimuli for relatively short periods: it provided a large number of independent samples. To increase n further, we pooled the data across all disparities (or spatial frequencies, for the grating stimuli) and resampled the residuals. For this pooling to be valid, the SD must be the same at each disparity, so, as before, we transformed each datum by taking its square root. That is, for each disparity we calculated the mean of the square root of the firing rate, and the residual difference between this mean and the square root of the firing rate on each stimulus presentation. We then pooled all these residuals into a single population. To generate a resampled datum, we picked a residual at random from this pool, added it to the mean square-root firing rate, and squared it to obtain the resampled firing rate. We also explored resampling the data for each stimulus condition separately and found that this gave closely similar results. In the few cases where the results were different, the method of resampling the residuals generally gave the wider confidence interval. Because this yields more conservative estimates for significance testing, resampling of residuals was adopted throughout. This meant that the effective n was always >80 for the SFTCs and always >200 for the disparity tuning. All quoted significances are at the 5% level.
Classification as simple or complex
Within the energy model, complex cells are viewed as being made up from the summed output of several simple cells (Eq. 3 below). Our analysis holds for both simple and complex cells and our conclusions do not depend on a classification of cells as either simple or complex. For this reason we have not treated simple and complex cells differently, and hence avoided the complications of attempting to make the classification in awake animals in the face of small eye movements.
Data set
We recorded monocular and binocular responses to random-dot stimuli in 210 neurons, at eccentricities between 2° and 10°. Of these, 180 produced a maximum firing rate of at least 10 spikes/s; 138/180 were disparity-selective. Adequate data on spatial frequency tuning were available for 101/138 disparity-selective cells, and in an additional 23 disparity-selective neurons we had data on spatial frequency tuning but not monocular responses to random dots.
The energy model and our modified version
This study represents a critical comparison of the energy model (Adelson and Bergen 1985
; Fleet et al. 1996
; Ohzawa 1998
; Ohzawa et al. 1990
; Qian 1994
) and our modified version of it, introduced to explain the weaker response to anticorrelated stimuli (Read et al. 2002
). In this section, we lay out the key features of both models and explain how they differ. Detailed calculations are given in the APPENDICES.
The building blocks of all the models considered in this study are binocular subunits characterized by a receptive field in each eye, which performs a linear operation on the retinal image in that eye. The input from each eye,
L or
R, is the result of this operation (for details, see APPENDIX C, Eq. C2). The distinctive feature of the energy model is that the inputs from the two eyes are combined linearly: the response of a binocular subunit is a function of the sum (
L +
R) of the inputs from each eye separately. If this sum is negative, the binocular cell is silent because it cannot signal firing rates below zero. If this sum is positive, the energy model postulates that the binocular cell outputs the square of this sum. Thus, writing C for the output of the disparity-selective cell
![]() | (2) |
![]() | (3) |
Our modified version (Read et al. 2002
) differs from the energy model in postulating that inputs from the two eyes are half-wave rectified before being combined
![]() | (4) |
Figure 1B shows one physiologically plausible implementation of this nonlinearity. In the figure, inputs from the left and right eyes initially synapse onto monocular simple cells ("MS"), which impose an output threshold, before being combined in a binocular subunit. If the inputs are combined with an inhibitory synapse, as in the lower binocular subunit in Fig. 1B, we obtain units like
![]() | (5) |
in the right eye's receptive field. |
|
RESULTS |
|---|
|
DISPARITY SELECTIVITY AND MONOCULARITY. We present evidence that some cells receive purely suppressive input from one eye. We show that this is inconsistent with the linear binocular combination of the energy model, but can be explained in our nonlinear model.
SPATIAL FREQUENCY TUNING IN THE TWO EYES. Motivated by the assumption of the original energy model that receptive fields are identical up to phase, we investigate whether there is evidence for differences in spatial frequency tuning between eyes. We find that tuning in most cells agrees well, but a minority show significant differences.
DISPARITY FREQUENCY AND SPATIAL FREQUENCY TUNING. If the assumptions of the original energy model hold, then the disparity frequency should equal the preferred spatial frequency in the dominant eye. We show that this prediction is systematically violated, and that vergence movements cannot account for the difference. However, this prediction applied only to the original energy model, which included strong constraints on the receptive field profiles in addition to linear binocular combination.
GENERALIZING THE ENERGY MODEL. We therefore generalize the energy model to allow for receptive fields with different phases, positions, and spatial frequency tuning (both across subunits, and across eyes within a subunit). We derive a constraint that even this generalized model must fulfill. We show that the data systematically violate this constraint.
THRESHOLDING BEFORE BINOCULAR COMBINATION. We finally show that our modified version of the energy model, in which a threshold precedes binocular combination, can account for the observations on disparity and spatial frequency tuning.
Disparity selectivity and monocularity
SOME CELLS RECEIVE PURELY SUPPRESSIVE INPUT FROM ONE EYE. Cells that are sensitive to binocular disparity must receive information from both eyes. It is tempting to extrapolate from this that the cells that are most sensitive to binocular disparity must be those that respond most nearly equally to input in either eye. However, previous investigators (Ohzawa and Freeman 1986b
; Poggio and Fischer 1977
; Prince et al. 2002b
; Smith et al. 1997
) have found little support for this idea. In agreement with these studies, we find no relationship between monocularity and disparity selectivity. Many cells that respond nearly equally to monocular stimulation in either eye are not disparity selective, whereas many cells that show little or no response to monocular stimulation in one of the eyes nevertheless show clear disparity tuning. Examples are shown in Fig. 3 (see also Fig. 8). The response to monocular stimulation is shown by the broken horizontal lines labeled L and R (and marked with a leftward/rightward-point arrowhead respectively:
and
). In duf096 (Fig. 3A), monocular stimulation in the left eye evokes almost no response; in duf099 (Fig. 3B), it is the right eye that is silent. Yet the black dots show the cells' responses as a function of disparity (curve = fitted Gabor); clearly both cells are selective to disparity, and so must be receiving information from both eyes. Thus, it is important to distinguish between two common uses of the term "monocular": the classical sense, "responsive to monocular stimulation in one eye and not the other," must not be interpreted to mean "receiving input from only one eye" (Ohzawa and Freeman 1986a
,b
; Smith et al. 1997
).
|
|
One natural way to explain the phenomenon of disparity selectivity in "monocular" neurons is to propose that the input from one eye always has a net inhibitory effect, and thus no spikes are produced by stimulation in that eye alone. In the absence of complications such as response normalization (which could adjust the response to monocular stimuli relative to binocular), such a scheme makes two predictions. First, binocularly uncorrelated dots should produce a weaker response than monocular dots in the dominant eye (because adding dots to the other eye produces net inhibition). This was the case for 86/138 disparity-tuned cells (significant in 44). Second, the monocular response in the nondominant eye should not be significantly greater than the spontaneous response (it is rarely possible to observe a monocular response less than spontaneous, given that the latter is so frequently indistinguishable from zero); 30/138 disparity-selective neurons showed both these phenomena. In 9/30 cells, the spontaneous response was significantly greater than zero, so that if one eye had an inhibitory influence, it would be possible to observe suppression of the spontaneous response when this eye was stimulated. In 5 of these 9, the response to random-dot stimulation in the nondominant eye was smaller than the spontaneous response. The cells shown in Fig. 3 are two examples. The broken line labeled U and marked with a square (
) shows the response to uncorrelated stimuli; in both cells this is below the response to monocular stimulation in the dominant eye (i.e., adding stimulation to the nondominant eye has reduced the response). The broken line labeled S and marked with a circle (
) shows the spontaneous rate, estimated from the response to a blank screen: in both cells, the cell fires more to a blank screen than to monocular stimulation in the nondominant eye. Such examples are clearly indicative of a predominantly inhibitory input from one eye. We conclude that in many cells, stimulation in one of the eyes always has a suppressive effect.
THIS INDICATES A NONLINEARITY BEFORE BINOCULAR COMBINATION. It is important to note that this represents a substantial deviation from any model in which binocular summation is linear. By definition, a model with linear binocular summation is of the form C = f(
L +
R), where f is an arbitrary function and
L and
R represent the inputs from left and right eyes, respectively. If f(
L) is never positive for any value of the input
L, either positive or negative (no possible stimulus in the left eye elicits a positive response), then f(
R) can never be positive either, and so the cell would never respond. In a linear model, if the cell responds at all, then stimulation in each eye can exert either a suppressive or an enhancing effect, depending on the stimulus. To obtain the situation where one eye always exerts a suppressive effect, we must postulate some nonlinearity before binocular combination, such as half-wave rectification followed by an inhibitory synaptic connection. This is exactly what is proposed by our modified version of the energy model. Looking at Eq. 5, C = {Pos[Pos(
L) Pos(
R)]}2, it is obvious that stimulation in the right eye always has a suppressive effect. For monocular right-eye stimulation, the response C is zero, and yet with disparate binocular stimuli, this unit is disparity-selective. Figure 4 shows simulations of two subunits described by Eq. 5. The solid line shows the disparity tuning curve. In A, the left and right receptive fields are identical, so because input from one eye is inhibitorythe disparity tuning curve is of the tuned-inhibitory class. In B, the left and right receptive fields are 180° out of phase. When combined with the inhibitory synapse in Eq. 5, this results in tuned-excitatory disparity tuning. This demonstrates that an inhibitory synapse at binocular combination does not necessarily result in tuned-inhibitory tuning. Thus, our thresholding model explains the existence of cells that would classically be called "monocular" and yet are disparity-selective.
|
A THRESHOLDING NONLINEARITY CAN EXPLAIN THE RELATIVE AMPLITUDE OF MONOCULAR AND BINOCULAR RESPONSES. We now investigate the extent to which this model can account quantitatively for the relative amplitude of monocular and binocular responses. Prince et al. (2002b
) observed that the response to binocularly uncorrelated dot patterns was often close to the mean of the responses to monocular stimulation in the two eyes, whereas the energy model predicts that it should be their sum. Prince et al. suggested that this could be attributable to a normalization process that lowers the response to binocular stimuli. However, our modification to the energy model already allows us to build cells in which the uncorrelated response is the mean of the 2 monocular responses, without incorporating any normalization. The horizontal lines in Fig. 4 show the response to monocular stimuli (L
, R
) and binocularly uncorrelated stimuli (U
). In both cases, the uncorrelated response is close to the mean of the monocular responses, demonstrating that our model can explain this phenomenon, for both tuned-excitatory and tuned-inhibitory cells.
These simulations portray something of an extreme case: in both these examples, inhibition from the suppressive eye is much stronger than excitation from the excitatory eye, so that the response to monocular stimulation in the dominant eye, M, is nearly twice the response U to uncorrelated stimuli. In fact, 2U is an upper bound for M: our model predicts that M can never exceed 2U. The energy model has a similar upper bound: it predicts that M can never exceed U. We have seen that the energy model's upper bound is violated by most cells (86/138). We now investigate whether the upper bound predicted by our model is similarly violated. Figure 5 shows the distribution of M/U for the 138 disparity-selective cells in our data set. The vertical lines mark the upper bounds predicted by the energy model (dashed) and our model (solid). The mode of the distribution is close to M/U = 1, so over half the cells exceed the energy model upper bound. However, the distribution begins to fall off after M/U = 2, so that the upper bound predicted by our model is violated in only 23/138 cells. We used resampling to estimate the 95% confidence interval for M/U. If this interval lies entirely above 1, we can be 95% confident that the upper bound predicted by the energy is violated; this was the case for 44/138 cells (32%), shaded gray in Fig. 5. If this interval lies above 2, we can be 95% confident that the upper bound predicted by our model is violated; this was so for only 4/138 cells (3%), shaded black in Fig. 5. We conclude that almost all cells respond to monocular stimulation in the dominant eye at less than twice the rate for uncorrelated stimuli, and can therefore be accommodated within our modified model. Thus, our model can explain the observed spectrum of monocular and binocular response rates, without needing to invoke other mechanisms such as contrast normalization.
|
Spatial frequency tuning in the two eyes
The original implementation of the energy model (Ohzawa et al. 1990
) assumed that all receptive fields have the same spatial frequency and orientation tuning and bandwidth. They differ only in their amplitude, their position, and phase, and even so, the position and phase disparity between left and right receptive fields of a single subunit is assumed to be the same for all subunits. These constraints on the receptive fields have been assumed by all implementations of the energy model we are aware of (e.g., Fleet et al. 1996
; Lippert and Wagner 2001
; Ohzawa et al. 1997
; Qian 1994
; Read 2002
; Tsai and Victor 2003
). We shall use the phrase original energy model to denote Eq. 3 with these additional constraints on the receptive fields. (Later, we shall consider a generalized energy model in which many of these constraints are relaxed.)
The available evidence suggests that these constraints are generally observed in simple cells (Anzai et al. 1999b
; Ohzawa et al. 1996
). In complex cells, the situation is harder to assess. Preferred orientation is observed to be closely matched between the two eyes (Bridge and Cumming 2001
), supporting the view that all receptive fields share the same orientation. However, there is some evidence from the cat suggesting that there may be a population of cells in which spatial frequency differs between the two eyes (Hammond and Pomfrett 1991
; Ohzawa et al. 1996
). In this section, we investigate the agreement in spatial frequency tuning for our monkey data.
For 151 cells, the spatial frequency tuning to monocular drifting sinusoidal luminance gratings at the cell's preferred orientation was probed in both eyes; 84 of these were sufficiently responsive and selective to permit fitting in both eyes. We defined the preferred spatial frequency to be the frequency at which the Gaussian fitted to the tuning curve had its peak. To ensure this is meaningful, we required the fits in each eye to explain more than 60% of the variance of the tuning curve data. Figure 6 compares the preferred spatial frequency in the two eyes for the remaining 73/84 cells. The solid line shows the identity; the dotted lines mark difference in SF tuning of 1 octave. Clearly, spatial frequency tuning is usually well matched between eyes. The correlation coefficient is 0.87 (P < 105). Nonetheless, 25/73 cells showed a significant difference (P < 0.05, by resampling) in preferred spatial frequency between the eyes; these are colored black in Fig. 6. There was no correlation between the difference in preferred spatial frequency and the difference in peak response between the two eyes. The figure of 25 includes some cells where the difference in preferred frequency was small (but turned out to be significant because the peak positions were robust under resampling). However, for 6/25, the peak firing rates in the two eyes occurred for gratings differing in frequency by over an octave. Two examples are shown in Fig. 7. The arrowheads show the response of the cell to monocular grating stimuli as a function of the grating spatial frequency (L:
, R:
); the curves show the fit. The 95% confidence interval for the peak of the fitted function is shaded. The confidence intervals for the two eyes do not come close, indicating significant and substantial differences in spatial frequency tuning between the two eyes. About 10% of cells showed evidence of such a difference.
The selection criteria applied in obtaining Fig. 6 exclude an interesting class of cells in which the response in the nondominant eye was very weak, but was maximal at those frequencies that produced the weakest responses in the dominant eye. Two examples are illustrated in Fig. 8, A and B. On the face of it, these cells show a severe mismatch in spatial frequency tuning, with the nondominant eye being tuned to frequencies an order of magnitude lower than the dominant eye. However, we believe a more plausible explanation is that the spatial structure of the receptive fields is really similar in the two eyes (as in the vast majority of cells, Fig. 6), but that the nondominant eye exerts a suppressive effect. This interpretation is supported by the experiments with random-dot patterns. Both these cells are disparity-selective, but also show virtually no response to random dots in the nondominant eye (Fig. 8, C and D). Thus, such cases are further evidence for purely inhibitory input from one eye.
Disparity frequency and spatial frequency tuning
We now turn to possibly the most important prediction of the energy model: the shape of monocular receptive fields determines the shape of the disparity-tuned response (Anzai et al. 1999b
; Ohzawa et al. 1997
). Because most cells do indeed show similar spatial frequency and orientation tuning in the two eyes, we shall assume in this section that the assumptions of the original energy model hold true. Then, the original energy model predicts that the disparity-tuning curve is simply the cross-correlation of the receptive fields in the left and right eyes.
For simple cells, which are single binocular subunits (Eq. 2), this prediction can be tested directly. For complex cells, which represent the sum of several binocular subunits (Eq. 3), the disparity tuning curve is predicted to be the sum of the cross-correlations of the receptive fields in the component subunits. This makes the prediction hard to test in complex cells because it is difficult to obtain the receptive fields of the component subunits experimentally. Fortunately, provided all subunit receptive fields have the same preferred orientation, the comparison can be made without a direct measurement of receptive field profile. We simply need to obtain 1) the cell's response to binocular random-dot patterns as a function of disparity along an axis orthogonal to this preferred orientation, and 2) the cell's response to monocular sinusoidal gratings oriented parallel to this preferred orientation, as a function of spatial frequency. The energy model predicts that the shape of the Fourier amplitude spectrum of the disparity tuning curve measured in 1) will be given by the monocular spatial frequency tuning curves measured in 2). In particular, their peaks should coincide: that is, the disparity peak frequency, defined as the position of the peak in the Fourier amplitude spectrum, should be the preferred spatial frequency of the cell. This key prediction of the original energy model, which depends critically on its linear properties, holds for both simple and complex cells. Previous work (Ohzawa et al. 1997
; Prince et al. 2002b
) has suggested that this prediction is not fulfilled, but, as discussed above, these studies leave open a number of possible ways in which the data could be reconciled with the energy model. We carried out a detailed comparison, using bootstrap resampling to estimate the significance of any discrepancy.
Figure 9 shows the comparison for 3 neurons, illustrating the common patterns observed. The left-hand column shows the disparity tuning curves. On the right, the Fourier amplitude spectrum of the disparity-modulated component is compared with the spatial frequency tuning in the dominant eye. For both these quantities, two estimates are shown: one from the raw data and one from the fitted function. The raw SFTCs are shown with filled circles (
) in the plots on the right, whereas the fits are drawn with the black curve. The disparity-modulated component can be estimated from the raw data by subtracting the mean response to uncorrelated stimuli (horizontal line labeled with the letter U and the symbol
in the left-hand plots) from the mean response of the cell to random-dot stereograms at different disparities [filled circles (
) in the plots on the left]. The Fourier spectrum of this is shown on the right with a dotted gray line ["FT-DMC (data)" in the legend]. Alternatively, the disparity-modulated component can be obtained from the fitted Gabor (solid curve in the left-hand plots). The Fourier spectrum of this is shown on the right with a dashed gray line ["FT-DMC (fit)"].
|
In a few cases (Fig. 9A) the Fourier transform of the disparity-modulated component (FT-DMC) did closely resemble the SFTC, but for the majority of cases there were substantial discrepancies, of two types. First, the peak of the FT-DMC was often at a lower frequency than the peak of the SFTC (Fig. 9B). Second, the FT-DMC was often close to low-pass in form, despite a clear band-pass SFTC (Fig. 9C).
We had 105 disparity-selective neurons that were sufficiently responsive to gratings in the dominant eye, selective for spatial frequency, and adequately described (>60% variance explained) by the Gaussian fit. To avoid making the assumption that all disparity tuning curves were well described by Gabors, we first used a model-independent estimate of the disparity peak frequency, using the response to uncorrelated stimuli as an estimate of the baseline of the disparity tuning curve, and taking the continuous Fourier transform of the raw data. We compared this estimate of disparity peak frequency with the SFTC peak frequency of the Gaussian fit, for the 105/112 cells in which the response to uncorrelated stimuli was available and in which the Gaussian fitted to the SFTC explained >60% of the variance. The disparity peak frequency was less than the SFTC peak frequency in 84/105 of cells [P < 109 under the null hypothesis that the estimated disparity peak frequency is as likely to be above the SFTC peak frequency as below it (binomial distribution)]. The frequency difference was individually significant in 43/84 cells.
This model-independent method of extracting the disparity peak frequency has two disadvantages. First, in about 10% of cells, the disparity tuning curve appeared to be truncated by the lower limit of 0 spikes/s. These cells may represent an energy model unit followed by an output threshold. It is possible that the discrepancy between the disparity peak frequency obtained from the raw data, and the SFTC peak frequency, may reflect distortions introduced into the Fourier spectrum by the threshold. For these cells, a better estimate of the underlying response may be gained from the unrectified Gabor corresponding to the half-wave rectified Gabor fitted to the data (shown in Fig. 9A). Second, because the Fourier spectra of raw data are usually noisy and multimodal, it is hard to extract measures of bandwidth. Again, this is solved by using the fitted Gabor.
For those 99 cells in which the Gabor fitted to the disparity tuning curve explained >60% of the variance, we therefore repeated the analysis using the estimate of disparity peak frequency derived from the fit. The results are shown in Fig. 10A, which plots the disparity peak frequency against the preferred spatial frequency in the dominant eye, both derived from the fitted functions. The solid line marks the identity line; according to the energy model, all points should lie on this line. In fact, the SFTC peak frequency was greater than the disparity peak frequency in 80/99 cells (P < 109, binomial), and the difference was significant in 51/80 individual cells (resampling; these are the filled symbols in Fig. 10A). Thus, we obtain very similar results whether we use the fitted Gabor or the raw disparity tuning curve.
|
We also examined the high and low cutoff frequencies of the fitted functions. Figure 10, B and C plot the cutoff frequencies for the disparity tuning curve against those for the SFTCs. Again, the energy model predicts that all points should lie on the identity line (marked with the solid line). In fact, the low cuts differed significantly in 43/99 of cells, whereas the high cuts differed in 67/99 (filled symbols). Once again these significant differences nearly all reflect relatively more power at low frequencies in the FT-DMC than in the SFTC.
In many cases, there is so little attenuation of the FT-DMC at low frequencies that the low-cut frequency is not defined (plotted as a low cut of zero). The discrepancy in the response at very low frequencies is made clearer in Fig. 10D, which compares the relative power at the lowest frequency tested monocularly. In 77/99 cells, the FT-DMC contains more power at these low frequencies than the SFTC (P < 106, binomial). In 45/77 cases, this difference is significant (filled symbols). In many cases, the disparity tuning curve is close to Gaussian in form (relative power = 1; i.e., no attenuation at low frequencies). That this occurs in the presence of a band-pass SFTC is a dramatic deviation from the energy model. The band-pass SFTC implies that the receptive fields of the subunits contain both "ON" and "OFF" subregions. At a disparity equal to the separation of the "ON" and "OFF" regions, the contributions from the left and right eyes should be negatively correlated, producing a response that is smaller than the response to uncorrelated dots. These suppressive side lobes in the disparity-tuning curves are often not found. Prince et al. (2002b
) also noted that many disparity-tuning curves were Gaussian in form. However, those data were not clearly at odds with the energy model for two reasons. First, disparity was applied horizontally, regardless of receptive field orientation, so it remained possible that suppressive side lobes would have emerged if disparity had been applied orthogonally to the preferred orientation. Second, their data on spatial frequency tuning were not generally sufficient to exclude the possibility of a substantial low-pass component in the monocular SFTC. The present data eliminate both of these difficulties, and clearly indicate a need for more complex processing than the original energy model can provide.
VERGENCE EYE MOVEMENTS ARE UNLIKELY TO EXPLAIN THE MISMATCH. One possible explanation of the mismatch between disparity tuning and spatial frequency tuning is that the monkeys may be making small vergence movements. This would have the effect of introducing jitter into the disparity of each stimulus. Effectively, we would be summing several disparity tuning curves of the form predicted by the energy model, each with a random disparity offset. This tends to smear out the sidelobes, shifting the peak of the disparity power spectrum to lower frequencies. This process is illustrated in Fig. 11. Consider an energy-model binocular subunit, whose receptive fields in both eyes are identical, with no position disparity, both having the profile shown in Fig. 11A. The thin line in Fig. 11B shows the disparity tuning curve that would be obtained for this subunit in the absence of vergence movements. Now suppose the monkey makes random vergence movements, so that the disparity of his actual fixation point relative to the fixation target at any moment is a Gaussian centered on zero. This means that the disparity-tuning curve actually measured is the true curve convolved with this Gaussian. The result is shown with the thick line in Fig. 11B. Figure 11C shows the effect on the Fourier amplitude spectrum of the disparity-modulated component. The thin line shows the power spectrum for the original tuning curve, and the thick line for the observed version contaminated by vergence. The vergence has had two effects: it has shifted the peak of the disparity power spectrum toward DC, and it has greatly reduced the amplitude. For clarity, therefore, the broken line shows the observed disparity power spectrum scaled up to the same amplitude as the uncontaminated one. The same effect would be obtained if the neuron being recorded from represented the sum of several subunits that differed in their position disparities.
|
Either of these possibilities might explain why, in 80/99 cells, the peak of the disparity power spectrum is at a lower frequency than the preferred spatial frequency obtained with gratings. To estimate the vergence jitter that would be necessary to achieve this, we assume that the underlying disparitytuning curve is a Gabor with disparity peak frequency equal to the preferred spatial frequency in the dominant eye, and that the Gabor function fitted to the observed disparity-tuning curve represents this underlying disparity tuning curve convolved with a Gaussian distribution of vergence. In over half the cells (48/80), the amount of vergence jitter needed to bring about the requisite shift in frequency is larger than the SD of vergence reported by the scleral search coils, even though search coils clearly overestimate variability in vergence (Read and Cumming 2003). It therefore seems unlikely that the animal's vergence jitter is large enough to explain the mismatch in peak frequency in most cells.
Generalizing the energy model
It remains possible that combinations of subunits with different position disparities might be responsible for the lower disparity frequency. However, even if this is the case, the energy model places an upper limit on the power at any frequency. This can be appreciated by inspecting Fig. 11. The multiple subunits have shifted the peak toward lower frequencies, but they have done this by removing power at high frequencies rather than by adding power at any frequency. If multiple subunits are responsible for the mismatch between disparity peak frequency and preferred spatial frequency, the overall power in the disparity tuning should be greatly reduced. We therefore generalized the energy model to examine the possibility that such scatter in positions could explain our data.
In the original formulation of the energy model (Ohzawa et al. 1990
), all binocular subunits were assumed to have the same phase disparity, position disparity, spatial frequency, and orientation tuning. We now allow an arbitrary number of subunits, with different phases, positions, and spatial frequency tuning (both across subunits and across eyes within a subunit), including monocular subunits and binocular subunits that are not tuned to disparity. We shall, however, continue to assume that all receptive fields have the same orientation, and the same profile parallel to this orientation (see APPENDIX E). With this much weaker set of constraints on the receptive fields, it is no longer true that the disparity Fourier spectrum must have the same shape as the spatial frequency tuning. However, it can be shown (APPENDIX E; Eq. E9) that the disparity power spectrum, |
(f)|2, and the product of the monocular spatial frequency tuning curves, LSF(f)RSF(f), must still satisfy the following inequality, for every spatial frequency f:
![]() | (6) |
This inequality allows us to detect whether the lower disparity frequency can be explained by the presence of multiple subunits with different positions, as in Fig. 11. Then, the shift in the peak frequency would be achieved not by boosting the disparity power at low frequencies, but only by removing power at high frequencies. This would substantially weaken the disparity modulation (cf. Fig. 11B), so that the inequality Eq. 6 would be satisfied. If the inequality is violated, then multiple subunits are not a sufficient explanation.
The same generalization allows for differences in spatial frequency tuning between eyes. Our analysis in the previous section [like that of Ohzawa et al. (1997
)] considered only the dominant eye, and assumed that the SFTC in the nondominant eye differed only by a scaling of response magnitude. Yet our results (Figs. 6 and 7) show that there is a significant minority of cells in which spatial frequency tuning differs between eyes. The inequality of Eq. 6, which includes terms for the spatial frequency tuning in each eye, holds for the generalized energy model even in this case.
The top row of Fig. 12 (AC) examines Eq. 6 for one cell, ruf030, which satisfied the energy model prediction reasonably well. Figure 12A shows the SF tuning in the two eyes together with the fitted functions; Fig. 12B shows the disparity-tuning curve, with the 95% confidence interval at each disparity shaded. Figure 12C compares the fitted grating curve LSF(f)RSF(f)/LASFRASF (black) and disparity spectrum |
(f)|2/U2 (gray). The disparity peak frequency is at 0.82, whereas that of the grating curve is at 0.24. This difference was significant, so the data are incompatible with the simplest form of the energy model. Because the disparity peak frequency is, atypically, higher than the SFTC peak frequency, this cannot be explained by vergence eye movements. However, because the inequality of Eq. 6 is not significantly violated at most frequencies, the data are fairly compatible with our generalized version of the energy model, incorporating many subunits with different spatial frequency and/or disparity tuning. Note that the odd-symmetric disparity tuning in this cell cannot arise simply from a phase disparity of
/2 between receptive fields whose properties are otherwise identical, as originally envisaged by Ohzawa et al. (1990
), because this would require the normalized curves in Fig. 12C to peak at the same frequency. One possibility is that this cell receives input from one tunedexcitatory subunit with a position disparity of 0° and one tuned-inhibitory subunit with a position disparity of 0.5°, resulting in the odd-symmetric curve. Such a scheme is allowed within this generalized version of the energy model, but is nevertheless very different from previous explanations of oddsymmetric disparity tuning (DeAngelis et al. 1991
; Ohzawa et al. 1990
). Some properties of such a model are discussed in Read et al. (2002
).
|
More typical results are shown in the bottom two rows of Fig. 12. The cell in Fig. 12F is an example where the peak frequencies (marked by vertical lines) are close, so that an analysis only of the peak frequency would conclude that it is consistent with the energy model. However, over a range of low frequencies, the scaled disparity power spectrum is significantly higher than is possible under the energy model. The SFTCs indicate no response to a DC stimulus, so according to the energy model the disparity-tuning curve should have no DC component. In the energy model, the disparity-tuning curve of a complex cell is the sum of the disparity-tuning curves for the individual subunits. This cannot produce a curve with a DC component that is absent from all the subunits. The bottom row of Fig. 12 (GI) shows another example that severely violates the inequality. The SFTCs peak at high frequencies: over 10 cpd. In contrast, the disparity-tuning curve has most power at DC, and no power at all at 10 cpd. Once again, the power at DC is far beyond what could be accounted for by multiple subunits. The 3 example cells shown in Fig. 9 also violate the inequality.
To quantify the results across the entire data set, we used the difference
![]() | (7) |
should be positive or zero for all frequencies (Eq. 6); that is, the black curve above the gray curve in Fig. 12, C, F, and I. Using the functions fitted to the disparity and SF tuning, we evaluated
at a range of frequencies. We used resampling to estimate the 95% percentile of this difference; if this percentile is negative, we can reject the inequality of Eq. 6 with 95% confidence.
We had 83 disparity-tuned cells for which SFTCs were available in both eyes, and for which all 3 fitted functions explained more than 60% of the variance. For frequencies lower than the peak of the product of the SFTCs, the hypothesis
0 could confidently be rejected for over half of the cells. The significance of the rejection in individual cells was often very high. At a frequency of 0.01 cpd, the hypothesis could be rejected with 95% confidence for 47/83 cells, and with better than 99.99% confidence for 13/83 cells. Figure 13 shows how the proportion of cells for which the energy model can be rejected (i.e.,
significantly less than zero) varies depending on the frequency at which
is evaluated. To make a meaningful definition of "low frequencies" across a population with differing SF tuning, we express frequencies relative to the peak. This shows clearly that, whereas at high frequencies most cells appear consistent with the generalized energy model, significant discrepancies emerge as we move toward lower frequencies. Thus, although a more general version of the energy model can explain the differences in the peak frequencies previously reported (Ohzawa et al. 1997
; Prince et al. 2002b
), the absolute power at low frequencies in the FT-DMC still allows us to reject the energy model in over half the cells. Note that this analysis also demonstrates that no amount of vergence variability could explain the data in these cells. If the vergence variability was large enough to account for the necessary shift in disparity peak frequency, it would have reduced the amplitude of disparity tuning below that observed. We conclude that the disparity-selective responses of many V1 neurons are not compatible with the energy model, even after it has been generalized to include multiple subunits with different SF and disparity tuning.
|
THE POSSIBLE SOURCES OF ERROR CANNOT EXPLAIN THE MISMATCH. Spontaneous firing rates. An overestimate of the area under the SFTC would yield excessively small values for the normalized product of SFTCs, which might cause erroneous rejection of the energy model. This could occur if a substantial spontaneous firing rate was added to the response resulting from receptive field structure. However, the observed spontaneous rates were almost always very low. For 211 of the 252 cells, blank stimuli, consisting of a uniform gray screen at the mean luminance of the random dots, were interleaved with the disparate random dots. The mean blank response, averaged over these 211 cells, was 2.1 spikes/s. The mean response exceeded 10 Hz in only 13/211 cells.
Orientation misalignment. Care was taken to ensure that disparity tuning and SF tuning were measured along the same axis, but of course this may not have been exactly orthogonal to the receptive field orientation. Such a misalignment will mean that Eq. 6 does not hold in general, and the effects will depend on the monocular receptive field. For Gabor-receptive fields that are longer in the direction parallel to the carrier Gabor than in the direction orthogonal to it, such misalignment always moves the peak of the product of the monocular SFTC to lower frequencies, while having less effect on the disparity frequency (APPENDIX F). Thus, for these respective fields, the disparity frequency should be if anything higher than the frequency of the peak of the product of monocular SFTCs, the opposite of the observed pattern. Thus, it seems unlikely that our results reflect any artifact of this kind.
Thresholding before binocular combination can explain the mismatch
It is therefore clear that the energy model, either in its original form or as generalized here, cannot account for the data. More elaborate extensions of the energy model are needed to reconcile it with experimental observations. We previously (Read et al. 2002
) extended the energy model by adding thresholds before binocular combination (Eq. 4). We introduced this modification to explain the reduced response of V1 neurons to binocularly anticorrelated stimuli. We have already seen that this same modification also explains the phenomenon of disparity selectivity in classically "monocular" cells. We now show that the same modification also explains the existence of low-pass disparity tuning curves in cells that have band-pass SF tuning.
Figure 14 compares disparity and SF tuning for the generalized energy model (top row, AC) and for our modified version with two different sets of parameters (bottom two rows: DF, GI). The layout is the same as in Fig. 12: the left-hand column shows the SFTCs obtained with grating stimuli, the middle column shows the disparity tuning curve and the response to uncorrelated random-dot patterns, whereas the right-hand column compares the normalized Fourier power spectrum of the disparity-modulated component (FT-DMC, gray curve) and the product of the normalized SFTCs (black curve). We have chosen an example in which the SF tuning is different in the two eyes: the left eye is tuned to a SF of 2.5 cpd, and the right eye to 3.5 cpd.
|
In the top row, we show results for a single binocular subunit, combining left- and right-eye inputs linearly in accordance with the energy model. Because there is only a single subunit, the normalized FT-DMC is identical to the product of the normalized SFTCs (Fig. 14C) [i.e., Eq. 6 holds with an equals sign (cf. APPENDIX E)]. The inequality is thus satisfied.
The middle row, Figure 14, DF, shows the same quantities for a binocular subunit in which the inputs from left and right eyes have been passed through a high-threshold nonlinearity before being summed and squared. This has completely removed the inhibitory side lobes that were present for the energy model subunit in Fig. 14B, resulting in a low-pass disparity power spectrum. The inequality of Eq. 6 is violated, just as in real data.
The bottom row, Figure 14, GI, shows a similar nonlinear model, this time with an inhibitory synapse resulting in a tuned-inhibitory-type disparity-tuning curve (given that the left and right receptive fields are in phase). For a single subunit, the SFTC in the inhibitory eye would be zero, so the product of the SFTCs would be zero and the inequality would be severely violated. Because it was rare for a completely silent SFTC to be recorded in one eye, this would be a rather extreme example. Instead, we make our model cell the sum of 2 binocular subunits, with identical left and right receptive fields. In the first subunit, the left eye is excitatory and the right eye inhibitory; in the second subunit, it is the other way around. Thus, the response observed for grating stimuli in the left eye is entirely attributable to the first subunit, whereas the response to gratings in the right eye is entirely attributable to the second subunit, although both subunits contribute to the disparity-tuning curve. Thus, in our modified version of the energy model, power present in the disparity-tuning curve does not necessarily show up in the SFTC. This is why the disparity power spectrum for this model cell rises well above the product of the SFTCs at low frequencies (Fig. 14I). This behavior is incompatible with the generalized energy model, but is commonly observed experimentally (Fig. 12, F and I). We conclude that our modified version of the energy model provides a better match to the data.
|
|
DISCUSSION |
|---|
|
The first problem for the energy model is that there are many cells in which input from one eye appears to be predominantly inhibitory. The energy model is linear up to binocular combination, and this means that inputs from each eye can be both positive or negative, depending on the particular stimulus. For example, if a receptive field in one eye consists of a single ON region, then a random-dot pattern in which mainly white dots happen to fall on the receptive field will elicit an excitatory response from this eye, whereas a pattern in which the receptive field is covered predominantly with black dots will elicit an inhibitory effect. Thus we cannot classify this eye as being "excitatory" or "inhibitory." However, previous workers have noted cases where one eye appears to have a purely inhibitory effect (Ohzawa and Freeman 1986a
; Poggio and Fischer 1977
).
We investigated this by carrying out a quantitative comparison of the response rates to monocular and binocular randomdot stimuli. In agreement with previous reports, we find that many neurons show little or no response to stimulation in one eye, despite exhibiting clear disparity selectivity when tested with binocular stimuli. Indeed, in the small number of cells that show significant spontaneous firing, stimulation in the nondominant eye often reduces the response below the spontaneous level. We also find that in many cells that respond well to random-dot patterns in the dominant eye, adding dots to the nondominant eye reduces the response. The lack of this second, critical piece of evidence may explain why the challenge posed to the energy model by monocular disparity-selective neurons has so far been ignored. Further support for the idea of inhibitory input from one eye is provided by our study of SFTCs with grating stimuli. Spatial frequency tuning in most eyes is well matched between eyes, and for some of the cells where the peak frequencies appear mismatched, the data suggest that the underlying receptive field structure may be identical in the two eyes, but that one eye has an inhibitory effect. This is impossible in the energy model.
The observation of cells in which one eye has a net inhibitory effect implies a nonlinear operation before binocular combination. We previously proposed modifying the energy model to incorporate such a nonlinearity: half-wave rectification followed by an inhibitory synapse (Read et al. 2002
). This modified version of the model can explain our observations; we present simulations of classically "monocular" cells that are nevertheless tuned to disparity. The same modification enables us to construct cells in which the response to uncorrelated stimuli is close to the mean of the responses to monocular stimuli in the two eyes, as often observed experimentally, rather than fixed at the sum of the monocular responses, as required by the energy model. We conclude that this modified version of the energy model is a better explanation of the data.
The second discrepancy investigated in this study is the mismatch between disparity frequency and SF tuning (Ohzawa et al. 1997
). In the original energy model, the disparity peak frequencyloosely, the frequency of the undulations in the disparity-tuning curveshould be the same as the preferred spatial frequency observed with monocular sinusoidal luminance gratings. In fact, the disparity peak frequency is systematically found to be lower than predicted by the energy model. Crucial to this demonstration was the use of disparities applied at right angles to the preferred orientation of the cell. Previous work in monkeys using only horizontal disparities failed to make clear this failure of the energy model (Prince et al. 2002b
).
We considered the possibility that differences in SF tuning between the eyes are responsible for this discrepancy. Although the energy model in its original form assumes identical SF tuning in left and right eyes, we found a few clear examples of excitatory inputs from both eyes with different SF tuning, as previously reported in the cat (Hammond and Pomfrett 1991
). The energy model can easily be generalized to take such interocular differences into account. If the eyes differ in SF tuning, then the disparity peak frequency should be located at the peak of the product of the tuning curves from left and right eyes. (Note that this implies that the disparity peak frequency is always in between the preferred spatial frequencies in the two eyes.) However, we showed that even this generalized version of the energy model must obey an inequality relating SF tuning to disparity tuning. This inequality was violated by most of the cells in our data set: their disparity-tuning curves had more spectral power at low spatial frequencies than was possible even in the generalized energy model, given the observed response to grating stimuli.
However, the additional power at low spatial frequencies can be explained with our modified version of the energy model. Half-wave rectification before binocular combination removes the side lobes in disparity-tuning curves from band-pass receptive fields, shifting power to lower frequencies and explaining how a low-pass disparity-tuning curve can be observed alongside band-pass SF tuning. In addition, allowing purely inhibitory input from one eye means that subunits can contribute to the observed disparity tuning without affecting the SF tuning observed in one eye. Thus, in our modified model, the SF tuning is decoupled from the disparity tuning. This offers an explanation of why the correlation between disparity frequency and spatial frequency predicted by the energy model is not observed.
Thus, in all the areas where we have compared the energy model and our modified version, the latter has agreed more closely with the data. In addition, the modified version also explains the weaker disparity tuning observed with anticorrelated stimuli (Cumming and Parker 1997
; Livingstone and Tsao 1999
; Ohzawa et al. 1990
). This does not constitute a proof that binocular combination is nonlinear. It is possible that all of the discrepancies noted here and elsewhere (e.g., Cumming and DeAngelis 2001
) could be reconciled with linear binocular summation if sufficiently complex subsequent processing is postulated (perhaps incorporating contrast normalization and multiple binocular subunits with different output nonlinearities). Also, there may be individual neurons in which the original energy model remains a better description than our modified version. If it is true that binocular complex cells receive input from many binocular subunits, there may be a mixture of both mechanisms at work, with some binocular subunits receiving essentially linear input from both eyes and others receiving thresholded input, thus explaining the continuum of observed properties.
However, we propose that the most straightforward solution is to postulate that binocular combination is not linear in most neurons. With this one modification to the energy model, the key characteristics of disparity selectivity in striate cortex can be economically reproduced. This suggests we are close to an accurate mechanistic description of how this novel property of the visual cortex is produced.
|
|
APPENDIX A: FOURIER TRANSFORMS |
|---|
|
(x, y) is a receptive field function,
) represents its Fourier transform, where
![]() |
represents the orientation of each Fourier component relative to the x-axis and
represents the spatial frequency of each Fourier component. (We denote these with tildes to distinguish them from the orientation and spatial frequency of the receptive field, or of the stimulus, which we shall encounter later.) We shall sometimes need the Fourier spectrum in terms of the frequency components orthogonal and parallel to the x-axis,
and
. We denote this
, where the superscript C stands for Cartesian
![]() |
and phase
. The squared magnitude
gives the Fourier power spectrum. |
|
APPENDIX B: DISPARITY TUNING CURVES |
|---|
|
![]() | (B1) |
is disparity, D(
) is the disparity-modulated component of the tuning curve (i.e., after the subtraction of the baseline), f is the spatial frequency f and
the phase of the carrier cosine,
is the standard deviation and
0 the center of the Gaussian envelope, and A is the amplitude. The Fourier power spectrum of this one-dimensional Gabor is
![]() | (B2) |
2f 2 >> 1, this reduces to
![]() | (B3) |
), becomes nonnegligible. If cos (2
) > 0 (closer to even symmetry than odd symmetry), the disparity peak frequency is less than the carrier frequency; if cos (2
) < 0 (closer to odd symmetry), it is greater. |
|
APPENDIX C: MONOCULAR SPATIAL FREQUENCY TUNING FOR A HALF-SQUARED-LINEAR BINOCULAR SUBUNIT |
|---|
|
L(x, y),
R(x, y). The response of the binocular subunit to a particular stereogram is the square of the sum of the inner product of the image in each eye with that eye's receptive field
![]() | (C1) |
![]() | (C2) |
We consider how such a subunit responds when presented with a monocular sinusoidal grating of spatial frequency fg oriented at an angle
g away from the optimal orientation, drifting with a temporal angular frequency
g. We define the retinal coordinate system such that the x-axis is orthogonal to the optimal orientation of the subunit. Then, the grating stimulus is
![]() | (C3) |
The instantaneous response of the subunit at time t is
![]() |
and
(fg,
g) are, respectively, the Fourier power and phase of the receptive field function in the stimulated eye at the spatial frequency and orientation of the grating; we have chosen to express the orientation of Fourier components relative to the optimal orientation. The unit's mean response averaged over a stimulus temporal cycle is
![]() | (C4) |
g = 0, then this expression gives the mean response of the cell as a function of the spatial frequency fg of an optimally oriented grating. This is therefore the model prediction for the shape of our experimental SFTCs. Later, it will be convenient to express the Fourier power spectrum in Cartesian coordinates
![]() | (C5) |
|
|
APPENDIX D: DISPARITY TUNING CURVE FOR A HALF-SQUARED-LINEAR BINOCULAR SUBUNIT |
|---|
|
along the x-axis (i.e., orthogonal to the preferred orientation), the disparity-modulated component of its response is given by
![]() | (D1) |
![]() | (D2) |
2 is the area of a pixel. Then Eq. D1 becomes
![]() | (D3) |
(fx) is
![]() | (D4) |
is the Fourier transform of the receptive field expressed in Cartesian axes in which x is orthogonal to the preferred orientation, the suffixes L and R denote left and right eyes, and * denotes complex conjugation.
The response of a complex cell is modeled as the sum of n of the binocular subunits of Eq. C1
![]() | (D5) |
![]() | (D6) |

j(fx, fy) is the difference between the Fourier phases of the left and right receptive fields in the jth subunit, at the frequency component indicated. In general 
j(fx, fy) depends on frequency, but if the receptive fields are narrow-band Gabors, it is constant and equal to 
=
L
R, the phase disparity between the Gabor receptive fields.
It will be convenient to normalize the Fourier power spectrum by the response to monocular random images. The mean response to white noise in the left eye is
![]() | (D7) |
![]() | (D8) |
|
|
APPENDIX E: INEQUALITY RELATING MONOCULAR TUNING CURVES TO FOURIER POWERS PECTRUM OF DISPARITY TUNING CURVE |
|---|
|
. We restrict ourselves to the case where all receptive fields (for all subunits, in both eyes) are Cartesian-separable in the same coordinate frame, so that every receptive field
(x, y) can be written as the product of a function of x only and a function of y only. The x-function is arbitrary, but we require the y-function to be the same for all receptive fields, and to have zero Fourier phase. One example is if all the receptive fields are Gabors with the same orientation and the same extent parallel to the preferred orientation (but note that Gabor receptive fields are not required). Orientation tuning is assumed to be the same in all subunits, but SF tuning is allowed to differ both across eyes in a single binocular subunit, and across subunits. Even though we have referred so far only to binocular subunits, this framework also includes monocular subunits as a special case: we simply set the x-function in one eye to zero, so that the subunit contributes a term [Pos(vL)]2 or [Pos(vR)]2. Non-disparity-selective subunits can be represented by the sum of two monocular terms: [Pos(vL)]2 + [Pos(vR)]2. Thus, this formulation is extremely general.
Because we have assumed separability, the Fourier transform of each receptive field can be written, in Cartesian form, as
and so forth, where the subscript Lj indicates that this is the left-eye receptive field of the jth subunit. Using the results of APPENDIX C (Eq. C5), the monocular spatial frequency tuning curves obtained with optimally oriented drifting gratings in the left and right eyes, respectively, are
![]() | (E1) |
![]() | (E2) |
The product of the normalized tuning curves is
![]() | (E3) |
We now use the simplifying assumptions of separability in the rather unwieldy expression (Eq. D8) derived in APPENDIX D for the Fourier power spectrum of the disparity-modulated component of the cell's response to binocular white noise, normalized by the monocular responses. The integrals over y cancel out top and bottom, whereas the Fourier phase depends on the x-function only, because we assumed that the y-function has zero Fourier phase (e.g., is a Gaussian). We then obtain
![]() | (E4) |
Now consider the difference between the normalized Fourier power spectrum, Eq. E4, and the product of the normalized left and right monocular tuning surfaces, Eq. E3
![]() | (E5) |
The right-hand side of Eq. E5 is clearly unaffected by interchanging the summation indices j and k. We write it out twice, interchanging the indices the second time. It then becomes apparent that the terms other than the sine terms form a perfect square, so we can write
![]() | (E6) |
![]() | (E7) |
/2.
One practical problem with Eq. E7 is that it does not take account of any gain control mechanism that boosts responses to monocular stimulation relative to binocular stimulation. On expanding the squared term in Eq. D5, it is apparent that the model's mean response to binocularly uncorrelated random-dot stimuli is given by the sum of its mean responses to monocular random-dot stimuli
![]() | (E8) |
However, this means that we cannot use the observed response to monocular random dot patterns in Eq. E7, given that the quantities L and R in Eq. E7 need to be the responses to left and right monocular random-dot stimulation before any contrast normalization, and we do not have access to this experimentally. We therefore recast Eq. E7 using the response to binocularly uncorrelated stimuli U. The energy model (before response normalization) states that U = L + R. It then follows from (L R)2
0 that U2
4LR. Thus
![]() | (E9) |
In summary, for a significantly generalized version of the energy model, we have derived a useful upper limit on the normalized power spectrum of the disparity-tuning curve. The normalized disparity power at any frequency cannot exceed the normalized response to monocular drifting gratings at that frequency. This holds for a model of the same form as that proposed by Ohzawa et al. (1990
), but generalized to allow any number of subunits that may differ in their spatial frequency tuning, in the phase of their receptive fields, in the position of their receptive fields on the retina, and in the position and phase disparity between left and right receptive fields. The only property that is not allowed to vary between subunits is the orientation of the receptive fields and their extent along this axis. The upper limit on disparity power holds even in the presence of response normalization mechanisms that scale the responses to grating stimuli relative to the responses to random-dot patterns, or which scale the responses to monocular stimuli relative to the responses to binocular stimuli.
|
|
APPENDIX F: EFFECT OF STIMULUS MISALIGNMENT |
|---|
|
, restricted to the case where all receptive fields are narrow-band Gabor functions with identical spatial frequency and orientation tuning (i.e., differing only in phase and position on the retina), and where the position and phase disparity between left and right receptive fields is the same for all subunits. This is the form of the energy model used by all previous investigators that we are aware of (e.g., Fleet et al. 1996
![]() | (F1) |
![]() | (F2) |
As before, the equations are for disparities along the x-axis, and for the SFTCs obtained with gratings oriented parallel to the y-axis. However, we now allow for the possibility that the y-axis is not aligned along the true preferred orientation of the cellfor instance, because of experimental error in assessing this true orientation. The power spectrum
of this cell is shown in Fig. F1. The SFTC (Eq. F2) is simply a slice through this surface, along the fx axis. The Fourier spectrum of the disparity-tuning curve, on the other hand, is given by the line integral of this surface along lines parallel to the y-axis (Eq. F1). Referring to Fig. F1, and using the symmetry of the Gabor power spectrum, it is apparent that the peak of the disparity power spectrum will occur at the horizontal frequency f0 cos
0, where f0 and
0 are the spatial frequency and orientation of the Gabor, respectively. In contrast, it can be shown that the peak of the monocular SFTC will occur at
![]() | (F3) |
|| and 
describe the extent of the Gabor, respectively, parallel and orthogonal to its preferred orientation. It follows that, provided
|| > 
, misaligning the stimuli will cause the measured monocular SFTCs to peak at a lower frequency than the power spectrum of the disparity-tuning curve. Thus, at least for the class of model considered here, such misalignment cannot be responsible for the observation that the disparity frequency is usually lower than the preferred monocular spatial frequency.
|
|
|
APPENDIX G: GLOSSARY OF SYMBOLS USED IN THE APPENDICES |
|---|
|

)
(f)
x if x > 0, =0 otherwise
(x, y)

,
are the spatial frequency and orientation of each Fourier component)

x,
y are the components of spatial frequency orthogonal and parallel to the preferred orientation)

g
|
|
ACKNOWLEDGMENTS |
|---|
|
|
|
FOOTNOTES |
|---|
Address for reprint requests and other correspondence: J. Read, Building 49/Room 2A50, 49 Convent Dr., Bethesda, MD 20892-4435 (E-mail: jcr{at}lsr.nei.nih.gov).
|
|
REFERENCES |
|---|
|
Anzai A, Ohzawa I, and Freeman RD. Neural mechanisms for processing binocular information. II. Complex cells. J Neurophysiol 82: 909924, 1999a.
Anzai A, Ohzawa I, and Freeman RD. Neural mechanisms for processing binocular information. I. Simple cells. J Neurophysiol 82: 891908, 1999b.
Barlow HB, Blakemore C, and Pettigrew JD. The neural mechanisms of binocular depth discrimination. J Physiol 193: 327342, 1967.
Bridge H and Cumming BG. Responses of macaque V1 neurons to binocular orientation differences. J Neurosci 21: 72937302, 2001.
Cumming B and Parker A. Local disparity not perceived depth is signaled by binocular neurons in cortical area V1 of the macaque. J Neurosci 20: 47584767, 2000.
Cumming BG. An unexpected specialization for horizontal disparity in primate primary visual cortex. Nature 418: 633636, 2002.[Medline]
Cumming BG and DeAngelis GC. The physiology of stereopsis. Annu Rev Neurosci 24: 203238, 2001.[Web of Science][Medline]
Cumming BG and Parker AJ. Responses of primary visual cortical neurons to binocular disparity without depth perception. Nature 389: 280283, 1997.[Medline]
Cumming BG and Parker AJ. Binocular neurons in V1 of awake monkeys are selective for absolute, not relative, disparity. J Neurosci 19: 56025618, 1999.
Dean AF. The variability of discharge of simple cells in the cat striate cortex. Exp Brain Res 44: 437440, 1981.[Web of Science][Medline]
DeAngelis GC, Ohzawa I, and Freeman R. Depth is encoded in the visual cortex by a specialised receptive field structure. Nature 352: 156159, 1991.[Medline]
de Valois RL, Albrecht DG, and Thorell LG. Spatial frequency selectivity of cells in macaque visual cortex. Vision Res 22: 545559, 1982.[Web of Science][Medline]
Efron B. Bootstrap methods: another look at the jackknife. Ann Stat 7: 126, 1979.
Fleet D, Wagner H, and Heeger D. Neural encoding of binocular disparity: energy models, position shifts and phase shifts. Vision Res 36: 18391857, 1996.[Web of Science][Medline]
Foley JD, Dam Av, Feiner SK, and Hughes JF. Computer Graphics: Principles and Practice. Reading, MA: AddisonWesley, 1990.
Hammond P and Pomfrett CJ. Interocular mismatch in spatial frequency and directionality characteristics of striate cortical neurones. Exp Brain Res 85: 631640, 1991.[Web of Science][Medline]
Lippert J and Wagner H. A threshold explains modulation of neural responses to opposite-contrast stereograms. Neuroreport 12: 32053208, 2001.[Web of Science][Medline]
Livingstone MS and Tsao DY. Receptive fields of disparity-selective neurons in macaque striate cortex. Nat Neurosci 2: 825832, 1999.[Web of Science][Medline]
Nikara T, Bishop PO, and Pettigrew JD. Analysis of retinal correspondence by studying receptive fields of binocular single units in cat striate cortex. Exp Brain Res 6: 353372, 1968.[Web of Science][Medline]
Ohzawa I. Mechanisms of stereoscopic vision: the disparity energy model. Curr Opin Neurobiol 8: 509515, 1998.[Web of Science][Medline]
Ohzawa I, DeAngelis G, and Freeman R. Stereoscopic depth discrimination in the visual cortex: neurons ideally suited as disparity detectors. Science 249: 10371041, 1990.
Ohzawa I, DeAngelis G, and Freeman R. Encoding of binocular disparity by complex cells in the cat's visual cortex. J Neurophysiol 77: 28792909, 1997.
Ohzawa I, DeAngelis G, and Freeman RD. Encoding the binocular disparity by simple cells in the cat's visual cortex. J Neurophysiol 75: 17791805, 1996.
Ohzawa I and Freeman RD. The binocular organization of complex cells in the cat's visual cortex. J Neurophysiol 56: 243259, 1986a.
Ohzawa I and Freeman RD. The binocular organization of simple cells in the cat's visual cortex. J Neurophysiol 56: 221242, 1986b.
Poggio GF and Fischer B. Binocular interaction and depth sensitivity of striate and prestriate cortex of behaving rhesus monkey. J Neurophysiol 40: 13921405, 1977.
Prince SJ, Cumming BG, and Parker AJ. Range and mechanism of encoding of horizontal disparity in macaque V1. J Neurophysiol 87: 209221, 2002a.
Prince SJ, Pointon AD, Cumming BG, and Parker AJ. Quantitative analysis of the responses of V1 neurons to horizontal disparity in dynamic random-dot stereograms. J Neurophysiol 87: 191208, 2002b.
Qian N. Computing stereo disparity and motion with known binocular cell properties. Neural Comput 6: 390404, 1994.
Qian N and Zhu Y. Physiological computation of binocular disparity. Vision Res 37: 18111827, 1997.[Web of Science][Medline]
Read JCA. A Bayesian model of stereopsis depth and motion direction discrimination. Biol Cybern 86: 117136, 2002.[Web of Science][Medline]
Read JCA, Parker AJ, and Cumming BG. A simple model accounts for the reduced response of disparity-tuned V1 neurons to anti-correlated images. Vis Neurosci 19: 735753, 2002.[Web of Science][Medline]
Smith EL 3rd, Chino YM, Ni J, Ridder WH 3rd, and Crawford ML. Binocular spatial phase tuning characteristics of neurons in the macaque striate cortex. J Neurophysiol 78: 351365, 1997.
Tsai JJ and Victor JD. Reading a population code: a multi-scale neural model for representing binocular disparity. Vision Res 43: 445466, 2003.[Web of Science][Medline]
This article has been cited by other articles:
![]() |
K. Miura, Y. Sugita, K. Matsuura, N. Inaba, K. Kawano, and F. A. Miles The Initial Disparity Vergence Elicited With Single and Dual Grating Stimuli in Monkeys: Evidence for Disparity Energy Sensing and Nonlinear Interactions J Neurophysiol, November 1, 2008; 100(5): 2907 - 2918. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Tanabe and B. G. Cumming Mechanisms Underlying the Transformation of Disparity Signals from V1 to V2 in the Macaque J. Neurosci., October 29, 2008; 28(44): 11304 - 11314. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Kumano, S. Tanabe, and I. Fujita Spatial Frequency Integration for Binocular Correspondence in Macaque Area V4 J Neurophysiol, January 1, 2008; 99(1): 402 - 408. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. M. Sanada and I. Ohzawa Encoding of Three-Dimensional Surface Slant in Cat Visual Areas 17 and 18 J Neurophysiol, May 1, 2006; 95(5): 2768 - 2786. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. C. A. Read and B. G. Cumming Effect of Interocular Delay on Disparity-Selective V1 Neurons: Relationship to Stereoacuity and the Pulfrich Effect J Neurophysiol, August 1, 2005; 94(2): 1541 - 1553. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Neri A Stereoscopic Look at Visual Cortex J Neurophysiol, April 1, 2005; 93(4): 1823 - 1826. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. C. A. Read and B. G. Cumming Ocular Dominance Predicts Neither Strength Nor Class of Disparity Selectivity With Random-Dot Stimuli in Primate V1 J Neurophysiol, March 1, 2004; 91(3): 1271 - 1281. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Visit Other APS Journals Online |