|
|
||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1Graduate School of Engineering Science and 2Graduate School of Frontier Biosciences, Osaka University, Osaka, Japan
Submitted 9 September 2005; accepted in final form 29 December 2005
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
How does the processing of stereoscopic information proceed once binocular disparity for small localized areas is available? Is a possible next stage of processing that of detecting the rate of change of binocular disparity, i.e., detecting 3D orientations of surfaces in depth? Some recent studies have examined these possibilities and report that a subset of neurons in higher visual areas such as MT, V4, and CIPs encode information regarding slant/tilt of surfaces (Hinkle and Connor 2002
; Nguyenkim and DeAngelis 2003
; Taira et al. 2000
). Response to 3D curvature is also reported in the inferotemporal cortex (IT) (Janssen et al. 1999
, 2000
Liu et al. 2004
). It is not known, however, whether such surface slant/tilt sensitivity is a unique feature of these higher-order visual areas. Because neurons in these areas receive inputs from primary visual cortex, selectivity for 3D surface slant/tilt may be inherited from the early visual areas. Historically, the role of interocular orientation difference has been examined in some detail (Blakemore et al. 1972
; Nelson et al. 1977
). More recent work has examined whether a subset of V1 neurons encode surface tilt by orientation disparity based on physiological experiments in the monkey and a computational study (Bridge and Cumming 2001
; Bridge et al. 2001
). However, possible roles of interocular spatial frequency difference have not been examined physiologically.
As illustrated in Fig. 1A, projection of a slanted surface onto the two retinae produces a spatial frequency difference, such that the eye closer to the nearer end of the slanted surface sees higher spatial frequency than the other eye. Such a difference in spatial frequency across the eyes is a potent cue for perceiving surface slants. With psychophysical experiments, Blakemore and later investigators reported that a difference of spatial frequency across the eyes produces a perception of slant-in-depth (Blakemore 1970
; Fiorentini and Maffei 1971
; Wilson 1976
). Binocular disparity caused by interocular spatial frequency difference is designated dif-frequency disparity (Tyler and Sutter 1979
). As expected, the angle of perceived surface slant depends on interocular ratio of spatial frequencies. Despite these psychophysical results, we are not aware of any physiological study that has systematically examined possible roles of dif-frequency disparity for encoding surface slant in the early visual cortex. In this study, we will thus address this question using modern receptive field-mapping techniques.
|
Considering the predictions illustrated above from a modified version of the disparity energy model, we will first examine the extent to which neurons in areas 17 and 18 of the cat visual cortex exhibit tilted binocular RFs. We will also examine the validity of the dif-frequency disparity energy model by comparing the degree of tilt of binocular RF and monocular spatial frequency-tuning curves for the two eyes.
| METHODS |
|---|
|
|
|---|
Surgical procedure and animal maintenance
Forty-four adult cats (1.54 kg) were prepared for electrophysiological recording as follows. First, subcutaneous injection was given of atropine sulfate (0.017 mg/kg) and hydroxyzine hydrochloride (Atarax, 0.83 mg/kg). Anesthesia was induced and maintained during surgery with isoflurane (2.53.5% in O2). Cefotiam hydrochloride (Panspolin, 2.8 mg/kg) and dexamethasone sodium phosphate (Decadron, 0.13 mg/kg) were administered. Electrocardiogram (ECG) electrodes and a rectal temperature probe were installed. The rectal temperature probe was coated with lidocaine ointment. ECG and core temperature were monitored using a custom-built PC-based physiological monitoring system. Catheters were inserted into femoral veins of two limbs for infusion of drugs and fluids. A glass tracheal cannula was inserted after tracheostomy. A stereotaxic apparatus was used to securely position the animals head. Lidocaine ointment was used at pressure points of ear bars. After securing the animal to the stereotaxic apparatus, anesthesia was switched to thiopental sodium (Ravonal, administered continuously in infusion, 1.01.5 mg · kg1 · h1). Then, paralysis was induced with an initial dose of gallamine triethiodide and the animal was placed under artificial respiration at the rate of 2030 strokes/min. The respiration rate and stroke volume were adjusted to maintain the end-tidal CO2 between 3.5 and 4.3%. A CO2 sensor (Datex-Ohmeda) was used to maintain a proper level of respiration. Anesthesia for the rest of recording session was maintained by a combination of 70% N2O-30% O2 and thiopental sodium as noted above. Paralysis was maintained by continuous infusion of Ravonal, gallamine triethiodide (10 mg · kg1 · h1) in lactated Ringer solution containing 50% glucose (40 mg · kg1 · h1). Body temperature was maintained near 38.3°C with the use of a servo-controlled heating pad. After securing the animal, a craniotomy was performed to access the central representation of the visual area 17 or 18 (HorsleyClarke P4 L2.5 for recordings of A17, A3 L3 for A18). The dura was carefully removed to allow insertion of microelectrodes. Pupils were dilated with atropine (1%), and nictitating membranes were retracted with phenylephrine hydrochloride (Neosynesin, 5%). Contact lenses of appropriate power with 4-mm artificial pupil were placed over the corneas.
The area of recording was primarily determined by the coordinate of electrode penetrations, although histological confirmation of recorded areas was conducted for the majority of animals. There is a possibility that a small fractions of neurons, especially from long penetrations, may be classified into a wrong cortical area. However, we did not eliminate those neurons (for which we were not completely certain of the area) from our analyses because they still represent important and valid samples for purposes of this study and there were no obvious areal differences.
Experimental apparatus
Tungsten microelectrodes (5 M
, A-M Systems) were used to record spike activities extracellularly. To increase the chance of encountering cells, two electrodes were mounted in parallel in a protective single guide tube and driven by a common microelectrode drive (Narishige). After confirming under a microscope that the electrodes do not penetrate blood vessels on the cortical surface, agar in warm Ringer solution was applied to stabilize and protect the cortex. Then, melted wax was applied over the agar to form a sealed chamber. An oscilloscope and audio speakers were used to monitor raw signals from the microelectrodes. Electrical signals from the microelectrodes were amplified (10,000x) and band-pass filtered (3005,000 Hz). Then spike sorting was achieved using a custom-built spike sorter (Ohzawa et al. 1996
), where each spike was sorted by their waveforms and time stamped with 40-µs resolution.
Visual stimulation and recording procedures
Experiment control functions and generations of visual stimuli were performed using custom-built software. Visual stimuli are generated by a dedicated PC and displayed on a CRT display (Sony GDM-FW900, a resolution of 1,600 x 1,024 pixels, covering the display area of 46.6 x 29.9 cm, 34.3 dot/deg; refresh rate: 76 Hz). A custom-built mirror haploscope was used to present stimuli to left and right eyes separately (Fig. 2A). To preclude projection of stimulus to contralateral eye, a separator was placed between the left and right visual fields. Distance between the screen and the eyes was set to 57 cm, subtending the visual field of 23.3(horizontal) x 29.9(vertical) degrees for each eye. Because we were examining interocular differences in neuronal responses, we carefully set up the haploscope and adjusted distances to the screen to equate the viewing conditions for two eyes as much as possible. The display surface of CRT monitor was carefully set perpendicular to the lines of sights for the subject.
|
Binocular receptive field mapping
Binocular reverse correlation procedure was equivalent to that used by Ohzawa et al. (1997)
. A pair of one-dimensional bar stimuli was simultaneously presented to left and right eyes by a mirror haploscope setup (Fig. 2A). Twenty stimulus locations of the bar were used to stimulate receptive fields for each eye. This defined 20 x 20-point stimulus grid in the (XL, XR) domain (Fig. 2B). Therefore the binocular receptive field was measured by tallying up responses to 1,600 (20 x 20 x 4) different dichoptic pairs of stimuli. The orientation of the bar stimuli was set to the preferred orientation for each eye and for each cell. All possible combinations of left and right eye stimulus positions were included for each leftright permutation of contrast sign (darkdark, brightbright, darkbright, brightdark). All pairs of positions and combinations of stimulus contrast were presented in a random order, each stimulus lasting for 26 ms (two video frames) or 53 ms (four video frames) without any blank stimulus. Stimulus sequence was reshuffled for each set. A complete stimulus sequence lasted 42 s. Typically 2040 sequences were used, which took 2040 min in all. The response map for each contrast subset was calculated by cross-correlating spike trains with stimulus sequences (Fig. 2B). Binocular receptive field is a sum of response maps for matched polarity (brightbright and darkdark) conditions minus those for mismatched polarity (darkbright and brightdark) conditions. Monocular responses are cancelled by this computation and do not appear in the binocular RF (Ohzawa et al. 1997
). We calculated the binocular RF for correlation delays from 100 to +300 ms in 5-ms steps. Because there is no correlation between spike train and stimulus sequence for negative time delays, we defined the response at negative time delays as noise. To obtain the optimal correlation delay, the sum of squared value of all data points in the RF at each correlation delay was obtained for the range of delays, and the peak delay was determined. A binocular receptive field is constructed at this optimal correlation delay. To evaluate the signal-to-noise ratio, we calculated the SD of the response at the optimal correlation delay divided by the average SD for negative correlation delays (100 to 5 ms in 5-ms steps). We rejected data when the total spikes are <1,000 impulses and the peak response at the optimal delay did not exceed the mean of response at negative correlation delays +10SD.
Spatial frequency-tuning test
Left and right spatial frequency tunings were obtained by using drifting sinusoidal gratings in a separate test. Orientations of grating stimuli were fixed at the optimal value for each eye because preferred orientations were typically different by 515° for the two eyes, probably arising from cyclorotation of the eye after paralysis (Nelson et al. 1977
). The gratings were presented in a random order and each presentation lasted for 4 s interspersed with 1 s of interstimulus intervals. Mean firing rates were calculated at each spatial frequency. One-dimensional Gaussian functions were fitted to each spatial frequency tuning. Preferred spatial frequencies were obtained by the peak position of the fitted Gaussian function.
| RESULTS |
|---|
|
|
|---|
Binocular RFs for representative examples from the separable and inseparable types are illustrated in Fig. 3. Figure 3A depicts a binocular RF for a simple cell recorded in area 18. The binocular RF appears to be described reasonably well by a product of left and right monocular receptive field profiles for simple cells, as reported by Anzai et al. (1999a)
. Correlation between the standard simple/complex RF types and separability of binocular RF is high, but these classifications are not identical. This issue, including the basis for our choice of using the separability, will be described later (see following text). An exemplar complex cell recorded in area 18 is illustrated in Fig. 3C. The binocular RF showed a horizontally elongated structure like that in previous studies (Anzai et al. 1999b
; Ohzawa et al. 1990
, 1997
). Complex cells tend to exhibit binocular RFs that are not leftright separable. Such inseparable receptive fields are well described by a disparity energy model where the sum of output of quadrature pairs of separable RFs constructs an inseparable RF (Anzai et al. 1999a
,b
; Ohzawa et al. 1990
, 1997
). On closer examination of this binocular RF, we noticed a small amount of tilt in the binocular RF from the frontoparallel axis in the clockwise direction. We wished to determine whether these small tilts are reliable properties of the neurons or arise from experimental noise or variability. Note that a small degree of tilt in the (XL, XR) domain translates into a substantially larger surface slant in real object space in front of the animal. This is because, under realistic viewing conditions, the lines of sight from the two eyes to a fixation point crosses with a much more acute angle than the 90 ° angle for the (XL, XR) domain. For example, given a viewing distance of 57 cm and interpupillary distance of 3 cm, a tilt of 5° in the (XL, XR) domain is equal to the surface slant that is 73.3° from the frontoparallel plane (see APPENDIX). Therefore even a small visible tilt in the (XL, XR) domain may have a large perceptual significance.
|
Alternatively, the same frequency domain may be referenced by a pair of orthogonal axes, along the vertical and horizontal directions corresponding to the diagonals of the square domain (Fig. 3B). These dimensions are defined as the disparity frequency and the frontoparallel frequency for vertical and horizontal axes, respectively (see APPENDIX). Interestingly, the four quadrants of the domain may be assigned to either disparity frequency tuning or frontoparallel frequency tuning. Top and bottom quadrants represent tuning for disparity, as indicated by two spectral peaks in Fig. 3D. The locations of the peaks in these domains allow extraction of such parameters as the optimal disparity frequency and binocular RF tilt. Left and right quadrants, on the other hand, will have substantial peaks only for separable neurons, and represent spatial frequency tuning of combined input from the two eyes. Therefore the peaks in these quadrants define the optimal frontoparallel frequency.
The process of determining binocular RF parameters in the frequency domain is illustrated further in Fig. 4. Based on the observation that substantial peaks are present in the left and right quadrants only for separable RFs, we define an index of separability of receptive field in the XL, XR domain, the binocular separability index (BSI), as follows
![]() | (1) |
|
) may be defined as the angular deviation of the spectral peak from the disparity frequency axis, connecting the top and bottom corners of Fig. 4C. If the spectral peak is exactly on the disparity frequency axis, original binocular RF has zero tilt. A nonzero
indicates a corresponding tilt of binocular RF. To estimate peak frequencies with greater accuracy, we interpolated the Fourier spectrum by cubic spline before evaluating the binocular RF tilt. A spatial frequency step of 0.005 (cycles/deg) is used as the resolution of interpolation for all neurons. To determine the step size for interpolation, we calculated percentage errors for binocular RF tilts for various resolutions of interpolation by simulations. Fourier transforms are performed on simulated binocular RF data obtained from model binocular complex cells with various interocular spatial frequency ratios (fL/fR = 0.66 to 1.5), and various disparity frequencies (fdisparity = 0.07 to 0.5; see APPENDIX). The data array is set to the same size as that in our experiments (20 x 20 grid). Then, interpolations are tested for various final resolutions (0.005 to 0.1 cycle/deg). On average, a sufficiently small error level (0.96 ± 0.03% error) for binocular RF tilt is obtained with the interpolation resolution of 0.005 cycle/deg. The percentage error increased to 13.43 ± 0.17% at the 0.1 cycle/deg resolution.
Because there is always a spectral peak in the bottom quadrant regardless of binocular RF separability, the calculations outlined above are applicable both to separable and to inseparable type of neurons. Note that cross sections going through the spectral peak that are parallel to the left and right frequency axes depict monocular spatial frequency-tuning curves, as estimated from the binocular RF data. These tuning curves are illustrated at the bottom left and right insets of Fig. 4C. The "tilt" angle of the binocular RF (
) may be determined from the peak coordinate of the binocular RF (f0L, f0R), as follows
![]() | (2) |
Binocular RF tilt
is transformed into disparity gradient, which is more commonly used to quantify slants of oriented surfaces in 3D. Disparity gradient represents surface slant independent of viewing distance. It is usually defined as
![]() | (3) |
is the angular separation between the directions for the two objects as viewed from the cyclopean eye, i.e., the midpoint between the two eyes (Burt and Julesz 1980
, as described in the following equation
![]() | (4) |
is 10°, the disparity gradient is 0.35, which corresponds to about 80° of physical surface slant at 57 cm of viewing distance. Using the disparity gradient as defined above, we will quantify and summarize RF slant for all neurons below. As illustrated in Figs. 3 and 4, simple and complex cells tended to show different binocular RF profiles, binocularly separable and inseparable, respectively. However, simple/complex and separable/inseparable classifications are not the same. There are simple cells that are classified as inseparable, and vice versa. For the reasons outlined below, we will use the separable/inseparable type classification throughout the paper. However, before we set out to perform all the analyses based on this classification, we should examine the correlation between the two classification methods.
Note that an ideal complex cell based exactly on the disparity energy model will have a BSI of exactly 0 (Anzai et al. 1999b
; Ohzawa et al. 1990
, 1997
). On the other hand, ideal binocular simple cells that linearly sum left and right eye input will have a BSI of 1 (Anzai et al. 1999a
; Ohzawa et al. 1990
, 1996
). The actual population of neurons we have recorded exhibited substantial deviations from the ideal cases as illustrated in Fig. 5.
|
Recall that one of the purposes of this study is to examine whether the apparent tilt of binocular RF profile is based on the difference in the optimal spatial frequencies across the eyes (Fig. 1). The question is addressed in the next several figures based on results of binocular RF and spatial frequency-tuning measurements from both binocularly separable and inseparable neurons. Data from representative examples of binocularly separable neurons are illustrated in Fig. 6. Binocular RF profiles are shown in the left column. In the middle column, monocular Fourier spectra derived from the binocular RF are shown as solid and dashed curves for the left and right eyes, respectively. These are cross sections through the peak of the Fourier spectrum as illustrated in Fig. 4C, taken parallel to the left and right frequency axes. Actual spatial frequency-tuning curves obtained by drifting sinusoidal grating stimuli are illustrated in the right column. The predicted tuning curves in the middle column and those in the right column should be comparable directly under certain linearity assumptions (DeAngelis et al. 1993a
,b
). Open and filled symbols depict responses for the left and right eyes, respectively. Error bars represent the SE. A horizontal dashed line indicates the spontaneous firing rate. A Gaussian function of the following form is fitted to each tuning curve
![]() | (5) |
|
The binocular RF tilt determined as above is automatically reflected as a difference in the predicted spatial frequency-tuning curves shown in Fig. 6A (middle). The predicted disparity gradient for this cell is 0.25, as calculated by Eq. 4. A similar statistically significant difference in the optimal spatial frequencies for the two eyes is also observed for the actual tuning curves measured by drifting sinusoidal gratings (Fig. 6A, right; P < 0.05, bootstrap test) in that the optimal spatial frequency for the right eye (vertical dashed line) is higher than that for the left eye (vertical solid line). Therefore for this neuron, there is a good correspondence between the tilt of the binocular RF (measured by reverse correlation) and the interocular difference between the optimal spatial frequencies (measured by drifting gratings).
Similar additional data from two separable binocular RFs are shown in Fig. 6, B and C in the same format as that of Fig. 6A. For these two cells (both of which were simple), tilt angles
of binocular RFs were significantly different from zero (P < 0.05, bootstrap test). The tilt angles of binocular RF for Fig. 6, B and C are 6.3 and 8.5°, which correspond to predicted preferred disparity gradients of 0.22 and 0.3, respectively. Again, for these additional cells, the actual spatial frequency-tuning curves measured by drifting gratings (right column) also show statistically significant difference between the eyes (P < 0.05, bootstrap test). The ratios of optimal spatial frequencies (left/right) are 0.71, 0.69, and 0.70 for cells in Fig. 6, AC, respectively. Again, the direction of the difference in predicted spatial frequency-tuning profiles (middle column) corresponds well to that for the measured data (right column) for each neuron. Therefore these results for binocularly separable neurons indicate that the tilt angles of their binocular RFs and their predicted disparity gradients correspond well with the leftright differences of optimal spatial frequencies measured by monocularly presented drifting gratings.
Data from representative examples of inseparable neurons are illustrated in Fig. 7 in the same format as that of Fig. 6. Spatial frequency-tuning curves are not available for B and D either because spikes for one of the cells appeared after the initial tuning tests were already completed or, for the case of D, data for the frequency-tuning test did not show significantly modulated responses as a function of spatial frequency (P > 0.05, ANOVA). Binocular RFs shown in Fig. 7, pairs A and B, C and D are from neurons that were recorded simultaneously. The neuron shown in A is an example for which the binocular RF was significantly tilted from the frontoparallel plane (P < 0.05, bootstrap test). In fact, all of the examples except for that in Fig. 7C had statistically significant tilt for their binocular RFs. Note that the neuron illustrated in B had a statistically significant tilt in the opposite direction from the other member of the pair shown in A. The opposite tilt directions for the pairs of neurons clearly indicate that the tilts of binocular RFs do not arise from optical factors such as errors in the eye-display distances or magnification differences between the eyes. Because there are significant differences in the degree of tilt among simultaneously recorded neurons, these variations must be neural in origin.
|
As with binocularly separable neurons presented in Fig. 6, independent measurements of spatial frequency-tuning curves are also conducted using drifting sinusoidal gratings. Preferred spatial frequencies, shown as vertical solid and dashed thin lines in the right column, differ significantly across the eyes for Fig. 7, A, E, and F (P < 0.05, bootstrap test), but not for Fig. 7C (P > 0.05, bootstrap test). This is consistent with the lack of significant tilt of binocular RF for this neuron. Therefore for all cases shown in Fig. 7, A, C, E, and F, directions of interocular spatial frequency difference correspond well to the frequency difference of binocular RFs.
Paired recordings are also possible between neurons of different binocular separability. Such an example is shown in Fig. 8. Binocular RFs shown in Fig. 8, A and B are separable and inseparable RF, respectively. For both neurons, binocular RFs exhibit significant tilts from the frontoparallel plane (P < 0.05, bootstrap test). Furthermore, the tilts are in opposite directions between the two neurons. The tilt angles for cells in Fig. 8, A and B are 3.2 and 3.7°, with the corresponding preferred disparity gradients of 0.11 and 0.13, respectively. Actual spatial frequency-tuning curves measured with drifting gratings are shown in the right column. As expected from the binocular RF tilts, the interocular difference in the preferred spatial frequencies are opposite for the two neurons. The left preferred spatial frequency is significantly higher than the right frequency (frequency ratio = 1.23, P < 0.05, bootstrap test) for A; the difference is significant and opposite (frequency ratio = 0.87, P < 0.05, bootstrap test) for B. Taken together with the results from the previous figure, both separable and inseparable binocular RFs show a variety of tilts that are consistent with the interocular difference in the monocularly measured preferred spatial frequencies. Therefore the notion of the basis of 3D surface tilt representation, as illustrated in Fig. 1, appears quite likely based on these examples.
|
|
) of the error distribution is set such that 1.96
= 0.1. Statistical tests for data and artifactual distributions are carried out by the F-test. Distribution of preferred disparity gradients is significantly wider than that of the error distribution (test for equal variance, F = 13.4, P < 0.001 for separable type; F = 7.42, P < 0.001 for inseparable type). Similarly, we also calculated possible contributions of interocular magnification differences.
|
Relationship between disparity gradient and spatial frequency ratio
In representative examples shown in Figs. 68, the spatial frequency differences across the eyes were generally qualitatively correlated with the tilt of binocular RFs. How does this correlation hold for the entire population of neurons? In general, how do other binocular tuning characteristics correlate with monocular tuning properties? Figure 11 summarizes the results relevant for addressing these questions.
|
To examine the correlation between the interocular frequency difference and the tilt of binocular RF, the ratios of preferred spatial frequencies were computed as follows and compared with the disparity gradients. The frequency ratio is given by
![]() | (6) |
An additional point was examined in relation to predicting binocular properties from monocular tuning characteristics. Figure 11, E and F shows the relationship between binocular disparity frequency and monocular preferred spatial frequency. The disparity energy model predicts identity between the two frequencies. Regarding this question, Ohzawa et al. (1997)
reported the discrepancy between the model prediction and the data. They reported that the disparity frequency tended to be lower than the monocular spatial frequency as measured by drifting grating stimuli. Because their analyses were performed only for complex cells, it is not clear at which stage of binocular processing this discrepancy occurs. Based on a new set of data and a more robust analysis method, we have addressed this issue. In our analysis, we use Fourier analysis both for separable and inseparable RFs to obtain disparity frequencies. For the monocular preferred spatial frequency, the average of left and right preferred spatial frequencies (from data in Fig. 11, A and B) are used. Cells recorded from areas 17 and 18 are plotted as circles and triangles. The scatterplot for the inseparable binocular RFs showed a discrepancy between the disparity frequency and the spatial frequency, in that the disparity frequency tends to be lower than the monocularly measured preferred spatial frequency (Fig. 11F). Deviations of actual data from the identity line tended to be larger for high spatial frequencies (slope = 0.71). Because most inseparable binocular RFs are from complex cells (Fig. 5B), our data show a trend similar to that reported in previous work (Ohzawa et al. 1997
). In contrast, separable binocular RFs show a much better fit with the identity relationship between the two frequencies. The slope of separable RF is close to 1 (slope = 0.92) (Fig. 11E). These results probably suggest that separable cells sum monocular inputs through linear processing, whereas neurons with inseparable RFs have substantial nonlinearities in their processing. The source of the deviation must therefore lie between the linear subunits of complex cells and the final complex cell stage if we assume the hierarchical organization similar to that in the disparity energy model.
Aspect ratio of binocular receptive field
Although the dif-frequency version of the disparity energy model accounts for the trend in the data as we have seen in the previous section, we wondered whether there are additional mechanisms by which tilted binocular RFs are constructed. Another possibility we examine is a hierarchical organization as illustrated in Fig. 12. A tilt in the binocular RF profile may be generated if the outputs of multiple disparity energy units are combined, where each unit is tuned to a specific disparity without tilt and its preferred disparity progressively shifts as a function of its frontoparallel position (Fig. 12A). Such a hierarchical pooling produces a binocular RF, shown in Fig. 12B. This neuron (Fig. 12B) will have a highly elongated and tilted binocular RF. The angle of tilt depends on the rate at which subunits preferred disparities shift with the frontoparallel position. Such an organization predicts a substantial elongation of the binocular RF in the frontoparallel dimension. The degree of pooling may be quantified by an aspect ratio of binocular RF. If the hierarchical organization underlies slant sensitivity of binocular neurons, there should be a correlation between the tilts of binocular RF and their aspect ratios.
|
![]() |
d and
f are the SDs of the fitted Gaussian in the disparity and frontoparallel frequencies, respectively. The aspect ratio of a binocular RF is defined by the ratio of SDs as
![]() |
Distributions of aspect ratios for separable and inseparable RFs are shown in Fig. 13, A and B. For most neurons, aspect ratios were >1 for inseparable RFs (Fig. 13B). Mean aspect ratios are 1.15 and 1.67 for separable and inseparable RFs, respectively. The result for inseparable RFs, the majority of which are complex cells, indicates a substantial degree of spatial pooling, deviating substantially from prediction of the disparity energy model. The relationship between the aspect ratio and the disparity gradient is presented in Fig. 13, C and D. If the hierarchical organization hypothesis (Fig. 12A) is correct as a basis for slant selectivity, neurons with highly elongated receptive fields should possess a wide range of disparity gradients. In contrast, neurons with aspect ratios close to 1 should show a narrow distribution for disparity gradients near zero. However, our data show the opposite: Disparity gradients tended to be highly variable for neurons with low aspect ratios, but were relatively small for those with high aspect ratios for inseparable RFs (separable: n = 45, P = 0.09, MannWhitney U test; inseparable: n = 90, P < 0.05, MannWhitney U test).
|
2), indicating that pooling may play at least some role in constructing slightly tilted binocular RFs. The fact that the model of Fig. 12 is rejected should not be interpreted to mean that pooling is not important. Rather, it may play an important role in slant discrimination. It is possible that the role of pooling for constructing RF with a high aspect ratio is to create neurons that can signal near-zero surface slants with greater accuracy, allowing fine slant discriminations for surfaces near frontoparallel. Our results are certainly consistent with such a possibility.
Having defined the aspect ratio of binocular RF, we return to the question of the relationship between the interocular spatial frequency difference and binocular RF tilt. Do neurons with untilted binocular RF with a clear interocular spatial frequency difference have elongated RFs (with high aspect ratios)? These neurons cannot be explained by either of the models we have examined so far. However, one possibility we have not considered is the opposite of the model in Fig. 12 where the spatial pooling is performed over highly tilted subunits but along the exact frontoparallel direction. That is, although individual unpooled units possess tilted binocular RFs, the spatial pooling produces a counteracting effect, thereby canceling the tilts of pooled members. We therefore examined the aspect ratios of representative neurons of this type. Four neurons in the rightmost part of the scatterplot in Fig. 11D have been selected. These neurons have a frequency ratio >1.5. Three of the four neurons had large aspect ratios of 1.91-2.53, and one of them had an aspect ratio of 1.08. The results are not conclusive, but there is a tendency for these neurons to have highly elongated binocular RFs.
Does the aspect ratio relate to other parameters of binocular RF? Figure 14 illustrates relationships among the depth-domain aspect ratios, RF sizes, and preferred spatial frequencies. The relationships between the depth-domain aspect ratio and RF sizes are illustrated in Fig. 14, A and B. Sizes of binocular RFs are defined both in the disparity and frontoparallel directions as illustrated in Fig. 12. There is no correlation between the aspect ratio and the RF size in the frontoparallel direction (Fig. 14A) (separable: r = 0.14, P > 0.05, n = 45; inseparable: r = 0.02, P > 0.05, n = 90). In contrast, there is a significant negative correlation between the aspect ratio and the RF size in the disparity direction (Fig. 14B) (separable: r = 0.1, P > 0.05, n = 45; inseparable: r = 0.48, P < 0.001, n = 90). These results indicate that RFs with high aspect ratios tended to have narrow absolute RF sizes in the depth dimension.
|
Relationship between orientation and disparity gradient
Orientation bias was found for encoding of binocular disparity in that neurons with dissimilar RF profiles (RF phases) between the two eyes tended to prefer near-vertical orientations (DeAngelis et al. 1991
, 1995
; Ohzawa et al. 1996
). Is there a similar orientation bias for the neural representation of slant-in-depth? The relationship between the preferred orientation and disparity gradient is illustrated in Fig. 15. The preferred orientation was evaluated by the peak of fitted Gaussian function to the orientation tuning data measured by drifting sinusoidal gratings. The average orientation for the two eyes was used. Each preferred orientation is represented as an angle from the horizontal. Black and gray symbols depict binocular RFs tilted significantly and nonsignificantly from the frontoparallel plane. Circles and triangles indicate cells recorded from areas 17 and 18, respectively. If the slant-in-depth encoding depends on the preferred orientation of RFs, there should be a positive correlation between these parameters, although no correlations are observed between the two parameters (Pearsons r = 0.09, n = 168, P > 0.05).
|
As illustrated in Fig. 11F, there is a discrepancy between the preferred spatial frequency and the disparity frequency for inseparable RFs as originally reported by Ohzawa et al. (1997)
and confirmed in the present study. That the discrepancy was found for inseparable binocular RFs, but not for separable ones, suggests that the discrepancy originates from a stage that pools the output of multiple simple-type subunits of complex disparity energy units. Moreover, nonlinearities in these pooling processes may be the source of the discrepancy. If this is the case, there may be a correlation between the size of discrepancy and the degree of pooling quantified by the aspect ratio. To examine this, we compared the aspect ratio with the size of discrepancy, which is quantified by the ratio of the preferred spatial frequency to the disparity frequency (SF/DF ratio). Because the preferred spatial frequency tended to be higher than the disparity frequency (Fig. 11F), the SF/DF ratio is >1 for most neurons.
Figure 16A illustrates the relationship between the aspect ratio and the SF/DF ratio. Filled and open symbols depict separable and inseparable RFs, respectively. Circles and triangles indicate cells recorded from areas 17 and 18, respectively. There is a positive significant correlation between the two parameters for inseparable RFs but not for separable RFs (separable: r = 0.09, P > 0.05, n = 45; inseparable: r = 0.55, P < 0.001, n = 90). Figure 16, B and C illustrates relationships between the RF size in the frontoparallel direction and the SF/DF ratio, and between RF size in the disparity direction and the SF/DF ratio, respectively. No significant correlations are observed from these scatters (B: r = 0.006, P > 0.05, n = 45 for separable; r = 0.08, P > 0.05, n = 90 for inseparable; C: r = 0.03, P > 0.05, n = 45 for separable; r = 0.2, P > 0.05, n = 90 for inseparable). The significant correlation between the SF/DF ratio and the aspect ratio suggests that the pooling process is responsible for the discrepancy between the monocular preferred spatial frequency and the binocular disparity frequency.
|
| DISCUSSION |
|---|
|
|
|---|
Selectivity of neural responses for 3D surface slant was estimated by the tilt of binocular RF from the frontoparallel plane. Because the degrees of tilts are small, approximately 10° at most, we had to show that these tilts do not arise from noise, variabilities of experimental calibrations, or other nonneural factors. We have presented three pieces of evidence to establish that observed variations of binocular RF tilts are real and neural in origin. First, bootstrap tests were performed to show that measured binocular RFs possess significant nonzero tilt in the presence of neural response variability. Second, we have shown that pairs of neurons recorded simultaneously had significantly different RF tilts. If the RF tilts are attributable to optical factors such as errors in distance or magnification adjustments for the two eyes, the degree and direction of tilt should be similar for the pair of neurons. Our examples show tilts of binocular RFs that are in opposite directions between the pair of cells (Figs. 7 and 8). Third, we have determined that the contribution of artifactual sources of errors is much smaller than the variance we observe in the data. Distributions of actual RF tilts are significantly wider than that for artifacts (Fig. 10). Based on these pieces of evidence, we conclude that there are true variations in the tilt of binocular RFs that are neural in origin.
The range of distribution of disparity gradients, which were converted from the tilts of binocular RF maps, is ±0.5 both for separable and inseparable binocular RFs (Fig. 9). These ranges are capable of representing actual surface slants in the real world of >70° assuming the 57-cm viewing distance. The range of disparity gradient representation we have found (for the cat) is also similar to that for neurons in area MT of the monkey (Nguyenkim and DeAngelis 2003
).
Although the possible ranges of slant encoding are similar across visual areas, there is a critical difference between the slant representation in the early visual cortex and higher-order areas. Neurons in extrastriate areas are known to have selectivities to surface orientations that are invariant with respect to positional disparity (Hinkle and Connor 2002
; Nguyenkim and DeAngelis 2003
; Taira et al. 2000
). Although we have not tested explicitly for the disparity invariance of slant-in-depth selectivity, it is clear that a model based on the tilted binocular RF by itself cannot be position-disparity invariant. Therefore neurons in areas 17 and 18 are likely to be highly sensitive to position disparity, but they also carry additional information on surface slant because the RF model predicts the maximum firing for the neuron when both the disparity and the surface slant match the binocular RF.
Dif-frequency organization for slant-in-depth encoding
We have examined a modified disparity energy model based on the interocular spatial frequency difference (dif-frequency). When viewing a slanted 3D surface, the spatial frequency contents for the corresponding areas of the surface are different between the eyes (Blakemore 1970
; Fiorentini and Maffei 1971
; Tyler and Sutter 1979
; Wilson 1976
). It was known that for some neurons in early visual cortex, the preferred spatial frequencies for the two eyes were not always the same (Hammond and Pomfrett 1991
; Read and Cumming 2003
). However, no examination of corresponding predicted tilt in the binocular RF (Fig. 1), which would be more direct evidence for surface slant representation, was available. Comparison of the preferred disparity gradient and the ratio of optimal spatial frequencies for the two eyes as measured by drifting gratings shows a significant correlation between these two parameters. However, the correlation was significant only for neurons with inseparable RFs. In general, the correlation was not as good as we initially expected. In fact, only about one third of the neurons show responses consistent with the theoretical prediction. Other neurons are distributed outside the range of prediction for the dif-frequency disparity energy model. Thus we have explored additional alternative possibilities.
Hierarchical organization for slant-in-depth encoding
Another obvious possibility for generating slant-in-depth selectivity is by spatially pooling multiple neurons with progressive shifts of their preferred disparities (Fig. 12A). To examine this possibility, we analyzed the aspect ratio of binocular RFs. The prediction based on the pooling model of Fig. 12 was not fulfilled despite the evidence for extensive spatial pooling. On the contrary, the relationship between the aspect ratio and the disparity gradients correlated in the opposite direction from our expectation. The neurons with little pooling (aspect ratio near 1) tended to have a variety of preferred disparity gradients, whereas RFs of those with substantial pooling were not tilted (Fig. 13D). However, the opposite result does not necessary rule out possible roles for the spatial pooling. For example, we should also note the possibility that spatial pooling actively generates neurons with high aspect ratios and tuned to near-frontoparallel surface slants to enhance slant discrimination performance for near-frontoparallel surfaces. Such a possibility is consistent with our findings.
In addition, our findings may have a possible basis in the way disparity gradients and lateral spatial extents are negatively correlated. If we assume equal average physical spatial extents for depth and frontoparallel directions for a large number of objects in the physical world, slanted surfaces on average should occupy a narrower frontoparallel extent than that of nonslanted surfaces. It would be of interest to examine stereoimage statistics of natural scenes to determine the exact form of such a correlation.
Two kinds of dif-frequency models
Psychophysically, Halpern et al. (1996)
reported that dif-frequency organization per se does not provide a robust slant-in-depth signal. Such a result appears to contradict the premise of the dif-frequency notion. However, we must note that there are two distinct levels of dif-frequency models. One is the strong form of the dif-frequency model that was examined and ruled out by Halpern et al. This model is based on the notion that a spatial frequency difference as such (without consistent local binocular correlations) is sufficient to signal surface slant. The other, weaker form of dif-frequency model, which we have examined, is an extension to the disparity energy model where the spatial frequency difference provides additional information regarding slant on top of local disparity information. Neurons with tilted binocular RFs will respond maximally when both the local binocular disparity and the surface slant simultaneously match the RF parameters. Therefore such a neuron is tuned to both the disparity and the interocular frequency difference.
Our findings on the effects of the interocular frequency difference is highly analogous to those reported by Bridge and Cumming (2001)
with respect to the interocular orientation difference. They have found that monkey V1 neurons show responses to interocular orientation difference in a predictable manner based on the "dif-orientation" disparity energy model, and that the neural responses depend on both the binocular disparity and orientation difference. This is exactly what we find for spatial frequency. They have also found that the V1 neurons are not tuned for the relative orientation difference. Tuning for the relative orientation difference means that the optimal orientation difference is invariant regardless of the absolute orientations of the stumuli. Similarly, the model based on tilted binocular RF predicts no tuning for the relative spatial frequency difference. As with the disparity invariance of surface orientation tunings found in higher-order visual areas, tunings for the relative orientation or spatial frequency difference may be found in those cortical areas.
Is there an orientation bias for slant-in-depth encoding?
We investigated the possible orientation bias for slant-in-depth encoding because such an orientation bias has been found for phase-disparity encoding (DeAngelis et al. 1991
, 1995
). As apparent from Fig. 15, there is no orientation bias in the distributions of disparity gradients. Perhaps, this difference may be explained by the fact that the ratio of spatial frequencies across the eyes is independent of the orientation. In other words, neurons with any preferred orientation, except those tuned to the exact horizontal, can make equal contributions for signaling a given surface slant (i.e., by signaling a given spatial frequency ratio). The situation is quite different for the RF phase disparity because the key parameterthe horizontal disparity (the primary determinant of depth)is dependent on the orientation. To produce neurons tuned to a given horizontal disparity, the required RF phase difference is smaller for neurons tuned to orientations closer to horizontal (Ohzawa et al. 1996
). Therefore although there is no need for neurons having large phase difference at near horizontal orientations, neurons tuned to any orientation are equally important and useful for signaling slant information. Admittedly, this ishighly speculative, but the results presented in Fig. 15 appear quite natural based on these considerations.
Discrepancies between the monocular and the disparity-tuning properties
The discrepancies between the optimal spatial and disparity frequencies (Fig. 11F) are similar to those reported previously (Ohzawa et al. 1997
; Read and Cumming 2003
). However, we have found substantial discrepancies only for neurons with inseparable RFs but not those with separable RFs. These results suggest that the discrepancy originates from some form of nonlinearity in the pooling process that underlies a hierarchical chain of processing where outputs of units with separable RFs are used to construct neurons with inseparable RFs. This notion is strengthened by the results presented in Fig. 16 in that neurons with larger aspect ratios (thus more pooling) tended to have a greater degree of discrepancy. Unfortunately, from our study it is not possible to determine details of where exactly the presumed nonlinearity lies. For example, it is still not known whether neurons with large aspect ratios receive input from complex cells organized as a disparity energy unit (with aspect ratio = 1) or if they directly collect input from neurons with separable RFs without the intermediate units. Further studies will be needed to address these questions.
In conclusion, neurons in areas 17 and 18 appear to encode slant-in-depth of 3D surfaces by having a variety of tilts in their binocular RFs. There are sufficient variations in the RF tilt angles for representing the range of 3D surface slants that occur in the real world. However, there may be multiple mechanisms by which tilted binocular RFs are generated. RF tilts for a subset of neurons could be accounted for by the dif-frequency model. However, neither the dif-frequency model nor the hierarchical pooling model could completely explain the entire data. It is possible that these neurons in the early visual areas contribute to surface slant selectivity of neurons in higher-order visual areas.
| APPENDIX |
|---|
|
|
|---|
When two objects, A and B, are separated in 3D space as shown in Fig. A1, disparity gradient for the line connecting the two objects (
d) is defined as the difference of binocular disparities for the two objects, (dA dB), divided by their spatial separation in cyclopean space (
). Thus the disparity gradient is defined as
![]() | (A1) |
![]() | (A2) |
and
indicate the separation between the objects in the monocular retinal space for the left and right eyes, respectively.
and
are calculated from the viewing distance and the separation of eyes as
![]() | (A3) |
is the slant angle in the real-world space.
|
and
, the binocular RF size is described as a function of slant of binocular RF,
![]() | (A4) |
and
into Eqs. A1 and A2, we obtain the disparity gradient as
![]() | (A5) |
![]() | (A6) |
![]() | (A7) |
![]() | (A8) |
Disparity frequency and frontoparallel frequency of tilted binocular RF of complex cell are derived as follows. We begin with a model of complex cells based on a generalized disparity energy model (Ohzawa et al. 1990
, 1997
; Qian and Mikaelian 2000
) where the left and right spatial frequencies may be different. According to this model, a complex cell receives input from quadrature pairs of simple cells. Members of the quadrature pairs may be modeled as having left and right monocular RFs that are even (Weven) and odd (Wodd) symmetric
![]() | (A9) |
![]() | (A10) |
is the envelope width and fL, fR are spatial frequencies of the left and right RFs, respectively.
depicts phase disparity. Response of the complex cell is the sum of squared sums of the left and right RF profiles
![]() | (A11) |
To present the binocular RF data, we remove the contributions of monocular terms by taking the difference of binocular RFs (measured with contrast-matched and mismatched stimuli) as described by Ohzawa et al. (1997)
, thereby extracting the pure binocular interaction component. The last term of Eq. A11, 2(WLevenWReven + WLoddWRodd) is the binocular interaction component.
![]() | (A12) |
degrees of visual angle, frontoparallel space, which is oriented 45° from left position axis in Fig. A2B, also spans
degrees. In contrast, the disparity dimension is expanded twofold from frontoparallel space, spanning 2
degrees. This unevenness is based on the definition of binocular disparity. The binocular disparity is described as the difference of leftright positions, and the frontoparallel position is defined as the average of left and right positions
![]() | (A13) |
![]() | (A14) |
![]() | (A15) |
![]() | (A16) |
|
| GRANTS |
|---|
|
|
|---|
| ACKNOWLEDGMENTS |
|---|
|
|
|---|
| FOOTNOTES |
|---|
Address for reprint requests and other correspondence: I. Ohzawa, Graduate School of Frontier Biosciences and School of Engineering Science, Osaka University, 1-3 Machikaneyama, Toyonaka, Osaka 560-8531 Japan (E-mail: ohzawa{at}fbs.osaka-u.ac.jp)
| REFERENCES |
|---|
|
|
|---|
Anzai A, Ohzawa I, and Freeman RD. Neural mechanisms for processing binocular information II. Complex cells. J Neurophysiol 82: 909924, 1999b.
Barlow HB, Blakemore C, and Pettigrew JD. The neural mechanism of binocular depth discrimination. J Physiol 193: 327342, 1967.
Blakemore C. A new kind of stereoscopic vision. Vision Res 10: 11811199, 1970.[CrossRef][Web of Science][Medline]
Blakemore C, Fiorentini A, and Maffei L. A second neural mechanism of binocular depth discrimination. J Physiol 226: 725749, 1972.
Bridge H and Cumming BG. Responses of macaque V1 neurons to binocular orientation differences. J Neurosci 21: 72937302, 2001.
Bridge H, Cumming BG, and Parker AJ. Modeling V1 neuronal responses to orientation disparity. Vis Neurosci 18: 879891, 2001.[Web of Science][Medline]
Burt P and Julesz B. A disparity gradient limit for binocular fusion. Science 208: 615617, 1980.
DeAngelis GC, Ohzawa I, and Freeman RD. Depth is encoded in the visual cortex by a specialized receptive field structure. Nature 352: 156159, 1991.[CrossRef][Medline]
DeAngelis GC, Ohzawa I, and Freeman RD. Spatiotemporal organization of simple-cell receptive fields in the cats striate cortex. I. General characteristics and postnatal development. J Neurophysiol 69: 10911117, 1993a.
DeAngelis GC, Ohzawa I, and Freeman RD. Spatiotemporal organization of simple-cell receptive fields in the cats striate cortex. II. Linearity of temporal and spatial summation. J Neurophysiol 69: 11181135, 1993b.
DeAngelis GC, Ohzawa I, and Freeman RD. Neuronal mechanisms underlying stereopsis: how do simple cells in the visual cortex encode binocular disparity? Perception 24: 331, 1995.[Web of Science][Medline]
De Valois RL, Albrecht DG, and Thorell LG. Spatial frequency selectivity of cells in macaque visual cortex. Vision Res 22: 545559, 1982.[CrossRef][Web of Science][Medline]
Efron B. The Jackknife, the Bootstrap, and Other Resampling Plans. Philadelphia, PA: Society for Industrial and Applied Mathematics, 1982.
Efron B and Tibshirani R. An Introduction to the Bootstrap. New York: Chapman & Hall, 1993.
Ferster D. A comparison of binocular depth mechanisms in areas 17 and 18 of the cat visual cortex. J Physiol 311: 623655, 1981.
Fiorentini A and Maffei L. Binocular depth perception without geometrical cues. Vision Res 11: 12991305, 1971.[CrossRef][Web of Science][Medline]
Halpern DL, Wilson HR, and Blake R. Stereopsis from interocular spatial frequency differences is not robust. Vision Res 36: 22632270, 1996.[CrossRef][Web of Science][Medline]
Hammond P and Pomfrett CJ. Interocular mismatch in spatial frequency and directionality characteristics of striate cortical neurones. Exp Brain Res 85: 631640, 1991.[Web of Science][Medline]
Hinkle DA and Connor CE. Three-dimensional orientation tuning in macaque area V4. Nat Neurosci 5: 665670, 2002.[CrossRef][Web of Science][Medline]
Hubel DH and Wiesel TN. Receptive fields, binocular interaction and functional architecture in the cats visual cortex. J Physiol 160: 106154, 1962.
Hubel DH and Wiesel TN. Receptive fields and functional architecture of monkey striate cortex. J Physiol 195: 215243, 1968.
Hughes A. A useful table of reduced schematic eyes for vertebrates which includes computed longitudinal chromatic aberrations. Vision Res 19: 12731275, 1979.[CrossRef][Web of Science][Medline]
Janssen P, Vogels R, and Orban GA. Macaque inferior temporal neurons are selective for disparity-defined three-dimensional shapes. Proc Natl Acad Sci USA 96: 82178222, 1999.
Janssen P, Vogels R, and Orban GA. Three-dimensional shape coding in inferior temporal cortex. Neuron 27: 385397, 2000.[CrossRef][Web of Science][Medline]
Jones JP and Palmer LA. The two-dimensional spatial structure of simple receptive fields in cat striate cortex. J Neurophysiol 58: 11871211, 1987.
Jones JP, Stepnoski A, and Palmer LA. The two-dimensional spectral structure of simple receptive fields in cat striate cortex. J Neurophysiol 58: 12121232, 1987.
LeVay S and Voigt T. Ocular dominance and disparity coding in cat visual cortex. Vis Neurosci 1: 395414, 1988.[Web of Science][Medline]
Li B, Peterson MR, and Freeman RD. Oblique effect: a neural basis in the visual cortex. J Neurophysiol 90: 204217, 2003.
Liu Y, Vogels R, and Orban GA. Convergence of depth from texture and depth from disparity in macaque inferior temporal cortex. J Neurosci 24: 37953800, 2004.
Mata ML and Ringach DL. Spatial overlap of ON and OFF subregions and its relation to response modulation ratio in macaque primary visual cortex. J Neurophysiol 93: 919928, 2005.
Mechler F and Ringach DL. On the classification of simple and complex cells. Vision Res 42: 10171033, 2002.[CrossRef][Web of Science][Medline]
Nelson JI, Kato H, and Bishop PO. Discrimination of orientation and position disparities by binocularly activated neurons in cat striate cortex. J Neurophysiol 40: 260283, 1977.
Nguyenkim JD and DeAngelis GC. Disparity-based coding of three-dimensional surface orientation by macaque middle temporal neurons. J Neurosci 23: 71177128, 2003.
Nikara T, Bishop PO, and Pettigrew JD. Analysis of retinal correspondence by studying receptive fields of binocular single units in cat striate cortex. Exp Brain Res 6: 353372, 1968.[Web of Science][Medline]
Nishimoto S, Arai M, and Ohzawa I. Accuracy of subspace mapping of spatiotemporal frequency domain visual receptive fields. J Neurophysiol 93: 35243536, 2005.
Ohzawa I, DeAngelis GC, and Freeman RD. Stereoscopic depth discrimination in the visual cortex: neurons ideally suited as disparity detectors. Science 249: 10371041, 1990.
Ohzawa I, DeAngelis GC, and Freeman RD. Encoding of binocular disparity by simple cells in the cats visual cortex. J Neurophysiol 75: 17791805, 1996.
Ohzawa I, DeAngelis GC, and Freeman RD. Encoding of binocular disparity by complex cells in the cats visual cortex. J Neurophysiol 77: 28792909, 1997.
Ohzawa I and Freeman RD. The binocular organization of complex cells in the cats visual cortex. J Neurophysiol 56: 243259, 1986a.
Ohzawa I and Freeman RD. The binocular organization of simple cells in the cats visual cortex. J Neurophysiol 56: 221242, 1986b.
Prazdny K. On the disparity gradient limit for binocular fusion. Percept Psychophys 37: 8183, 1985.[Web of Science][Medline]
Qian N and Mikaelian S. Relationship between phase and energy methods for disparity computation. Neural Comput 12: 279292, 2000.[CrossRef][Web of Science][Medline]
Read JC and Cumming BG. Testing quantitative models of binocular disparity selectivity in primary visual cortex. J Neurophysiol 90: 27952817, 2003.
Ringach DL, Sapiro G, and Shapley R. A subspace reverse-correlation technique for the study of visual neurons. Vision Res 37: 24552464, 1997.[CrossRef][Web of Science][Medline]
Skottun BC, De Valois RL, Grosof DH, Movshon JA, Albrecht DG, and Bonds AB. Classifying simple and complex cells on the basis of response modulation. Vision Res 31: 10791086, 1991.[CrossRef][Web of Science][Medline]
Taira M, Tsutsui KI, Jiang M, Yara K, and Sakata H. Parietal neurons represent surface orientation from the gradient of binocular disparity. J Neurophysiol 83: 31403146, 2000.
Tanabe S, Doi T, Umeda K, and Fujita I. Disparity-tuning characteristics of neuronal responses to dynamic random-dot stereograms in macaque visual area V4. J Neurophysiol 94: 26832699, 2005.
Trivedi HP and Lloyd SA. The role of disparity gradient in stereo vision. Perception 14: 685690, 1985.[Web of Science][Medline]
Tyler CW and Sutter EE. Depth from spatial frequency difference: an old kind of stereopsis? Vision Res 19: 859865, 1979.[CrossRef][Web of Science][Medline]
Wilson HR. The significance of frequency gradients in binocular grating perception. Vision Res 16: 983989, 1976.[CrossRef][Web of Science][Medline]
This article has been cited by other articles:
![]() |
R. Kimura and I. Ohzawa Time Course of Cross-Orientation Suppression in the Early Visual Cortex J Neurophysiol, March 1, 2009; 101(3): 1463 - 1479. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. S. Sasaki and I. Ohzawa Internal Spatial Organization of Receptive Fields of Complex Cells in the Early Visual Cortex J Neurophysiol, September 1, 2007; 98(3): 1194 - 1212. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Visit Other APS Journals Online |