|
|
||||||||
Group in Vision Science, School of Optometry, and Helen Wills Neuroscience Institute, University of California, Berkeley, California 94720-2020
Submitted 13 June 2003; accepted in final form 28 November 2003
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
None of the early neurophysiological studies was concerned with neural mechanisms. It was tacitly assumed that RFs with lateral displacements were disparity detectors. Furthermore, left and right RFs were thought to be identical (Hubel and Wiesel 1962
) so that lateral displacement made sense as an encoding mechanism. More recent work has shown that differences in internal RF structure or phase between left and right eyes constitute an important encoding mechanism. This has been demonstrated in anesthetized paralyzed cats (Anzai et al. 1997
; DeAngelis et al. 1992
, 1995
; Freeman and Ohzawa 1990
, 1992
; Ohzawa et al. 1996
, 1997
) and in awake behaving monkeys (Cumming and Parker 1999
, 2000
). In addition, a disparity-encoding scheme, based on the energy model (Adelson and Bergen 1985
; Watson and Ahumada 1985
), has been developed under the assumption of serial processing from simple-to-complex cells (Ohzawa et al. 1990
, 1997
).
Although these studies of mechanisms are illuminating, a number of questions remain. One important area concerns the combination and transmission of stereoscopic information. There are two variables relevant to spatial scale; disparity frequency and disparity range. Disparity frequency is important because visual scenes contain a wide variety of detail, and neurons are selectively responsive to specific spatial bands. Disparity information must therefore be combined across spatial scale. Disparity range is relevant because of a trade-off with frequency. Coarse stereopsis can occur over a wide range of depths. Fine stereopsis provides the required resolution but is limited to a small range. A central question is: how does stereoscopic processing occur so that different spatial scales are combined to provide coarse and fine levels? In the preceding paper, we addressed this question using two types of neurophysiological analysis. First, we found that disparity tuning generally sharpens during the time course of the response. Second, we determined that neurons tuned to large spatial scale have relatively short temporal latencies (Menz and Freeman 2004
). Both these findings are consistent with a coarse-to-fine neural processing sequence. In the study reported here, we have examined this topic by carrying out an analysis of simultaneous extracellular recordings from pairs of neurons in striate cortex. Both sparse and dense noise visual stimulation were used to obtain disparity-tuning curves for simple and complex cells. (Anzai et al. 1999a
, b
, c
; Ohzawa et al. 1996
, 1997
) Cross-correlations between pairs of neurons were analyzed to determine functional intercellular relationships for disparity-tuned neurons.
We have determined several types of relationships between disparity-tuned cortical cells. The most important concerns the connections between disparity frequency and range. The most common connection is one in which the presynaptic cell of the pair has a relatively high disparity frequency. However, cross-correlograms are generally stronger for pairs in which the presynaptic cell is lower in disparity frequency. Considered together, this combination of findings and the results reported in the preceding paper (Menz and Freeman 2004
) are expected for a system in which there is a weighted averaging across spatial scale, and coarse processing occurs prior to fine. This type of coarse-to-fine process has been proposed in theoretical work as a solution to the correspondence problem (Marr and Poggio 1979
). A similar mechanism may apply to overall stereoscopic processing and to other perceptual events.
| METHODS |
|---|
|
|
|---|
Surgical procedures and animal maintenance
The surgical and animal maintenance procedures are described in detail in the preceding companion paper (Menz and Freeman 2004
). Experiments lasted for 34 days. At the end of the experiment, animals are killed with pentobarbital sodium (Nembutal,
50 mg/kg). The animal is immediately perfused with a formaldehyde solution. Electrode tracks are reconstructed and cortical lamina identified. Recordings were made at all depths and all cells were recorded from area 17.
Experimental apparatus
Visual stimuli are generated by a computer with two high-resolution graphics boards. Images are displayed on a pair of video monitors, one for each eye, that the cat views by means of beam splitters. The mean luminance of the displays viewed through the beam splitters is 23 cd/m2. In one recording configuration, two tungsten-in-glass or varnished tungsten microelectrodes are advanced together into visual cortex via a piezoelectric micropositioner. The lateral tip spacing is
250 µm for the varnished tungsten and
350 µm for the tungstenin-glass electrodes. Vertical position tip differences are
200 µm. An alternative electrode positioner with a hydraulic system allowed four microelectrodes to be moved independently; this facilitated locating neurons on separate electrodes at the same time. Relative depth differences between these electrodes did not exceed 500 µm. In some experiments, the four electrodes were arranged in a lateral-medial line to facilitate recording from different layers within a single column. In other configurations, the four electrodes were arranged as close as possible to each other in a square-shaped configuration. Action potentials are discriminated by custom created software (Ohzawa et al. 1996
) and time stamped with 40-µs resolution.
Recording procedures
The recording procedures are identical to those of the preceding companion paper (Menz and Freeman 2004
).
RF mapping
For most pairs, dichoptic one-dimensional binary m-sequence noise is used to map RFs (Anzai et al. 1999a
,b
). For some pairs, a sparse noise stimulus is used, and sometimes both methods are employed. Sixteen adjacent bars are presented for each eye at the mean optimal orientation of multiple neurons for a given eye as illustrated in Fig. 1A. The width of the bars is approximately one-fourth the period of the (mean) optimal frequency. The length of the bar is equal to 16 times the width. This square pattern is centered over the (mean) RF center. Computationally, each spike train is cross-correlated with the stimulus sequence by means of the fast m-transform (Sutter 1991
) to obtain space-time RF maps for dense noise. Conceptually, a spike train is cross-correlated at a specific time delay with the stimulus sequence at each spatial position to obtain a RF map at that time delay as depicted in Fig. 1B. This is repeated for all time delays of interest (0200 ms) in increments of 5 ms to obtain a space-time RF. The binocular view field is a representation of the stimuli in the axial plane (i.e., the plane defined by the visual axes) assuming a nonhorizontal stimulus orientation (i.e., there can be no horizontal disparity, that encodes stereoscopic depth, if the orientation is horizontal). In Fig. 1A, when the luminance polarity of the bars is the same, that position is marked in white. Opposite polarity bar combinations are shown in black. A nonlinear binocular interaction map is obtained by cross-correlating the spike train with each spatial location in the binocular view field pattern. In the maps shown in Fig. 1, C and D, the white and dark regions indicate excitatory responses to same polarity and opposite polarity bars, respectively.
|
Data analysis
The two-dimensional binocular interaction maps are reduced to one-dimensional disparity-tuning data by integrating along lines of equal disparity (Ohzawa et al. 1997
) as illustrated in Fig. 1, C and D. The optimal time delay is defined as the correlation delay that produces the largest root mean square signal strength in the one-dimensional tuning data. For this study, only the disparity tuning curves at the optimal delay are considered. The noise of the RF measurement is estimated from the average root mean square of noncausal time slices.
The similarity index (i.e., Pearson product-moment correlation) is a model-free measure of the similarity of a pair of disparity tuning curves
![]() | (1) |
,
are the mean values. The disparity tuning curves are fit with a Gabor function by the Levenberg-Marquardt algorithm (Press et al. 1992
![]() | (2) |
d is the disparity range or size parameter along the disparity axis, fd is the disparity frequency,
d is the disparity phase. Differences in some parameters (e.g., frequency and size, but not phase) between pairs of cells are normalized by the average value. The quality of the fit is evaluated with the rms of the error. The ratio of signal strength to model error must exceed 1.6 for both cells of the pair to be included in this study. This is an arbitrary value used to indicate reliable signals and determined from observations of all the data. If there are multiple repetitions, the results from the run that produces the strongest signal to noise ratio is used.
Raw neural cross-correlograms are computed between disparity-tuned pairs of cells for all stimulus runs that have at least two repetitions and 1,000 spikes per cell; this includes orientation and spatial frequency tuning runs with sinusoidal gratings and dense and sparse noise stimuli. All raw neural correlograms are shuffle-subtracted to eliminate stimulus-based correlations (Perkel et al. 1967b
). The following steps are used. First, a cross-correlogram is computed using all repetitions, so this includes both stimulus and neural based correlations. Second, the responses from the two cell types are cross-correlated across different repetitions, so this includes only stimulus based correlations and it excludes those that are neurally based. Third, these "shuffled" correlations are subtracted from the first step, yielding neurally based correlations only.
The neural cross-correlogram is binned in increments of 0.5 ms and computed from ±100 ms (the figures only display ±50 ms). Amplitude is normalized by
, and multiplied by 100 to give percentage, where N1 is the number of spikes from cell 1, and N2 is the number of spikes from cell 2 (Brosch and Schreiner 1999
). We compute a correlogram asymmetry index, which is a measure of how much the correlogram peak area is shifted from zero (Alonso and Martinez 1998
): AI = (R L)/(R + L), where R and L are the area of the bins to the right and left of zero, respectively. The cross-correlogram is smoothed with a five-bin Gaussian function. If the pair of cells is recorded from the same electrode, there is a small region around zero correlation (
±2.5 ms) where the two spikes cannot be discriminated. This region is identified and excluded from analysis (exclusion zone). We compute the mean and SD of the shuffled cross-correlogram in the region between ±50 and 100 ms. If the amplitude of a bin exceeds the mean by 3 SDs, then it is considered statistically significant. Cross-correlograms with no significant bins in the range of ±20 ms are excluded from this study.
Multiple significant bins occur adjacent to each other to create an excitatory "peak" in the cross-correlogram. The latency of this peak is at the bin of maximum amplitude. The width of the peak is defined as the width at half-maximum amplitude. The strength of cross-correlation is the area of the peak between half-heights of maximum amplitude (i.e., width). For correlograms measured from the same electrode, sometimes the peak extends across the small exclusion zone
0 ms, in which case, the estimates of width and strength continue across the exclusion zone. Because of the presence of the exclusion zone, the magnitude of strength and width is underestimated when recordings are from the same electrode. There is no interpolation across the exclusion zone. For any pair of cells, there are multiple cross-correlograms based on the different stimulus runs. The run that produces the greatest cross-correlation strength defines the strength, width, and latency for that pair.
Quantitative criteria are used to distinguish between the different connection types. Monosynaptic connections are defined as having a peak latency
3.5 ms, width
5 ms, and asymmetry index
0.2. Polysynaptic connections are defined as nonmonosynaptic connections having a peak latency
10 ms and asymmetry index
0.2. Common input connections are defined as having a peak latency
5 ms and asymmetry index <0.2 (Menz and Freeman 2003
). These quantitative criteria are consistent with known properties of monosynaptic, polysynaptic, and common input connections (Moore et al. 1970
). Note that these are mutually exclusive categories, and a cross-correlogram that does not fall into any of these is excluded from this study. After classification, mono- and polysynaptic cross-correlograms are renormalized by the number of presynaptic spikes (i.e., "effectiveness") (Aertsen and Gerstein 1985
) and multiplied by 100 to give percentage.
For the population data, robust regression (Li 1985
) is used frequently to quantify correlation between parameters. This technique gives lower weight to data with large residual error and calculates the regression iteratively. It is a quantitative method for reducing the influence of outliers.
| RESULTS |
|---|
|
|
|---|
Figure 2 shows the distribution of neural cross-correlograms according to their latency, width, and asymmetry index. Cross-correlograms that are consistent with monosynaptic connections have very small latencies and widths but a large value of the asymmetry index (indicating that 1 neuron consistently fires before the other). Connections that could be polysynaptic are also highly asymmetric, but their widths and latencies are generally larger. Pairs of cells that receive some common input are likely to have relatively symmetrical peaks with small latencies but widths may vary.
|
|
The motion energy model (Adelson and Bergen 1985
; Watson and Ahumada 1985
) and binocular energy model (Ohzawa et al. 1990
) describe specific characteristics of subunits of complex cells. These subunits have properties that match the RF properties of simple cells (Anzai et al. 1999b
,c
). If simple cells are the subunits, then one might expect monosynaptic connections from simple-to-complex cells when the simple cell has the same disparity tuning curve as the complex cell and their binocular interaction maps overlap. Previous work has established that there are direct connections from simple-to-complex cells, but it is not known whether these fulfill the requirements of the energy model (Alonso and Martinez 1998
). Our expectation is that pairs of cells that closely match the energy model requirements will exhibit stronger neural cross-correlograms.
What are the requirements of the binocular energy model? The subunits derive their disparity tuning from either a position or a phase shift difference between monocular RFs, or a combination of both mechanisms(Anzai et al. 1999b
,c
). The monocular RFs are summed linearly, and the summation is followed by a half-squaring nonlinearity process. The subunits differ only in their monocular phase. A minimum of four subunits is required, and they must be in quadrature.
A good example of a simple-to-complex cell monosynaptic connection that closely matches the requirements of the binocular energy model is shown in Fig. 4. The binocular interaction RF maps (Fig. 4, A and B) have the typical structure of simple and complex cells, and they are centered and overlap extensively. The one-dimensional disparity tuning data are fit with Gabor functions (Fig. 4C). The similarity of the disparity tuning is quantified by the similarity index (SI = 0.73, where 1.0 is identical, 0 dissimilar, 1.0 opposite, see METHODS). In terms of the Gabor fit, the presynaptic simple cell is 0.08 cycle/° higher in disparity frequency, exhibits a 0.4° position shift and differs in phase by 25°. The negative subregions, which are excitatory to opposite contrast, are highly overlapping. The positive subregions, which are excitatory to same contrast, are shifted a little from the combination of position, phase, and spatial frequency differences. This cell pair is not a perfect fit for the energy model, yet the neural cross-correlogram has a strong peak [strength = 34.9 (see METHODS)] and is classified as monosynaptic (Fig. 4D). These cells were recorded from adjacent electrodes in layer 4 with an estimated lateral separation of 250 µm.
|
|
|
We next examine the relationship between strength of neural cross-correlation and similarity of the pairs in the population. The obvious question is do monosynaptic pairs with similar disparity tuning have stronger neural cross-correlation? Based on previous cross-correlogram studies, pairs of cells with similar RFs tend to have stronger connections (Alonso and Martinez 1998
; Brosch and Schreiner 1999
; DeAngelis et al. 1999
; Reid and Alonso 1995
). This question is addressed in Fig. 7 where the most similar pairs are to the left side of the x axis. There are different ways of characterizing similarity of tuning. In Fig. 7A, we use our similarity index. There is a group of cells with SI >0.6 (similar,
), and another group with SI <0.05 (dissimilar,
). The large gap between the groups is a natural zone of discrimination because the distribution is bimodal. The similar group has stronger neural cross-correlation values compared with the dissimilar group (P = 0.039, Wilcoxon rank-sum test). It is important to note here that there are connections between cells with dissimilar tuning, some of them with nearly opposite tuning profiles, but the strength of these connections tends to be very weak. We should also point out that we first made a general comparison for the entire population of cells without reference to categories of monosynaptic, polysynaptic and common input. There were generally no correlations between parameters of interest. We then used the classification system on a subpopulation to reveal interesting correlations between parameters.
|
). A third measure of similarity is based on spatial frequency difference. The difference in frequency is normalized by the average frequency. The data are split into similar and dissimilar groups at the median value of 22.5%. The similar group has a stronger neural cross-correlogram compared with the dissimilar group with larger frequency differences (P = 0.025, Wilcoxon rank-sum test). The tendency for similar disparity-tuned pairs to have stronger neural cross-correlograms only holds for monosynaptic pairs. This relationship is not maintained for polysynaptic and common input connections as determined by the same methods used for the data in Fig. 7. Because binocular disparity tuning is determined by the difference in right and left RFs, it is possible for a pair of cells to have very different disparity-tuning properties but have similar monocular tuning. It is also possible for a pair of cells to have identical disparity tuning and different monocular RFs in the two eyes. If connectivity between neurons is based on dominant-eye monocular RF similarity, that does not necessarily translate into disparity-tuning similarity. It is possible that cells with dissimilar disparity-tuning properties and strong neural cross-correlations are connected together because of dominant-eye monocular RF similarity.
Some functional implications
We must consider the possibility that some of the binocular disparity-tuned cell pairs we have studied may not play a role in stereopsis. They may be signaling some other aspect of the stimulus, such as motion. Stereopsis requires horizontal disparity, which results when cell-orientation preferences are near vertical. Vertical disparity in natural scenes arises from the difference in image size created when objects are closer to one eye. Vertical disparity is useful to the visual system (Backus et al. 1999
; Berends and Erkelens 2001
), but it is not necessary for the perception of depth through stereopsis. Cell pairs tuned to nearly horizontal orientations (vertical disparities) do not have a direct perceptual role in stereopsis.
The orientation preference distribution of our cell-pair sample is shown in Fig. 8. For mono- and polysynaptic pairs, there is a clear bias in favor of vertical orientations (Fig. 8A). More cell pairs are tuned to nearly vertical compared with nearly horizontal orientations. The distribution for common input connections appears to be different (Fig. 8B). Horizontal and vertical orientations are equally well represented, but there is a deficit of oblique orientations. This could be related to the oblique effect, i.e., human subjects exhibit reduced spatial resolution for visual detail at oblique orientations (Appelle 1972
). There are analogous effects in the visual cortex (Li et al. 2003
). For mono- and polysynaptic connections, the bias for vertical orientations suggests that these cells could be involved in stereopsis. For common input cell pairs, about half are not likely to be processing stereopsis because they are tuned to nearly horizontal orientations.
|
We now consider possible differences between cell pairs recorded from the same electrode versus those for which neurons are recorded from different electrodes. Same electrode pairs are presumed to be physically closer together and might exhibit different RF relationships compared with those from separate electrodes. Pairs of cells involving at least one simple cell have slightly different distributions of similarity index values for same versus different electrode pair samples (Fig. 9). Same electrode pairs (
) can have similar or near quadrature disparity tuning, but it is rare for them to have opposite tuning, i.e., large negative values. A quadrature relation is one in which all parameters are equal except phase, which is 90° different. Near quadrature tuning has a similarity index near zero. In terms of similarity index values, the simple and complex cells shown in Fig. 5 and 6, respectively, may be categorized as similar or quadrature types. The pairs recorded from different electrodes (Fig. 9,
) tend to lack quadrature (only 1 pair with a near 0 SI), are more often nearly opposite and also more frequently have similar disparity tuning. There are reports that pairs of adjacent simple cells have monocular RFs in quadrature (Liu et al. 1992
; Pollen and Ronner 1981
), but other work establishes that many phase relationships exist between cell pairs. (DeAngelis et al. 1999
). A recent report suggests a bias for odd and even symmetric spatial phase in visual cortex (Ringach 2002
).
|
The most common type of cell pair in our sample is between complex cells (n = 27). There is a sufficient number of these pairs to make some generalizations about connections between disparity-tuned complex cells. A representative sampling of complex-to-complex mono- and polysynaptic pairs of disparity-tuned cells is shown in Fig. 10. An example of an oppositely tuned monosynaptic pair is shown in Fig. 10A. This pair has a similarity index of 0.72 due to the obvious phase difference (164°). There is also a difference in spatial frequency of 0.15 cycle/° in which the presynaptic cell (spike 1) has the higher value. A polysynaptic example is shown in Fig. 10B. This pair has a phase difference of 91°, but no quadrature because of other differences in their parameters. The presynaptic cell has the higher spatial frequency (by 0.11 cycle/°). There is also a position difference; the presynaptic cell is shifted to the right by 0.3°. The combination of this position shift and higher spatial frequency content moves the location of the optimal disparity closer to that of the postsynaptic cell (i.e., the positive peaks nearly coincide, in spite of the 90° phase difference). We would expect a similarity index of 0.0 for a 90° phase shift, but because of the shift in the optimal disparity due to spatial frequency and position differences, the similarity index is 0.58. In Fig. 11C, the presynaptic cell (spike 1) has a higher spatial frequency (by 0.08 cycle/°) and a fairly large position shift of 0.8°, which results in alignment of the negative peaks. A negative peak is excitatory to opposite contrast visual stimulation, and we assume that same contrast stimulation causes suppression. As a result of this alignment of negative peaks, the similarity index is 0.66, which is indicative of a fairly similar pair. In all of the examples presented so far, the presynaptic cell has a higher disparity frequency. In Fig. 10D, we present an example in which the presynaptic cell (spike 2) has a slightly lower disparity frequency by (0.04 cycle/°) and a much larger size parameter (0.8°). The cross-correlograms in Fig. 10 are all plotted on the same scale to facilitate amplitude of the correlation strength comparisons. The example in Fig. 10D has a much stronger correlogram peak compared with the other examples shown. We observed a wide variety of differences in the fitted Gabor parameters: phase, position, disparity frequency, and size. Nonetheless, pairs tend to be either similar or opposite in tuning based on similarity index. They are different in ways that tend to bring the peaks or troughs into alignment.
|
|
0) is striking. The strength of the neural cross-correlograms as a function of similarity index is shown in Fig. 11B. Our expectation that the opposite pairs will have weak correlograms does not hold. There is no difference in cross-correlogram strength between opposite and similar pairs (P = 0.92 Wilcoxon rank-sum). The expectation that cells with similar RFs tend to have stronger neural cross-correlograms is not necessarily borne out. Fine-to-coarse and coarse-to-fine tuning
In the examples shown in the preceding text, there is a tendency for the presynaptic cell to have a higher disparity frequency compared with the postsynaptic cell. To explore this more completely, a population summary for all mono- and polysynaptic connections is given in Fig. 12A. This is a histogram of the normalized disparity frequency difference in which HF and LF refer to high and low frequency, respectively. As the histogram shows, most of the data fall to the left of zero, indicating that the presynaptic cell has the higher disparity frequency (P = 0.006, sign test). The mean frequency difference is 12.3%, but there are a few pairs of cells with large frequency differences. We expect disparity range (size) to be correlated to disparity frequency; higher frequency cells should have smaller RFs. This implies that the disparity range of the presynaptic cell should be smaller compared with that of the postsynaptic neuron. The anticipated relationship is confirmed in Fig. 12B, although there is not as strong an effect as that for frequency (P = 0.028, sign test). This is consistent with the observation that the correlation between spatial frequency and size is affected by the fact that higher frequency cells tend to have more subregions (DeAngelis et al. 1995
).
|
10 for nearly all coarse-to-fine pairs, and the values are much lower for smaller disparity range presynaptic cells (to the right of 0, fine-to-coarse). Figure 12D shows a disparity frequencysize correlation, which applies to complex-to-complex cell connections. When the presynaptic cell is higher in disparity frequency, it also tends to be smaller in disparity range (size). There is a spatial frequency-size correlation for complexto-complex cell combinations. In summary, there are more pairs in which the presynaptic cell has higher spatial frequency (a fine-to-coarse system). Connection strength is stronger when the presynaptic cell is larger in size (a coarse-to-fine system). Considered together, these findings suggest that there are both coarse-to-fine and fine-to-coarse processes at work. A postsynaptic cell that pools across neurons of different spatial frequency content receives a weighted average of its inputs. This configuration contains pairs of both coarse-to-fine and fine-to-coarse cell combinations. The system may function as follows. Pooling over neurons that share a similar peak or trough in their disparity tuning curves (see Fig. 12, BD) but vary across disparity scale (i.e., different disparity frequency and disparity range) will disambiguate the true disparity and reduce false matches (Fleet et al. 1996| DISCUSSION |
|---|
|
|
|---|
Vertical orientation encodes horizontal disparity that is necessary for stereopsis. We find that for mono- and polysynaptic connections there is a bias for vertical orientations, implying that these pairs are playing a role in stereopsis. For common input connections, there is an oblique effect, a deficit of pairs tuned to oblique orientations, and an equal quantity of horizontal and vertical orientations. Even though all pairs are tuned to binocular disparity, the common input pairs may be playing a different functional role.
Cell-pair tuning relationships
Most previous studies of cortical cell cross-correlations support the idea that neurons with similar RF locations tend to have stronger connections (Alonso and Martinez 1998
; Arnett and Spraker 1981
; Brosch and Schreiner 1999
; DeAngelis et al. 1999
; Ghose et al. 1994b
; Hata et al. 1991
; Kruger and Aiple 1988
; Nelson et al. 1992
; Reid and Alonso 1995
; Schwarz and Bolz 1991
; Tanaka 1983
). One report suggests that the strength of cross-correlations depends only on lateral cortical distance and is independent of orientation (Das and Gilbert 1999
). This result is at odds with all other findings that indicate stronger connections between similar orientation pairs. Our monosynaptic results are consistent with the general trend of stronger connections between similar pairs. We also find a correlation between disparity-tuning similarity and cross-correlation strength, as described in the preceding text.
Some previous reports indicate that adjacent simple cells are either in quadrature (90 degree phase difference) or anti-phase (180 degree phase difference) (Liu et al. 1992
; Pollen and Ronner 1981
). Their quadrature pairs (13/16, 83105° difference as shown previously) (Liu et al. 1992
) generally had no structure in their neural cross-correlogram, whereas the anti-phase pairs (3/16, 160194° difference) exhibited mutual inhibition. No excitatory connections were found, and one common input connection was reported between adjacent simple cells. Other results of recordings from pairs of adjacent simple cells have shown mono- and polysynaptic excitation and common input connections (DeAngelis et al. 1999
; Ghose et al. 1994a
). The phase differences between pairs that were otherwise similar consisted of all values including 90 and 180° (9/12 are between 120 and 180°, 2 pairs are <30°, and only 1 pair is between 90 and 120°) (DeAngelis et al. 1999
). This distribution is clearly not consistent with the quadrature/anti-phase distribution described previously (Liu et al. 1992
). However, the distribution of spatial phase in visual cortex appears to be biased toward even and odd symmetry, which would favor quadrature pairing for connected cells (Ringach 2002
). In our current study, the sample size for simple-to-simple connections from the same electrode is very small (n = 6). Nonetheless, we still find all types of neural cross-correlograms: monosynaptic, polysynaptic, and common input.
Disparity quadrature is very different from monocular RF quadrature. Disparity tuning is determined by the relationship between left and right eye RFs. If responses of a pair of cells show quadrature for both left and right eye RFs, their disparity tuning is the same (binocular energy model subunits). Similar disparity tuning does not imply similar monocular RF tuning. It means that there is a similar relationship between left and right eye RFs. If a pair of cells have one eye's RFs in quadrature but the other eye's RFs have the same (or opposite) tuning, that will produce quadrature in disparity tuning. If there were only quadrature and anti-phase monocular relationships, that limits the possible disparity relationships to similar tuning and quadrature tuning. If we expand our sample to include connections between simple and complex, in addition to simple with simple, then we have a reasonable sample size (n = 18 for same electrode, n = 14 for different electrode, Fig. 9). We find that for same electrode pairs the distribution includes both similar tuning and near quadrature tuning. There are other ways of generating this kind of distribution besides quadrature/anti-phase monocular RFs, but the data are consistent with that hypothesis. The distribution for pairs from different electrodes appears to lack quadrature. This also supports the hypothesis that there is a special quadrature relationship between nearby cells. Quadrature disparity tuning relationships among physically nearby simple cells may be more common than monocular RF quadrature. The three binocular simple cell examples shown previously (DeAngelis et al. 1999
) are in near quadrature (i.e., similarity indices close to zero) for their disparity tuning but not in their monocular RFs.
Our pairs of complex cells tend to have either similar or opposite tuning with an obvious lack of quadrature. The main peaks and troughs in the disparity tuning curves tend to be aligned, which creates the biphasic distribution. It is not the case that the phase difference is always 0 or 180° with the other parameters being equal. The distribution includes all three types of neural cross-correlations.
Monocular quadrature and similarity of disparity tuning is the basis for the binocular energy model, but what function is served by disparity quadrature and opposite disparity tuning? If a cell pools over disparity-tuned subunits in quadrature followed by a half-squaring nonlinearity, then the postsynaptic cell will respond to all disparities just as a complex cell responds to all monocular phases. We find some complex cells that are clearly binocular but are not disparity selective. They may be the same population of complex cells that does not exhibit phase-specific binocular interaction to dichoptically presented gratings (Ohzawa and Freeman 1986
). This disparity-energy model consists of subunits in which some pairs have opposite disparity tuning, whereas other pairs have quadrature disparity tuning. This potentially explains the presence of quadrature and opposite pairs but not why the subunits are connected to each other in an excitatory manner. It could be that this connectivity is based on monocular relationships. The energy model may represent a cortical building block replicated at higher levels of signal processing.
Spatial pooling
To solve the global stereo correspondence problem it is desirable to combine information across spatial scale (Marr and Poggio 1979
). Low-frequency cells can respond to a large range of disparities but with poor resolution. High-frequency neurons have good resolution but limited range. The original idea was that the solution at coarse scales would shift (with or without eye movement) the range of high-frequency neurons. This particular implementation of coarse-to-fine called "shifting" is not likely to occur (Rohaly and Wilson 1993
; Smallman and MacLeod 1997
). However, the basic and appealing notion, developed in the preceding paper (Menz and Freeman 2004
) is that a coarse-to-fine mechanism operates such that a solution at a coarse spatial scale occurs more quickly and constrains one at higher frequencies. The current study of characteristics of cell pairs complements that of dynamics (Menz and Freeman 2004
) by showing that coarse-to-fine connections are relatively stronger than fine-to-coarse types.
Our study shows that the presynaptic cell more frequently has a higher disparity frequency and smaller disparity range (fine-to-coarse) compared with the postsynaptic cell. However, when the presynaptic cell has a larger disparity range (coarse-to-fine), the connection strength tends to be much stronger compared with the more common situation with a presynaptic cell that has a smaller range. Thus the fine-to-coarse connection is more common, but the coarse-to-fine connection is stronger. We propose that pooling across spatial scale can explain both types of connection. The postsynaptic cell has disparity tuning that is a weighted sum of its inputs. More weight is given to the less common coarse-to-fine connection; less weight is given to the more common fine-to-coarse connection. Both coarse-to-fine and fine-to-coarse mechanisms coexist with disparity averaging. The psychophysics literature contains studies supporting coarse-to-fine (Glennerster 1996
; Rohaly and Wilson 1993
; Watt 1987
; Wilson et al. 1991
) and fine-to-coarse processing (Smallman 1995
) and disparity averaging (Rohaly and Wilson 1994
). Certain tasks may emphasize particular pathways, but all three mechanisms are compatible. Pooling across spatial scale is desirable to disambiguate disparity signaled by one cell (Fleet et al. 1996
). If there is a temporal bias for low frequencies having shorter latencies, then pooling across scale also produces a temporal coarse-to-fine mechanism as described in the preceding paper (Menz and Freeman 2004
).
| ACKNOWLEDGMENTS |
|---|
|
|
|---|
GRANTS
This work was supported by research and CORE grants (EY-01175 and EY-03716) from the National Eye Institute.
| FOOTNOTES |
|---|
Address for reprint requests and other requests: R. D. Freeman, 360 Minor Hall, University of California, Berkeley, CA 94720-2020 (E-mail: freeman{at}neurovision.berkeley.edu).
| REFERENCES |
|---|
|
|
|---|
Aertsen AM and Gerstein GL. Evaluation of neuronal connectivity: sensitivity of cross-correlation. Brain Res 340: 341354, 1985.[CrossRef][ISI][Medline]
Alonso JM and Martinez LM. Functional connectivity between simple cells and complex cells in cat striate cortex. Nat Neurosci 1: 395403, 1998.[CrossRef][ISI][Medline]
Anzai A, Ohzawa I, and Freeman RD. Neural mechanisms underlying binocular fusion and stereopsis: position versus phase. Proc Natl Acad Sci USA 94: 54385443, 1997.
Anzai A, Ohzawa I, and Freeman RD. Neural mechanisms for encoding binocular disparity: receptive field position versus phase. J Neurophysiol 82: 874890, 1999a.
Anzai A, Ohzawa I, and Freeman RD. Neural mechanisms for processing binocular information. I. Simple cells. J Neurophysiol 82: 891908, 1999b.
Anzai A, Ohzawa I, and Freeman RD. Neural mechanisms for processing binocular information. II. Complex cells. J Neurophysiol 82: 909924, 1999c.