|
|
||||||||
INVITED REVIEW
School of Optometry, University of California, Berkeley, California
Submitted 12 October 2004; accepted in final form 18 November 2004
ABSTRACT
Three recent studies offer new insights into the way visual cortex handles binocular disparity signals. Two of these studies recorded from single neurons in two different visual areas of the monkey brain, one (V5/MT) in dorsal and one (V4) in ventral cortex. While V5/MT neurons respond similarly to neurons in primary visual cortex (V1), V4 neurons appear to reflect a more advanced stage in the analysis of retinal disparity, closer to the perceptual experience of stereoscopic depth. Both studies are consistent with a third study using fMRI to address similar questions in humans. Together with previous evidence, these results suggest a new framework for understanding stereoscopic processing based on the separation between ventral and dorsal streams in visual cortex.
Humans, like many other animals, possess two eyes that are frontally placed and can thus simultaneously inspect the same region of visual space. Objects within this overlapping region project slightly disparate images to the two retinae. Our brain uses these retinal disparities to infer the distance of objects, to guide our fixational eye movements, and to provide us with the perception of stereoscopic depth. This important phenomenon was not discovered until the beginning of the 19th century (Wheatstone 1838
), and it was only in the 1960s that neurons in cat V1 were shown to respond selectively to retinal disparity (Barlow et al. 1967
; Nikara et al. 1968
). It was hypothesized that these neurons carry the signals used by the brain to support stereoscopic vision. These results were subsequently confirmed in monkeys (Poggio 1995
) and other animals (Pettigrew 1980
).
Although many neurons in V1 respond selectively to retinal disparity, and retinal disparity certainly plays a central role in stereoscopic perception, it turns out that the selectivity of these neurons differs markedly from the selectivity experienced by humans in stereoscopic perception, suggesting that V1 is only part of the story (Parker 2004
). There are two major differences, outlined in the following text.
The first difference has to do with absolute versus relative disparity. The absolute disparity of an object is determined by the differences in the two retinal images generated by that object alone. These differences are measured with reference to the coordinate system of the two retinae. The retinal coordinate system, however, depends on where the eyes fixate, meaning that the absolute disparity of an object is not unique but varies all the time as we change our fixation. The relative disparity between two objects is the difference between the absolute disparity of one object and that of the other object. This measure is more robust in that it does not depend on where we look: as the eyes verge in and out to change fixation, both absolute disparities change, but their difference remains constant. Humans are especially sensitive to relative disparity, much less to absolute (Westheimer 1979
). Moreover, large changes in absolute (but not relative) disparities can go unnoticed (Erkelens and Collewijn 1985
). In stark contrast to these perceptual observations, neurons in primary visual cortex are selective for absolute not for relative disparities (Cumming and Parker 1999
), providing the first piece of evidence that further processing downstream of V1 must be happening.
The second difference has to do with correlated versus anticorrelated random-dot stereograms [random arrays of dots that look unstructured when viewed monocularly, but yield disparity-defined structure under binocular inspection (Julesz 1971
)]. With random-dot stereograms (RDSs), the task of correctly matching one dot in the left eye with its counterpart in the right eye can be a challenging one for our visual system, particularly when dot density is high and despite corresponding dots having the same contrast in both eyes (i.e., their being contrast-correlated, white-matching-white and black-matching-black). If the contrast of the image in one eye is reversed (i.e., black dots are turned white, and white dots are turned black, generating an anti-correlated RDS), the matching mechanism breaks down and humans do not experience a sense of stereoscopic depth (Cogan et al. 1995
; Cumming et al. 1998
). This is in contrast with the properties of V1 neurons that respond to both types of RDSs rather than discarding the anti-correlated type (Cumming and Parker 1997
). In particular, V1 neurons respond to anti-correlated patterns in a manner predicted by a widely accepted energy model of how these neurons detect retinal disparities (Ohzawa et al. 1990
).
The conclusion from the studies cited so far is that the final stages of stereoscopic processing must reside somewhere outside primary visual cortex (Parker 2004
). The question is, of course, where. This question has been the subject of intense research over the past 10 yr, and very recent results appear to provide a preliminary, unexpected answer.
Some neurons in the cortical area immediately after V1 in the visual system, area V2, show selectivity for relative disparity (Thomas et al. 2002
). This is encouraging because it confirms our expectation that signals downstream of V1 would be more similar to what we perceive. However, the coding of absolute and relative disparity in V2 is not as markedly different from V1 as one would hope. The next area is V3. There is only one study that provides some relevant information about V3 (Poggio et al. 1988
), and it indicates that V3 neurons respond to anti-correlated RDSs in a manner similar to V1 neurons. It must be noted that this study does not provide conclusive evidence in this respect because it makes no clear distinction between anticorrelated and uncorrelated RDSs and only examined sensitivity at zero disparity, so the question remains open in V3. After this area, the visual system takes two different directions: dorsally toward V3A, V7 (in humans), V5/MT and MST, and ventrally toward V4, V8 (in humans) and IT (Fig. 1). Which direction to go?
|
Despite all the preceding unresolved issues relating to which aspects of stereoscopic perception are supported by V5/MT neurons (a topic that clearly requires further investigation), the evidence in favor of a role for the ventral stream in stereoscopic perception seemed too limited to grant attention as a potential alternative. This evidence mainly came from one published study (Janssen et al. 2003
) demonstrating that neurons in macaque IT discard anti-correlated signals and from a preliminary report (Fujita et al. 2003
) that signals in IT show high choice probabilities in a fine stereoacuity task. However, IT [being a terminal station in the ventral stream (Felleman and Van Essen 1991
)] is far removed from the early areas (V1V3) and receives extensive inputs from the dorsal stream (Baizer et al. 1991
; Saleem et al. 2000
), leaving open the possibility that the relevant processing is happening in dorsal cortex and is later relayed to ventral cortex.
Three very recent papers now provide us with a clearer picture. They indicate that, rather unexpectedly, the processing supporting stereoscopic perception is likely to be completed in the ventral stream. The first study (Krug et al. 2004
) recorded from neurons in macaque V5/MT and tested their responses to correlated and anticorrelated RDSs. The authors show that V5/MT neurons, like MST neurons (Takemura et al. 2001
), respond to both types of RDSs in a way that is similar to V1 neurons. This paper appears in the Journal of Neurophysiology together with an fMRI study of selectivity for absolute and relative disparity in human visual cortex (Neri et al. 2004
) that relied on a fMR-adaptation technique designed to target neuronal selectivity at the subpopulation level (Grill-Spector and Malach 2001
). The fMRI results indicate that the dorsal stream is primarily involved with processing absolute disparity, whereas areas in the ventral stream show a significant degree of selectivity for relative disparity. These two studies are therefore consistent in providing evidence that signals in the dorsal stream do not directly support the final stages of stereoscopic perception. The fMRI study, moreover, is consistent with the already cited IT study (Janssen et al. 2003
) in showing that the ventral stream may be a better candidate. Given these various indications, one would like to see what neurons do in an early area along the ventral stream. This piece of information now comes from a study by Tanabe et al. (2004)
, who show that neurons in V4 (the earliest area in the ventral stream) show a noticeable degree of exclusion for anti-correlated RDSs, significantly more pronounced than in V1. Moreover, recent results from the same group show that V4 neurons are remarkably selective for relative disparity (Umeda et al. 2004
). The quantitative difference in the response to anti-correlated RDSs between V4 (Tanabe et al. 2004
) and V1 (Cumming and Parker 1997
) is clearly significant, but it should not be overstated in that it is relatively small and the two studies are not directly comparable [in particular, Tanabe et al. used stimuli of very low luminance (1.5 cd/m2), possibly compromising stereovision]. Rather, the V4 result should be combined with the earlier results in IT (Fujita et al. 2003
; Janssen et al. 2003
). Taken together, this evidence is clearly pointing toward an important role for the ventral stream in stereoscopic perception.
The picture that is emerging from these studies fits nicely with current theories on the dorsal/ventral dichotomy in visual processing. Ventral cortex is believed to support perceptually relevant signals, whereas dorsal cortex is believed to process visual information of immediate relevance to motor responses (Milner and Goodale 1992
). In the specific context of stereoscopic vision, motor processing is mostly related to vergence eye movements, which allow the two eyes to change fixation by rotating inward or outward. Absolute disparity signals are of obvious importance to vergence control, and vergence eye movements are driven by anti-correlated RDSs in a manner that is consistent with the response of V1 and MST neurons (Masson et al. 1997
; Takemura et al. 2001
). In other words, the motor side of stereoscopic processing displays characteristics that are easy to reconcile with the signals that have been measured along the dorsal stream.
In the same way that (as discussed in relation to the V5/MT literature) the perceptual compartmentalization of absolute and relative disparities is not water-tight and is only viable to a limited extent, the dorsal/ventral separation discussed so far is to be intended as a rough architectural distinction rather than a rigid dichotomy (Maunsell 1992
). Behavior tells us that perception influences action, and action influences perception, so it is certainly simplistic to imagine that the two would be happening in isolation from each other in two different parts of the brain. Depending on task demands, quality of the available signals, required speed of response and other parameters that determine the performance of the observer, different cortical modules may display different degrees of involvement in different aspects of behavior (Fias et al. 2002
). Even if it became possible to identify clear specializations, caution should be used in comparing modules across processing streams: is it really correct, for example, to rank V4 and V5/MT at similar stages within their respective hierarchies in the ventral and dorsal streams? This rather arbitrary comparison, although in line with current literature (Felleman and Van Essen 1991
), may turn out to be more problematic than expected. Nevertheless, despite the obvious limitations of such an oversimplified description, the dorsal/ventral framework described in this article does an excellent job of bringing together a dozen available studies on the topic (see Fig. 1), and similar ideas have already been proposed by other authors (Parker 2004
; Tyler 1990
).
It is remarkable how much was learnt about stereoscopic processing over the past 1015 yr. We now know that the energy model provides an adequate description of the earliest stage in disparity processing (Ohzawa 1998
). This model will need to be modified in its details (Cumming 2002
; Read and Cumming 2003
, 2004
; Read et al. 2002
; Uka and DeAngelis 2002b
), but the main features are likely to be correct. The new findings reviewed in this article offer a preliminary picture of what happens after this initial stage. They indicate that early disparity signals may be channeled in two directions: toward the dorsal stream to provide guidance to the vergence motor system and possibly support coarse stereo judgements; and toward the ventral stream, where they would undergo further analysis to establish a richer perceptual representation of surface structure and possibly of the detailed three-dimensional arrangement of objects in space. Further research will be necessary to confirm whether this simple framework gets anywhere close to providing a veridical description of stereoscopic processing in visual cortex.
Address for reprint requests and other correspondence: School of Optometry, University of California, Berkley, CA 94720-2020. (E-mail: pn{at}white.stanford.edu)
REFERENCES
Backus BT, Fleet DJ, Parker AJ, and Heeger DJ. Human cortical activity correlates with stereoscopic depth perception. J Neurophysiol 86: 20542068, 2001.
Baizer JS, Ungerleider LG, and Desimone R. Organization of visual inputs to the inferior temporal and posterior parietal cortex in macaques. J Neurosci 11: 168190, 1991.[Abstract]
Barlow HB, Blakemore C, and Pettigrew JD. The neural mechanisms of binocular depth discrimination. J Physiol 193: 327342, 1967.
Bradley DC, Chang GC, and Andersen RA. Encoding of three-dimensional structure-from-motion by primate area MT neurons. Nature 392: 714717, 1998.[CrossRef][Medline]
Christiansen DL, Chowdhury SA, and DeAngelis GC. Vertical disparities modulate depth perception in monkeys but do not alter disparity tuning in area MT. Soc Neurosci Abstr 865.9, 2004.
Cogan AI, Kontsevich LL, Lomakin AJ, Halpern DL, and Blake R. Binocular disparity processing with opposite-contrast stimuli. Perception 24: 3347, 1995.[Web of Science][Medline]
Cumming BG. An unexpected specialization for horizontal disparity in primate primary visual cortex. Nature 418: 633636, 2002.[CrossRef][Medline]
Cumming BG and Parker AJ. Responses of primary visual cortical neurons to binocular disparity without depth perception. Nature 389: 280283, 1997.[CrossRef][Medline]
Cumming BG and Parker AJ. Binocular neurons in V1 of awake monkeys are selective for absolute, not relative, disparity. J Neurosci 19: 56025618, 1999.
Cumming BG, Shapiro SE, and Parker AJ. Disparity detection in anti-correlated stereograms. Perception 27: 13671377, 1998.[Web of Science][Medline]
DeAngelis GC, Cumming BG, and Newsome WT. Cortical area MT and the perception of stereoscopic depth. Nature 394: 677680, 1998.[CrossRef][Medline]
Dodd JV, Krug K, Cumming BG, and Parker AJ. Perceptually bistable three-dimensional figures evoke high choice probabilities in cortical area MT. J Neurosci 21: 48094821, 2001.
Erkelens CJ and Collewijn H. Motion perception during dichoptic viewing of moving random-dot stereograms. Vision Res 25: 583588, 1985.[CrossRef][Web of Science][Medline]
Felleman DJ and Van Essen DC. Distributed hierarchical processing in the primate cerebral cortex. Cereb Cortex 1: 147, 1991.
Fias W, Dupont P, Reynvoet B, and Orban GA. The quantitative nature of a visual task differentiates between ventral and dorsal stream. J Cogn Neurosci 14: 646658, 2002.[CrossRef][Web of Science][Medline]
Fujita I, Uka T, Tanabe S, and Watanabe M. Neural correlates of fine depth localization in monkey inferior temporal cortex. Soc Neurosci Abstr 2003.
Grill-Spector K and Malach R. fMR-adaptation: a tool for studying the functional properties of human cortical neurons. Acta Psychol 107: 293321, 2001.[CrossRef][Medline]
Janssen P, Vogels R, Liu Y, and Orban GA. At least at the level of inferior temporal cortex, the stereo correspondence problem is solved. Neuron 37: 693701, 2003.[CrossRef][Web of Science][Medline]
Julesz B. Foundations of Cyclopean Perception. Chicago, IL: Chicago University Press, 1971.
Krug K, Cumming BG, and Parker AJ. Comparing perceptual signals of single V5/MT neurons in two binocular depth tasks. J Neurophysiol 92: 15861596, 2004.
Masson GS, Busettini C, and Miles FA. Vergence eye movements in response to binocular disparity without depth perception. Nature 389: 283286, 1997.[CrossRef][Medline]
Maunsell JHR. Functional visual streams. Curr Opin Neurobiol 2: 506510, 1992.[CrossRef][Medline]
Milner AD and Goodale MA. Separate visual pathways for perception and action. Trends Neurosci 15: 2025, 1992.[Medline]
Neri P, Bridge H, and Heeger DJ. Stereoscopic processing of absolute and relative disparity in human visual cortex. J Neurophysiol 92: 18801891, 2004.
Nguyenkim JD and DeAngelis GC. Disparity-based coding of three-dimensional surface orientation by macaque middle temporal neurons. J Neurosci 23: 71177128, 2003.
Nikara T, Bishop PO, and Pettigrew JD. Analysis of retinal correspondence by studying receptive fields of binocular single units in cat striate cortex. Exp Brain Res 6: 353372, 1968.[Web of Science][Medline]
Ohzawa I. Mechanisms of stereoscopic vision: the disparity energy model. Curr Opin Neurobiol 8: 509515, 1998.[CrossRef][Web of Science][Medline]
Ohzawa I, DeAngelis G, and Freeman RD. Stereoscopic depth discrimination in the visual cortex: neurons ideally suited as disparity detectors. Science 249: 10371041, 1990.
Parker AJ. From binocular disparity to the perception of stereoscopic depth. In: The Visual Neurosciences, edited by Chalupa LM and Werner JS. Cambridge, MA: MIT Press, 2004, p. 779792.
Pettigrew JD. Comparative physiology of binocular vision. Aust J Optom 63: 204210, 1980.
Poggio GF. Mechanisms of stereopsis in monkey visual cortex. Cereb Cortex 3: 193204, 1995.
Poggio GF, Gonzalez F, and Krause F. Stereoscopic mechanisms in monkey visual cortex: binocular correlation and disparity selectivity. J Neurosci 8: 45314550, 1988.[Abstract]
Prince SJD, Pointon AD, Cumming BG, and Parker AJ. The precision of single neuron responses in cortical area V1 during stereoscopic depth judgments. J Neurosci 20: 33873400, 2000.
Read JCA and Cumming BG. Testing quantitative models of binocular disparity selectivity in primary visual cortex. J Neurophysiol 90: 27952817, 2003.
Read JCA and Cumming BG. Understanding the cortical specialization for horizontal disparity. Neural Comput 16: 19832020, 2004.[CrossRef][Web of Science][Medline]
Read JCA, Parker AJ, and Cumming BG. A simple model accounts for the response of disparity-tuned V1 neurons to anticorrelated images. Visual Neurosci 19: 735753, 2002.[CrossRef][Web of Science][Medline]
Saleem KS, Suzuki W, Tanaka K, and Hashikawa T. Connections between anterior inferotemporal cortex and superior temporal sulcus regions in the macaque monkey. J Neurosci 20: 50835101, 2000.
Takemura A, Inoue Y, Kawano K, Quaia C, and Miles FA. Single-unit activity in cortical area MST associated with disparity-vergence eye movements: evidence for population coding. J Neurophysiol 85: 22452266, 2001.
Tanabe S, Umeda K, and Fujita I. Rejection of false matches for binocular correspondence in macaque visual cortical area V4. J Neurosci 24: 81708180, 2004.
Thomas OM, Cumming BG, and Parker AJ. A specialization for relative disparity in V2. Nat Neurosci 5: 472478, 2002.[CrossRef][Web of Science][Medline]
Tyler CW. A stereoscopic view of visual processing streams. Vision Res 30: 18771895, 1990.[CrossRef][Web of Science][Medline]
Uka T and DeAngelis GC. MT neurons do not signal relative disparity (Abstract). J Vision 2: 37a, 2002a.
Uka T and DeAngelis GC. Binocular vision: an orientation to disparity coding. Curr Biol 12: R764R766, 2002b.[CrossRef][Web of Science][Medline]
Uka T and DeAngelis GC. Contribution of middle temporal area to coarse depth discrimination: comparison of neuronal and psychophysical sensitivity. J Neurosci 23: 35153530, 2003.
Uka T and DeAngelis GC. Contribution of area MT to stereoscopic depth perception: choice-related response modulations reflect task strategy. Neuron 42: 297310, 2004.[CrossRef][Web of Science][Medline]
Umeda K, Tanabe S, and Fujita I. Relative-disparity-based coding of stereoscopic depth in V4. Soc Neurosci Abstr 865.8, 2004.
Westheimer G. Cooperative neural processes involved in stereoscopic acuity. Exp Brain Res 36: 585597, 1979.[Web of Science][Medline]
Wheatstone C. Contributions to the physiology of vision: part the first. On some remarkable, and hitherto unobserved, phenomena of binocular vision. Philos Trans Roy Soc Lond B Biol Sci 128: 371394, 1838.
This article has been cited by other articles:
![]() |
T. J. Preston, S. Li, Z. Kourtzi, and A. E. Welchman Multivoxel Pattern Selectivity for Perceptually Relevant Binocular Disparities in the Human Brain J. Neurosci., October 29, 2008; 28(44): 11315 - 11327. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Chandrasekaran, V. Canon, J. C. Dahmen, Z. Kourtzi, and A. E. Welchman Neural Correlates of Disparity-Defined Shape Discrimination in the Human Brain J Neurophysiol, February 1, 2007; 97(2): 1553 - 1565. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Uka and G. C. DeAngelis Linking neural representation to function in stereoscopic depth perception: roles of the middle temporal area in coarse versus fine disparity discrimination. J. Neurosci., June 21, 2006; 26(25): 6791 - 6802. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Uka, S. Tanabe, M. Watanabe, and I. Fujita Neural Correlates of Fine Depth Discrimination in Monkey Inferior Temporal Cortex J. Neurosci., November 16, 2005; 25(46): 10796 - 10802. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Tanabe, T. Doi, K. Umeda, and I. Fujita Disparity-Tuning Characteristics of Neuronal Responses to Dynamic Random-Dot Stereograms in Macaque Visual Area V4 J Neurophysiol, October 1, 2005; 94(4): 2683 - 2699. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Visit Other APS Journals Online |