|
|
||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1Department of Psychology, Princeton University, Princeton, New Jersey; 2Max-Planck Institute for Biological Cybernetics, Tübingen, Germany; 3Department of Physiology, Anatomy and Genetics, Oxford University, Oxford; and 4School of Psychology, University of Birmingham, Birmingham, United Kingdom
Submitted 9 October 2006; accepted in final form 30 November 2006
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
Neurophysiological and imaging studies revealed selectivity for binocular disparity at multiple levels of the visual hierarchy in the monkey and the human brain from early visual areas, to object- and motion-selective areas and the parietal cortex (for reviews see Cumming and DeAngelis 2001
; Neri 2005
; Orban et al. 2006a
,b
; Parker 2004
). However, understanding how activity in these multiple disparity-selectivity regions relates to key visual functions that guide human behavior remains an important challenge. Specifically, how do different cortical networks support key computational functions that exploit depth estimates to perceive the shape of three-dimensional (3D) objects, interpret their material properties, recognize 3D objects in complex scenes, and control the movement of body parts when interacting with objects? Previous work (for reviews see Neri 2005
; Orban et al. 2006b
; Parker 2004
; Tyler 1990
) has proposed a functional dichotomy between the dorsal visual pathway ascending into parietal cortex that processes disparity signals relating to spatial position and object-related actions and the ventral visual pathway leading to temporal cortex that supports shape discrimination and object recognition. Our study aimed to investigate which of the cortical areas in the human ventral or dorsal visual pathway that are known to be involved in the processing of disparity information contribute to the integration and perception of global shape from disparity.
To address this question, we examined the relationship between cortical activity and coherent shape perception from horizontal disparity signals by measuring functional magnetic resonance imaging (fMRI) activity concurrently with observers' judgments of shape. We used random-dot displays in which 3D convex shapes were apparent only after global binocular correspondence had been established (Fig. 1). To investigate the cortical areas whose activity related to processing of disparity signals for global shape perception, we degraded the visual stimulus to measure effects on both perceptual discrimination and fMRI response. We parametrically varied shape coherence by shuffling the position of dots whose disparity defined the 3D shape. The advantage of this approach is that it allowed us to degrade the perception of a global 3D shape while keeping the image disparity content constant. Our findings provide novel evidence that activity in both dorsal (V3, hMT+/V5) and ventral (LOC) visual areas varies in correspondence with an observer's perception of shape defined by disparity. These findings are consistent with interactions, rather than a dichotomy, between ventral and dorsal areas that mediate the perception of coherent disparity-defined shapes possibly by supporting a continuum of visual integration and recognition processes from coarse contour segmentation to fine curvature discrimination.
|
| METHODS |
|---|
|
|
|---|
Ten students from the University of Tübingen participated in the experiment. All had normal/corrected (contact lenses) visual acuity, were able to perceive disparity-specified depth (Stereo Fly test), and were screened for color deficiencies (Ishihara plates). All observers gave informed written consent.
Stimuli
Ten convex shapes defined by horizontal disparity were used as stimuli. Shapes were defined by an average of 200 dots and were embedded in background noise (average number of dots: 850) positioned in the plane of the screen (two exemplars are shown in Fig. 1). Stimuli were presented on a gray background and were 14.42 x 14.42° in size. Individual dots subtended 0.2° and were either black (1.55 cd/m2) or white (415 cd/m2). Disparity was maximal (0.21°) at the center of the shape and fell off smoothly to zero toward the boundaries along both the X and Y directions. The 3D shapes covered 9.64 x 9.64° in the center of the stimulus display. The position of the shapes was jittered across trials to control for local position cues that could facilitate extraction of the shape contour. Half of the shapes were symmetrical along the vertical axis and half along the horizontal axis. Observers indicated (by a button press) whether presented stimuli were symmetrical along the vertical or horizontal axis while maintaining fixation on a central target. Nonius fixation targets were provided to promote maintained eye vergence to the fixation target (in the plane of the screen). The symmetry task ensured that the observers attended to the whole shape, rather than the local elements.
We manipulated the coherence of the 3D shapes by randomly swapping the position of disparity-defined stimulus dots with background dots. This shuffling manipulation was used to create eight conditions in which different percentages of stimulus dots were exchanged to vary shape coherence. Shapes were most coherent ("90% coherence") when only 10% of dots were swapped and 90% of stimulus dots were in their original positions. We examined conditions of varying shape coherence (either 10, 20, 30, 40, 50, 70, 80, or 90% of stimulus dots were in their original positions). Our pilot experiments showed that these conditions were sufficient to estimate observers' threshold performance; 0, 60, and 100% coherence levels were not tested because our pilot data showed that observers' performance at these levels was very similar to nearby coherence levels. Shuffling dot positions across the whole stimulus rather than only within the shape boundary ensured that observers could not perform the shape discrimination based on a disparity-defined region whose contours would be constant across coherence levels.
Stimuli were rear-projected onto a screen inside the scanner bore and viewed through a mirror 10 cm above the eyes (viewing distance: 78 cm). Dichoptic presentation was effected by redgreen anaglyphs (cross talk only 0.6%), encouraging a natural vergence posture. Photometric measures of the red and green signals from the NEC GT950 video projector were used for gamma correction and to equate the luminance of the images presented to each eye.
Design
Psychophysical and fMRI data were collected concurrently from each observer for each condition in eight event-related scans (coherence levels were intermixed on individual scans). Each scan consisted of one experimental trial epoch and two 8-s fixation epochs (one at start; one at end). The experimental trial epoch in each scan consisted of 125 trials in total: 25 trials from each of four of the eight conditions and 25 fixation trials (providing a measure of baseline activity). Different coherence levels were presented across scans: stimuli of 10, 30, 50, and 80% coherence were presented in half of the scans and stimuli of 20, 40, 70, and 90% coherence were presented in the remaining scans. This grouping of coherence levels across scans ensured that both low and high coherence levels were presented in each scan and resulted in scans that were both fully balanced for trial history (two trials back) and not too long for the participants (6.25 min in duration). The 60% stimulus coherence level was not tested because pilot testing showed that this level was distant from any of the individual observer's threshold performance, consistent with the behavioral results during scanning (Fig. 3A). Presentation order was counterbalanced so that trials from each condition (including fixation) were preceded equally often by trials from other conditions. A new trial began every 3 s and consisted of one stimulus (duration = 300 ms) followed by a blank (2,700 ms).
|
Imaging
We used a 3T Siemens scanner (University Clinic, Tübingen, Germany) with gradient echo pulse sequence (TR = 2 s, TE = 40 ms localizers; TR = 1.5 s, TE = 40 ms event-related scans) and an eight-channel head coil. We collected 18 near-coronal slices (5 mm thick; 3 x 3 mm in-plane resolution) covering occipitotemporal, parietal, and part of frontal cortex.
Data analysis
PSYCHOPHYSICAL DATA.
A standard logistic function was used to fit each observer's behavioral responses and obtain estimates of threshold performance
![]() | (1) |
where
is the baseline,
is the offset,
is the intercept, and
is the slope of the psychometric function.
FMRI DATA. fMRI data were processed using BrainVoyager (Brain Innovation, Maastricht, The Netherlands) and MATLAB (The MathWorks, Natick, MA). Preprocessing of functional data included head-movement correction, high-frequency temporal filtering, and linear trend removal. Two-dimensional functional images were aligned to 3D anatomical data, transformed to standardized Tailarach coordinates, inflated, unfolded, and flattened.
For each observer, we identified ROIs using standard mapping techniques (Fig. 2; Table S1 for Talairach coordinates). This approach allowed us to localize ROIs based on independent data from the data collected during the experimental trials. 3D statistical maps for each ROI were obtained by correlating the fMRI time courses with a reference time course based on hemodynamic response properties. The borders of early retinotopic regions (V1, V2, V3, V3A, VP/V3, V4) were localized using rotating wedge stimuli and eccentricity mapping was achieved using concentric rings, as described in our previous studies (Welchman et al. 2005
). The lateral occipital complex (LOC) constituted the voxels in the ventral occipitotemporal cortex showing significantly stronger activation (P < 104, corrected) to intact than to scrambled images of objects. Two separate LOC subregions were identified: a posterior lateral occipital (LO) region and an anterior region in the posterior fusiform gyrus (pFs), as described in previous studies (Welchman et al. 2005
). The human medial temporal area (hMT+/V5) constituted the contiguous voxels in the ascending limb of the inferior temporal sulcus showing significantly stronger activation (P < 104, corrected) to moving than to static low-contrast concentric rings (Tootell et al. 1995
; Zeki et al. 1991
). We distinguished between medial temporal (MT) and medial superior temporal (MST) subregions using rotating triangular wedge (45°) stimuli rendered by moving dots (Huk et al. 2002
). V3B/KO was defined as the set of contiguous voxels anterior to V3A and posterior to hMT+/V5 that showed significantly stronger activation (P < 104, corrected) to kinetic boundaries versus transparent motion (Van Oostende et al. 1997
). Finally, areas along the intraparietal sulcus (IPS: VIPS, POIPS, DIPSM, DIPSA) and the frontal eye field (FEF) region showed significantly stronger activation (P < 104, corrected) to intact than to scrambled 3D shapes (the same 10 shapes used in the experiment) defined by disparity. These areas were identified as 3D shape-related ROIs, in accordance with previous studies (Orban et al. 1999
, 2006a
).
|
| RESULTS |
|---|
|
|
|---|
During scanning, observers judged whether the viewed disparity-defined shape was symmetrical along the vertical or horizontal axis. We manipulated the coherence of the disparity-defined shape by randomly repositioning dots carrying disparity information about the shape across the display. As the proportion of repositioned dots increased, the shape appeared less coherent and the task became increasingly difficult (Fig. 3A). This was quantified by fitting the behavioral data with a logistic function to estimate the 75% correct discrimination threshold. On average, observers required 32.21 ± 1.68% stimulus coherence to determine reliably whether the shapes were symmetrical along the horizontal or vertical axis (see Table S2 for individual observer thresholds).
fMRI responses
We examined the average fMRI response evoked by stimuli with different levels of disparity-defined shape coherence in early visual areas (V1, V2, V3, V3A, V3B/KO, VP/V3, V4), higher ventral areas (LOC: LO, pFs), and dorsal visual areas (hMT+/V5: MT, MST) previously implicated in the processing of disparity information in the human brain (i.e., disparity-defined planes: Backus et al. 2001
; Gulyas and Roland 1994
; Neri et al. 2004
; Rutschmann and Greenlee 2004
; Tyler et al. 2006
; shape contours: Gilaie-Dotan et al. 2002
; Kourtzi et al. 2002
, 2003
; Mendola et al. 1999
; or gradients: (Brouwer et al. 2005
; Welchman et al. 2005
).
Figure 3, BD shows the effect of increasing disparity-defined shape coherence on the fMRI response across cortical areas (V1, V2, ventral, dorsal areas). Responses in higher visual areas hMT+/V5 and LOC (and their subregions) increased as a function of shape coherence, confirmed by significant regressions of fMRI response on coherence in these areas (hMT+/V5: P = 0.001; LOC: P < 0.01). In contrast, responses in the early visual areas did not change significantly as a function of coherence (Table 1), with the exception of area V3 (P < 0.05) and a marginal effect in V3A (P = 0.059). Additional analysis considered fMRI responses across areas based on trials only on which observers' shape-discrimination judgments were correct. This analysis showed a similar pattern of results (Fig. S2, Table S3), suggesting that the lack of significant activations in the remaining early visual areas (V1, V2, VP/V3, V4) did not arise from weak responses for incorrect trials. These results suggest a role for V3 and higher dorsal (hMT+/V5) and ventral (LOC) visual areas in processing disparity signals that underlie the integration and perception of shape from disparity. In contrast, early visual areas, known to be involved in processing disparity, responded similarly at different levels of shape coherence, suggesting local processing of disparity signals that is unaffected by the disparity shuffling manipulation.
|
|
Relationship between psychophysical and fMRI responses
Our data show that increasing disparity coherence results in enhanced performance in shape discrimination (psychophysical results) and higher levels of activity in some of the visual areas (fMRI results). Based on the relationships between stimulus coherence and our two dependent measures (psychophysics and fMRI) using between-subjects data, we would expect a relationship between psychophysical and fMRI responses. To test this expectation, we performed regression analyses across visual areas based on the data from individual subjects (Fig. 4). This analysis provided evidence for a relationship between perceived shape from disparity and fMRI activation in V3 and higher (hMT+/V5, LOC) visual areas (Table 2). Specifically, there was evidence of a significant positive relationship between psychophysical and fMRI responses in hMT+/V5 (P < 0.01) and LOC (P = 0.01) but not in the early visual areas with the exception of area V3 (P < 0.05).
|
|
![]() | (2) |
where
is the intercept, and
is the slope of the psychometric function. We did this by adjusting the baseline and scaling parameters using a nonlinear least-squares method while constraining the slope (
) and intercept (
) parameters of the model to be those obtained from the psychophysical data. This approach is consistent with previous studies (Buracas et al. 2005
; Zenger-Landolt and Heeger 2003
) that evaluated the relationship between fMRI responses and contrast or speed discrimination performance in V1 and hMT+/V5. Figure 5 shows the fMRI data across the visual areas fit with the logistic model obtained from the psychophysical data. A Pearson correlation coefficient was used to estimate the goodness of fit of the psychophysical model to the fMRI data. There was good agreement between the psychophysical model and the fMRI response in hMT+/V5 (P < 0.05) and LOC (P = 0.05), but not in the early visual areas (Table 3) with the exception of V3 (P < 0.05). This evidence is confirmatory in suggesting a relationship between processing in V3 and higher visual areas (hMT+/V5, LOC) and the observers' ability to discriminate coherent shapes defined by horizontal disparity rather than processing of local disparity signals in these areas.
|
|
Recent imaging studies have revealed a network of parietal and frontal regions beyond the occipitotemporal cortex that respond to 3D shape information (Orban et al. 1999
, 2006a
; Peuskens et al. 2004
; Sakata et al. 2005
; Sereno et al. 2002
; Shikata et al. 2001
; Taira et al. 2001
; Tsao et al. 2003
; Vanduffel et al. 2002
). We identified these areas in individual subjects by comparing activations to 3D shapes defined by disparity (100% coherence) with random-dot displays (0% coherence) based on an independent data set collected in a localizer scan (Fig. 2; see METHODS). Analysis of the fMRI responses for the shape coherence experiment provided evidence for a relationship between shape coherence and fMRI activity in these areas (VIPS, POIPS, DIPSM, DIPSA, FEF). However, the relationship was opposite in sign to that observed in areas V3, hMT+/V5, and LOC. Specifically, activity in parietal areas and FEF decreased as the coherence of the stimulus increased (Fig. 6A, Table 1). Regression analyses confirmed significant negative relationships between the observers' ability to discriminate shape and fMRI responses in these areas (Fig. 6B; Table 2).
|
A further possibility is that activity in these areas relates to the demanding nature of the task performed. An analysis of the time taken for observers to make their perceptual decision indicated that observers responded faster when shape coherence was higher and the shape's symmetry more readily appreciated (Fig. 7A). We observed a significant positive relationship between response time and activity in parietal areas and FEF (Fig. 7B; Table 4), suggesting that activity in these areas reflects the difficulty of the subject's task. This finding appears intriguing in that these regions showed significantly stronger activation to coherent disparity-defined shapes than random disparity fields when the observers performed a fixation-dimming task during a localizer scan. However, activity in these areas during the visual discrimination of shapes at different coherence levels appears more closely matched to the perceptual decision made by the observers and the task demands. A further possibility is that this task-dependent effect results from increases in peripheral clusters of dots carrying disparities that become more salient with shuffling. This could enhance activation in these cortical regions that are known to be engaged in the processing of salient targets (Claeys et al. 2003
). In contrast to these task-related effects in parietal areas and the FEF, we observed a significant negative relationship between fMRI responses and observers' response times in V3 and hMT+/V5 and no relationship in the LOC subregions (Fig. S5, Table 4). These results provide additional evidence that the activity in these areas relates to disparity processing for coherent shape perception, rather than nonspecific attentional modulation of activity arising from task difficulty.
|
Further analysis of the psychophysical data revealed that observers were better at discriminating disparity-defined shapes with a vertical rather than horizontal symmetry axis (Fig. 8A), consistent with previous psychophysical studies (Wagemans 1997
). We examined whether these differences in the observers' performance were reflected by differences in fMRI responses. In particular, we asked whether it is possible to determine whether a disparity-defined shape with a horizontal or vertical symmetry axis was presented to observers based only on the fMRI response. To answer this question we calculated a receiver operating characteristic (ROC) curve (cf. Britten et al. 1992
) based on distributions of the fMRI responses evoked by disparity-defined shapes with a vertical, compared with a horizontal, axis of symmetry. For each observer we computed the average fMRI response (% signal change) for each trial from voxels (n = 50) in each ROI that showed the most significant activation to the stimulus compared with the fixation baseline. We compared the distributions of fMRI responses across trials evoked by vertically and horizontally symmetrical shapes in these stimulus-selective voxels (the same 50 voxels were used to calculate the response to horizontal and vertical stimuli). To construct the ROC curve, we calculated the probability of obtaining an fMRI response at or above some criterion level of activity for each distribution and then plotted these probabilities against each other for a range of criterion values. The area under the resulting curve provides an estimate of the relative selectivity of the region for a 3D shape with a horizontal or vertical symmetry axis, where 0.5 represents chance performance (total overlap of the distributions) and a value of 1.0, perfect performance (total separation of the distributions). To assess the statistical significance of the calculated ROC values (95% confidence intervals), we used a permutation test based on the distribution of ROC values obtained when the data were repeatedly shuffled (4,000 iterations) to disrupt the correlation between the presented stimulus and the fMRI response.
|
A possible limitation of this analysis is that the magnitude of the observed ROC values is considerably lower than that typically seen with electrophysiology (e.g., Britten et al. 1992
; Dodd et al. 2001
; Krug et al. 2004
; Uka and DeAngelis 2004
). This potentially reflects the spatial resolution of fMRI where voxel responses accord to activity in large neural populations that may respond heterogeneously to different stimulus types. Despite this limitation, the conservative statistical validation of the data suggests reliably higher fMRI responses for disparity-defined shapes of vertical than of horizontal symmetry at the 40% coherence level, consistent with behavioral performance and previous imaging studies (Sasaki et al. 2005
). Further, a recent physiological study (Liu and Newsome 2006
) calculated ROC values based on measurements of local field potential (LFP) signals, thought to contribute to the fMRI response (Logothetis et al. 2001
). These ROC values were of similar magnitude to those observed in our study and reliably reflected trial-to-trial associations between behavioral judgments and neural activity. Our ROC analysis provides additional evidence for a link between coherent shape perception from disparity and activity in higher visual areas (hMT+/V5, LOC): activity in these areas predicts differences in shape symmetry consistent with performance differences. Interestingly, the ROC analysis showed significant values in V3B/KO and VP/V3 (marginal effect in V4: P = 0.061), suggesting that differences in stimulus symmetry can be predicted by activity in these areas. This is consistent with previous studies implicating these areas in symmetry processing (Sasaki et al. 2005
).
| DISCUSSION |
|---|
|
|
|---|
These findings advance our understanding of the neural correlates of 3D shape perception in the human brain by extending beyond the localization of regions involved in 3D shape processing to investigate the relationship between activity in these areas and the perception of coherent disparity-defined shape. In particular, previous imaging studies have identified multiple areas in the visual, temporal, and parieral cortex that show stronger activations for stimuli defined by binocular or monocular depth cues than those of two-dimensional versions of these stimuli (for reviews see Neri 2005
; Orban et al. 2006a
,b
; Parker 2004
). Further, previous studies have used parametric manipulations to investigate the neural correlates of surface depth judgments (Backus et al. 2001
; Gilaie-Dotan et al. 2002
). Our study is the first to use a stimulus coherence manipulation to investigate whether disparity-defined shape judgments correlate with cortical activity in the human brain. Similar stimulus manipulations were used in psychophysical (e.g., Harris and Parker 1992
) and physiological studies (e.g., Uka and DeAngelis 2003
) investigating the estimation of surface depth position (i.e., near vs. far) as well as numerous studies on motion coherence (e.g., Britten et al. 1992
; Rees et al. 2000
). Parametrically varying the coherence of 3D shapes allowed us to investigate the effects of stimulus degradation on both shape judgments and cortical activity as measured by fMRI.
Disparity processing and 3D shape perception
Estimating an object's shape based on disparity signals requires a cascade of processing that starts with the local registration of corresponding visual features and proceeds through global constraints on feature matching, the extraction of information about changes in local disparity signals (gradients), and more regional registration of the rate of change of disparity gradient (curvature). Early neurophysiological recordings revealed selectivity for binocular disparity at multiple levels of the visual hierarchy: V1 (Barlow et al. 1967
; Poggio and Fischer 1977
), V2 (Thomas et al. 2002
; von der Heydt et al. 2000
), V3 (Adams and Zeki 2001
; Felleman and Van Essen 1987
; Poggio et al. 1988
), V4 (Hegde and Van Essen 2005
; Hinkle and Connor 2002
, 2005
; Tanabe et al. 2004
; Watanabe et al. 2002
), MT/V5 (DeAngelis and Newsome 1999
; Krug et al. 2004
; Palanca and DeAngelis 2003
; Uka and DeAngelis 2004
, 2006
), MST (Eifuku and Wurtz 1999
; Takemura et al. 2002
), IT (Janssen et al. 2000
, 2001
, 2003
; Liu et al. 2004
; Tanaka et al. 2001
; Uka et al. 2000
, 2005
), and in regions of the parietal (caudal intraparietal sulcus) cortex (Taira et al. 2000
; Tsutsui et al. 2002
). Complementary evidence from human brain imaging implicated several areas across the visual, object-related, motion-related, and parietal cortex in the processing of disparity information (for reviews see Neri 2005
; Orban et al. 2006a
,b
). Further, several recent studies suggest that areas involved in disparity processing primarily in the temporal and parietal cortex are also engaged in the processing of monocular cues to depth (e.g., texture, motion, shading) (James et al. 2002
; Kourtzi et al. 2003
; Liu et al. 2004
; Murray et al. 2003
; Orban et al. 2006a
; Peuskens et al. 2004
; Sakata et al. 2005
; Sereno et al. 2002
; Shikata et al. 2001
; Taira et al. 2001
; Tsutsui et al. 2002
; Vanduffel et al. 2002
).
How is activity at these multiple stages of visual processing related to the perception of shape derived from disparity signals? Neurophysiological evidence suggests that primary visual cortex does not respond to global constraints on correspondence (Cumming and Parker 1997
, 2000
), whereas this computation appears partially solved in V4 (Tanabe et al. 2004
) and predominantly so in IT (Janssen et al. 2003
). Selectivity for disparity-defined gradients is evident in MT/V5 (Nguyenkim and DeAngelis 2003
), IT (Liu et al. 2004
), and CIP (Tsutsui et al. 2002
) and responses to shapes defined by curvature are found in IT (Janssen et al. 2000
, 2001
). Reviews of these studies point to the anatomy-based dichotomy of the visual pathways suggesting that the ventral visual pathway computes 3D shape information for fine discriminations (e.g., object curvature) important for the recognition of objects and their parts whereas the dorsal visual pathway computes 3D scene layout important for spatial positioning and interactions with objects (for reviews see Neri 2005
; Orban et al. 2006b
; Parker 2004
; Tyler 1990
).
Our findings provide evidence for a link between disparity processing and coherent shape perception not only in ventral areas (LOC) implicated in shape analysis but also in dorsal areas (V3, hMT+/V5) thought to be involved in the analysis of spatial structure. This multisite activity is consistent with the shape task we used relying on multiple stages of visual analysis that include integrating local disparity signals to extract extended surface patches, segmenting the bounding contours of the object, and registering shape curvature. The shuffling manipulation we used disrupted the continuity of disparity across the scene and likely interfered with these processes, potentially accounting for the reduced fMRI activity we observed as shape coherence was reduced. The advantage of this manipulation is that it disrupted both contours and curvature to ensure that observers could not perform the shape discrimination based on the contour of a disparity-defined region constant across all stimulus coherence levels.
Recent neurophysiological evidence points to a distinction between the processing of coarse and fine disparity information in the cortical hierarchy. Specifically, MT neurons have a sensitivity to coarse disparity information (surfaces located near vs. far) that is sufficient to account for behavioral performance (Uka and DeAngelis 2003
, 2004
) and microstimulation of MT during nearfar disparity discrimination biases behavioral choice (DeAngelis et al. 1998). In contrast, microstimulation of neurons in MT has little effect on behavioral performance when monkeys are engaged in fine judgments of relative depth position (Uka and DeAngelis 2006
). The symmetry task we used relies on information about global structure, rather than fine depth position. In line with neurophysiological evidence we find evidence for a relationship between behavioral measures and cortical activity in hMT+/V5. Our current data do not allow us to determine whether the same relationship would hold for fine depth discriminations.
Responses in early visual areas were not found to vary significantly with stimulus coherence. This is compatible with a local analysis of disparity signals that is unaffected by the disruption to surface structure caused by the shuffling manipulation or with neural responses that are not related to the perceptual task (cf. Cumming and Parker 1997
, 2000
). Is it possible that our stimuli were not optimal for neurons in the early visual areas leading to a null effect? Specifically, the population sensitivity to disparity is known to be slightly biased toward crossed disparity in monkey MT/V5 (DeAngelis and Uka 2003
) and V4 (Hinkle and Connor 2005
), whereas it is less biased and closer to a peak sensitivity at zero binocular disparity in V1 (Prince et al. 2002
). If the presented stimulus fell outside the signaling capacities of neurons in early visual cortices, we would not expect to see an effect of the coherence manipulation even though this might be observed in these areas for a different set of stimuli. We view this possibility as unlikely. First, the population sensitivity to disparity in monkey V1 is sufficiently broad that disparities in the range 0 to 0.21° should evoke a robust response close to the central tendency of the population response (Prince et al. 2002
). Second, robust human fMRI responses selective to binocular disparity have been observed in areas V1 and V2 for stimuli containing disparities exceeding those used in our study (Backus et al. 2001
). Finally, it has been reported that the population sensitivity to disparity declines slightly with eccentricity (Prince et al. 2002
). Were it the case that the near disparities in our stimuli did not evoke optimal responses in V1, we might expect that the fMRI response would increase as stimulus coherence decreased as the result of a larger proportion of zero-disparity dots in the center of the visual field. In contrast, fMRI responses in V1 did not vary with stimulus coherence.
Interestingly, area V3 rather than earlier visual areas (V1, V2) appeared sensitive to the coherence manipulation, suggesting sensitivity to more global properties of the stimulus for the integration of shape contours or extended surface regions. This suggestion is consistent with the disparity columnar organization of V3 (Adams and Zeki 2001
) similar to that observed in MT (DeAngelis and Newsome 1999
), its input from both the magno- and the parvo-cellular pathways (Callaway 1998
), and the strong anatomical connectivity between V3 and MT (Van Essen et al. 1986
). It is possible that the activity we observed in dorsal areas (V3, hMT+/V5) reflects the role of these areas in pooling disparity signals and extracting 3D shape contours and regions important for the coarse segmentation and detection of 3D targets in complex scenes. Further finer analysis of 3D shape curvature may proceed in temporal areas to support identification of complex 3D objects and their parts.
In contrast, previous studies implicated areas V3A (Backus et al. 2001
; Gulyas and Roland 1994
; Mendola et al. 1999
; Tsao et al. 2003
) and V3B/KO (Brouwer et al. 2005
; Tyler et al. 2006
; Van Oostende et al. 1997
; Zeki et al. 2003
) in the analysis of disparity-defined surfaces and boundaries. Further, caudal intaparietal (CIP) regions were previously implicated in processing 3D object orientation and surface slant (James et al. 2002
; Naganuma et al. 2005
; Sakata et al. 2005
; Taira et al. 2000
) rather than being involved in form discrimination (Shikata et al. 2001
). The anatomical connectivity (Adams and Zeki 2001
; Nakamura et al. 2001
) established between areas V3A, V3B/KO, CIP, and anterior intraparietal areas involved in object grasping (Orban et al. 2006a
; Sakata et al. 2005
) suggest a role of disparity-selective mechanisms in object-related actions rather than visual recognition tasks. Consistent with these studies, activations in areas V3A and V3B/KO observed in our study did not correlate significantly with the observers' performance in discriminating disparity-defined shapes, while activations in parietal areas were consistent with task demands. Taken together, these findings suggest a coarse to fine analysis of spatial surface properties from dorsal visual areas V3A, V3B/KO to the parietal cortex important for computing distances in complex layouts and preparing for actions toward objects.
In conclusion, our results suggest that the visual discrimination of disparity-defined shapes recruits both ventral and dorsal visual areas that mediate a continuum of shape extraction and recognition processes from coarse contour segmentation to fine curvature discrimination. Although our study was not designed to discern these processes (i.e., 3D shape contour extraction vs. 3D curvature analysis) and their contribution to the visual discrimination of 3D shapes, our findings are consistent with proposed interactions (Orban et al. 2006b
; Peuskens et al. 2004
) rather than a strict anatomical dichotomy of function between the visual pathways for 3D shape processing. Our study did not set out to investigate functional interactions between the pathways. Nevertheless, our findings are in agreement with accumulating evidence for form processing in higher dorsal visual areas (i.e., hMT+/V5) (e.g., Kourtzi et al. 2002
; Krekelberg et al. 2003
; Malonek et al. 1994
; Peuskens et al. 2004
). Further imaging and neurophysiological studies are necessary to elucidate the cortical dynamics including feedforward and recurrent interactions between areas in this disparity-selective neural circuit that determine its functionality and critical importance for successful interactions in the complex three-dimensional environment we inhabit.
| GRANTS |
|---|
|
|
|---|
| ACKNOWLEDGMENTS |
|---|
|
|
|---|
| FOOTNOTES |
|---|
Address for reprint requests and other correspondence: A. Welchman, School of Psychology, University of Birmingham, Edgbaston, Birmingham, B15 2TT, UK
| REFERENCES |
|---|
|
|
|---|
Backus BT, Fleet DJ, Parker AJ, Heeger DJ. Human cortical activity correlates with stereoscopic depth perception. J Neurophysiol 86: 20542068, 2001.
Barlow HB, Blakemore C, Pettigrew JD. The neural mechanism of binocular depth discrimination. J Physiol 193: 327342, 1967.
Britten KH, Shadlen MN, Newsome WT, Movshon JA. The analysis of visual motion: a comparison of neuronal and psychophysical performance. J Neurosci 12: 47454765, 1992.[Abstract]
Brouwer GJ, van Ee R, Schwarzbach J. Activation in visual cortex correlates with the awareness of stereoscopic depth. J Neurosci 25: 1040310413, 2005.
Buracas GT, Fine I, Boynton GM. The relationship between task performance and functional magnetic resonance imaging response. J Neurosci 25: 30233031, 2005.
Callaway EM. Local circuits in primary visual cortex of the macaque monkey. Annu Rev Neurosci 21: 4774, 1998.[CrossRef][ISI][Medline]
Claeys KG, Lindsey DT, De Schutter E, Orban GA. A higher order motion region in human inferior parietal lobule: evidence from fMRI. Neuron 40: 631642, 2003.[CrossRef][ISI][Medline]
Cumming BG, DeAngelis GC. The physiology of stereopsis. Annu Rev Neurosci 24: 203238, 2001.[CrossRef][ISI][Medline]
Cumming BG, Parker AJ. Responses of primary visual cortical neurons to binocular disparity without depth perception. Nature 389: 280283, 1997.[CrossRef][Medline]
Cumming BG, Parker AJ. Local disparity not perceived depth is signaled by binocular neurons in cortical area V1 of the macaque. J Neurosci 20: 47584767, 2000.
DeAngelis GC, Newsome WT. Organization of disparity-selective neurons in macaque area MT. J Neurosci 19: 13981415, 1999.
DeAngelis GC, Uka T. Coding of horizontal disparity and velocity by MT neurons in the alert macaque. J Neurophysiol 89: 10941111, 2003.
Dodd JV, Krug K, Cumming BG, Parker AJ. Perceptually bistable three-dimensional figures evoke high choice probabilities in cortical area MT. J Neurosci 21: 48094821, 2001.
Eifuku S, Wurtz RH. Response to motion in extrastriate area MSTl: disparity sensitivity. J Neurophysiol 82: 24622475, 1999.
Felleman DJ, Van Essen DC. Receptive field properties of neurons in area V3 of macaque monkey extrastriate cortex. J Neurophysiol 57: 889920, 1987.
Gilaie-Dotan S, Ullman S, Kushnir T, Malach R. Shape-selective stereo processing in human object-related visual areas. Hum Brain Mapp 15: 6779, 2002.[CrossRef]