JN Fuel your research with LabChart
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


J Neurophysiol 95: 3712-3726, 2006. First published March 22, 2006; doi:10.1152/jn.00112.2006
0022-3077/06 $8.00
This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Supplemental Figure
Right arrow All Versions of this Article:
95/6/3712    most recent
00112.2006v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via ISI Web of Science (5)
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Barthélemy, F. V.
Right arrow Articles by Masson, G. S.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Barthélemy, F. V.
Right arrow Articles by Masson, G. S.

Behavioral Receptive Field for Ocular Following in Humans: Dynamics of Spatial Summation and Center-Surround Interactions

Frédéric V. Barthélemy, Ivo Vanzetta and Guillaume S. Masson

Institut de Neurosciences Cognitives de la Méditerranée, Centre National de la Recherche Scientifique Unité Mixte de Recherche 6193, Aix-Marseille Université, Marseille, France

Submitted 1 February 2006; accepted in final form 7 March 2006


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
Visual neurons integrate information over a finite part of the visual field with high selectivity. This classical receptive field is modulated by peripheral inputs that play a role in both neuronal response normalization and contextual modulations. However, the consequences of these properties for visuomotor transformations are yet incompletely understood. To explore those, we recorded short-latency ocular following responses in humans to large center-only and center-surround stimuli. We found that eye movements are triggered by a mechanism that integrates motion over a restricted portion of the visual field, the size of which depends on stimulus contrast and increases as a function of time after response onset. We also found evidence for a strong nonisodirectional center-surround organization, responsible for normalizing the central, driving input so that motor responses are set to their most linear contrast dynamics. Such response normalization is delayed about 20 ms relative to tracking onset, gradually builds up over time, and is partly tuned for surround orientation/direction. These results outline the spatiotemporal organization of a behavioral receptive field, which might reflect a linear integration among subpopulations of cortical visual motion detectors.


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
The "receptive field" is a central notion in visual neuroscience: at the single-cell level, neurons integrate and respond to information over a restricted part of the visual field, with both high selectivity and sensitivity (Barlow 1953Go; Hartline 1938Go; Kuffler 1953Go). The receptive field’s functional properties can be characterized from the neuronal responses by a set of basic operators such as spatial summation and contrast-response functions (for a review see Albrecht et al. 2003Go; Carandini 2004Go). Furthermore, this (excitatory) receptive field is often modulated by visual inputs from its periphery (Allman et al. 1985Go). Such center-surround organization is involved in many fundamental tasks of visual processing such as input normalization (e.g., Bonin et al. 2005Go; Carandini et al. 1996Go; Heeger 1992Go; Schwartz and Simoncelli 2001Go), object motion segmentation (Allman et al. 1985Go; Nakayama and Loomis 1974Go), or contour detection (Field et al. 1993Go).

Several attempts have been made to measure the functional consequences of neuronal receptive field properties, such as for motion perception (e.g., Anderson and Burr 1991Go; Chubb et al. 1989Go; Tadin et al. 2003Go; Watson and Turano 1995Go; Xing and Heeger 2001Go) and, to a lesser extent, for visuomotor transformations such as those underlying ocular tracking behavior (Heinen and Watamaniuk 1998Go; Miles et al. 1986Go). Ocular following responses are reflexive eye movements that, in both humans and monkeys, are initiated at ultrashort latencies (about 85 and 55 ms, respectively) and exhibit many of the properties attributed to the earliest stages of motion detection and integration (for reviews, see Masson 2004Go; Miles 1998Go). We and others have shown in humans that the earliest phase of ocular following is driven by luminance-defined motion (Masson et al. 2002Go) with a very high contrast gain (Masson and Castet 2002Go; Sheliga et al. 2005Go). Moreover, in monkeys, ocular following is initiated in response to motion presented in the central (<30°) visual field but can be positively modulated by an antagonistic peripheral motion (Miles et al. 1986Go), suggesting that the large-scale center-surround organization of visual motion mechanisms might have a functional role for tracking eye movements (Miles 1998Go). In humans, however, this center-surround modulation has so far been difficult to demonstrate in similar experiments (Gellman et al. 1990Go; Heinen and Watamaniuk 1998Go). Nevertheless, both ocular following (Gellman et al. 1990Go) and slow phases of optokinetic nystagmus (Abadi et al. 2005Go; Howard and Ohmi 1984Go) appear to be more efficiently driven by motion in the center of the visual field.

Here we introduce the concept of a large-scale "behavioral receptive field" that describes how a population of direction-selective neurons is organized to drive ocular following responses. First, we proceed to define such a functional entity with the help of three fundamental operators commonly used to characterize a receptive field (Carandini 2004Go): 1) a spatial summation function, 2) its dependency on contrast for central stimuli, and 3) its modulation by peripheral stimuli. Next, we explore those operators’ temporal dynamics and, finally, we show that these behavioral receptive field’s properties mimic many of the properties of neuronal receptive fields found at the earliest stages of the cortical visual motion pathway. Although further research is clearly needed to bridge the gap between the single-neuron level and the behavioral output, the functional characterization of such a "behavioral receptive field" provides theoretical constraints on the various suggested models and mechanisms of how local motion signals are integrated from neuronal populations to extract an efficient motor drive.


    METHODS
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
Subjects

Five subjects (two naïve and the three authors, mean age: 34 ± 6 yr) participated in the study. All subjects were free of neurological diseases and underwent eye examination before participating in the experiments. All subjects had normal or corrected-to-normal acuity. All experiments were in accordance with the principles of the Declaration of Helsinki and followed the CNRS guidelines for conducting experiments in humans.

Eye movements recording and visual stimuli

The techniques used were described previously (Masson and Castet 2002Go; Masson et al. 2000Go). Eye movements were recorded using the electromagnetic search-coil technique (Robinson 1963Go) using coils embedded in a Silastin scleral ring placed in one eye after application of one to two drops of oxybuprocaine (Collewijn et al. 1985Go). Data acquisition, on-line control of the behavior, and stimulus triggering were controlled by a PC using the REX software package with the real-time QNX operating system (Hays et al. 1982Go). Voltage signals separately encoding horizontal and vertical positions of the right eye were low-pass filtered (Bessel, six poles, DC, 180 Hz) and sampled at 1 kHz, with a resolution of 16 bits.

Subjects were seated in a fiberglass chair, with chin and headrests, and faced a large (70 x 70°) vertical screen at a viewing distance of 1 meter. Visual stimuli were back-projected using a high-resolution Barco 809s video-projector (resolution: 1,280 x 1,024 at 76 Hz, frame duration: 13.2 ms). Visual stimuli were precomputed movies, generated using the HIPS libraries (Landy et al. 1984Go) and stored into the memory of a SGI Fuel workstation. Visual workstation and experimental PC communicated throughout a serial port. Synchronization between the two computers was fully described elsewhere (Masson et al. 2000Go).

Center stimuli were always vertical sinusoidal luminance gratings drifting either rightward or leftward (spatial frequency: 0.27 cpd; temporal frequency: 10 Hz; speed: 37°/s). The motion directions of the central, driving, pattern were fully randomized across trials, together with the different conditions of one particular experiment (center sizes, center contrast, center-surround). All motion stimuli were presented within a circular aperture, whose diameter was varied in some experiments. Mean grating luminance was of 22.5 cd/m2. Display luminance levels were calibrated and linearized by means of a look-up table. For the different sizes of the center-surround patches, the ratio between the outer diameter of the surround and the center stimulus diameter was kept constant at about 2.4. To prevent edge effects, the center disk and the surround ring were always separated by a small gray annulus of mean luminance (width: 17 pixels, about 1°). Stimuli were presented over a gray background of the same mean luminance. All surround stimuli had the same spatial and temporal frequencies as those of the center stimulus. The counterphase gratings used as dynamic surrounds were generated by summing two identical gratings drifting in opposite directions.

Behavioral paradigm

The behavioral paradigm was extensively described previously (Gellman et al. 1990Go; Masson and Castet 2002Go; Masson et al. 2000Go; Miles et al. 1986Go). Trials started with a gray background of mean luminance (22.5 cd/m2) and a target spot produced by a light-emitting diode (LED) back-projected onto the screen, 10° to the right of the center. Subjects were required to fixate this spot for a time interval of random duration, after which the spot disappeared and a second spot appeared at the center of the screen. The subject had to make a saccadic eye movement to this new target, which was then switched off during the saccade. With gaze now directed at the center of the screen, and after a postsaccadic delay of 50 ms, the motion stimulus was presented for 220 ms (about 17 frames) before the screen was blanked, ending the trial. In the different experiments, all conditions were fully randomized and interleaved with "saccade-only trials" in which the subjects made the same saccade as above, but no motion stimulus was presented.

Data analysis

Experiments were organized into several daily recording sessions of nearly 1 h duration, to collect usually around 150–200 trials for each condition. The signal-to-noise ratio (SNR) necessary to adequately resolve the responses was then achieved through averaging. The data from all sessions were pooled and analyzed off-line as described previously (Masson et al. 2000Go). Because ocular following was triggered in the close temporal vicinity of a saccade, all data shown here have the saccade-only condition subtracted to eliminate effects arising from postsaccadic drift.

Quantitative analysis was performed by measuring, for each trial, the changes in vertical ({Delta}ev) and horizontal ({Delta}eh) eye position over several 20-ms time windows. To achieve a complete description of the behavioral tuning for a given parameter (such as total contrast, stimulus diameter), we fitted the data with well-established descriptive functions using the Marquardt–Lindberg algorithm (Matlab, The MathWorks, Natick, MA). The goodness of fits was estimated by computing a normalized {chi}N2 value (Cavanaugh et al. 2002Go). First, to estimate the contrast at half-maximum response (C50) and the slope (n) of the contrast-response functions of grating-driven tracking responses, we fitted the contrast-response function of the ocular following responses with a Naka–Rushton function (Albrecht and Hamilton 1982Go)

Formula
Here and in all subsequent formulae, C is contrast and R is the response (i.e., the change in position over a given 20-ms time window) to be fitted to the experimental data. Second, we assumed visual information to be integrated over the central, excitatory part of the visual field. This spatial summation function was expressed as an integral of a Gaussian error function (erf) (DeAngelis and Uka 2003Go; Pack et al. 2005Go)

Formula
and fitted to the responses R(d); d is the stimulus size, {omega} is the size of the excitatory part of the visual field (i.e., the width of the Gaussian function), R0 is the response amplitude offset, and Ae is the modulation amplitude. In some cases, largest stimulus radii resulted in a slight suppression of the amplitude of responses. To include such suppression, we also fitted these responses to a function of the integral of the difference between two Gaussians, with different amplitudes and widths for the excitatory and suppressive component

Formula
For convenience, in the text we refer to those two functions shortly as erf1 and erf2.

The surround suppression’s strength was expressed by defining a contrast normalization index (CNI), computed for each time window, using the following formula

Formula
where C50cs and C50c are the half-saturation contrast values obtained with and without surround, respectively, for a given time window. Positive values of the CNI indicate depression of the contrast gain function resulting from the surrounding stimulus, whereas negative values indicate enhancement.

The responses’ amplitude tuning with respect to the surround relative to center orientation ({theta}) was modeled using a Gaussian tuning function of width {sigma}, amplitude Ao, and offset Ro

Formula
(see Fig. 5B). To measure the strength of the suppression’s orientation tuning we computed a suppression index (SI) for both iso- (SIiso) and cross-oriented (SIcross) surrounds

Formula
where Ro and Ao are best-fit parameters of the response tuning function and Rc is the response amplitude for the center-only condition. In the absence of surround modulation (Ro = Rc; Ao + Ro = Rc), those indices are equal to zero, whereas they are equal to 1 when either the iso- or the cross-oriented surround totally suppresses the response (as Ro = 0 or Ao + Ro = 0, respectively). Finally, we computed an orientation modulation index (OMI) as Ao/(RcRo). When OMI is equal to zero, surround suppression is not tuned for orientation. As OMI increases (≤1), the surround suppression is increasingly tuned for orientation.


Figure 5
View larger version (32K):
[in this window]
[in a new window]
 
FIG. 5. Orientation tuning of surround inhibition. A: mean eye velocity profiles of ocular following responses to a 30% contrast center vertical grating, drifting rightward and presented alone (continuous lines) or together with a surround with an orientation difference of 0 (i.e., iso-oriented), 30, 60, or 90° (i.e., cross-oriented). Contrast of the surround grating was either 10% (left) or 80% (right). Naïve subject AM. B: mean changes in eye position over the earliest (95–105 ms, light shaded background) and latest (165–175 ms, dark shaded background) time windows, plotted as a function of the surround, relative to the center orientation. White, gray, and black circles are data points obtained with 10, 30, or 80% surround contrast. Horizontal dotted lines and gray squares plot the ocular following amplitudes obtained with a 30% contrast center alone. C: mean suppression index for iso-oriented (SIiso) or cross-oriented (SIcross) surround, as a function of surround contrast. D: widths ({sigma}) of the best-fit orientation tuning functions, plotted vs. surround contrast. Closed symbols are data obtained for each subject and motion direction. Open symbols are their average (±SD). E: orientation modulation index (OMI; see METHODS) illustrates the proportion of the orientation tuning in the total surround suppression, plotted vs. surround contrast. F: mean best-fit tuning functions for surround suppression, at 3 different surround contrasts (strength of suppression vs. orientation of surround relative to center stimulus).

 

    RESULTS
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
Our goal was to define a limited set of descriptive, time-varying functions describing spatial integration of motion, by measuring the magnitude of the initial, open-loop phase of ocular following (Miles and Kawano 1987Go) at different points in time. We used sine-wave gratings of low spatial and temporal frequencies to probe spatial summation and center-surround interactions within one single spatiotemporal channel.

Spatial summation area of ocular following eye movements

First, we accurately measured the spatial summation area of the motion mechanism driving ocular following responses to a single drifting grating. Figure 1A plots, for one subject, the horizontal eye velocity profiles of tracking responses to a rightward drifting grating presented behind a circular aperture of various sizes (diameters: 3.6–43.2°). As diameter increased, the initial eye velocity rose very rapidly. Latencies increased only weakly if at all for decreasing stimulus sizes. We estimated the latency using an objective method (Krauzlis and Miles 1996Go) from the mean velocity profiles. Across subjects and grating motion directions, we found that mean (±SD) response latencies were of 97 ± 11 and 90 ± 8 ms for stimulus diameters of 3.6 and 7.2°, respectively. For stimuli >7° diameter, the response latencies remained largely constant at about 85 ms after stimulus motion onset (i.e., mean ± SD, 81 ± 4 ms for a stimulus diameter of 43.2°). Thus a small (nearly 15 ms) but significant increase in latency was observed only with the smallest aperture size [3.6 vs. 43.2° diameters, t(14) = 3.14, P < 0.01; 7.2 vs. 43.2°, t(14) = 1.8, NS]. The relationship between stimulus size and amplitude of ocular responses showed a complex temporal dynamics. For the earliest part of the response (i.e., <120 ms after stimulus onset), the initial eye velocity first increased quasi-linearly up to nearly 20°, and then saturated for larger grating diameters (Fig. 1A). For the later part of the responses (>120 ms after stimulus onset) the saturation observed with the larger stimuli disappeared, so that eye velocity increased with stimulus sizes >20°, albeit with a lower rate.


Figure 1
View larger version (22K):
[in this window]
[in a new window]
 
FIG. 1. Spatial summation area for ocular following. A: eye velocity profiles of responses to a low spatial frequency vertical grating, drifting rightward behind apertures of various diameters. B: mean (±SE) change in eye position over the 95- to 115-ms time window (shaded area in A), as a function of stimulus diameter. Continuous line shows the best fit of the response amplitudes, obtained with an integral of a Gaussian spatial summation function (erf1) (subject GM). For comparison, the broken line shows the best fit obtained with the difference of 2 integral of a Gaussian functions (erf2). C: mean (±SD) across subjects and directions best-fit parameter ({omega}) coding for the width of the excitatory spatial summation area (see METHODS for details) is plotted vs. 4 different 20-ms time windows to illustrate how temporal integration led to an increase of the spatial summation area over time. Inset: earliest (gray curve) and latest (black curve) Gaussian functions yielding the best-fit erf1 functions (response amplitude vs. distance in the visual space).

 
The dependency of the ocular following responses on stimulus diameter at early and late times can be better seen in Fig. 1B. The data were well fitted (solid line) by a spatial summation modeled with an integral of a Gaussian function (erf1; see METHODS). However, because in one subject (GM) the initial responses displayed a small hypersaturation for large stimulus values, we also tested whether the data might not be better fitted by assuming the presence of a weak peripheral suppression by the isodirectional surround stimulus, modeled by a difference between two integral of a Gaussian functions (erf2; see METHODS). We found that the goodness of the fit was very similar to that obtained with an integral of one Gaussian model alone (i.e., {chi}N2 were similar: 0.21 and 0.29, respectively). The optimal size (i.e., the width of the excitatory erf function) was similar for the two fits. Suppression at larger diameters was significant but weak (Subject GM, mean ± SE across motion directions: 12 ± 5.2%) and the width of the inhibitory erf function was very large. For the other subjects (as well as for GM at later times), we found only very little evidence for peripheral suppression with large stimuli. Altogether, the erf2 model did not significantly improve description of the data. Although we cannot completely rule out a small isodirectional surround suppression at very early times, our data rather suggest that, at least for the low spatial frequency gratings used (0.28 cpd), the oculomotor driving motion signal is read out by a spatial summation function that integrates isodirectional motion signals over a restricted area of the visual field. Furthermore, this central integration area grows with time.

To quantify this temporal dynamics, the extension of the excitatory spatial summation area driving ocular following was estimated by the width parameter ({omega}) of the best-fit erf1 functions, for four different time windows (Fig. 1C). With a 0.28-cpd grating presented at full contrast, best-fit {omega} for the earliest time window (95–115 ms) ranged from 9.6 to 18.1° across subjects and directions, with a mean ± SD of 12.2 ± 3.2°. As can be seen, {omega} significantly increased over time, up to a mean value of 28.5 ± 9.2° for the latest time window [95–115 vs. 155–175: t-test, t(10) = 4.09, P < 0.01].

Effect of contrast on the optimal spatial summation area

It was previously shown at the single-neuron level that the spatial summation area of macaque V1 and MT neurons changes at very low contrast (Cavanaugh et al. 2002Go; Kapadia et al. 1999; Pack et al. 2005Go; Sceniak et al. 1999Go). These neuronal effects should lead to a change in ocular following responses in primates. Therefore we investigated the effects of stimulus contrast on the ocular spatial summation function. To do this, we presented the same set of grating diameters as above, but at three different contrast levels (5, 20, and 80%), selected to cover the complete contrast dynamics of ocular following (Masson and Castet 2002Go).

Figure 2A illustrates, for each subject, the relationships between response amplitude and stimulus diameter, observed at different contrasts. At very low contrast, all curves became more sluggish, showing only very little saturation for at least two subjects, suggesting a larger spatial summation area compared with that underlying the processing of medium and high contrast stimuli, which displayed response functions that were much steeper and saturated for stimulus diameters of about 20°. As before, an erf1 function was fitted to each data set and the best-fit width parameter {omega} is plotted against contrast in Fig. 2B. Increasing the contrast from 5 to 80% resulted in a significant decrease in the mean {omega} from 21.6 ± 7.5 to 12.1 ± 3.3° [t-test, t(10) = 2.98, P < 0.01], indicating that the highest-contrast gratings were sampled with spatial summation areas of about half the size of those with which the lowest-contrast ones were sampled. This reduction is further illustrated in the inset of Fig. 2B, where we plot the profiles of the mean, best-fit Gaussian envelopes. Overall, these results suggest that increasing the contrast both enhanced the response amplitude and reduced the width of the spatial integration area.


Figure 2
View larger version (29K):
[in this window]
[in a new window]
 
FIG. 2. Effect of contrast on the spatial summation area. A: for 3 subjects, the mean change in eye position (95- to 115-ms time window) is plotted as a function of center diameter for a 5% (black symbols), 20% (gray symbols), and 80% (white symbols) center contrast. Continuous lines are best fits of an integral of a Gaussian (erf1) spatial summation model. B: relationship between contrast and the best-fit estimate of the width ({omega}) of the spatial summation function. Closed symbols are data obtained for each subject and direction. Open circles plot their mean (±SE). Inset: mean best-fit Gaussian functions, as response amplitude vs. distance in the visual space, for each different contrast level.

 
Center-surround interactions for ocular tracking: contrast normalization

Our next goal was to probe the oculomotor effects of center-surround interactions for nonisodirectional motion signals. We first found that a static peripheral grating had no effect on the tracking responses, showing that center-surround interactions occur only between two dynamic stimuli. Next, we used moving peripheral stimuli. However, recording responses to stimuli made of different motion directions presented simultaneously is inadequate because each visual motion would drive antagonistic (or synergistic depending on their relative directions) eye movements (see Supplemental Data1), hampering our objective to measure the behavioral consequences of center-surround modulations at the level of visual processing, rather than at the motor output level. To eliminate such motor interactions, we thus used a dynamic surround constructed by summing two gratings of identical orientation and spatial frequency but drifting in opposite directions at the same temporal frequency (speed). Such summation generates a "counterphase"-modulated grating that contains no net motion signal but drives neuronal mechanisms tuned for each of the two opposite motions (Qian and Andersen 1995Go). Indeed, when the surround was presented alone, no or only very small ocular following was observed, which was not consistent among subjects and thus most likely resulted from a small asymmetry in the individual sensitivity to the two antagonistic motion signals (Fig. 3, C and D, gray symbols). In other words, the motor effects of the two opposite peripheral motion signals cancel out. Figure 3 shows, for one subject, the dependency of ocular following on the contrast of a central grating presented behind a circular aperture of optimal diameter (20°) with or without such a counterphase grating in the surround. In Fig. 3A, we plot the mean velocity profiles of responses to a center grating presented alone.


Figure 3
View larger version (29K):
[in this window]
[in a new window]
 
FIG. 3. Center-surround interactions for early and late ocular following. A: eye velocity profiles of ocular following responses to a vertical grating drifting leftward behind a 20° diameter aperture and presented at different contrasts. B: same center grating presented with a counterphase surround grating of same orientation. Light and dark gray shaded bars indicate the early (95- to 115-ms) and late (155- to 175-ms) time windows used for measurements. C: early (95- to 115-ms time window) change in eye position plotted vs. the center contrast, for center-only (open symbols) and center-surround (closed symbols) conditions. D: same as C, but for late time window (155–175 ms). Continuous lines are best fits of the Naka–Rushton functions. Gray symbol in C and D plots the response to the surround-only condition, showing that no significant ocular following response was observed even at high contrast.

 
The initial velocity of ocular following increased with grating contrast, up to a maximum reached for contrast of roughly 30–40%. Figure 3B illustrates the mean eye velocity profiles for the same sample of center contrast but now presented together with the high-contrast (80%) surround stimulus. Comparison between Fig. 3A and Fig. 3B shows that the earliest phase of ocular following responses was largely unaffected by a peripheral counterphase grating of same orientation: latencies remained largely unchanged at about 85 ms, particularly at high contrast, and initial eye velocity was only weakly affected. On the contrary, the later part of the tracking response was strongly suppressed by the peripheral grating. In particular, the maximum eye velocity reached at the time of the closing of the oculomotor loop (about 170–180 ms) was largely reduced.

These results are seen more clearly in Fig. 3, C and D, where open and closed symbols plot the mean changes in eye position for center-alone and center-surround conditions, respectively. They were measured over two different time windows: one early (95–115 ms), centered at response initiation, and one late (155–175 ms), set immediately before the closing of the oculomotor loop. Ocular contrast-response functions were fitted with a Naka–Rushton function, best describing the contrast dynamics of ocular following (Masson and Castet 2002Go; Sheliga et al. 2005Go). Again, the figure shows that the earliest contrast-response functions were very similar for the two conditions (Fig. 3C), whereas the two response functions largely differed for the late phase of ocular following (Fig. 3D). Similar results were obtained for all four subjects and for the two center grating motion directions.

Time course of contrast dynamics

Next, we compared early and late responses, separately for both conditions. Without surround, a clear shift toward low contrast values was observed in the late contrast-response function with respect to the early one. In the presence of a dynamic surround, on the other hand, this shift was much smaller, the early and late contrast-responses functions being very similar, as can be seen by comparing Fig. 3C and Fig. 3D. To ease the comparison, in Fig. 4A we plot the half-saturation contrast (C50, obtained from best-fit Naka–Rushton functions) for the center-only versus the center-surround condition, for each subject and motion direction.


Figure 4
View larger version (13K):
[in this window]
[in a new window]
 
FIG. 4. Early and late change in half-saturation contrast: effect of surround. A: for each subject and direction, best fit C50 parameters obtained with either a center-alone or with a center-surround stimulus are plotted one vs. the other. Open and closed symbols represent the estimates for early and late time windows, respectively. Squares are the mean value across subjects and directions. B: mean (±SD) contrast normalization index (CNI) is plotted vs. time to illustrate the temporal dynamics of surround suppression.

 
Open symbols are data obtained with the earliest time window: they were clustered only slightly above the identity line (first from bottom), indicating that, at early times, C50 values with and without surround were very similar. Filled symbols are data obtained with the late time window. Squares plot mean values across subjects and directions. The leftward shift of the open versus the filled symbols reflects the C50 decrease observed at late time with center-only gratings (Fig. 3D), indicating an increased contrast gain. On the other hand, we observed no decrease of the late C50 obtained with a center-surround stimulus: rather, the data were slightly shifted in the upward direction, indicating, if anything, a decrease in contrast gain that, however, was found not to be significant (see following text). We quantified these changes by comparing C50 values across subjects. We first compared these values across time windows, within each condition. With a center grating alone, the mean (±SD) C50 values were of 10.2 ± 3.4 and 5.4 ± 1.2% for the early and late time windows, respectively, yielding a significant decrease over time [t(14) = 3.78, P < 0.01]. With an iso-oriented surround, mean (±SD) early versus late C50 values were not significantly different [15.6 ± 6.4 and 23.14 ± 11.8%, respectively; t(14) = 1.58, P = 0.13]. Second, we compared C50 across center-only and center-surround conditions, for each time window. For the earliest time window, values of C50 were not significantly different between the two conditions [t(14) = 2.08, P > 0.05], whereas in the later time window the presence of a dynamic surround significantly increased C50 [t(14) = 4.21, P < 0.01]. The change in contrast gain was not only the result of a change in half-saturation contrast (C50). Indeed, the presence of a peripheral surround also significantly reduced the slope (n) of the contrast-response function from 1.96 ± 0.36 to 1.47 ± 0.33 [mean across four subjects and two directions, t(14) = 2.84, P < 0.05], whereas the peak of the response (Rmax) was not significantly affected [mean across subjects and directions, 0.026 ± 0.017 and 0.022 ± 0.015, t(14) = 0.62, NS].

To illustrate the temporal dynamics of the surround suppression, we computed a contrast normalization index (CNI) for each time window. The mean (±SD) index is plotted against time in Fig. 4B, showing that no significant effects of the surround were seen before about 110 ms. Indeed, with the 95- to 115-ms time windows, the mean CNI was only of 0.15 ± 0.11. However, this index rapidly increased to reach about 0.6 at roughly 140 ms after motion onset. At the end of the open-loop period, the CNI had a value of 0.61 ± 0.1: at this point in time, the surround had suppressed the center C50 by nearly 60%. Taken together, these results indicate that the iso-oriented dynamic surround prevented the temporal evolution of the contrast dynamics of ocular following, clamping it to its earliest, quasi-linear range.

Late surround suppression: orientation selectivity

In the previous experiment, contrast and orientation of the surround were fixed. To investigate whether surround suppression is tuned for orientation and whether it scales with contrast, we kept both contrast (30%) and orientation (vertical) of the center grating constant, while we varied both contrast and orientation of the peripheral counterphase grating. Figure 5A illustrates, for one naïve subject, the mean eye velocity profiles of the ocular responses to a center grating presented alone (continuous lines) or together with a surround of 0, 30, 60, or 90° orientation difference with respect to the center orientation (broken lines). Left- and right-hand plots illustrate the effects of either low- or high-contrast surrounds, respectively. With low-contrast surrounds, suppression by the peripheral stimulus was nearly absent or very weak. On the contrary, high-contrast surrounds produced a strong, albeit delayed suppression. Moreover, suppression developed over time. As in the previous experiment, the first 20–30 ms of the responses were indistinguishable from the center-alone condition, but at about 110 ms after stimulus onset (vertical arrow, Fig. 5A, right), a strong suppression began to emerge. Furthermore, the surround grating induced an overall reduction in eye velocity that became gradually and partly tuned for surround orientation (Fig. 5B): at the end of the open-loop period, ocular responses were clearly more suppressed by an iso-oriented (0°) than by a cross-oriented (90°) surround. Similar results to those shown in Fig. 5 were found in three other subjects (the authors).

To illustrate these four different aspects of modulation by the surround (i.e., global and orientation-tuned suppression, contrast dependency, and temporal dynamics), we computed the mean (±SE across trials) change in horizontal eye position over 20-ms time windows, covering the whole open-loop period from 95 to 175 ms after stimulus onset. Figure 5B plots, for the same naïve subject and for several contrast values, the mean change in eye position as a function of surround orientation, for the earliest time bin where a clear response could be seen (95–115 ms), as well as for the latest time bin (155–175 ms) before the closing of the oculomotor loop (i.e., 170 ms after stimulus onset, twice the response latency of about 85 ms). Continuous lines are Gaussian functions that were fitted to the data to evaluate both amplitude and width of the orientation tuning of surround suppression. As a comparison, broken horizontal lines depict the changes in position obtained without surround. For the early time window, no surround suppression could be seen for any of the surround contrast values (Fig. 5B, left). For the late time window, strong surround suppression could be detected, which increased with contrast (Fig. 5B, right). Both global suppression and its orientation specificity increased with surround contrast, as can be seen by comparing the various curves in Fig. 5B (right) to the horizontal dotted line marking the response to the center-only condition. To quantify this result, we computed a suppression index, separately for each one of these two conditions. Figure 5C plots their mean values (±SD across subjects and directions) against surround contrast. For both iso- and cross-oriented surround gratings, the suppression indices increased with contrast from 0.07 ± 0.04 to 0.49 ± 0.12 and from 0.02 ± 0.01 to 0.33 ± 0.09, respectively. At low and medium contrast, the suppression indices of the two conditions were not significantly different. At high surround contrast, however, iso-oriented suppression was roughly 30% stronger than cross-oriented suppression [t(9) = 2.61, P < 0.05].

Increasing the surround contrast thus resulted in a stronger surround orientation tuning of the response amplitude which was best described by a Gaussian function peaking at 90° relative orientation. More specifically, we found that the modulation amplitude increased with surround contrast, whereas the width of the tuning function remained fairly constant. This can be seen in Fig. 5D, where we plot the individual best-fit orientation widths (closed symbols) as well as the mean (±SD) across subjects and directions, as a function of surround contrast: Increasing the contrast from 10 to 80% produced a small but nonsignificant widening of the orientation tuning, the mean {sigma} of the tuning function increasing only marginally from 22.8 ± 4.5 to 26.8 ± 6.9°. However, the modulation amplitude Ao increased regularly—and significantly—with contrast, from 0.09 ± 0.05 to 0.25 ± 0.1° [t(10) = 3.5 P < 0.01; Fig. 5E]. To further illustrate the orientation tuning of surround suppression, we plotted (Fig. 5F) the mean Gaussian tuning functions drawn from the data illustrated in Fig. 5, D and E.

Finally, Fig. 6A illustrates the temporal dynamics of surround suppression and its orientation tuning by plotting best-fit tuning functions computed every 10 ms, for a rightward motion, showing that both orientation-tuned and global suppression gradually developed over time. Figure 6B plots the mean OMI (±SE across subjects and directions) against time, showing quantitatively that orientation-tuned suppression started only about 100 ms after stimulus onset, regularly increasing thereafter over the whole open-loop period: At the end of the open-loop period, about 30% of the surround suppression can be attributed to a mechanism tuned for orientation of the peripheral grating (Fig. 6B).


Figure 6
View larger version (27K):
[in this window]
[in a new window]
 
FIG. 6. Temporal dynamics of orientation-tuned surround suppression. A: best-fit tuning functions computed from changes in eye position over 10-ms time windows ranging from 75 to 175 ms after stimulus onset (subject GM). Amplitude of ocular following responses obtained with a center alone is illustrated for the last time window by the straight line. B: temporal dynamics of the mean (±SE, across subjects and motion directions) OMI of surround suppression for an 80% surround contrast.

 
Surround suppression: effect of spatial scale

The results described above show a strong suppression of the center-driven responses by iso-oriented peripheral stimuli. Moreover, this suppression appears only about 20 ms after response onset and develops over time. Such suppression can be described as a mainly divisive change in the contrast gain, similar to the center-surround effects found in single neurons at the early stages of a visual motion stream (see Carandini 2004Go). However, the very large center stimuli (20°) used here are most likely to cover the receptive fields of a large sample of neurons located at different eccentricities and having different spatial summation functions. Moreover, at least in the macaque, ocular following responses depend strongly on the neuronal activity at the level of areas MT/MST (see Kawano 1999Go), where most neurons have receptive field sizes that are much smaller than the size of our stimulus. Using large stimuli, we might thus have driven only a small fraction of those neuronal center-surround interactions because neurons with a counterdirectional center-surround suppression mechanism would be suppressed only if located at or near the edge of our center stimuli, eventually leading to an underestimation of their impact on the ocular tracking responses. Alternatively, it is possible that the detected tracking suppression does not result from center-surround suppression at the level of each individual unit, but rather results from inhibitory interactions between two subpopulations of neurons located in the central and the peripheral part of the visual field, respectively.

Following this reasoning, we decided to test the spatial scale of surround suppression for ocular following and designed two control experiments. In a first test, we measured the contrast-response functions of eye responses to a small central patch (10° diameter, i.e., smaller than the optimal size) presented either alone or together with an iso-oriented, counterphase grating in the surround. As a second test, we presented a set of nine small patches (each 6° in diameter) forming an array covering the same visual surface as the large stimuli used above. Each patch was presented either as center-alone or together with an iso-oriented surround. As for the case of the large stimulus used before, to measure center-surround interactions we varied the contrast of the central patch while keeping the contrast of the surround constant at 80%.

Figure 7, A and B illustrates the results obtained with an array of small patches and with a single, midsize (center diameter, 10°) patch, respectively. To maintain a sufficient number of grating cycles (>2.5) inside each aperture, we here used a spatial frequency (0.45 cpd) different from that in the preceding experiments. To preserve the same temporal frequency (10 Hz), drifting grating speed was set to about 22°/s. Clearly, because we used both suboptimal stimulus sizes and spatiotemporal properties, ocular following responses were very small and somewhat delayed (about 5–10 ms). The five 20-ms time windows were adjusted accordingly, starting at 85 ms after stimulus onset. Open and closed symbols plot the amplitude of ocular following responses over three of these time windows, two early (85–105 and 105–125 ms, left) and one late (165–185 ms, right) for center-alone and center-surround conditions, respectively. Continuous lines are best-fit Naka–Rushton functions. As found with a single, large (20° diameter) patch, the effect of the added surround developed in time. Such suppressive effect resulted in a rightward shift of the contrast-response function, yielding a large and significant increase in the best-fit C50 parameters with respect to the center-only condition, for the later phase of the responses (time window: 165–185 ms). On the contrary, during the earliest phase (time-window: 85–105 ms) of ocular following the responses were only weakly, if at all, affected by the presence of an iso-oriented surround. A closer examination of the different contrast-response functions reveals that, as observed for the large stimuli used above, ocular contrast-response functions for the center-only condition gradually shifted to lower contrast values over time, whereas those obtained in the presence of a surround remained remarkably constant. Such dynamics was very similar for both types of motion stimuli (single and multiple patches). Finally, in both experiments, the maximum amplitudes of the ocular responses were substantially reduced by the presence of a surround, a result that is somewhat different from that reported above for a large stimulus. This reduction in the ocular response gain might be attributable either to the slightly different spatial frequency and/or speed used or to stronger surround suppression. These two hypotheses remain to be further investigated.


Figure 7
View larger version (33K):
[in this window]
[in a new window]
 
FIG. 7. Contrast normalization at different spatial scales. A: contrast-response functions at 3 different time windows for ocular following responses to a micropattern made of 9 small (center diameter: 6°) patches of a leftward-drifting grating. Each circular patch was presented either alone or with a peripheral counterphase grating of same orientation. Open and closed symbols illustrate the amplitude of responses to center-only and center-surround stimuli, respectively. Continuous lines are best-fit Naka–Rushton functions. Gray symbols are the ocular following responses to the surround-only condition. B: same plots but for ocular following responses to one single, small (center diameter: 10°) patch, presented either alone or with a counterphase grating of same orientation in the periphery.

 
The above effects are summarized in Fig. 8, A and B, where we plot the half-saturation contrast values measured with center-alone and center-surround conditions one against the other, for three subjects and each grating motion direction. Note that, because of the small stimulus sizes used here, the early responses were very small (see first column of Fig. 7), yielding only noisy estimates of change in eye position over the first 20 ms and thus poorer fits for this time window. For this reason, in Fig. 8 we plotted the data obtained from three different time windows. As expected, the C50 values for the center stimulus decreased between the first (85–105 ms) and the second (105–125 ms), as well as between the second and the third (165–185 ms) time window (arrows in Fig. 8, A and B). Moreover, C50 for the center-surround condition remained roughly constant for the micropattern (Fig. 8B). For the single patch (the smallest stimulus in terms of overall surface), on the other hand, C50 decreased between the first and the second time window even for the center-surround condition (first arrow segment, Fig. 8A), as opposed to the data obtained with a large stimulus (Fig. 4A). This apparent discrepancy can be ascribed to a poor fit of the Naka–Rushton functions arising from the low SNR of the measurement at the earliest times. Indeed, between the second and the latest time window, where the quality of the fit was much better, the C50 values for the center-surround stimulus again increased (second arrow segment in Fig. 8A), as found with the large-size stimulus used above (Fig. 4A).


Figure 8
View larger version (27K):
[in this window]
[in a new window]
 
FIG. 8. Dynamics of contrast normalization at different spatial scales. A: best-fit half-saturation contrast values (C50) for center-alone and center-surround conditions are plotted one vs. the other for the earliest, middle, and latest time windows (white, gray, and black symbols, respectively). Circles are individual data (each subject and each grating motion direction). Squares are mean data, across subjects. Vertical shift of squares was statistically not significant, as opposed to their horizontal shift (see text). B: same plot but for a stimulus of multiple small patches. Note how, over time, center-alone C50 drastically shifted to lower values, whereas center-surround C50 values remained nearly constant (arrow). C: mean (±SE) contrast normalization index (CNI) across subjects, plotted vs. time. Single-patch and multipatch conditions are indicated by white and black symbols, respectively. Data from Fig. 4 are replotted with gray symbols for ease of comparison.

 
With a small (10° diameter) central grating (Fig. 8A), the earliest C50 points (open circle) were scattered around a mean of 19.1 ± 3.5% (center-alone) and 32.8 ± 4.5% (center-surround) (open square). In the next time window (gray circles), the mean C50 point was found at 6.7 ± 1.7 and 11 ± 3.5%, indicating a small (roughly 4%) albeit significant [t-test, t(10) = 2.7, P = 0.02] difference in mean C50 values. At the end of the open-loop period, data points were scattered around mean values of 4.4 ± 0.8% (center-alone) and 18.1 ± 4.1% (center-surround), indicating a threefold difference in half-saturation contrast [t-test, t(10) = 8.03, P < 0.0001]. Mean C50 values obtained with the center-only condition at the earliest time window and the center-surround condition at the latest time window were not significantly different [19.1 ± 3.5 and 18.1 ± 4.1%, respectively, t-test, t(10) = 0.8, P > 0.4]. With the micropattern (Fig. 8B), the earliest C50 data points were scattered around mean values of 28.9 ± 3.5% (center-alone conditions) and 20.2 ± 4.8% (center-surround conditions). In the 105- to 125-ms time window, the data were scattered around a mean value of 8.8 ± 2.4 and 21.9 ± 9.2%. Within 100 ms of tracking, data points moved purely along the horizontal axis, scattering around 4.5 ± 0.5 and 25.2 ± 2.2%, which is a fivefold increase in mean C50 [t-test, t(10) = 22.5, P < 0.0001]. Moreover, C50 values obtained for the center-only condition in the earliest time window and for the center-surround condition with the last time window were not significantly different [28.9 ± 3.5 and 25.2 ± 2.2%, respectively, t-test, t(10) = 2.19, P > 0.05]. This result demonstrates that an iso-oriented surround clamps the contrast gain parameter to its initial value, as opposed to the case of center-alone conditions, where C50 dramatically decreased over time.

Finally, Fig. 8C plots the CNI for the two control conditions (open symbols: small single patch; closed symbols: multiple small patches) together with the single large center grating condition, replotted from Fig. 4B. It is clear that, for all conditions, the surround suppression grew over time with very similar dynamics. It reached a maximum of about 50–60% after 100 ms. Mean (±SE) maximum CNI values were of 56 ± 6 and 69 ± 3% for single and multiple small patches conditions, respectively [for comparison, with a large (20° diameter) central patch, we observed a mean CNI of 61 ± 10%; see above]. Notice that for the case of the micropattern stimulus, the CNI during the earliest time window was negative (–20 ± 0.13), indicating that, for some subjects, C50 values were actually smaller in the presence of a surround, meaning that somewhat larger ocular following responses were observed at very low center contrast in the presence of a surround. We attribute this phenomenon to a slight contamination of the responses by one of the two opposite motion direction components of the peripheral stimulus, as shown by the small but significant transient tracking response observed with the surround-stimulus alone (the gray symbol in Fig. 7A, left) for the earliest time window. Whereas in the case of large responses to the center stimulus (see Figs. 3C and 4B) this surround-only response is negligible, its effect becomes relevant (Fig. 8C) when the center responses are small (Fig. 7A).

Altogether, very similar results were obtained for center-surround stimuli of different sizes and for different spatial arrangements, suggesting that the observed center-surround effects result from neuronal interactions at the population level, rather than from center-surround properties of single neurons.


    DISCUSSION
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
By probing spatial summation and center-surround interactions underlying the rapid initiation of tracking eye movements, we have mapped several key properties of motion integration for ocular following. These properties can be described with a small set of operators that summarize both linear and nonlinear mechanisms and define a behavioral receptive field (BRF), in a way reminiscent of the neuronal and perceptual receptive fields used to synthesize an input–output transfer function at a given level of brain organization. We suggest thereafter that properties of the BRF emerge from the activity of large populations of neurons that are cascaded at different stages of the cortical motion pathway. Moreover, several functional consequences for ocular tracking can be predicted from the properties of this BRF. Finally, such definition of a BRF should contribute to a better understanding of how fundamental properties of motion-selective neurons are readout at the population level in the context of a simple action system (Priebe and Lisberger 2004Go; Takemura et al. 2001Go).

Behavioral receptive field: spatial properties

The earliest phase of ocular tracking is driven by a mechanism that integrates center motion signals in a quasi-linear fashion, up to an optimal stimulus diameter of about 20° for high-contrast, low spatial frequency stimuli. This optimal size defines the driving center of the BRF. Similar values have been found using low spatial frequency random-dot patterns in monkeys (Miles et al. 1986Go) and humans (Gellman et al. 1990Go). Altogether, these results demonstrate that ocular following responses are not driven by the en masse motion of the visual field but rather by a large central subportion of it (see Miles 1998Go; Miles and Kawano 1987Go). Within the BRF’s driving center, motion signals are linearly integrated, as shown by the linear relationship found between response amplitude and stimulus size in the 0–20° diameter range. Such linear integration is consistent with our previous finding that ocular following is initiated by a linear combination (i.e., vector sum/average) of the different motion direction signals present in the central visual field (Masson 2004Go; Masson and Castet 2002Go; Masson et al. 2001Go).

The BRF, however, also exhibits several nonlinearities. First, the saturation of the initial tracking responses found for stimuli >20° suggests that, at short latencies (<100 ms), neurons in the peripheral visual field are given little weight in the integration of motion signals. Second, the only marginal or absent decrease of the initial response amplitudes for stimuli exceeding the optimal size (20°) suggests that suppression by peripheral stimuli of the same motion direction as that of the center is only very weak, if not absent, with low spatial frequency inputs. Third, the suppressive effects observed in the presence of a dynamic surround unravel the suppressive effect of nonisodirectional peripheral motion signals. Indeed, the experiments performed with supraoptimal size center stimuli show that peripheral motion isodirectional with the center have no or only small effects; the suppression observed with counterphase peripheral gratings can thus be attributed only to the peripheral motion component of direction opposite to that of the center. Finally, showing that surround-only stimuli did not trigger ocular following, we ruled out any possible interactions between motor responses. Such visual peripheral suppression acts mainly by dividing the contrast gain of ocular tracking and mimics many of the properties of the divisive process postulated to explain contrast normalization at different stages of the motion pathways: a rightward shift in the contrast-response function (Figs. 3 and 7), a suppression scaled with surround contrast (Fig. 5) and partially tuned for orientation, peaking with iso-oriented surround gratings (Fig. 5) (see Heeger 1993Go; Schwartz and Simoncelli 2001Go; Simoncelli and Heeger 1998Go). Finally, the last experiments demonstrate that contrast normalization for ocular tracking is not restricted to one particular, optimal spatial scale or geometry. Instead, reducing the size of the center-surround stimulus below the optimal center size or presenting several center-surround patches inside the BRF produced a similar contrast gain change together with a stronger reduction in response amplitudes. This indicates that interactions between visual signals occur at several spatial scales within the BRF.

Because we used counterphase gratings as peripheral stimuli, we could not directly distinguish between directional and orientation tuning of the surround suppression. However, oriented static surrounds have no modulatory effects and, most important, in the case of iso-oriented surrounds, suppression resulted from gratings moving in the opposite rather than in the same direction, strongly suggesting that this selective component is actually tuned for motion direction and that suppression results from interactions between different subpopulations of direction-selective neurons. Moreover, because orientation/direction tuning explains only a part of surround suppression, our results show that contrast normalization also contains a global suppressive component in addition to the tuned one. All these nonlinear properties have already been demonstrated for motion processing at both neuronal (e.g., Cavanaugh et al. 2002Go; Heuer and Britten 2002Go; also see Carandini 2004Go) and perceptual (e.g., Chubb et al. 1989Go; Petrov et al. 2005Go; Solomon et al. 1993Go; Xing and Heeger 2001Go) levels. To our knowledge, however, the results presented herein are the first demonstration of the functional consequences of these early visual nonlinearities on motor behavior.

It shall be noticed, however, that the peripheral suppression reported herein is largely different from the peripheral modulation reported earlier by Miles and co-workers for ocular following (monkeys: Miles et al. 1986Go; humans: Gellman et al. 1990Go). With random-dot patterns, they observed a facilitation of center-driven responses with an antagonistic peripheral motion, an effect called antiphase enhancement. Several differences in the experimental conditions can explain the apparent contradiction between the two studies. More important, in both humans and monkeys, antiphase enhancement was observed only with very large center motion (>40°), that is, for center stimuli much larger than the optimal sizes observed herein with low spatial frequency gratings. Second, this antiphase enhancement was found only very late in the response, roughly after the closing of the oculomotor loop. During the initial part of the responses, a weak suppressive influence was observed that can be attributed to a cancellation between two antagonistic responses (see also supplemental data). In fact, Masson et al. (2001)Go observed that when motions in opposite directions are presented in the central part of the visual field, no net ocular following responses were seen, suggesting that each motion drives a tracking response in its own direction. Further investigation of the differences and similarities between ocular following cancellation and surround suppression is clearly needed to decipher the visuomotor and visual interactions driven by complex motion stimuli.

Behavioral receptive field: temporal properties

We propose that the BRF includes a weighted linear integration and divisive contrast normalization through center-surround interactions. We probed their temporal dynamics by measuring these operators at different points in time over the nearly 90-ms open-loop period. Temporal motion integration for ocular tracking has two different signatures. First, in the absence of a suppressive surround, there is both a progressive expansion of the BRF’s center (Fig. 1) and a shift of the ocular contrast-response function toward lower contrast values (i.e., an increase in the contrast gain; Fig. 3). These changes are gradual but seem to saturate at the end of the open-loop period. A dynamic surround prevents the latter temporal evolution by both clamping the contrast gain to its earliest range and decreasing the response gain. Such suppressive interaction gradually develops over time so that about 90 ms after response onset the contrast gain decreased by nearly 60% with respect to its value observed in the center-only condition. At this time, the maximum eye velocity obtained with the highest contrast is also lowered, indicating an overall reduction of the ocular tracking response.

There is a second striking aspect of the BRF’s temporal properties. Suppression by surround motion of direction different from that of the center starts only about 20 ms after response onset, suggesting that surround signals have an effect on the center-driven responses only after a fixed delay. We previously showed that linear integration of local grating motion information is done at ultrashort latency (about 80–85 ms) (Masson and Castet 2002Go). Moreover, inverted ocular following responses to reversed-phi motion have the same latency as that of normal responses (Masson et al. 2002Go). Ocular contrast gain is also set immediately at response onset (Masson and Castet 2002Go; Sheliga et al. 2005Go), albeit its value changes over time. On the contrary, nonlinear motion computations are delayed by about 20 ms, as shown by response timing to pattern motion direction when using plaids (Masson et al. 2002Go). The similarity between the timing found here for center-surround interactions and that observed for one-dimensional/two-dimensional (1D–2D) motion integration using plaids (Masson and Castet 2002Go) is of particular interest in the context of recent modeling work, which has demonstrated that both nonlinear contrast gain control and inhibitory inputs are necessary to reconstruct pattern motion direction (Rust et al. 2005Go), leading to the speculation that the temporal dynamics of 2D motion integration might in fact reflect the temporal dynamics of these two nonlinear elements.

A neuronal population coding for the BRF

Characteristics of the BRF should be seen as features emerging from the properties of neuronal populations involved in integrating motion signals for driving eye movements. Short-latency ocular following responses were first documented in monkeys (Miles et al. 1986Go) and numerous studies have been conducted since to elucidate their neuronal substrate, particularly the key roles played by areas MT and MST (see Kawano 1999Go). First, correlated neuronal activity was found in areas MT and MST, preceding ocular following onset by about 10 ms (Kawano et al. 1994Go). Second, lesions of area MST completely abolish ocular following (Takemura et al. 2002Go). Third, another type of short-latency ocular responses, vergence, is driven by a vector averaging readout of MT/MST disparity-selective neurons (Takemura et al. 2001Go). Similar studies have yet to be conducted for ocular following, although these physiological findings are consistent with the idea that initiation of smooth pursuit eye movement is driven by a linear readout (vector sum/averaging) of a population of MT neurons tuned for direction and speed (Priebe and Lisberger 2004Go). Finally, it has been suggested that the dynamics of ocular following responses to plaids (Masson and Castet 2002Go) closely mimics the time course of neuronal selectivity for pattern motion direction in area MT (Pack et al. 2001Go; Smith et al. 2005Go).

What are the neuronal population mechanisms implementing a BRF? Most properties of ocular following are very similar in both humans and monkeys (Busettini et al. 1996Go) and one might therefore speculate that properties of the BRF can be explained from those observed at neuronal levels in monkeys. First, the ocular following spatial summation function can be explained by a simple model, where activity is linearly integrated over a population of MT neurons paving the visual field, with receptive field sizes increasing linearly with eccentricity (Albright and Desimone 1987Go). The fact that we found only little if any evidence for isodirectional surround suppression at all spatial scales down to <5° suggests that information at low spatial frequency might be linearly integrated over a neuronal population. A suitable candidate could be wide field neurons, a subclass of MT neurons having large receptive field sizes and weak center-surround suppression (Born 2000Go; Born and Tootell 1991Go). Interestingly, wide-field neurons have been found to be involved in global motion processing and initiation of reflexive tracking eye movements in monkeys (Born et al. 2000Go).

A simple way to account for the observed saturation for stimulus diameters >20° consists then in assuming that the activity of the individual neurons is weighted at the readout level when integrating over the visual field, the weighting function being characterized by a sharp peripheral decay so that the contribution of peripheral neurons rapidly vanishes beyond a given eccentricity. The dependency of spatial summation on contrast and temporal integration can then be easily explained within such a linear weighted integration. First, the spatial integration’s peripheral cutoff could be shifted toward larger values simply by an increase in neuronal receptive field sizes for low contrast, as observed in both V1 (Cavanaugh et al. 2002Go; Kapadia et al. 1999; Sceniak et al. 1999Go) and MT (Pack et al. 2005Go). Second, a broadening of the weight function itself with time would allow increasingly peripheral neurons to contribute to the motion signal readout.

How to explain center-surround interactions? The simplest hypothesis of neurons having a large excitatory receptive field (essentially matching the stimulus) and a suppressive surround activated by the nonisodirectional motion in the periphery is unlikely to be true because we show similar center-surround effects using both small and large stimuli as well as single- and multipatch patterns. Furthermore, neurons with such large excitatory fields in macaque areas MT and MST have little surround suppression (Born 2000Go). On the contrary, neurons with a strong center-surround interaction tend to have much smaller receptive fields (Born and Tootell 1991Go; Eifuku and Wurtz 1998Go; Pack et al. 2005Go). With large center stimuli, most of these units would be suppressed. Moreover, with large center-surround stimuli, neurons located only at the center-surround border would be modulated, resulting in only weak contrast gain change. Thus the strong modulation detected here, similar for large, small, and micropattern stimuli, indicates that the BRF is unlikely to result simply from the properties of neurons that belong to a particular subpopulation of direction-selective neurons in MT/MST.

We propose that the BRF should be understood at a more functional level as the product of interactions between different neuronal populations activated by the two motion stimuli (center vs. periphery). Similarly to what has been proposed at the neuronal level, one can assume that the two populations form an excitatory (or driving) and a suppressive (or modulating) field, respectively (see Carandini 2004Go). The suppressive field is formed by a subpopulation of neurons with direction selectivity different from that of the center motion direction. This suppressive population implements contrast normalization by dividing the activity of the neuronal population detecting center motion (Simoncelli and Heeger 1998Go) and units tuned for opposite motion direction have the highest weight in the divisive term (Schwartz and Simoncelli 2001Go). The structure of the BRF could then be modeled as a ratio of the two populations forming the excitatory and suppressive fields, somewhat similar to the neuronal ratio of a Gaussian model (Bonin et al. 2005Go; Carandini 2004Go; Cavanaugh et al. 2002Go). Such a mechanism is independent of the spatial scale of center-surround stimuli. Moreover, if the suppressive population’s weight decreases with eccentricity, the observed stronger suppression with multipatch stimuli is expected because with these stimuli, surround motions covered a lar