JN AJP: Cell Physiology
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


J Neurophysiol 93: 3659-3673, 2005. First published February 9, 2005; doi:10.1152/jn.01214.2004
0022-3077/05 $8.00
This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
93/6/3659    most recent
01214.2004v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via ISI Web of Science (15)
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Bell, A. H.
Right arrow Articles by Munoz, D. P.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Bell, A. H.
Right arrow Articles by Munoz, D. P.

Crossmodal Integration in the Primate Superior Colliculus Underlying the Preparation and Initiation of Saccadic Eye Movements

Andrew H. Bell1, M. Alex Meredith2, A. John Van Opstal3 and Douglas P. Munoz1

1Centre for Neuroscience Studies, Canadian Institutes of Health Research Group in Sensory-Motor Systems, Department of Physiology, Queen’s University, Kingston, Ontario, Canada; 2Department of Anatomy and Neurobiology, Virginia Commonwealth University School of Medicine, Richmond, Virginia; and 3Institute for Neuroscience, Department of Biophysics, Radboud University Nijmegen, Nijmegen, The Netherlands

Submitted 29 November 2004; accepted in final form 5 February 2005


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
Saccades to combined audiovisual stimuli often have reduced saccadic reaction times (SRTs) compared with those to unimodal stimuli. Neurons in the intermediate/deep layers of the superior colliculus (dSC) are capable of integrating converging sensory inputs to influence the time to saccade initiation. To identify how neural processing in the dSC contributes to reducing SRTs to audiovisual stimuli, we recorded activity from dSC neurons while monkeys generated saccades to visual or audiovisual stimuli. To evoke crossmodal interactions of varying strength, we used auditory and visual stimuli of different intensities, presented either in spatial alignment or to opposite hemifields. Spatially aligned audiovisual stimuli evoked the shortest SRTs. In the case of low-intensity stimuli, the response to the auditory component of the aligned audiovisual target increased the activity preceding the response to the visual component, accelerating the onset of the visual response and facilitating the generation of shorter-latency saccades. In the case of high-intensity stimuli, the auditory and visual responses occurred much closer together in time and so there was little opportunity for the auditory stimulus to influence previsual activity. Instead, the reduction in SRT for high-intensity, aligned audiovisual stimuli was correlated with increased premotor activity (activity after visual burst but preceding saccade-aligned burst). These data provide a link between changes in neural activity related to stimulus modality with changes in behavior. They further demonstrate how crossmodal interactions are not limited to the initial sensory activity but can also influence premotor activity in the SC.


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
The ability to integrate sensory inputs of different modality is vital for rapid and accurate responses to dynamic events within the environment. Oftentimes, such integration confers a distinct behavioral advantage over the processing of unimodal information (see Calvert et al. 2004Go for review). For example, saccadic eye movements to spatially aligned visual and auditory stimuli have reduced reaction times and increased accuracy over those generated to unimodal stimuli (Colonius and Arndt 2001Go; Corneil et al. 2002Go; Frens et al. 1995Go). Conversely, visually guided saccades made amid spatially misaligned auditory distractors can have increased reaction times (Corneil and Munoz 1996Go; Frens et al. 1995Go). Our research focuses on how these behavioral effects of crossmodal integration are represented in the brain.

One area that has been implicated in both crossmodal integration and orienting behavior is the midbrain superior colliculus (SC) (see Stein and Meredith 1993Go for review). Neurons in the intermediate/deep layers of the SC (dSC) are involved in the initiation and control of saccades (for review, see Munoz and Fecteau 2002Go; Sparks 1999Go; Sparks et al. 2001Go) and many of these same neurons are also capable of integrating converging sensory inputs, resulting in stronger responses (Bell et al. 2003Go; Frens and Van Opstal 1998Go; King and Palmer 1985Go; Meredith and Stein 1986Go; Populin and Yin 2002Go; Wallace et al. 1996Go). Crossmodal interactions depend on several factors. For example, they are greatest for spatially aligned stimuli and decrease in magnitude as spatial disparity increases (Bell et al. 2001aGo; Frens and Van Opstal 1998Go; Meredith and Stein 1996Go). Stimulus intensity also influences the magnitude of crossmodal interactions: they are greatest for pairings of weaker stimuli ("principle of inverse effectiveness"; Meredith and Stein 1986Go). These same factors are known to influence the behavioral benefits of crossmodal integration (Corneil et al. 2002Go; Stein et al. 1989Go).

The objective of the current study is to investigate how crossmodal processing in the dSC contributes to the reduction of SRTs to combined audiovisual stimuli. We recorded single-unit activity from neurons in the dSC while monkeys generated saccades to visual and audiovisual stimuli. We varied both the intensity and spatial relationship of the visual and auditory stimuli to elicit crossmodal interactions of different strength. Two different pairs of auditory and visual stimulus intensities were selected and on combined audiovisual trials, the visual and auditory stimuli were placed either in spatial alignment or in opposite hemifields. These stimulus combinations were presented in 2 interleaved saccade tasks. The gap-saccade task was used to facilitate the merging of the sensory- and motor-related activities, thereby increasing the probability that changes to the sensory response will affect behavior. The delayed-saccade task was used to dissociate sensory and motor activity, allowing assessment of the effect of stimulus condition on the sensory and motor responses independently. We show that the nature of the behavioral benefit provided by aligned audiovisual stimuli is different for high- versus low-intensity stimuli and that these behavioral effects are correlated with early versus later influences of the additional auditory stimulus. Preliminary data have been presented in abstract form (Bell et al. 2001bGo).


    METHODS
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
Preparation of experimental animals

All procedures were approved by the Queen’s University Animal Care Committee and were in accordance with the Canadian Council on Animal Care policy on the use of laboratory animals in research. Three adult male rhesus monkeys (Macaca mulatta), weighing between 6 and 12 kg were used in this study. Animals were prepared for chronic experiments in one aseptic surgical session. Anesthesia was induced with an injection of ketamine hydrochloride [10 mg/kg, administered intramuscularly (im)] and maintained during surgery with isofluorane (1–2%), which was administered by an endotracheal tube. Heart rate, respiratory rate, and body temperature were monitored closely throughout the surgical session.

A single craniotomy (19 mm in diameter) was performed using stereotaxic techniques and a stainless steel recording chamber was centered on the midline and oriented 38–40 ° posterior from vertical, which allowed access to both SC. The recording chamber and a stainless steel head post (for securing the animal’s head during the recording sessions) were embedded in an explant formed with dental acrylic. The explant was secured to the animal’s skull using stainless steel, surgical-quality screws. Eye coils, preformed from insulated stainless steel wire (19–20 mm, 3 turns), were inserted under the conjunctiva of each eye (Judge et al. 1980Go) and used to measure eye position (Robinson 1963Go). The eyecoil leads were led subcutaneously to the explant and the connectors were embedded within the acrylic.

Animals were given prophylactic antibiotics (enrofloxacin, 5 mg/kg im) and analgesic medication (buprenorphine hydrochloride, 0.01 mg/kg im), administered daily for 7 days after surgery. They were allowed a recovery period of ≥2 wk before initiation of behavioral training.

Experimental procedures and behavioral paradigm

The experiments took place in darkened, sound-attenuated rooms. The animals were seated in a primate chair (Crist Instruments) with their heads restrained. Monkeys Z and R were seated 94 cm from a tangent screen spanning approximately ±45o of the visual field. Experiments with Monkey O were performed in a different lab, in which the monkey was seated 86 cm from a tangent screen spanning approximately ±35o of the visual field. Between trials, the tangent screen was illuminated diffusely to prevent dark adaptation.

The monkeys were trained to perform 2 interleaved audiovisual saccade tasks (Fig. 1). Each trial began with the removal of the background light and the appearance of a central visual fixation point (FP) that was back-projected onto the tangent screen. The monkeys fixated the FP for a period of 800–1,200 ms, after which one of the following 2 tasks was presented: on half of the trials, the saccade target was presented but the FP remained illuminated for an additional 400–800 ms before being extinguished ("DELAYED-SACCADE TASK"; Fig. 1A). The removal of the FP was the animal’s cue to generate a saccade toward the target. On the other half of the trials, the FP was extinguished 200 ms before the presentation of the target ("GAP-SACCADE TASK"; Fig. 1B). The 2 different saccade tasks were randomly interleaved with equal distribution.



View larger version (16K):
[in this window]
[in a new window]
 
FIG. 1. Schematic representation of the 2 saccade tasks used in this study (see METHODS for details). A: delayed-saccade task with aligned audiovisual stimulus—monkey is required to withhold generating a saccade until the fixation point (FP) is removed, which occurs 400–800 ms after stimulus presentation. B: gap-saccade task with misaligned audiovisual stimulus—monkey is required to maintain central fixation after FP removal and is trained to generate a saccade to the visual stimulus immediately after target onset.

 
The visual saccade target was presented either into the receptive field of the neuron (such that the saccade necessary to foveate the target was the neuron’s preferred vector) or to its diametrically opposite position, across the horizontal and vertical meridians. It was presented either alone (visual condition) or in combination with an auditory stimulus, appearing at the same location (aligned audiovisual condition; e.g., Fig. 1A) or at the diametrically opposite location (misaligned audiovisual condition; e.g., Fig. 1B). Trials were presented in the following distributions: 28.5% unimodal visual trials, 28.5% aligned audiovisual trials, 28.5% misaligned audiovisual trials, and 14.5% catch trials where no target stimulus was presented and monkeys were simply required to maintain central fixation to receive their reward (included to minimize anticipatory responses).

The visual stimulus was generated with either a laser (Power Technologies) or an LED to yield 2 very different intensities (8.0 and 0.05 cd/m2, respectively). The auditory stimulus consisted of a white noise burst, produced by small 4-cm, 8.0-{Omega} speakers suspended in front of the tangent screen, facing the animal. The stimuli were grouped to yield 2 intensity pairs: high-intensity stimuli (visual: 8.0 cd/m2; auditory: 46–51 dB, A-weighted) and low-intensity stimuli (visual: 0.05 cd/m2; auditory: 43.5 dB, A-weighted). Monkey Z performed the experiment exclusively with high-intensity stimuli, whereas Monkeys O and R performed the experiment under both conditions.

Monkeys were given a liquid reward if they maintained central fixation (i.e., held their eyes stable within 2–3° of the FP) for the duration of the fixation period and generated a saccade within 400 ms of FP disappearance (in the delayed-saccade task) or visual target onset (in the gap-saccade task) without first orienting to the auditory stimulus (in the case of the misaligned audiovisual condition). They worked until fully satiated, at which point they were returned to their home cages. Weight and water intake were recorded on a daily basis and the institute veterinarian monitored the animals closely throughout the study.

Recording techniques and receptive field mapping

Extracellular, single-neuron activity was recorded using tungsten microelectrodes (Frederick Haer) with impedances of 0.5–3 M{Omega} at 1 kHz. Electrodes were driven with a hydraulic microdrive (Narishige MO-95) through stainless steel guide tubes supported by a Delrin grid placed inside the recording chamber (Crist et al. 1988Go). Single-neuron activity was sampled at 1 kHz after passing through a window discriminator (Bak Electronics) that excluded action potentials that did not meet both amplitude and temporal constraints. The behavioral paradigms as well as storage of eye position and neural activity were controlled by a Pentium PC running a real-time data-acquisition software package (REX Ver. 5.4; Hays et al. 1982Go). Horizontal and vertical eye positions were sampled at 500 Hz.

To approximate a neuron’s visual receptive field, a handheld ophthalmoscope was used to back-project moving spots and bars of light onto the tangent screen while the monkey maintained central fixation. In many instances, visual stimuli were also systematically presented throughout the visual field. The center of the receptive field was defined as the point where the maximum visual response was elicited.

Data analysis

The data were analyzed off-line using a Sun Ultra 60 Sparcstation running user-generated programs and a Pentium PC running MatLab software (The MathWorks). Data were first run through an automated saccade detection program, which identified the beginning and end of each saccade based on velocity and acceleration template matching (Waitzman et al. 1991Go). All marks were later verified by the experimenter and adjusted when necessary. Before analysis, all incorrect trials were eliminated. Incorrect trials were defined as: 1) those where the saccade was generated before the removal of the FP in the delayed-saccade task or during the gap in the gap-saccade task; 2) the saccade was generated toward the auditory stimulus during misaligned audiovisual trials; 3) the saccade landed outside the 2–3° acceptance window around the target; or 4) saccades with latencies >400 ms.

Neuronal responses were analyzed by constructing spike density functions (Richmond and Optican 1987Go) based on a normal (Gaussian) probability distribution, as follows

(1)
where the activation level (A) varied as a function of time (t). Each action potential was converted to an individual Gaussian pulse having a total area of 1 (spike) and SD ({sigma}) of 4 ms. The individual pulses were summed together to yield a single spike density function for each trial.

Neuronal responses were quantified in several ways by measuring activity in the following epochs. To evaluate the effect of the auditory stimulus on the activity preceding the onset of the visual response (Bell et al. 2003Go, 2004Go; Wallace et al. 1996Go), the activity 40–50 ms after target onset was measured (previsual epoch). The sensory epoch was used to measure the magnitude of the target-aligned sensory response and differed according to task and stimulus intensity. In the delayed-saccade task, the sensory epoch was defined as the peak spike density 0–200 ms after target onset for both stimulus intensities. In the gap-saccade task, the sensory epoch was defined as the peak spike density 50–100 and 70–120 ms after target onset for high- and low-intensity stimuli, respectively. The different epochs were necessary as a result of the different response onset latencies for high- versus low-intensity stimuli (see RESULTS). To minimize the inclusion of motor activity in the calculation of sensory response magnitude in the gap-saccade task, trials with SRTs <120 and <140 ms for high- and low-intensity stimuli, respectively, were excluded for this portion of the analysis.

The premotor epoch, used as an estimate of the relative state of motor readiness after the onset of the initial sensory response but before the saccade-aligned burst of activity (see RESULTS for details), was defined as the peak activity 80–90 ms after target onset for high-intensity stimuli. The threshold epoch, used as an estimate of the amount of activity necessary to evoke a saccade ("saccadic threshold"; see RESULTS and DISCUSSION; Carpenter and Williams 1995Go; Hanes and Schall 1996Go; Paré and Hanes 2003Go), was defined as the average level of activity from 20 to 10 ms immediately before saccade onset. This epoch was chosen because it approximates the delay between activity in the SC and saccade initiation. Microstimulation studies have shown that saccades can be elicited as early as 20 ms after dSC stimulation (Robinson 1972Go; Stanford et al. 1996Go) and perturbed or modified in midflight about 10 ms after SC stimulation (Gandhi and Keller 1999Go; Miyashita and Hikosaka 1996Go; Munoz and Wurtz 1993Go; Sparks and Mays 1983Go). The saccade-aligned activity was defined as the average activity ±5 ms surrounding saccade onset.

To estimate the onset of target-related activity, we generated a second spike density function based on an exponential growth/decay function (Thompson et al. 1996Go). This asymmetric activation waveform mimics an excitatory postsynaptic potential and is physiologically more plausible for estimating response onset latency than a Gaussian activation function. A spike exerts an influence only forward and not backward in time and so the symmetrical Gaussian function underestimates response onset latencies. The spike density function was obtained by convolving each spike with the following function

(2)
where the activation level (A) varies as a function of time (t), according to {tau}g, the growth time constant that was set to 1 ms, and {tau}d, the decay time constant that was set to 20 ms. Response onset latency was defined as the point where the activation level exceeded baseline (defined as 400–0 and 100–0 ms before target onset in the delayed- and gap-saccade tasks, respectively) plus 3SDs. The activity had to remain above this level for a minimum of 10 ms to be classified as a valid response.

Population analyses of the behavioral and neuronal data were performed using pairwise Wilcoxon signed-rank sum tests unless otherwise stated. In all instances, an alpha of 0.05 was chosen as significant. All data are presented as mean ± SE, unless otherwise indicated. For display purposes only, population spike density functions are shown as floating averages of 10-ms bin widths, plotted every 5 ms (i.e., 0–10, 5–15, 10–20 ms, etc.).

Neuron classification

Neurons in the dSC were classified into one of 3 different categories (sensory only, sensory-motor, motor only) based on their sensory and motor response properties, assessed using the delayed-saccade task. All responses were first analyzed using an automated classification scheme. A sensory response was defined as activity >50 spikes/s above baseline (see above) after target onset. Neurons with sensory activity were then classified as visual, auditory, or bimodal (responsive to both unimodal visual and auditory stimuli) based on their individual responses. Auditory-only neurons were identified as those with no visual response but that were responsive to the auditory stimulus when it alone was presented to the receptive field in the misaligned audiovisual condition. A motor response was defined as saccade-aligned activity >80 spikes/s for saccades to the neuron’s preferred direction and eccentricity in the delayed-saccade task. The experimenter later verified accuracy and consistency of all neuron classifications.

Laminar distribution of the neurons was estimated based on activity landmarks and relative depths. The superficial (dorsal-most) border of the SC was identified from the onset of brisk, visually evoked multiunit activity. The intermediate layers were then estimated to begin about 1,000–1,500 µm below this point, corresponding to the appearance of saccade-related activity (Ma et al. 1991Go). Neurons shallower than 1,500 µm below the dorsal-most superficial border but that displayed auditory and/or motor-activity were also classified as intermediate layer neurons (Wallace et al. 1996Go). Neurons recorded from the superficial layers of the SC were excluded from all further analyses.


    RESULTS
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
Analysis of behavior

Three monkeys performed a total of 12,936 correct trials over the course of the recording sessions (Monkey R: 6,457, Monkey O: 4,340, Monkey Z: 2,129). Stimulus modality had no significant effect on saccadic reaction times (SRTs) in the delayed saccade task for both high-intensity (mean SRT for visual: 250 ± 2 ms, aligned audiovisual: 251 ± 2 ms, misaligned audiovisual: 254 ± 2 ms; P values >0.3) and low-intensity stimuli (mean SRT for visual: 254 ± 1 ms, aligned audiovisual: 253 ± 1 ms, misaligned audiovisual: 256 ± 1 ms; P values >0.4). Likewise, there was no significant difference in mean SRT across the 2 stimulus intensities in the delayed-saccade task for any of the 3 stimulus conditions (P values >0.5).

In the gap-saccade task, stimulus modality and intensity had a significant influence on the mean and distribution of SRTs. We first describe the behavior elicited by the 2 stimulus intensities independently and then summarize with a comparison of the two.

Saccades to high-intensity stimuli.    The distribution of SRTs for the 3 stimulus conditions (visual, aligned audiovisual, misaligned audiovisual) is shown in Fig. 2. For high-intensity stimuli, correct stimulus-triggered saccades began at about 65 ms after target onset for all stimulus conditions (Fig. 2, A, C, and E). Trials with SRTs <65 ms were classified as anticipatory responses because they had equal probability of being directed toward or away from the target (Fig. 3A) and were eliminated from further analysis.



View larger version (30K):
[in this window]
[in a new window]
 
FIG. 2. Distribution of saccadic reaction times (SRTs) for saccades generated to high- and low-intensity stimuli in the gap-saccade task. Data are plotted as the proportion of the total number of trials per given stimulus condition, with values below zero indicating saccades (incorrectly) directed away from the saccadic target. Distributions of SRTs for high-intensity stimulus trials are further subdivided into express (shaded portion) and regular-latency (unshaded portion) saccades. Dashed and solid arrows indicate median and mean SRT, respectively.

 


View larger version (13K):
[in this window]
[in a new window]
 
FIG. 3. Detailed breakdown of behavior in response to high-intensity stimuli. A: performance in gap-saccade task for high-intensity stimulus trials. This panel plots the difference in number of correct vs. incorrect trials as a function of time from target onset. B: proportion of express saccades generated to the different stimulus conditions. C: effect of stimulus condition on mean SRT. Effect of stimulus condition was strongest when express saccades were excluded from the calculation (no ES), suggesting that the benefit of aligned audiovisual stimuli arises primarily from changes to regular-latency saccades (SRT >95 ms). Asterisk indicates a significant difference from the visual condition (Wilcoxon rank-sum test, P < 0.05; {dagger}P < 0.10).

 
Among saccades with SRTs >65 ms, very few were generated away from the visual target (85/2411; 3.5%) and they occurred most often in the misaligned audiovisual condition (visual: 17/85, aligned audiovisual: 28/85, misaligned audiovisual: 40/85; {chi}2 = 9.35, P < 0.01). SRTs for correct, stimulus-triggered saccades formed a bimodal distribution, corresponding to express (SRTs: 65–95 ms; shaded portion, Fig. 2, A, C, and E) and regular-latency saccades (SRTs: >95 ms; Fischer and Boch 1983Go; Paré and Munoz 1996Go). These ranges did not change across the 3 stimulus conditions.

A detailed breakdown of the effect of stimulus condition on express saccade generation and mean SRT is shown in Fig. 3, B and C. High-intensity stimuli evoked relatively few express saccades, which were distributed evenly across the 3 stimulus conditions (Fig. 3B; {chi}2 = 2.12, P > 0.3). There was no significant difference in the mean SRT of express saccades across the 3 stimulus conditions ("ES only," Fig. 3C; mean SRT for visual: 80 ± 1 ms, aligned audiovisual: 82 ± 1 ms, misaligned audiovisual: 81 ± 1 ms; Wilcoxon rank-sum tests, P values >0.2). When express saccades were included in the calculation of mean SRT, the advantage of placing an auditory stimulus on the same side as the visual target failed to reach significance ("with ES," Fig. 3C; mean SRT for visual: 158 ± 2 ms, aligned audiovisual: 154 ± 2 ms, misaligned audiovisual: 165 ± 2 ms; Wilcoxon rank-sum test, P < 0.15). However, when express saccades were excluded, a significant advantage of the aligned audiovisual condition was revealed ("no ES," Fig. 3C; mean SRT for visual: 164 ± 2 ms, aligned audiovisual: 158 ± 2 ms, misaligned audiovisual: 170 ± 2 ms, P < 0.05). There was also a trend for longer SRTs to the misaligned audiovisual condition that approached significance in both cases (P = 0.07 and P = 0.06, respectively).

Saccades to low-intensity stimuli.    Contrary to what was observed after high-intensity stimuli, the proportion of saccades generated toward versus away from the low-intensity visual target was influenced by stimulus condition (Fig. 2, B, D, and F). Figure 4A plots the performance as a function of time from target onset for each of the low-intensity stimulus conditions. Initially, all 3 low-intensity stimulus conditions averaged an equal number of correct versus incorrect responses, identifying these saccades as anticipatory responses. However, at about 55 ms after target onset, the proportion of correct saccades directed toward the aligned audiovisual stimulus (blue traces; Fig. 4A) increased. At about the same time, the proportion of correct saccades to the misaligned audiovisual stimulus (green traces; Fig. 4A) decreased substantially. This suggests that the auditory stimulus was contributing to the triggering of saccades, in the correct direction when aligned with the visual target and in the incorrect direction when presented to the opposite hemifield.



View larger version (18K):
[in this window]
[in a new window]
 
FIG. 4. Detailed breakdown of behavior in response to low-intensity stimuli. A: performance in gap-saccade task for low-intensity stimulus trials. This panel plots both the percentage correct responses (solid traces, left y-axis) as well as the difference in number of correct vs. incorrect trials as a function of time from target onset (dashed traces, right y-axis). B: mean SRTs (±SE) for the different low-intensity stimulus conditions. Mean SRTs for the aligned audiovisual stimulus are shown for when trials with SRTs 70–100 ms are included (solid blue bar in foreground) and excluded (shaded blue bar in background). Asterisk indicates a significant difference from the visual condition (Wilcoxon rank-sum test, P < 0.05; {dagger}P < 0.10).

 
On the basis of the above analysis, anticipatory responses in the low-intensity stimulus sessions were thus defined as those with SRTs <70 ms for the aligned audiovisual condition and <100 ms for the visual and misaligned audiovisual stimuli. This corresponds to a performance rate of ≥80% correct in the respective conditions and is thus well above "chance." To evaluate the extent to which the behavioral effects observed for low-intensity, aligned audiovisual stimuli are attributable to the additional trials with SRTs 70–100 ms (which, in the case of the other 2 low-intensity stimulus conditions, were classified as anticipatory and thus excluded), we report the values and statistics for when these trials are included and excluded. It is also important to note that cutoffs within ±10 ms of those selected yielded the same results. Saccades generated away from the saccadic target, but with SRTs beyond the range for anticipatory responses (i.e., direction errors), showed no systematic effect of stimulus condition (visual: 84/269, aligned audiovisual: 80/269, misaligned audiovisual: 105/269; {chi}2 = 4.02, P > 0.1).

The mean SRTs for the 3 low-intensity stimulus conditions are shown in Fig. 4B. Saccades to the aligned audiovisual stimulus (mean SRT: 155 ± 1 ms) had significantly shorter SRTs compared with the unimodal visual stimulus (mean SRT: 160 ± 1 ms; P < 0.001). When saccades to the aligned audiovisual stimulus with SRTs 70–100 ms were eliminated (mean SRT: 158 ± 1 ms, light blue bar, Fig. 4B), the trend persisted but failed to reach statistical significance (P = 0.08). Furthermore, saccades to the misaligned audiovisual stimulus (mean SRT: 165 ± 1) had, on average, significantly longer SRTs compared with the unimodal visual stimulus (P < 0.01).

The above analyses revealed several important differences between saccades generated to high- versus low-intensity stimuli in the gap-saccade task, which we propose are linked to early versus later influences of the auditory stimulus on activity in the dSC. To address this hypothesis, we separated the remaining analyses according to stimulus intensity into 2 independent investigations of crossmodal integration at the neuronal level and their consequences on behavior.

Analysis of neuronal activity

A total of 132 neurons were recorded from the dSC of 3 monkeys [Monkey R: 57 (high-intensity: 9; low-intensity: 48), Monkey Z: 36 (all high); Monkey O: 39 (high: 13; low: 29; N.B. 3 neurons were recorded using both intensities)]. Of these, 109 (83%) exhibited sensory and/or motor-related activity that satisfied our criteria for analysis (see METHODS). Over half of these neurons exhibited both stimulus- and saccade-related activity (64/109; 59%). Smaller proportions had stimulus-related activity but no saccade-related activity (25/109; 23%) or only saccade-related activity (20/109; 18%). The majority of neurons with stimulus-related activity responded exclusively to visual stimuli (79/89; 89%). A smaller proportion responded to both visual and auditory stimuli ("bimodal"; 9/89; 10%) and one neuron in our sample population responded to auditory stimuli only (1/89; 1%). Because this latter neuron responded only to the combined audiovisual target (and not the unimodal visual target) and because of the small sample size, it was eliminated from further analysis.

Crossmodal integration underlying saccades to low-intensity stimuli

The behavioral advantage provided by low-intensity, aligned audiovisual stimuli was driven by a reduction in the onset of the earliest correct stimulus-triggered saccades relative to the other 2 conditions (Fig. 4). Auditory stimuli are known to elicit responses in the dSC with shorter response onset latencies but decreased response magnitude compared with those elicited by visual stimuli (Bell et al. 2003Go, 2004Go; Jay and Sparks 1987Go; Wallace et al. 1996Go). We examined the effect of the auditory stimulus on activity preceding the onset of the visual response (previsual activity), the response onset latency (ROL), and the magnitude of the target-aligned sensory response.

Variable response onset latency and previsual activity.    Figure 5, AC compares the mean activity of all sensory and sensory-motor neurons for trials where the visual target appears inside (solid traces) versus opposite to (dashed traces) the receptive field of the neuron. The point where these 2 curves diverge approximates when activity related to the sensory stimuli is being registered by the neurons. In the case of the aligned audiovisual stimulus condition (Fig. 5B), the curves initially diverge at about 30 ms, which likely represents the arrival of the earliest auditory input to the dSC. The curves then diverge dramatically at about 70–80 ms, likely representing the arrival of the visual input. Interestingly, there is also an early divergence in the case of the misaligned audiovisual condition but in the opposite direction. In this case, the auditory stimulus, located in the opposite hemifield as the visual target, falls into the receptive field of neurons represented by the dashed traces in Fig. 5 and so there is a slight increase in activity about 40 ms after target appearance (Fig. 5C).



View larger version (14K):
[in this window]
[in a new window]
 
FIG. 5. Effect of stimulus condition on response onset latency (ROL) in the dSC for low-intensity stimuli. AC: mean population activity of all sensory and sensory-motor neurons recorded in the intermediate/deep layers of the superior colliculus (dSC) in the gap-saccade task. Solid traces represent trials where the visual saccade target fell within the neuron’s receptive field. Dotted traces represent trials where the target fell outside the neuron’s receptive field. Traces are aligned on target onset. Point where the 2 traces diverge approximates when the sensory response is being registered by the neurons. D: mean ROL for each low-intensity stimulus condition in the gap-saccade task. Asterisk indicates a statistically significant difference from the unimodal visual condition (Wilcoxon signed-rank test, P < 0.05).

 
When the ROL was calculated on a neuron-by-neuron basis for the 3 stimulus conditions (see METHODS for details), it was revealed that aligned audiovisual stimuli elicited responses with significantly shorter ROLs compared with both the visual and misaligned audiovisual conditions (Fig. 5D, Table 1; P values <0.05). The small increase in activity after the aligned audiovisual stimulus was insufficient to be detected as a valid response according to our algorithm but provided enough of an increase in activity to accelerate the detection of the visual response.


View this table:
[in this window]
[in a new window]
 
TABLE 1. ROLs for high- vs. low-intensity stimuli for sensory and sensory-motor neurons in the SC

 
Population spike density functions for the 3 stimulus conditions show how the aligned auditory stimulus evokes greater activity in the period leading up to the onset of the visual response (blue trace, Fig. 6A). To quantify this difference in activity, we selected an epoch 40–50 ms after target onset ("previsual epoch"; shaded portion, Fig. 6, A and B), which encompasses the period shortly after the initial divergence in activity patterns described earlier but before the onset of the visual response. As predicted, the aligned audiovisual stimuli evoked significantly greater activity compared with that of the other 2 stimulus conditions (solid bars, Fig. 6C; P < 0.05). When the visual target was located outside the receptive field (Fig. 6B), the misaligned auditory stimulus (green trace, Fig. 6B) fell within the receptive field of these neurons, evoking greater activity compared with that of the other 2 conditions (hollow bars, Fig. 6C, P < 0.05). Figure 6, DF compares the magnitude of activity during this previsual epoch on a neuron by neuron basis when the visual target was located in versus out of the receptive field for each of the 3 stimulus conditions. Not surprisingly, the unimodal visual stimulus showed no change in previsual activity (Fig. 6D). The aligned audiovisual stimulus, however, showed an increase in previsual activity when the target was located in the receptive field that was highly significant across the population (Fig. 6E). Conversely, the misaligned auditory stimulus evoked significantly greater activity in neurons opposite those responsive to the visual target (Fig. 6F), confirming that the auditory stimulus was influencing the activity preceding the onset of the visual response.



View larger version (28K):
[in this window]
[in a new window]
 
FIG. 6. Analysis of activity preceding onset of visual response in the dSC. A and B: mean population activity of all sensory and sensory-motor neurons recorded in the dSC in the gap-saccade task, for trials where the target fell in (A) vs. out of (B) the receptive field. Shaded portion indicates the "previsual" epoch 40–50 ms after target onset. C: mean previsual activity for each low-intensity stimulus condition in the gap-saccade task. Asterisk indicates a statistically significant difference from the unimodal visual condition (Wilcoxon signed-rank test, P < 0.05). Solid bars indicate trials where the visual target fell within the receptive field; hollow bars indicate trials where the visual target fell outside the receptive field. DF: comparison of previsual activity for the different low-intensity stimulus conditions. Each point represents the mean previsual activity for an individual sensory or sensory-motor neuron. Points lying below the unity line indicate greater activity when the target appeared within the receptive field of the given neuron. In the case of the aligned audiovisual stimulus, there was a highly significant trend for stronger previsual activity when both the auditory and visual stimuli were located in the receptive field of the neuron (E, Wilcoxon signed-rank sum test; P = 0.0001); the opposite was true for the misaligned audiovisual stimulus (F, P < 0.05).

 
Variable sensory response magnitude.    Several studies have shown that when auditory and visual stimuli are presented simultaneously, the resulting sensory response in the dSC is often stronger in magnitude compared with the responses elicited by the stimuli when presented independently (see Stein and Meredith 1993Go for review). To determine whether changes in the sensory response magnitude related to stimulus condition also contributed to the behavioral outcome in our task, we first examined sensory responses independent of motor processes using data obtained in the delayed-saccade task and then contrasted these results to those obtained in the gap-saccade task. Figure 7 compares the sensory response properties for the 3 low-intensity stimulus conditions for all sensory and sensory-motor neurons. Stimulus modality did not significantly influence the magnitude of the sensory response in either the delayed- (Fig. 7A) or gap-saccade tasks (Fig. 7B). A comparison of the sensory response to the visual versus audiovisual stimulus confirmed these observations (Fig. 7, C and D). These figures show the mean change in activity for the aligned- (gray) and misaligned- (black) audiovisual conditions relative to the visual condition on a neuron-by-neuron basis. Although there was a trend for stronger responses in the audiovisual conditions (particularly for the aligned audiovisual condition; gray boxes, Fig. 7, C and D) compared with the visual condition, it failed to reach significance for either stimulus condition in either task (P values >0.05). Given that there was a potent behavioral effect of the aligned audiovisual stimulus (Fig. 4), it would seem that the subtle differences in sensory response magnitude across the different stimulus conditions did not contribute to the determination of SRT in our study.



View larger version (22K):
[in this window]
[in a new window]
 
FIG. 7. Effect of stimulus condition on sensory response magnitude in the dSC for low-intensity stimuli in the delayed- (A, C) and gap- (B, D) saccade tasks. A and B: mean population activity of all sensory and sensory-motor neurons recorded in the dSC. Traces are aligned on target onset. C and D: mean change in sensory response magnitude from the visual condition. Mean percentage change in sensory response magnitude from the visual condition was calculated on a neuron-by-neuron basis using the following formula [(AV – V/V) x 100%]. Positive values indicate stronger responses, on average, in the corresponding audiovisual condition.

 
For comparison purposes, analyses of ROL and sensory response magnitude are also shown for high-intensity stimuli (Figs. 8 and 9, respectively). Although direct comparison across stimulus intensity is limited because the data were collected from different neurons, this analysis revealed a dramatic reduction in ROL for high-intensity stimuli compared with low-intensity stimuli, irrespective of stimulus modality or task (Fig. 5D vs. Fig. 8D; Table 1; P values <0.001). No further improvement in ROL for high-intensity stimuli was observed in the gap-saccade task or when aligned audiovisual stimuli were presented, suggesting that the minimum ROLs physiologically possible with these high-intensity stimuli had been obtained. These data provide a neural correlate for why high-intensity stimuli evoked correct stimulus-triggered saccades earlier compared with those triggered by low-intensity visual and misaligned audiovisual stimuli (Fig. 2) and why no further benefit was provided by the high-intensity aligned audiovisual stimulus to the generation of express saccades. High-intensity audiovisual stimuli also failed to evoke statistically significant changes in sensory response magnitude in the delayed- or gap-saccade tasks (Fig. 9).



View larger version (14K):
[in this window]
[in a new window]
 
FIG. 8. Effect of stimulus condition on ROL in the dSC for high-intensity stimuli. See Fig. 5 for details.

 


View larger version (22K):
[in this window]
[in a new window]
 
FIG. 9. Effect of stimulus condition on sensory response magnitude in the dSC for high-intensity stimuli. See Fig. 7 for details.

 
Crossmodal integration underlying saccades to high-intensity stimuli

As shown in Fig. 3, the behavioral benefit of high-intensity aligned audiovisual stimuli was strongest for regular-latency saccades (i.e., SRTs >95 ms) and had little effect on the generation of express saccades (SRTs 65–95 ms). We assessed the contribution of changes to the ROL and sensory response magnitude to changes in SRTs of regular-latency saccades to high-intensity stimuli. Two correlation analyses were performed, examining the relationship between ROL and sensory response magnitude with the SRTs of high-intensity, regular-latency saccades (Fig. 10). All express saccades were removed from the following analyses.



View larger version (34K):
[in this window]
[in a new window]
 
FIG. 10. Relationship between SRT and sensory response properties for regular-latency saccades to high-intensity stimuli. A and B: histograms of the Pearson correlation coefficients for SRT vs. ROL (A) and sensory response magnitude (B) in the gap-saccade task (0.1 bin width). Neurons with statistically significant correlations are indicated by the solid bars (P < 0.05). Vertical arrows indicate mean correlation value across the sample population. To avoid including activity corresponding to the saccade-aligned burst, trials with SRTs <120 ms were excluded from the calculation (see METHODS). Neither measure evoked a substantial number of significantly correlated neurons, nor was the mean correlation coefficient considerably different from zero. C and D: relationship between the difference in SRT and the difference in ROL (C) or sensory response magnitude (D) across the visual and audiovisual stimulus conditions. These panels plot the arithmetic difference between the mean SRT vs. the corresponding response property on a neuron-by-neuron, session-by-session basis (see text for details). A relationship between the 2 would be indicated by a clustering in one or 2 (opposite corners) of the 4 quadrants. Neither variable exhibited such clustering.

 
Both ROL and sensory response magnitude produced mean correlation coefficients close to zero (Fig. 10, A and B), indicating that neither variable was strongly correlated with SRTs of regular-latency saccades. This is further illustrated in the plots of mean difference in SRT against mean difference in ROL (Fig. 10C) and sensory response magnitude (Fig. 10D). No clear relationship was evident between SRT and either of the 2 variables, indicating that a given change in sensory response magnitude or ROL in the audiovisual conditions was not consistently associated with a corresponding change in SRT. Thus the effect of high-intensity audiovisual stimuli on the generation of regular-latency saccades did not appear to be driven by significant changes to the initial sensory-related activity of the dSC.

To gain further insight into what was driving the changes in SRT, we performed a "floating correlation" analysis on all regular-latency saccades to the 3 high-intensity stimulus conditions (Fig. 11). All express saccades have been removed from this analysis, which is restricted to sensory-motor neurons only (accounting for why these curves do not match those in Fig. 9B). For this analysis, the mean spike density in 10-ms bins of all sensory-motor neurons was correlated with SRT, every 5 ms (i.e., 0–10, 5–15 ms, etc.). The initial sensory burst, which ranged from about 40 to 80 ms, exhibited relatively weak correlations. However, shortly after the peak of the sensory burst, neural activity became more and more negatively correlated with SRT, achieving the strongest negative correlation at about 100 ms after target onset (preceding saccade onset by about 50 ms). Beyond this point, activity once again became less and less correlated with SRT as it approached the range of saccade onsets. Importantly, the curves for all 3 stimulus conditions appeared similar.



View larger version (23K):
[in this window]
[in a new window]
 
FIG. 11. Floating correlation coefficient analysis for regular-latency saccades to high-intensity stimuli (express saccades have been excluded). In this analysis, the neural activity for each sensory-motor neuron was correlated with SRT every 10 ms, moving in 5-ms steps (i.e., 0–10, 5–15, etc.). Solid traces show the mean population activity and the dotted traces indicate the corresponding mean correlation value across all sensory-motor neurons. Correlation between neural activity and SRT is strongest between 80 and 100 ms across all 3 stimulus conditions. Solid and dashed arrows show the mean and median SRT for each stimulus condition, respectively.

 
Therefore SRTs of regular-latency saccades to high-intensity stimuli appeared most correlated with the activity between the peak of the initial sensory response and the motor burst. To determine how the premotor and motor-related activity relates to the effect of stimulus modality on behavior, we selected 3 epochs, representing the premotor activity, the saccadic threshold, and the saccade-aligned burst (see METHODS) and compared the activity elicited during these epochs for the 3 different stimulus conditions.

Figure 12 compares the mean level of activity during the premotor epoch, defined here as the mean spike density 80–90 ms after target onset (shown as the shaded portion, Fig. 11), for all sensory-motor neurons (sensory-only neurons were omitted from the remaining analyses as they do not contribute to the motor output of the dSC). To avoid including activity that might be part of the saccade-aligned burst, while still including the majority of relevant trials, we excluded all trials with SRTs <110 ms.



View larger version (15K):
[in this window]
[in a new window]
 
FIG. 12. Effect of high-intensity stimulus condition on the premotor activity of sensory-motor neurons in the dSC. A: change in premotor activity from the unimodal visual condition for all regular-latency saccades to high-intensity stimuli. Neurons showed significantly stronger premotor activity for aligned audiovisual stimuli compared with visual stimuli (Wilcoxon signed-rank test; P < 0.05). A similar trend was observed for misaligned audiovisual stimuli but failed to reach significance (P > 0.05). B: histograms of the Pearson correlation coefficients for SRT vs. premotor activity in the gap-saccade task (0.1 bin width). Neurons with statistically significant correlations are indicated by the solid bars (P < 0.05). Vertical arrows indicate mean correlation value across the sample population. A clear negative bias is revealed, indicating that greater premotor activity is correlated with shorter SRTs. C: relationship between the difference in SRT and the difference in premotor activity across the visual and audiovisual stimulus conditions. Majority of data points is clustered in either the top-left or bottom-right quadrants, supporting the negative correlation between SRT and premotor activity. Correlation between SRT and premotor activity: r = –0.41, P < 0.05 and r = –0.18, P < 0.40 for aligned and misaligned audiovisual stimuli, respectively.

 
On average, sensory-motor neurons in the dSC exhibited significantly greater premotor activity after aligned audiovisual stimuli compared with unimodal visual stimuli (Fig. 12A; P < 0.05). When trials from all 3 stimulus conditions were grouped and correlated with SRT on a neuron-by-neuron basis, a negative bias was revealed in the population (Fig. 12B). Of these neurons 93% (27/29) produced negative correlations, indicating that as the level of activity during this period 80–90 ms after target onset increased, SRT decreased. To further illustrate this point, we compared the mean difference in SRT for audiovisual versus visual trials against the mean difference in premotor activity (Fig. 12C). The majority of neurons (21/29 for aligned audiovisual, 17/29 for misaligned audiovisual) lay within either the bottom-right or top-left quadrants, indicating that on a neuron-by-neuron, session-by-session basis, greater premotor activity was associated with shorter SRTs and vice versa. Together, these results suggest a link between the magnitude of premotor activity and changes in the SRTs of regular-latency saccades.

To confirm that reductions in saccadic threshold or increases in the relative motor output of the dSC are not contributing to the reduction in mean SRT for regular-latency saccades to high-intensity, aligned audiovisual stimuli, we examined the effect of stimulus modality on these 2 motor epochs (see METHODS). Neither of these variables was well correlated with SRT (Fig. 13, A and B), nor did they exhibit any systematic relationship with stimulus modality (Fig. 13, C and D). Thus it would appear that high-intensity audiovisual stimuli drive changes in SRT by influencing the processes leading up to the crossing of saccadic threshold but are no longer influential once a saccade is triggered and about to be executed.



View larger version (33K):
[in this window]
[in a new window]
 
FIG. 13. Lack of relationship between saccadic reaction time and motor response properties for regular-latency saccades to high-intensity stimuli. A and B: histograms of the Pearson correlation coefficients for SRT vs. saccadic threshold (A) and saccadic burst magnitude (B) in the gap-saccade task (0.1 bin width). Neurons with statistically significant correlations are indicated by the solid bars (P < 0.05). Vertical arrows indicate mean correlation value across the sample population. Very few significantly correlated neurons were observed for either measure and the mean correlation coefficient was close to zero, suggesting a lack of clear relationship between SRT and these 2 motor response properties. C and D: relationship between the difference in SRT and the difference in saccadic threshold (C) or saccadic burst magnitude (D) across the visual and audiovisual stimulus conditions (see text and Fig. 10 for analysis details). Neither saccadic threshold nor saccadic burst magnitude displayed any systematic clustering when plotted vs. SRT, further supporting the lack of relationship between SRT and these 2 motor response properties.

 

    DISCUSSION
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
This study examined the role of neurons in the dSC in reducing SRTs to audiovisual stimuli. Consistent with previous studies in both humans (e.g., Colonius and Arndt 2001Go; Corneil et al. 2002Go) and monkeys (e.g., Frens and Van Opstal 1998Go), saccades to aligned audiovisual stimuli had significantly shorter SRTs compared with either unimodal visual or misaligned audiovisual stimuli (Figs. 24). The reduction in mean SRT observed for low-intensity, aligned audiovisual stimuli was correlated with a reduction in the onset of the earliest correct stimulus-triggered saccades relative to the other 2 conditions, which contributed to a decrease in mean SRT (Fig. 4). High-intensity audiovisual stimuli reduced the SRTs of regular-latency saccades (Fig. 3C). These behavioral effects were associated with 2 different neural correlates (increase in activity preceding visual response with subsequent reduction in ROL for low-intensity stimuli, Figs. 5 and 6; vs. increase in premotor activity for high-intensity stimuli, Fig. 12), demonstrating how auditory stimuli and crossmodal processing can influence both the sensory and premotor activity of the dSC to evoke changes in behavior.

Audiovisual stimuli accelerate the onset of stimulus-triggered saccades

Recent neurophysiological studies and models of saccadic initiation have postulated that neural activity among saccade neurons must exceed a threshold level to initiate a saccade ("saccadic threshold"; Carpenter and Williams 1995Go; Hanes and Schall 1996Go; Trappenberg et al. 2001Go). SRT is thus determined by the time it takes for a neural population to achieve this level of activation and any factors capable of influencing this time will have a direct impact on the SRT.

Presenting an auditory stimulus at the same time and place as the visual target in the low-intensity condition resulted in a significant increase in activity before the onset of the visual burst (Fig. 6), which appeared to facilitate the onset of the visual response relative to the other conditions (Fig. 5D). Increases in activity before the onset of the visual response have a strong impact on SRTs (Dorris et al. 1997Go). In both cases, the stronger activity preceding the onset of the visual burst likely facilitates the generation of shorter-latency saccades by providing a "head start" to the neural population ultimately responsible for triggering the saccade.

It would also appear that when the auditory stimulus appeared in the opposite hemifield as the visual target, a proportion of saccades was triggered to this distracting stimulus to eliminate any benefit of the additional auditory stimulus (Fig. 4). The presence of activity in the opposite dSC brought about by the misaligned auditory stimulus produced a competition between neurons in the 2 dSC (Munoz and Istvan 1998Go), potentially contributing to the longer SRTs observed after the misaligned audiovisual stimulus.

These data show how the combination of an auditory and visual stimulus could affect neural activity in the dSC leading to a change in behavior. This type of mechanism could represent a relatively low-level example of crossmodal integration where the response to one stimulus modality directly affects the response to another modality within the same structure. This type of mechanism could easily account for the "best-of-both worlds" result seen in crossmodal behavioral studies (e.g., Corneil et al. 2002Go), where subjects benefit from the spatial accuracy provided by the visual component and the shorter movement onset triggered by the auditory component of a spatially aligned audiovisual target.

One might expect a similar mechanism to influence the generation of express saccades to the high-intensity stimuli. However, this was not the case. The onset of the visual response occurred so soon after stimulus presentation (Fig. 8), leaving little opportunity for the auditory stimulus to bias the previsual activity before the arrival of the visual response in the dSC. Essentially, all 3 high-intensity stimulus conditions were evoking responses too soon after target onset to evoke a differential proportion of express saccades under the conditions used in this study.

Audiovisual stimuli enhance premotor activity in the dSC

As mentioned previously, saccades are believed to be triggered when neural activity exceeds saccadic threshold. It has been demonstrated in the dSC (Paré and Hanes 2003Go) and frontal eye fields (Hanes and Schall 1996Go) that variance in SRT is linked to changes in the rate of rise of premotor activity among saccade neurons: the steeper the rise, the sooner the threshold will be crossed and the shorter the SRT. These observations were made in a countermanding task, which produces a broad distribution of SRTs. An attempt to calculate the rate of rise of premotor activity (i.e., the slope of the spike density function preceding saccade onset) with our data set proved extremely difficult. Specifically, there was insufficient variance in SRT to allow subtle changes in slope to show meaningful correlations. In addition, because SRTs in our task were so short (Fig. 2), it was difficult to locate a period of time that consistently offered a stable linear rise in neural activity from trial to trial.

We therefore used a different approach to characterize the premotor activity of neurons in the dSC (Fig. 11). Using a floating correlation analysis, we identified a premotor epoch that was significantly correlated with SRT. High-intensity audiovisual stimuli evoked significantly greater premotor activity compared with the unimodal visual condition, thus facilitating the earlier crossing of saccadic threshold (Fig. 12). This significant enhancement of neural activity was strongest for spatially aligned audiovisual stimuli, likely representing a crossmodal interaction between the visual and auditory stimuli. Although the initial sensory response did not show a significant enhancement (Fig. 9), which could indicate a ceiling effect for such highly salient visual stimuli, these results show that the benefit of crossmodal interactions between auditory and visual stimuli can extend beyond the initial sensory response and affect premotor processing as well.

Interestingly, the misaligned audiovisual stimulus showed a similar trend that failed to reach significance (Fig. 12). This somewhat counterintuitive result suggests that the benefit of auditory stimuli could, in fact, be 2-fold. Presenting an auditory stimulus in combination with a visual target can provide additional spatial information (if presented in spatial alignment with the visual stimulus), in essence serving as a redundant target (Forster et al. 2002Go; Miller 1991Go; Murray et al. 2001Go). The auditory stimulus can also provide a potent nonspatial "alerting effect" (e.g., Farah et al. 1989Go; Ross and Ross 1981Go) that can further serve to facilitate orienting. These 2 effects are consistent with a recently proposed 2-stage model of crossmodal integration (Arndt and Colonius 2003Go; Colonius and Arndt 2001Go). These authors propose that the influence of stimulus intensity and location on SRT constitutes 2 separable mechanisms linked to the early unimodal pathways and later integrative processes, respectively. In the misaligned audiovisual condition, monkeys are still able to benefit from this latter effect, particularly with a highly salient auditory stimulus. The misaligned auditory stimulus likely served as a nonspatial "alerting" cue to trigger saccades with shorter SRTs, which may account for the increased premotor activity compared with the unimodal visual stimulus (Fig. 12).

One interesting question still remains: what is driving the observed effect of stimulus modality on premotor activity? Although our data do not address this question directly, one logical possibility is that the cerebral cortex is somehow involved. The importance of cortical inputs for crossmodal integration in the cat dSC has been demonstrated (e.g., Jiang et al. 1999Go, 2001Go; Wallace and Stein 1997Go, 1999Go; Wilkinson et al. 1996Go). Perhaps the effect of stimulus modality on premotor activity depends on descending cortical inputs that are absent or less active under unimodal conditions. Further study will be necessary to better understand the relationship between the SC, cortex, and crossmodal integration in awake monkeys.

Interaction between spatial location and stimulus intensity

Consistent with previous studies (Frens and Van Opstal 1998Go; Meredith and Stein 1996Go; Stein et al. 1989Go; Wallace et al. 1998Go), the behavioral and neuronal consequences of audiovisual stimulation depended on their spatial alignment and relative intensity. The interaction between these 2 variables, however, revealed an additional interesting point. The majority of direction errors in the high-intensity stimulus sessions were generated to the misaligned audiovisual stimulus, whereas those generated in the low-intensity stimulus sessions were equally distributed among the 3 stimulus conditions. Furthermore, if the saccade was directed toward the visual target correctly in the misaligned condition, monkeys did not show a significant increase in SRT for the high-intensity stimuli but did for the low-intensity stimuli (Figs. 3 and 4). Thus it is possible that in the high-intensity case, the monkeys were either able to ignore the spatial ambiguity of the auditory stimulus or were distracted by it to the point that they oriented toward the stimulus incorrectly (often correcting themselves in a subsequent saccade). In the case of the low-intensity stimuli, monkeys appeared less prone to orient to the auditory stimulus but still showed a negative effect on behavior. This effect of stimulus intensity is unlike the principle of inverse effectiveness previously described in the anesthetized preparation (Meredith and Stein 1986Go; Wallace et al. 1996Go) and thus represents one more example of how crossmodal integration in the awake monkey is subject to different/additional factors compared with the anesthetized animal (e.g., state of visual fixation: Bell et al. 2003Go; spatial attention: Bell and Munoz 2002Go).

In conclusion, we have shown that aligned audiovisual stimuli facilitate shorter latency saccades in at least 2 ways. In the case of low-intensity stimuli, aligned audiovisual stimuli reduced the mean SRT by increasing the proportion of shorter-latency saccades through a reduction in the ROLs of neurons in the dSC. In the case of high-intensity stimuli, the aligned audiovisual condition increased the premotor activity of dSC neurons, facilitating the generation of regular-latency saccades with shorter latencies. Thus our results demonstrate that crossmodal interactions in the dSC go beyond the sensory response to also influence premotor processing and orienting behavior.


    GRANTS
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
This work was supported by the Human Frontiers Science Program (RG0174/1998-B) and the Canadian Institutes of Health Research. A. H. Bell was supported by a Doctoral Research Award from the Canadian Institutes of Health Research and D. P. Munoz was supported by the Canada Research Chair Program.


    ACKNOWLEDGMENTS
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
The authors thank A. Lablans, D. Hamburger, C. Wellstood, F. Paquin, and K. Moore for invaluable assistance and technical expertise; M. Van Wanrooij for help with data collection; and J. Fecteau, S. Boehnke, J. Gore, I. Armstrong, and R. Marino for commenting on earlier versions of this manuscript.


    FOOTNOTES
 
The costs of publication of this article were defrayed in part by the payment of page charges. The article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

Address for reprint requests and other correspondence: D. Munoz, Centre for Neuroscience Studies, Queen’s University, Kingston, Ontario, Canada K7L 3N6 (E-mail: doug{at}eyeml.queensu.ca)


    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION