|
|
||||||||
1Centre for Neuroscience Studies, Canadian Institutes of Health Research Group in Sensory-Motor Systems, Department of Physiology, Queens University, Kingston, Ontario, Canada; 2Department of Anatomy and Neurobiology, Virginia Commonwealth University School of Medicine, Richmond, Virginia; and 3Institute for Neuroscience, Department of Biophysics, Radboud University Nijmegen, Nijmegen, The Netherlands
Submitted 29 November 2004; accepted in final form 5 February 2005
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
One area that has been implicated in both crossmodal integration and orienting behavior is the midbrain superior colliculus (SC) (see Stein and Meredith 1993
for review). Neurons in the intermediate/deep layers of the SC (dSC) are involved in the initiation and control of saccades (for review, see Munoz and Fecteau 2002
; Sparks 1999
; Sparks et al. 2001
) and many of these same neurons are also capable of integrating converging sensory inputs, resulting in stronger responses (Bell et al. 2003
; Frens and Van Opstal 1998
; King and Palmer 1985
; Meredith and Stein 1986
; Populin and Yin 2002
; Wallace et al. 1996
). Crossmodal interactions depend on several factors. For example, they are greatest for spatially aligned stimuli and decrease in magnitude as spatial disparity increases (Bell et al. 2001a
; Frens and Van Opstal 1998
; Meredith and Stein 1996
). Stimulus intensity also influences the magnitude of crossmodal interactions: they are greatest for pairings of weaker stimuli ("principle of inverse effectiveness"; Meredith and Stein 1986
). These same factors are known to influence the behavioral benefits of crossmodal integration (Corneil et al. 2002
; Stein et al. 1989
).
The objective of the current study is to investigate how crossmodal processing in the dSC contributes to the reduction of SRTs to combined audiovisual stimuli. We recorded single-unit activity from neurons in the dSC while monkeys generated saccades to visual and audiovisual stimuli. We varied both the intensity and spatial relationship of the visual and auditory stimuli to elicit crossmodal interactions of different strength. Two different pairs of auditory and visual stimulus intensities were selected and on combined audiovisual trials, the visual and auditory stimuli were placed either in spatial alignment or in opposite hemifields. These stimulus combinations were presented in 2 interleaved saccade tasks. The gap-saccade task was used to facilitate the merging of the sensory- and motor-related activities, thereby increasing the probability that changes to the sensory response will affect behavior. The delayed-saccade task was used to dissociate sensory and motor activity, allowing assessment of the effect of stimulus condition on the sensory and motor responses independently. We show that the nature of the behavioral benefit provided by aligned audiovisual stimuli is different for high- versus low-intensity stimuli and that these behavioral effects are correlated with early versus later influences of the additional auditory stimulus. Preliminary data have been presented in abstract form (Bell et al. 2001b
).
| METHODS |
|---|
|
|
|---|
All procedures were approved by the Queens University Animal Care Committee and were in accordance with the Canadian Council on Animal Care policy on the use of laboratory animals in research. Three adult male rhesus monkeys (Macaca mulatta), weighing between 6 and 12 kg were used in this study. Animals were prepared for chronic experiments in one aseptic surgical session. Anesthesia was induced with an injection of ketamine hydrochloride [10 mg/kg, administered intramuscularly (im)] and maintained during surgery with isofluorane (12%), which was administered by an endotracheal tube. Heart rate, respiratory rate, and body temperature were monitored closely throughout the surgical session.
A single craniotomy (19 mm in diameter) was performed using stereotaxic techniques and a stainless steel recording chamber was centered on the midline and oriented 3840 ° posterior from vertical, which allowed access to both SC. The recording chamber and a stainless steel head post (for securing the animals head during the recording sessions) were embedded in an explant formed with dental acrylic. The explant was secured to the animals skull using stainless steel, surgical-quality screws. Eye coils, preformed from insulated stainless steel wire (1920 mm, 3 turns), were inserted under the conjunctiva of each eye (Judge et al. 1980
) and used to measure eye position (Robinson 1963
). The eyecoil leads were led subcutaneously to the explant and the connectors were embedded within the acrylic.
Animals were given prophylactic antibiotics (enrofloxacin, 5 mg/kg im) and analgesic medication (buprenorphine hydrochloride, 0.01 mg/kg im), administered daily for 7 days after surgery. They were allowed a recovery period of
2 wk before initiation of behavioral training.
Experimental procedures and behavioral paradigm
The experiments took place in darkened, sound-attenuated rooms. The animals were seated in a primate chair (Crist Instruments) with their heads restrained. Monkeys Z and R were seated 94 cm from a tangent screen spanning approximately ±45o of the visual field. Experiments with Monkey O were performed in a different lab, in which the monkey was seated 86 cm from a tangent screen spanning approximately ±35o of the visual field. Between trials, the tangent screen was illuminated diffusely to prevent dark adaptation.
The monkeys were trained to perform 2 interleaved audiovisual saccade tasks (Fig. 1). Each trial began with the removal of the background light and the appearance of a central visual fixation point (FP) that was back-projected onto the tangent screen. The monkeys fixated the FP for a period of 8001,200 ms, after which one of the following 2 tasks was presented: on half of the trials, the saccade target was presented but the FP remained illuminated for an additional 400800 ms before being extinguished ("DELAYED-SACCADE TASK"; Fig. 1A). The removal of the FP was the animals cue to generate a saccade toward the target. On the other half of the trials, the FP was extinguished 200 ms before the presentation of the target ("GAP-SACCADE TASK"; Fig. 1B). The 2 different saccade tasks were randomly interleaved with equal distribution.
|
The visual stimulus was generated with either a laser (Power Technologies) or an LED to yield 2 very different intensities (8.0 and 0.05 cd/m2, respectively). The auditory stimulus consisted of a white noise burst, produced by small 4-cm, 8.0-
speakers suspended in front of the tangent screen, facing the animal. The stimuli were grouped to yield 2 intensity pairs: high-intensity stimuli (visual: 8.0 cd/m2; auditory: 4651 dB, A-weighted) and low-intensity stimuli (visual: 0.05 cd/m2; auditory: 43.5 dB, A-weighted). Monkey Z performed the experiment exclusively with high-intensity stimuli, whereas Monkeys O and R performed the experiment under both conditions.
Monkeys were given a liquid reward if they maintained central fixation (i.e., held their eyes stable within 23° of the FP) for the duration of the fixation period and generated a saccade within 400 ms of FP disappearance (in the delayed-saccade task) or visual target onset (in the gap-saccade task) without first orienting to the auditory stimulus (in the case of the misaligned audiovisual condition). They worked until fully satiated, at which point they were returned to their home cages. Weight and water intake were recorded on a daily basis and the institute veterinarian monitored the animals closely throughout the study.
Recording techniques and receptive field mapping
Extracellular, single-neuron activity was recorded using tungsten microelectrodes (Frederick Haer) with impedances of 0.53 M
at 1 kHz. Electrodes were driven with a hydraulic microdrive (Narishige MO-95) through stainless steel guide tubes supported by a Delrin grid placed inside the recording chamber (Crist et al. 1988
). Single-neuron activity was sampled at 1 kHz after passing through a window discriminator (Bak Electronics) that excluded action potentials that did not meet both amplitude and temporal constraints. The behavioral paradigms as well as storage of eye position and neural activity were controlled by a Pentium PC running a real-time data-acquisition software package (REX Ver. 5.4; Hays et al. 1982
). Horizontal and vertical eye positions were sampled at 500 Hz.
To approximate a neurons visual receptive field, a handheld ophthalmoscope was used to back-project moving spots and bars of light onto the tangent screen while the monkey maintained central fixation. In many instances, visual stimuli were also systematically presented throughout the visual field. The center of the receptive field was defined as the point where the maximum visual response was elicited.
Data analysis
The data were analyzed off-line using a Sun Ultra 60 Sparcstation running user-generated programs and a Pentium PC running MatLab software (The MathWorks). Data were first run through an automated saccade detection program, which identified the beginning and end of each saccade based on velocity and acceleration template matching (Waitzman et al. 1991
). All marks were later verified by the experimenter and adjusted when necessary. Before analysis, all incorrect trials were eliminated. Incorrect trials were defined as: 1) those where the saccade was generated before the removal of the FP in the delayed-saccade task or during the gap in the gap-saccade task; 2) the saccade was generated toward the auditory stimulus during misaligned audiovisual trials; 3) the saccade landed outside the 23° acceptance window around the target; or 4) saccades with latencies >400 ms.
Neuronal responses were analyzed by constructing spike density functions (Richmond and Optican 1987
) based on a normal (Gaussian) probability distribution, as follows
![]() | (1) |
) of 4 ms. The individual pulses were summed together to yield a single spike density function for each trial.
Neuronal responses were quantified in several ways by measuring activity in the following epochs. To evaluate the effect of the auditory stimulus on the activity preceding the onset of the visual response (Bell et al. 2003
, 2004
; Wallace et al. 1996
), the activity 4050 ms after target onset was measured (previsual epoch). The sensory epoch was used to measure the magnitude of the target-aligned sensory response and differed according to task and stimulus intensity. In the delayed-saccade task, the sensory epoch was defined as the peak spike density 0200 ms after target onset for both stimulus intensities. In the gap-saccade task, the sensory epoch was defined as the peak spike density 50100 and 70120 ms after target onset for high- and low-intensity stimuli, respectively. The different epochs were necessary as a result of the different response onset latencies for high- versus low-intensity stimuli (see RESULTS). To minimize the inclusion of motor activity in the calculation of sensory response magnitude in the gap-saccade task, trials with SRTs <120 and <140 ms for high- and low-intensity stimuli, respectively, were excluded for this portion of the analysis.
The premotor epoch, used as an estimate of the relative state of motor readiness after the onset of the initial sensory response but before the saccade-aligned burst of activity (see RESULTS for details), was defined as the peak activity 8090 ms after target onset for high-intensity stimuli. The threshold epoch, used as an estimate of the amount of activity necessary to evoke a saccade ("saccadic threshold"; see RESULTS and DISCUSSION; Carpenter and Williams 1995
; Hanes and Schall 1996
; Paré and Hanes 2003
), was defined as the average level of activity from 20 to 10 ms immediately before saccade onset. This epoch was chosen because it approximates the delay between activity in the SC and saccade initiation. Microstimulation studies have shown that saccades can be elicited as early as 20 ms after dSC stimulation (Robinson 1972
; Stanford et al. 1996
) and perturbed or modified in midflight about 10 ms after SC stimulation (Gandhi and Keller 1999
; Miyashita and Hikosaka 1996
; Munoz and Wurtz 1993
; Sparks and Mays 1983
). The saccade-aligned activity was defined as the average activity ±5 ms surrounding saccade onset.
To estimate the onset of target-related activity, we generated a second spike density function based on an exponential growth/decay function (Thompson et al. 1996
). This asymmetric activation waveform mimics an excitatory postsynaptic potential and is physiologically more plausible for estimating response onset latency than a Gaussian activation function. A spike exerts an influence only forward and not backward in time and so the symmetrical Gaussian function underestimates response onset latencies. The spike density function was obtained by convolving each spike with the following function
![]() | (2) |
g, the growth time constant that was set to 1 ms, and
d, the decay time constant that was set to 20 ms. Response onset latency was defined as the point where the activation level exceeded baseline (defined as 4000 and 1000 ms before target onset in the delayed- and gap-saccade tasks, respectively) plus 3SDs. The activity had to remain above this level for a minimum of 10 ms to be classified as a valid response. Population analyses of the behavioral and neuronal data were performed using pairwise Wilcoxon signed-rank sum tests unless otherwise stated. In all instances, an alpha of 0.05 was chosen as significant. All data are presented as mean ± SE, unless otherwise indicated. For display purposes only, population spike density functions are shown as floating averages of 10-ms bin widths, plotted every 5 ms (i.e., 010, 515, 1020 ms, etc.).
Neuron classification
Neurons in the dSC were classified into one of 3 different categories (sensory only, sensory-motor, motor only) based on their sensory and motor response properties, assessed using the delayed-saccade task. All responses were first analyzed using an automated classification scheme. A sensory response was defined as activity >50 spikes/s above baseline (see above) after target onset. Neurons with sensory activity were then classified as visual, auditory, or bimodal (responsive to both unimodal visual and auditory stimuli) based on their individual responses. Auditory-only neurons were identified as those with no visual response but that were responsive to the auditory stimulus when it alone was presented to the receptive field in the misaligned audiovisual condition. A motor response was defined as saccade-aligned activity >80 spikes/s for saccades to the neurons preferred direction and eccentricity in the delayed-saccade task. The experimenter later verified accuracy and consistency of all neuron classifications.
Laminar distribution of the neurons was estimated based on activity landmarks and relative depths. The superficial (dorsal-most) border of the SC was identified from the onset of brisk, visually evoked multiunit activity. The intermediate layers were then estimated to begin about 1,0001,500 µm below this point, corresponding to the appearance of saccade-related activity (Ma et al. 1991
). Neurons shallower than 1,500 µm below the dorsal-most superficial border but that displayed auditory and/or motor-activity were also classified as intermediate layer neurons (Wallace et al. 1996
). Neurons recorded from the superficial layers of the SC were excluded from all further analyses.
| RESULTS |
|---|
|
|
|---|
Three monkeys performed a total of 12,936 correct trials over the course of the recording sessions (Monkey R: 6,457, Monkey O: 4,340, Monkey Z: 2,129). Stimulus modality had no significant effect on saccadic reaction times (SRTs) in the delayed saccade task for both high-intensity (mean SRT for visual: 250 ± 2 ms, aligned audiovisual: 251 ± 2 ms, misaligned audiovisual: 254 ± 2 ms; P values >0.3) and low-intensity stimuli (mean SRT for visual: 254 ± 1 ms, aligned audiovisual: 253 ± 1 ms, misaligned audiovisual: 256 ± 1 ms; P values >0.4). Likewise, there was no significant difference in mean SRT across the 2 stimulus intensities in the delayed-saccade task for any of the 3 stimulus conditions (P values >0.5).
In the gap-saccade task, stimulus modality and intensity had a significant influence on the mean and distribution of SRTs. We first describe the behavior elicited by the 2 stimulus intensities independently and then summarize with a comparison of the two.
Saccades to high-intensity stimuli. The distribution of SRTs for the 3 stimulus conditions (visual, aligned audiovisual, misaligned audiovisual) is shown in Fig. 2. For high-intensity stimuli, correct stimulus-triggered saccades began at about 65 ms after target onset for all stimulus conditions (Fig. 2, A, C, and E). Trials with SRTs <65 ms were classified as anticipatory responses because they had equal probability of being directed toward or away from the target (Fig. 3A) and were eliminated from further analysis.
|
|
2 = 9.35, P < 0.01). SRTs for correct, stimulus-triggered saccades formed a bimodal distribution, corresponding to express (SRTs: 6595 ms; shaded portion, Fig. 2, A, C, and E) and regular-latency saccades (SRTs: >95 ms; Fischer and Boch 1983
A detailed breakdown of the effect of stimulus condition on express saccade generation and mean SRT is shown in Fig. 3, B and C. High-intensity stimuli evoked relatively few express saccades, which were distributed evenly across the 3 stimulus conditions (Fig. 3B;
2 = 2.12, P > 0.3). There was no significant difference in the mean SRT of express saccades across the 3 stimulus conditions ("ES only," Fig. 3C; mean SRT for visual: 80 ± 1 ms, aligned audiovisual: 82 ± 1 ms, misaligned audiovisual: 81 ± 1 ms; Wilcoxon rank-sum tests, P values >0.2). When express saccades were included in the calculation of mean SRT, the advantage of placing an auditory stimulus on the same side as the visual target failed to reach significance ("with ES," Fig. 3C; mean SRT for visual: 158 ± 2 ms, aligned audiovisual: 154 ± 2 ms, misaligned audiovisual: 165 ± 2 ms; Wilcoxon rank-sum test, P < 0.15). However, when express saccades were excluded, a significant advantage of the aligned audiovisual condition was revealed ("no ES," Fig. 3C; mean SRT for visual: 164 ± 2 ms, aligned audiovisual: 158 ± 2 ms, misaligned audiovisual: 170 ± 2 ms, P < 0.05). There was also a trend for longer SRTs to the misaligned audiovisual condition that approached significance in both cases (P = 0.07 and P = 0.06, respectively).
Saccades to low-intensity stimuli. Contrary to what was observed after high-intensity stimuli, the proportion of saccades generated toward versus away from the low-intensity visual target was influenced by stimulus condition (Fig. 2, B, D, and F). Figure 4A plots the performance as a function of time from target onset for each of the low-intensity stimulus conditions. Initially, all 3 low-intensity stimulus conditions averaged an equal number of correct versus incorrect responses, identifying these saccades as anticipatory responses. However, at about 55 ms after target onset, the proportion of correct saccades directed toward the aligned audiovisual stimulus (blue traces; Fig. 4A) increased. At about the same time, the proportion of correct saccades to the misaligned audiovisual stimulus (green traces; Fig. 4A) decreased substantially. This suggests that the auditory stimulus was contributing to the triggering of saccades, in the correct direction when aligned with the visual target and in the incorrect direction when presented to the opposite hemifield.
|
80% correct in the respective conditions and is thus well above "chance." To evaluate the extent to which the behavioral effects observed for low-intensity, aligned audiovisual stimuli are attributable to the additional trials with SRTs 70100 ms (which, in the case of the other 2 low-intensity stimulus conditions, were classified as anticipatory and thus excluded), we report the values and statistics for when these trials are included and excluded. It is also important to note that cutoffs within ±10 ms of those selected yielded the same results. Saccades generated away from the saccadic target, but with SRTs beyond the range for anticipatory responses (i.e., direction errors), showed no systematic effect of stimulus condition (visual: 84/269, aligned audiovisual: 80/269, misaligned audiovisual: 105/269;
2 = 4.02, P > 0.1). The mean SRTs for the 3 low-intensity stimulus conditions are shown in Fig. 4B. Saccades to the aligned audiovisual stimulus (mean SRT: 155 ± 1 ms) had significantly shorter SRTs compared with the unimodal visual stimulus (mean SRT: 160 ± 1 ms; P < 0.001). When saccades to the aligned audiovisual stimulus with SRTs 70100 ms were eliminated (mean SRT: 158 ± 1 ms, light blue bar, Fig. 4B), the trend persisted but failed to reach statistical significance (P = 0.08). Furthermore, saccades to the misaligned audiovisual stimulus (mean SRT: 165 ± 1) had, on average, significantly longer SRTs compared with the unimodal visual stimulus (P < 0.01).
The above analyses revealed several important differences between saccades generated to high- versus low-intensity stimuli in the gap-saccade task, which we propose are linked to early versus later influences of the auditory stimulus on activity in the dSC. To address this hypothesis, we separated the remaining analyses according to stimulus intensity into 2 independent investigations of crossmodal integration at the neuronal level and their consequences on behavior.
Analysis of neuronal activity
A total of 132 neurons were recorded from the dSC of 3 monkeys [Monkey R: 57 (high-intensity: 9; low-intensity: 48), Monkey Z: 36 (all high); Monkey O: 39 (high: 13; low: 29; N.B. 3 neurons were recorded using both intensities)]. Of these, 109 (83%) exhibited sensory and/or motor-related activity that satisfied our criteria for analysis (see METHODS). Over half of these neurons exhibited both stimulus- and saccade-related activity (64/109; 59%). Smaller proportions had stimulus-related activity but no saccade-related activity (25/109; 23%) or only saccade-related activity (20/109; 18%). The majority of neurons with stimulus-related activity responded exclusively to visual stimuli (79/89; 89%). A smaller proportion responded to both visual and auditory stimuli ("bimodal"; 9/89; 10%) and one neuron in our sample population responded to auditory stimuli only (1/89; 1%). Because this latter neuron responded only to the combined audiovisual target (and not the unimodal visual target) and because of the small sample size, it was eliminated from further analysis.
Crossmodal integration underlying saccades to low-intensity stimuli
The behavioral advantage provided by low-intensity, aligned audiovisual stimuli was driven by a reduction in the onset of the earliest correct stimulus-triggered saccades relative to the other 2 conditions (Fig. 4). Auditory stimuli are known to elicit responses in the dSC with shorter response onset latencies but decreased response magnitude compared with those elicited by visual stimuli (Bell et al. 2003
, 2004
; Jay and Sparks 1987
; Wallace et al. 1996
). We examined the effect of the auditory stimulus on activity preceding the onset of the visual response (previsual activity), the response onset latency (ROL), and the magnitude of the target-aligned sensory response.
Variable response onset latency and previsual activity. Figure 5, AC compares the mean activity of all sensory and sensory-motor neurons for trials where the visual target appears inside (solid traces) versus opposite to (dashed traces) the receptive field of the neuron. The point where these 2 curves diverge approximates when activity related to the sensory stimuli is being registered by the neurons. In the case of the aligned audiovisual stimulus condition (Fig. 5B), the curves initially diverge at about 30 ms, which likely represents the arrival of the earliest auditory input to the dSC. The curves then diverge dramatically at about 7080 ms, likely representing the arrival of the visual input. Interestingly, there is also an early divergence in the case of the misaligned audiovisual condition but in the opposite direction. In this case, the auditory stimulus, located in the opposite hemifield as the visual target, falls into the receptive field of neurons represented by the dashed traces in Fig. 5 and so there is a slight increase in activity about 40 ms after target appearance (Fig. 5C).
|
|
|
|
|
|
As shown in Fig. 3, the behavioral benefit of high-intensity aligned audiovisual stimuli was strongest for regular-latency saccades (i.e., SRTs >95 ms) and had little effect on the generation of express saccades (SRTs 6595 ms). We assessed the contribution of changes to the ROL and sensory response magnitude to changes in SRTs of regular-latency saccades to high-intensity stimuli. Two correlation analyses were performed, examining the relationship between ROL and sensory response magnitude with the SRTs of high-intensity, regular-latency saccades (Fig. 10). All express saccades were removed from the following analyses.
|
To gain further insight into what was driving the changes in SRT, we performed a "floating correlation" analysis on all regular-latency saccades to the 3 high-intensity stimulus conditions (Fig. 11). All express saccades have been removed from this analysis, which is restricted to sensory-motor neurons only (accounting for why these curves do not match those in Fig. 9B). For this analysis, the mean spike density in 10-ms bins of all sensory-motor neurons was correlated with SRT, every 5 ms (i.e., 010, 515 ms, etc.). The initial sensory burst, which ranged from about 40 to 80 ms, exhibited relatively weak correlations. However, shortly after the peak of the sensory burst, neural activity became more and more negatively correlated with SRT, achieving the strongest negative correlation at about 100 ms after target onset (preceding saccade onset by about 50 ms). Beyond this point, activity once again became less and less correlated with SRT as it approached the range of saccade onsets. Importantly, the curves for all 3 stimulus conditions appeared similar.
|
Figure 12 compares the mean level of activity during the premotor epoch, defined here as the mean spike density 8090 ms after target onset (shown as the shaded portion, Fig. 11), for all sensory-motor neurons (sensory-only neurons were omitted from the remaining analyses as they do not contribute to the motor output of the dSC). To avoid including activity that might be part of the saccade-aligned burst, while still including the majority of relevant trials, we excluded all trials with SRTs <110 ms.
|
To confirm that reductions in saccadic threshold or increases in the relative motor output of the dSC are not contributing to the reduction in mean SRT for regular-latency saccades to high-intensity, aligned audiovisual stimuli, we examined the effect of stimulus modality on these 2 motor epochs (see METHODS). Neither of these variables was well correlated with SRT (Fig. 13, A and B), nor did they exhibit any systematic relationship with stimulus modality (Fig. 13, C and D). Thus it would appear that high-intensity audiovisual stimuli drive changes in SRT by influencing the processes leading up to the crossing of saccadic threshold but are no longer influential once a saccade is triggered and about to be executed.
|
| DISCUSSION |
|---|
|
|
|---|
Audiovisual stimuli accelerate the onset of stimulus-triggered saccades
Recent neurophysiological studies and models of saccadic initiation have postulated that neural activity among saccade neurons must exceed a threshold level to initiate a saccade ("saccadic threshold"; Carpenter and Williams 1995
; Hanes and Schall 1996
; Trappenberg et al. 2001
). SRT is thus determined by the time it takes for a neural population to achieve this level of activation and any factors capable of influencing this time will have a direct impact on the SRT.
Presenting an auditory stimulus at the same time and place as the visual target in the low-intensity condition resulted in a significant increase in activity before the onset of the visual burst (Fig. 6), which appeared to facilitate the onset of the visual response relative to the other conditions (Fig. 5D). Increases in activity before the onset of the visual response have a strong impact on SRTs (Dorris et al. 1997
). In both cases, the stronger activity preceding the onset of the visual burst likely facilitates the generation of shorter-latency saccades by providing a "head start" to the neural population ultimately responsible for triggering the saccade.
It would also appear that when the auditory stimulus appeared in the opposite hemifield as the visual target, a proportion of saccades was triggered to this distracting stimulus to eliminate any benefit of the additional auditory stimulus (Fig. 4). The presence of activity in the opposite dSC brought about by the misaligned auditory stimulus produced a competition between neurons in the 2 dSC (Munoz and Istvan 1998
), potentially contributing to the longer SRTs observed after the misaligned audiovisual stimulus.
These data show how the combination of an auditory and visual stimulus could affect neural activity in the dSC leading to a change in behavior. This type of mechanism could represent a relatively low-level example of crossmodal integration where the response to one stimulus modality directly affects the response to another modality within the same structure. This type of mechanism could easily account for the "best-of-both worlds" result seen in crossmodal behavioral studies (e.g., Corneil et al. 2002
), where subjects benefit from the spatial accuracy provided by the visual component and the shorter movement onset triggered by the auditory component of a spatially aligned audiovisual target.
One might expect a similar mechanism to influence the generation of express saccades to the high-intensity stimuli. However, this was not the case. The onset of the visual response occurred so soon after stimulus presentation (Fig. 8), leaving little opportunity for the auditory stimulus to bias the previsual activity before the arrival of the visual response in the dSC. Essentially, all 3 high-intensity stimulus conditions were evoking responses too soon after target onset to evoke a differential proportion of express saccades under the conditions used in this study.
Audiovisual stimuli enhance premotor activity in the dSC
As mentioned previously, saccades are believed to be triggered when neural activity exceeds saccadic threshold. It has been demonstrated in the dSC (Paré and Hanes 2003
) and frontal eye fields (Hanes and Schall 1996
) that variance in SRT is linked to changes in the rate of rise of premotor activity among saccade neurons: the steeper the rise, the sooner the threshold will be crossed and the shorter the SRT. These observations were made in a countermanding task, which produces a broad distribution of SRTs. An attempt to calculate the rate of rise of premotor activity (i.e., the slope of the spike density function preceding saccade onset) with our data set proved extremely difficult. Specifically, there was insufficient variance in SRT to allow subtle changes in slope to show meaningful correlations. In addition, because SRTs in our task were so short (Fig. 2), it was difficult to locate a period of time that consistently offered a stable linear rise in neural activity from trial to trial.
We therefore used a different approach to characterize the premotor activity of neurons in the dSC (Fig. 11). Using a floating correlation analysis, we identified a premotor epoch that was significantly correlated with SRT. High-intensity audiovisual stimuli evoked significantly greater premotor activity compared with the unimodal visual condition, thus facilitating the earlier crossing of saccadic threshold (Fig. 12). This significant enhancement of neural activity was strongest for spatially aligned audiovisual stimuli, likely representing a crossmodal interaction between the visual and auditory stimuli. Although the initial sensory response did not show a significant enhancement (Fig. 9), which could indicate a ceiling effect for such highly salient visual stimuli, these results show that the benefit of crossmodal interactions between auditory and visual stimuli can extend beyond the initial sensory response and affect premotor processing as well.
Interestingly, the misaligned audiovisual stimulus showed a similar trend that failed to reach significance (Fig. 12). This somewhat counterintuitive result suggests that the benefit of auditory stimuli could, in fact, be 2-fold. Presenting an auditory stimulus in combination with a visual target can provide additional spatial information (if presented in spatial alignment with the visual stimulus), in essence serving as a redundant target (Forster et al. 2002
; Miller 1991
; Murray et al. 2001
). The auditory stimulus can also provide a potent nonspatial "alerting effect" (e.g., Farah et al. 1989
; Ross and Ross 1981
) that can further serve to facilitate orienting. These 2 effects are consistent with a recently proposed 2-stage model of crossmodal integration (Arndt and Colonius 2003
; Colonius and Arndt 2001
). These authors propose that the influence of stimulus intensity and location on SRT constitutes 2 separable mechanisms linked to the early unimodal pathways and later integrative processes, respectively. In the misaligned audiovisual condition, monkeys are still able to benefit from this latter effect, particularly with a highly salient auditory stimulus. The misaligned auditory stimulus likely served as a nonspatial "alerting" cue to trigger saccades with shorter SRTs, which may account for the increased premotor activity compared with the unimodal visual stimulus (Fig. 12).
One interesting question still remains: what is driving the observed effect of stimulus modality on premotor activity? Although our data do not address this question directly, one logical possibility is that the cerebral cortex is somehow involved. The importance of cortical inputs for crossmodal integration in the cat dSC has been demonstrated (e.g., Jiang et al. 1999
, 2001
; Wallace and Stein 1997
, 1999
; Wilkinson et al. 1996
). Perhaps the effect of stimulus modality on premotor activity depends on descending cortical inputs that are absent or less active under unimodal conditions. Further study will be necessary to better understand the relationship between the SC, cortex, and crossmodal integration in awake monkeys.
Interaction between spatial location and stimulus intensity
Consistent with previous studies (Frens and Van Opstal 1998
; Meredith and Stein 1996
; Stein et al. 1989
; Wallace et al. 1998
), the behavioral and neuronal consequences of audiovisual stimulation depended on their spatial alignment and relative intensity. The interaction between these 2 variables, however, revealed an additional interesting point. The majority of direction errors in the high-intensity stimulus sessions were generated to the misaligned audiovisual stimulus, whereas those generated in the low-intensity stimulus sessions were equally distributed among the 3 stimulus conditions. Furthermore, if the saccade was directed toward the visual target correctly in the misaligned condition, monkeys did not show a significant increase in SRT for the high-intensity stimuli but did for the low-intensity stimuli (Figs. 3 and 4). Thus it is possible that in the high-intensity case, the monkeys were either able to ignore the spatial ambiguity of the auditory stimulus or were distracted by it to the point that they oriented toward the stimulus incorrectly (often correcting themselves in a subsequent saccade). In the case of the low-intensity stimuli, monkeys appeared less prone to orient to the auditory stimulus but still showed a negative effect on behavior. This effect of stimulus intensity is unlike the principle of inverse effectiveness previously described in the anesthetized preparation (Meredith and Stein 1986
; Wallace et al. 1996
) and thus represents one more example of how crossmodal integration in the awake monkey is subject to different/additional factors compared with the anesthetized animal (e.g., state of visual fixation: Bell et al. 2003
; spatial attention: Bell and Munoz 2002
).
In conclusion, we have shown that aligned audiovisual stimuli facilitate shorter latency saccades in at least 2 ways. In the case of low-intensity stimuli, aligned audiovisual stimuli reduced the mean SRT by increasing the proportion of shorter-latency saccades through a reduction in the ROLs of neurons in the dSC. In the case of high-intensity stimuli, the aligned audiovisual condition increased the premotor activity of dSC neurons, facilitating the generation of regular-latency saccades with shorter latencies. Thus our results demonstrate that crossmodal interactions in the dSC go beyond the sensory response to also influence premotor processing and orienting behavior.
| GRANTS |
|---|
|
|
|---|
| ACKNOWLEDGMENTS |
|---|
|
|
|---|
| FOOTNOTES |
|---|
Address for reprint requests and other correspondence: D. Munoz, Centre for Neuroscience Studies, Queens University, Kingston, Ontario, Canada K7L 3N6 (E-mail: doug{at}eyeml.queensu.ca)
| REFERENCES |
|---|
|
|
|---|