|
|
||||||||
Institute of Neuroscience, University of Oregon, Eugene, Oregon
Submitted 23 January 2006; accepted in final form 28 April 2006
|
|
ABSTRACT |
|---|
|
|
|
INTRODUCTION |
|---|
|
A certain amount of improved performance in multisensory tasks may be due to simple probability. In the same way that flipping two coins instead of one increases the chances of obtaining at least one "head," the probability of either hearing or seeing a weak multisensory stimulus is greater than the probability afforded by either modality alone. Raab (1962)
extended this idea of statistical facilitation to reaction times. According to his "race model, " the two sensory signals race along separate pathways toward a common response generation site, and the winning modality triggers the response. If stimulus conditions are such that the two modalities have an equal chance of winning the race, the race model predicts that the multisensory reaction time will be, on average, 0.6 SD earlier than the mean unisensory reaction time (Raab 1962
). Multisensory reaction times that are quicker than predicted are said to positively violate the race model (Hughes et al. 1994
; Miller 1982
). Positive violations have been taken as evidence that instead of remaining separate, the two modalities "converge," giving rise to reaction times that are quicker than the race model allows (Arndt and Colonius 2003
; Harrington and Peck 1998
; Hughes et al. 1994
, 1998
; Nozawa et al. 1994
). The convergence model is thus distinguished from the race model by its assumption that the two modalities converge to either form a new internal sensory representation, enhance the efficiency of the ensuing motor response, or both.
The study of audiovisual responses in mammalian systems has elucidated three general principles of multisensory integration (for review, Calvert et al. 2004
; Spence and Driver 2004
; Stein and Meredith 1993
). First, when the auditory and visual components of an audiovisual stimulus are aligned in space, the probability of correctly localizing the target is increased relative to trials in which they are misaligned (Jiang et al. 2002
; Stein et al. 1988
, 1989
). Conversely, spatial misalignment lengthens saccadic reaction times beyond what would be seen with one stimulus alone (Corneil and Munoz 1996
; McSorley et al. 2005
). Second, auditory and visual information must be aligned in time. If the timing of the auditory and visual stimuli compensates for the difference in processing times between the two modalities, facilitated behavioral (Corneil et al. 2002
) or neural responses (Meredith et al. 1987
) are more likely. Third, multisensory facilitation is more likely to occur when the unisensory components are presented at low-amplitudes or signal-to-noise ratios. This has been termed "inverse effectiveness" (Stein and Meredith 1993
). Most recently, Stanford et al. (2005)
showed that neural responses to audiovisual stimuli were most likely to be superadditive (i.e., facilitated beyond the predicted linear sum of the unisensory components) when near-threshold unisensory stimuli were presented. Similarly, feline detection/orientation behavior in response to audiovisual stimuli was most enhanced when unisensory stimuli of low efficacies (dim and quiet) were combined (Stein et al. 1989
). Moreover the saccades of human subjects to auditory, visual, and audiovisual stimuli in the presence of background auditory and visual noise were initiated as rapidly with saccadic reaction times (SRTs) typical of unisensory auditory trials and high accuracy typical of unisensory visual trials (Corneil et al. 2002
). To our knowledge, this tendency to observe the "best" features of acoustically and visually guided saccades had not been previously noted in studies using higher signal-to-noise ratios.
The present study takes an ethological approach to multisensory integration using the barn owl (Tyto alba) as a model. When the owl hears or sees an object of interest, it naturally turns its head to face the object. This stereotyped saccadic behavior, like primate eye movements, can be measured with precision (Knudsen and Konishi 1979
; Knudsen et al. 1979
). Moreover, the barn owl, renown for its ability to localize sound, has a precise spatiotopic map of auditory space in the external nucleus of its inferior colliculus (ICx) that projects topographically to the retinotopic map in its optic tectum generating a physiologically accessible multisensory map of auditory and visual space (Knudsen 1982
; Knudsen and Konishi 1978
). The barn owl thus provides us with a model system in which to explore the integration of information from two modalities in the context of its natural behavior.
Here, we measured SRT and saccadic error in auditory, visual, and simultaneous, spatially aligned audiovisual conditions. The targets spanned a wide range of stimulus locations and strengths, including those unisensory stimuli that were significantly less effective in eliciting fast, accurate saccades. For most audiovisual combinations, SRTs were no earlier than the earlier of the two modalities, typically, audition, and were no more accurate than the more accurate of the two, typically, vision. However, saccades to audiovisual targets had the short latencies typical of saccades to auditory targets and the accuracy typical of those to visual targets, thus revealing the best features of both modalities, in agreement with a previous human audiovisual study (Corneil et al. 2002
). These results are not consistent with the race model which predicts that the winning modality would determine both the SRT and error. Instead, our results support an "updating" hypothesis of sensory convergence in which the first modality to reach threshold triggers the saccade, whereas information from the second modality is available to update and refine the target position (Corneil et al. 2002
; Van Opstal and Munoz 2004
).
|
|
METHODS |
|---|
|
Apparatus
All experiments were conducted in a double-wall, sound-isolating anechoic chamber (Industrial Acoustics) the properties of which have been described previously (Spitzer et al. 2003
). The ambient noise inside the chamber was <18 dB SPLA, between 2 and 10 kHz. Head-aim was recorded and sampled at 468 Hz with a custom-built magnetic search coil system (Remmel Labs, Ashland, MA). The head coil was mounted on a post that was cemented to the skull prior to training. The head coil system was calibrated over a ±50° range in azimuth and elevation at the beginning of each session. This compensated for any day-to-day changes in the imposed magnetic field and enabled the system to remain accurate within 12°.
ACOUSTICAL STIMULI. One of 30 presynthesized, 100-ms auditory stimuli was randomly presented in each auditory or audiovisual trial. Stimuli were broad band noises, flat between 2 and 12 kHz and differed only in fine structure. The noises were digitally synthesized, ramped on and off by a 5-ms cosine envelope, and sampled at 30 kHz (Power DAC PD1, Tucker-Davis Technologies). The analog signal was fed to a programmable attenuator (PA4, Tucker Davis Technologies), amplified (HB6, Tucker-Davis Technologies), and presented through one of seven lightweight speakers (Peerless, 5.08-cm cone tweeters). The speakers' frequency response characteristics were flat (within 3 dB) between 2 and 12 kHz. Auditory levels ranged from 12 to 6 dB, in increments of 3 dB on an A-weighted SPL scale. The lower levels were extrapolated from attenuation/SPL relationships recorded at detectable levels with a 1/2 inch microphone (Brüel and Kjaer model 1760) and sound-level meter (Brüel and Kjaer model 2235).
VISUAL STIMULI. The visual stimulus was a single, stationary 100-ms flash from a red (635 nm) light-emitting diode (LED), presented in the otherwise completely darkened, light-sealed chamber. Luminous intensities were manipulated by ranging the applied voltage. Six intensities were used (5.2, 0.052, 0.0053, 0.0013, 0.00077, and 0.00031 cd). Each LED was suspended directly in front of one of the seven speakers.
Behavioral paradigm
Two adult barn owls (Tyto alba) were hand-reared and trained to make head saccades to auditory, visual, and simultaneously presented audiovisual stimuli. Because the owl's eyes are relatively immobile (du Lac and Knudsen 1990
), gaze was estimated by head-aim. Stimulus synthesis, data acquisition, and data analysis were all carried out with Matlab 6.5 (The Mathworks). Animals were given food rewards and maintained at 85% free-feeding body weight.
Trials were conducted while the bird was tethered to a perch located near the center of an isolated, completely darkened room. Food rewards (mouse bits) were presented from a remotely controlled feeder located in front of the bird, at the level of the perch. Each session, seven targets were pseudo-randomly positioned on either a T-shaped or inverted-V-shaped frame, separated by
7° (Fig. 1A, black and gray orientations, respectively). The frame was alternated between the T and inverted-V orientations on a session-by-session basis. Target locations ranged from 30 to 30° in azimuth and 0 to 30° in elevation with 0° azimuth and 0° elevation being at the intersection of the owl's midsagittal plane and eye-level.
|
4 mo. During initial training, only clearly audible and visible stimuli were used (30 dB, 5.2 cd), and the birds were rewarded for localization attempts directed toward the quadrant of space containing the target. As performance gradually improved, the reward criteria became more stringent. Eventually, the birds localized clearly audible and visible stimuli within 5° on >90% of the trials. Weaker stimuli (lower SPL or luminous intensity) were then introduced. The training period ended when the accuracy and precision of the birds remained stable for each given modality and stimulus-strength. Data from the training period are not reported here.
The 11 mo after training yielded 97 and 94 test sessions for birds N and J, respectively. There were 40 possible rewards within a given session. The session ended when all rewards were dispensed. Sometimes the bird's performance would warrant a reward on every trial. In these sessions, there were only 40 trials. Other times, the bird's behavior fell short of the reward criteria and would not be rewarded for
15 trials. Such sessions were ended at trial number 65. Analysis included both rewarded and unrewarded trials. In general, rewards were given for head turns terminating within 5° of the target. Trials with very low level stimuli, however, were rewarded when localization attempts were directed in the quadrant of the target so as to maintain motivation. Unlike earlier head-turn experiments in owls (Knudsen et al. 1993
; Poganiatz et al. 2001
), we did not require the initial head-aim to be within some radius of a central starting location. Instead, a trial was manually triggered when the bird's spontaneous head turns ceased and its head came to rest, typically within 40° of being aligned with its body, as seen with an infrared camera. Post hoc analysis discarded any trials during which the head was moving faster than 10o/s prior to stimulus onset, thus avoiding saccades with head motion immediately prior to and during stimulus presentation. The angular distance to the target is expressed relative to the initial head-aim and is referred to as target eccentricity.
Head-aim was recorded for 3 s, a time span comprising a 200-ms prestimulus window, the 100-ms stimulus, and a 2,700-ms poststimulus window. The inter-stimulus interval varied from 15 to 90 s. A total of 9,872 trials was recorded, 2,779 of which were excluded from analysis as described in the following text.
Of excluded trials, 66% were those in which the head moved >2° in azimuth or elevation in the 200-ms prestimulus window or before 40 ms poststimulus onset. Head turns initiated before 40 ms poststimulus onset were almost never directed toward the target and thus not stimulus driven. Trials were also excluded if, at any point during the 3-s recording, the head was outside the calibrated region of our coil system (100 x 100° perimeter, Fig. 1A). Seventeen percent of excluded trials were outside of the calibrated region. The remaining exclusions were those trials in which the targets were located above the initial head-aim and/or either too close or too far away; In training and preliminary testing, both birds showed a bias against responding to stimuli closer than 12° and farther than 55° away as well as those targets presented in the upper visual hemi-field.
The 7,093 accepted trials were categorized as responses or nonresponses. Nonresponses were relevant in measuring the probability of response, especially in the trials with very low stimulus intensities. Each trial contained an auditory, visual, or audiovisual stimulus, and therefore any head turn with a SRT falling between 40 and 600 ms of stimulus onset was scored as a response. More than 99% of scored responses were directed toward the quadrant of space containing the target. Those trials in which the SRT was >600 ms were instances in which the birds typically did not make any head turn at all and were thus scored as nonresponses.
We tested seven sound pressure levels (SPLs, henceforth also referred to as levels for simplicity) and six luminous intensities, resulting in 13 unisensory conditions and 42 audiovisual combinations. For each bird, 70120 trials were recorded for each unisensory condition and 3060 trials for each audiovisual condition. In each session, trials varied randomly in stimulus-strength, location, and modality (auditory, visual, and audiovisual). Each session had a random proportion of unisensory and multisensory trials; however the average session consisted of 30% unisensory and 70% multisensory trials. The luminances, SPLs, and combinations of luminance and SPL also pseudorandomly varied on a session by session basis, and each session included very strong as well as very weak stimuli.
Data analyses
SRT. Figure 1B shows 20 randomly selected, visually evoked, saccades from both birds N and J. Instantaneous speed from one 40° saccade (Fig. 1C) was computed by dividing the distance between consecutively sampled locations by the time between sampling points. SRT was defined relative to stimulus onset as the first point in time at which the instantaneous head speed continuously exceeded 2 SD of the average speed measured in the 200-ms prestimulus window (Fig. 1C, open circle).
Preliminary analysis showed a negative correlation between target eccentricity and SRT for auditory trials in bird N [r (566) = 0.1228, P < 0.01] and bird J [r (550) = 0.3126, P < 0.0001]. However, correlations between SRT and target eccentricity were not significant in visual trials for bird N [r (579) = 0.005, P = 0.9135] and bird J [r (604) = 0.0627, P = 0.1608]. No significant correlation was found in bimodal trials between SRT and target eccentricity for bird N [r (1,514) = 0.0087, P = 0.73], but this correlation was significant in bimodal trials for bird J [r (1,558) = 0.1465, P < 0.0001]. Accounting for this correlation in an analysis of covariance, however, did not affect the overall trends reported in the following text. Thus trials were pooled across target eccentricities in all further SRT analysis. An ANOVA was used to test for main effects of stimulus-strength on SRT in unisensory conditions. Mean SRT changed across conditions in a sigmoidal fashion and was fit by the least squares method for display. A priori pair-wise comparisons (auditory vs. audiovisual, visual vs. audiovisual) of mean SRTs were conducted using the Bonferroni correction for nonorthogonal test design [
= (0.05/number of comparisons)].
Observed audiovisual SRT distributions were compared with theoretical race model distribution boundaries computed assuming: 1) the auditory and visual information remain separate. 2) Either the auditory or the visual stream may trigger a response. 3) The first signal to arrive at the motor control center will determine the SRT (Raab 1962
). In addition, any particular model from the class of models based on these basic race-model assumptions will also assume a certain level of unisensory dependence or independence (Hughes et al. 1994
). Regardless of the level of dependence, however, the upper boundary of SRT facilitation is statistically predicted by the Miller Inequality, which sums the unisensory cumulative probabilities at each SRT (Miller boundary: Eq. 1, Fig. 2, dot-dash line) (Hughes et al. 1994
; Miller 1982
; Townsend and Wenger 2004
)
![]() | (1) |
|
![]() | (2) |
Race-model predictions and observed audiovisual SRT distributions were compared using a one-tailed, one-dimensional, two-sample Kolmogorov-Smirnov test (KS test) at
= 0.05. Bonferroni corrections were applied for multiple comparisons. Observed distributions were subtracted from theoretically predicted distributions and plotted as difference graphs (insets, ![]()
![]()
![]()
![]()
Figs. 8 and 9). Significantly positive differences between the Miller boundary and observed SRT distributions indicate conditions in which the observed audiovisual SRT was shorter than predicted by the race model. Significantly negative differences between the Grice boundary and observed SRT distributions represent conditions in which the observed audiovisual SRT was longer than predicted by the race model.
|
|
|
|
|
|
|
![]() | (3) |
LOCALIZATION ERROR. The endpoint of a saccade was designated when the head speed dropped <5°/s, (Fig. 1). Across saccades, this criterion for head speed accurately delineated the end position of the saccade, (e.g., Fig. 1C). Errors in azimuth and elevation were measured, respectively, as the horizontal and vertical distance between the saccade endpoint and the target. Total error was computed by vector summation of the horizontal and vertical components.
ANCOVA's were conducted to assess total errors co-varying with target eccentricity, as a function of stimulus-strength. Slopes and marginal means for the fitted lines are reported for each condition. The marginal mean error is the mean of the total error, adjusted for the linear relationship between total error and target eccentricity. The marginal mean errors for each condition were computed at a value of 28° from the target, the general midpoint of tested target eccentricities. Marginal means were plotted against stimulus-strength and fit by least squares with a sigmoid. Pair-wise comparisons of marginal means were conducted for unisensory conditions, with Bonferroni corrections at
= 0.05.
|
|
RESULTS |
|---|
|
|
|
Auditory and visual unisensory trials were randomly interleaved with multisensory trials in each daily test session. The measures of response percentage, SRT, and error were monitored for each head saccade and individually reported in the following text. The effectiveness of stimuli at evoking a response of the seven auditory levels and six visual intensities ranged from near to well above threshold. At the highest auditory and visual stimulus levels, both birds responded
99.9% of the time. The weakest auditory stimuli were minimally effective, in that both birds responded to the stimulus only 20% of the time. The weakest visual stimulus evoked a response 75% and 35% of the time for birds J and N, respectively. Predictably, response percentage in unisensory trials increased as auditory and visual stimulus-strength increased (Fig. 3).
Mean unisensory SRTs decreased toward an asymptote as auditory and visual stimulus-strength increased (Fig. 4, A and C, respectively; lighter shades = weaker stimuli). At the highest levels tested, SRTs of auditory trials were significantly shorter than those of visual trials (bird N, auditory: 72 ± 2 ms; visual: 146 ± 2 ms; t-test, P < 0.001). Figure 4, B and D, shows the cumulative probability distributions for auditory and visual SRTs for each condition. For both modalities, the variance, which may be visualized by the slopes and ranges of the cumulative distribution functions, increased for weaker stimuli. Response percentages, taken from the cumulative response probability at 600 ms poststimulus onset, also decreased systematically with stimulus-strength.
Figure 5 shows the analysis of the head-turn accuracy to auditory (AC) and visual (DF) targets. Figure 5, A and D, demonstrates that saccade error depended on target eccentricity (angular distance to the target, relative to the initial head aim; see METHODS) as well as stimulus strength. We therefore report the marginal means (B and E) and the slopes (C and F) of the regressions shown in Fig. 5, A and D. Just as SRT and response percentage improved systematically with stimulus strength, saccade error decreased as SPL and luminous intensity increased (B and E). The error for visual trials, however, reached an asymptote more quickly than those of auditory trials for the range of magnitudes tested. The degree to which error depended on target eccentricity also diminished as stimulus strength increased (slope; Fig. 5, C and F).
To better understand the nature of the errors, saccade total error was divided into its azimuthal and elevational components. Figure 6 shows the azimuthal and elevational components of saccade length for strong and weak unisensory stimuli plotted against target eccentricity. Perfect performance in either dimension would thus yield points along the unity slope line (dashed black line). Stronger auditory stimuli, (black dots, see Fig. 4 for relative stimulus effectiveness), consistently resulted in accurate saccades (Fig. 6, A and B). Interestingly, when auditory targets of small eccentricity and high SPL were presented, this bird had a distinct overshoot in elevation (Fig. 6B). The regression line for the azimuthal and elevational components of the saccades to the weaker auditory stimuli (gray circles) tended to have a shallower slope compared with the regression line for the louder condition and to the unity line. As stimulus-strength decreased, this tendency to undershoot was more pronounced for elevation than for azimuth (Fig. 6, A vs. B, but note differences in scales). Furthermore, the scatter about the regression line was larger for elevation than for azimuth, especially for the quieter sound (Fig. 6, r2 values in insets). Thus accuracy and precision in both azimuth and elevation were better with louder stimuli.
In contrast, for visual intensities that elicited equivalent response percentages, there was no significant difference between head turns to dim and bright stimuli in azimuth or elevation (compare open vs. closed circles in Fig. 6, C and D; note: the dim visual condition plotted here was not the dimmest condition tested). There was, however, a slightly greater tendency to undershoot in elevation than in azimuth. These results are also consistent with the data in Fig. 5, E and F, which shows a significant increase in error for only the dimmest stimulus (at 0.00033 cd, pair-wise Bonferroni corrected, P < 0.05). Thus while visual SRT increases systematically with decreasing stimulus-strength, visual accuracy remains virtually unaffected.
Responses to audiovisual stimuli
In the following text, we compare results from the multisensory trials with those of unisensory trials. Measured audiovisual response percentages, SRTs, and errors are then compared with race-model predictions.
When weaker auditory and visual stimulus conditions were combined in audiovisual trials, the probability of observing a head turn often increased as compared with unisensory levels (Fig. 3,
). Given the auditory and visual unisensory response probabilities, predicted audiovisual response percentages were computed assuming the race model (Eq. 3) and plotted (Fig. 3,
). In general, the observed probability of responding to an audiovisual stimulus was indistinguishable from the theoretical probability of responding to the light, the sound, or both. Only one stimulus combination yielded a response probability that was noticeably higher than predicted out of 42 conditions for two birds (Fig. 3A, 9 dB, 3.1 x 10-4 cd, bird N).
SRTs in audiovisual trials
Mean audiovisual SRTs typically followed the shorter of the two unisensory SRT means. In Fig. 7, AF, the mean SRT (±SE) is plotted against SPL for auditory (short-dash) and audiovisual trials (solid). The long-dashed horizontal line in each graph indicates the mean visual SRT (±SE) for the luminance shown. The figure thus represents 42 audiovisual combinations (7 auditory; 6 visual). When the visual stimulus was dim, as in Fig. 7A, most auditory SRTs were shorter, and the audiovisual SRTs closely approximated the auditory SRTs. With brighter visual stimuli, the audiovisual SRTs followed the visual SRT values at low SPL combinations and the auditory values at higher SPL combinations (Fig. 7, DF). There was one condition for each bird in which the mean audiovisual SRT was significantly shorter than the shortest unisensory SRT (black arrow, Fig. 7E). All of these results are qualitatively consistent with the race model. On the other hand, one stimulus combination in each bird (example bird N, gray arrow; Fig. 7F, bird J, same stimulus combination, data not shown) resulted in a mean audiovisual SRT that was significantly longer than the shortest unisensory SRT. The following distribution analysis addresses whether these differences in mean SRT significantly violate race-model predictions.
Figure 8 shows cumulative probability functions for audiovisual trials at a single low level SPL (9 dB) and six luminances (AF) for bird J. Under these conditions, the distribution of auditory and visual SRTs often overlapped. For conditions in which the visual stimulus was fairly dim (Fig. 8, AD) the effectiveness of either unisensory stimulus alone was less than optimal in evoking a quick, reliable response. Thus these combinations theoretically provided the best opportunity for observing positive race-model violations. The insets plot the difference between the upper boundary of race model predictions (Miller boundary: see METHODS) and observed cumulative probability functions. Upward excursions in the inset graphs indicate positive distribution deviations from the upper boundary of the race model. The statistical significances of the excursions were computed with the KS-test (see METHODS). As exemplified in this set of conditions, all observed SRTs were consistent with the predictions of the race model, and no significant positive race violations were found for bird J. There was only one audiovisual combination in one bird (9 dB and 5.2 cd, bird N) in which the SRT was significantly shorter than race-model predictions and thus qualified as a positive violation. This condition is shown in Fig. 9B (note: the inset difference graph for this figure is computed using the Grice boundary, but pound symbols mark significant positive violations as compared with the Miller boundary).
Negative violations, where the observed SRTs were significantly longer than race-model predictions, were somewhat more frequent and found with brighter (5.2 and 0.052 cd) and louder (3 through 6 dB) stimulus combinations. The condition marked by the gray arrow in Fig. 7F is an example of a negative violation severe enough to significantly affect mean values. There were multiple conditions, however, that showed significant negative violations in the distribution analysis. Figure 9 plots SRT cumulative probability distributions for a single high-intensity visual stimulus (5.2 cd) and seven SPLs (AG). As shown in Fig. 9, DG, four of the loud/bright combinations resulted in distributions that fell to the right of the Grice boundary (data shown for bird N; bird J gave similar results.)
Saccade error in audiovisual trials
The race model is based on the competition between two separate streams of sensory information. Does the modality that "wins" the race determine all characteristics of the ensuing saccade or does the presence of the stream that "lost" have an influence? Specifically do audiovisual saccades with auditory-like SRTs also have auditory-like errors? In this section, we compare audiovisual errors with unisensory error in the context of race-model predictions. Error and SRT are then considered simultaneously for one exemplary audiovisual combination.
Figure 10 plots marginal mean error against SPL for all 42 stimulus combinations. The conventions are the same as those of Fig. 7. The insets show the slope of the regression between total error and target eccentricity (see Fig. 5). The stimulus combinations shown in Fig. 10, AC, typically yielded audiovisual SRTs similar to auditory SRTs (see Fig. 7, AC). Yet, even in these cases where the auditory stream presumably "won" the race to trigger a saccade, the audiovisual errors approached visual values. This would not have been the case were the winning modality to control all aspects of the saccade.
When brighter stimuli (Fig. 10, E and F) were paired with sounds that were less than 5 dB SPL, visual SRTs were shorter than the auditory SRTs (see Fig. 7, E and F). For these combinations, the audiovisual errors were, again, closer to the visual errors. In bird N, the results of which are shown, the audiovisual error was slightly but significantly larger than the visual error in 4 of the 42 combinations tested (pound symbol in Fig. 10, C, E, and F). In bird J, the audiovisual errors were indistinguishable from visual errors in all conditions. As seen with SRT, audiovisual accuracy was never better than that seen in the most accurate unisensory condition (t-test, Bonferroni corrected, P > 0.05, birds N and J).
The results in the preceding text are inconsistent with the idea that the winning modality in the race model controls all aspects of the saccade. To test this more directly, we compiled a theoretical distribution of audiovisual saccades for each stimulus combination with a Monte Carlo simulation based on the race model. This simulation utilized the basic race-model assumptions: 1) the two modalities raced to the saccade generator, 2) the winner of the race controlled both the SRT and the error of the theoretical audiovisual saccade. For simplicity, we assumed independence of SRTs between the two unisensory streams. For each multisensory combination, one randomly selected auditory saccade SRT was compared with one randomly selected visual saccade SRT. The characteristics of the saccade with the shortest SRT (including error and SRT) were then pooled in a theoretical audiovisual distribution. If the two unisensory SRTs were equal, a modality was randomly selected and the corresponding saccade SRT and error was added to the predicted audiovisual distribution. Both auditory and visual saccades were then replaced in their original unisensory pools and the selection process was repeated 100 times. The SRT and error of the resulting pool of 100 "race-winning" auditory and visual saccades were analyzed as a theoretical audiovisual distribution.
As expected, the Monte Carlo simulation resulted in audiovisual SRTs and marginal mean errors that reflected the winning modality. For many audiovisual combinations, the errors predicted by the simulation were significantly greater than those observed in the actual audiovisual trials (7/42, bird N; 9/42, bird J, t-test, Bonferroni corrected, P < 0.05).
An example of an audiovisual condition in which the observed error was less than that predicted by the Monte Carlo simulation is illustrated in Fig. 11 (3 dB; 5.3e3 cd, birds N and J combined). These stimuli were of moderate effectiveness; unisensory trials with these stimulus-intensities individually evoked a response >90% of the time; however, both auditory and visual mean SRTs were significantly longer than those seen at the highest intensities (see Fig. 4). Figure 11, A and B, shows the distance traveled in azimuth and elevation as a function of target eccentricity in azimuth and elevation, respectively. In azimuth (A), the unisensory and audiovisual trials had similar errors (deviation from the unity line), although the auditory trials had slightly more scatter at the larger target eccentricities than did the visual trials. In elevation (B), the auditory trials had appreciably more scatter than the visual trials, and notably, the errors in the audiovisual trials were more like those in visual trials.
Total audiovisual errors also closely followed the visual error trends (Fig. 11C), with the slope and marginal mean error of audiovisual trials being significantly different from auditory but not visual values (pair wise comparison, Bonferroni corrected, P < 0.05, birds N and J). However, KS -tests of the SRT cumulative probability distributions, shown in Fig. 11D, indicate that the audiovisual SRT was significantly different from the visual distribution but not the auditory (Bonferroni corrected, P < 0.05, birds N and J). Figure 11E is a bivariate summary of these findings, plotting each head turn's SRT (abscissa) and total error (ordinate). The plot suggests that the audiovisual distribution has a shorter SRT than the visual distribution and a smaller error than the auditory distribution. The bivariate distribution of the audiovisual trials was significantly different from both the auditory and visual distributions (2-dimensional KS test, P < 0.025, birds N and J).
Figure 12 is a summary figure showing mean SRT and error across all unisensory and multisensory conditions for each bird (N: AF; J: GL). In each panel, the data are normalized to the mean visual values (V-norm, gray circle, dashed lines), at the specified luminous intensity. This normalization was done to simplify comparisons across modalities and stimulus-strengths. The same unisensory auditory data (gray diamonds) are reproduced in every panel for comparison to audiovisual data (black squares), and the SPLs are labeled in F and L.
|
SRTs also benefited from the inclusion of a second modality in a way that is consistent with the principle of inverse effectiveness. In each panel of Fig. 12, the horizontal disparity between the black squares and V-norm represents the improvement in SRT afforded by the addition of an auditory stimulus. Across the auditory conditions, this improvement in SRT is more evident in low-intensity visual combinations (AC and GI) than high-intensity ones (F and L). For instance, B and H represent conditions with lower-intensity stimuli, and each show four audiovisual combinations with SRTs less than half the value of the visual SRT (0.5 on x axis). Audiovisual combinations incorporating the highest-intensity stimuli, as shown in F and L, have SRTs that cluster to the right of 0.5 on the x axis. Therefore the shortening of SRT afforded by adding an auditory stimulus to a visual stimulus is most apparent when the visual stimulus is weaker.
Finally, Fig. 12 demonstrates that when a very strong stimulus is paired with a very weak one, the audiovisual SRTs and errors both take on the values of the stronger modality. This can be seen for high luminance, low SPL combinations in F and L, by comparing the right-most black squares to their unisensory counterparts. In these cases, the audiovisual responses resembled the visual responses in both SRT and error. Conversely, when the unisensory strengths in the audiovisual combinations were reversed, as in the left-most conditions in A and G, audiovisual responses resembled the auditory in both SRT and error. When both modalities in an audiovisual combination were strong, as in the left-most squares in F and L, the audiovisual error cannot be classified as being more like auditory or visual because the owls localized higher SPL sounds with near-visual accuracy. However, the audiovisual SRTs did seem to be more like the shorter auditory SRTs. Therefore any combination incorporating strong unisensory stimuli produce very little evidence of audiovisual integration.
Across stimulus conditions, the data exemplify a general definition of the inverse effectiveness principle; a best-of-both-worlds audiovisual integration is most apparent when stimuli are near behavioral threshold. When such stimuli are used, it becomes clear that the owl's saccade system uses information from both streams to optimize orienting behavior.
|
|
DISCUSSION |
|---|
|
Probability of response
Audiovisual response probability was indistinguishable from the predicted probability of responding to vision, audition, or both. To our knowledge, most published psychophysical studies of multimodal integration tend to employ stimuli at amplitudes well above behavioral threshold, thus ensuring detection of nearly all of the stimuli regardless of modality. By contrast, the behavioral studies of Stein and colleagues did test peri-threshold stimuli. They showed that cats, trained to approach a visual stimulus and press a bar directly beneath the source, were more likely to orient correctly toward a dim light when a faint sound accompanied it (Stein et al. 1988
, 1989
). The percentage of correct trials was, furthermore, enhanced beyond statistical predictions (Burnett et al. 2004
; Stein et al. 1988
, 1989
; Wilkinson et al. 1996
). Although our study included stimuli of comparable magnitudes, it is not directly comparable with these studies of the cat because Stein and colleagues required not only a reaction to the stimuli but also that the response be to the correct location. By contrast, we considered all saccade-like movements (within criteria; see METHODS), regardless of accuracy and precision. The question remains, whether the feline detection of stimuli regardless of correctness would be enhanced to levels beyond that predicted by a reaction to either modality alone or to both.
SRT
Because saccades to audiovisual targets usually had shorter, auditory-like SRTs, our results are consistent with a number of studies suggesting that a visual SRT will be facilitated by the addition of an auditory component (Arndt and Colonius 2003
; Colonius and Arndt 2001
; Corneil and Munoz 1996
; Corneil et al. 2002
; Engelken and Stevens 1989
; Frens et al. 1995
; Harrington and Peck 1998
; Hughes et al. 1994
, 1998
; Kirchner and Colonius 2004
; Nozawa et al. 1994
). Performance meeting this broad definition of enhancement is also predicted by the race model, and our results did not show substantial evidence of SRT facilitation beyond race-model predictions, in contrast to previous results (Arndt and Colonius 2003
; Harrington and Peck 1998
; Hughes et al. 1994
, 1998
; Nozawa et al. 1994
).
Instead, the lack of SRT facilitation beyond race model predictions in the owl is consistent with a recent study on multisensory integration in human saccadic behavior (Corneil et al. 2002
), wherein SRTs to simultaneous, spatially aligned audiovisual stimuli were not enhanced beyond race-model predictions. There are three main similarities between the study by Corneil and colleagues and our own that may help explain the lack of SRT facilitation beyond race predictions. First, both studies implemented a "divided-attention" task in which subjects were instructed or trained to localize both auditory and visual targets. In focused-attention tasks, by contrast, subjects are instructed to ignore one modality and must therefore identify the modality of the target before initiating a saccade. Because our divided-attention task did not require the modality identification step, unisensory responses and thus predictions made by the race model were quicker, making the observation of positive race violations less likely (Corneil and Munoz 1996
). Second, the difficulty of the task was set at a high level in both sets of studies by presenting stimuli with a low signal-to-noise ratio. Third, for any given trial, the stimulus-strength, location, and modality were randomly selected and thus unpredictable. Previous studies have linked response facilitation to a priori knowledge of stimulus characteristics (Kirchner and Colonius 2004
; Mordkoff and Yantis 1991
; Schwarz 1996
).
SRTs are also known to be affected by the temporal alignment of the auditory and visual components in a multisensory target (Corneil et al. 2002
; Diederich and Colonius 2004
). Corneil and colleagues (2002)
demonstrated positive violations of the race model when a light led the sound by 100 ms. Because SRTs in unisensory auditory trials of that study were on average 100 ms earlier than those in visual trials, presenting the light 100 ms earlier than the sound would presumably align the visual and auditory information streams within the saccade-control circuitry. Electrophysiological evidence from the mammalian superior colliculus suggests that this is the condition in which positive violations of the race model are most likely to occur (Meredith et al. 1987
). Although the visual and auditory streams were similarly aligned in our study by ranging SPL and luminous intensity (Figs. 4 and 7), we did not see significant positive race-model violations. It is worth noting, however, that aligning the unisensory components in this way during multisensory trials changes not only the processing time but the strength of the inputs. Future studies in which the temporal alignment of auditory and visual information is achieved by asynchronous presentation of the stimuli, independently of stimulus strength, may yet reveal positive race model violations in the owl's saccade control system.
Negative race-model violations in SRT
SRTs that were longer than predicted by the race model were more frequent in our study than in earlier studies of humans. These negative violations were consistently observed when the strongest visual stimulus was utilized in the audiovisual combination (Fig. 9). Interestingly, when these bright visual stimuli were presented alone, the SRTs were still longer than those from auditory-alone trials, and therefore the visual-information stream would not have been expected to win a race to saccade initiation in an audiovisual trial. Nevertheless, a significant proportion of SRTs in both birds had vision-like values that skewed the overall multisensory distributions to the right of the Grice boundary. Data suggesting negative violations have also been reported in focused-attention tasks where human subjects are instructed to generate ocular saccades to stimuli of one modality while ignoring another (Hughes et al. 1994
). Negative violations cannot be explained solely by a race between separate sensory streams (Grice et al. 1984
; Hughes et al. 1994
).
Negative violations suggest that under certain circumstances, the process of multimodal integration incorporate factors other than timing, such as the strength or precision with which the stimulus is represented within the nervous system. When the two information streams arrive within a time window, the "stronger" or more "focal" representation may control saccade initiation even though it was not the first to arrive at the saccadic trigger site. It is premature, however, to propose the neural mechanism by which the quality of the neural representation affects SRTs. The role of stimulus salience would be better tested by manipulating stimulus onset asynchrony independently of amplitude.
Negative violations may also be due to a perceived spatial misalignment of the auditory and visual sources. Previous studies have shown that SRTs to targets in the presence of a spatially misaligned second stimulus are significantly longer than in trials with only one stimulus (Corneil et al. 2002
; McSorley et al. 2005
; Spitzer and Takahashi 2006
). In our study, the LEDs and speakers were co-localized. However, in trials where a quiet sound was presented alone, we saw that the owls' saccades typically fell short of the target (Figs. 5 and 6). Hypometria of this magnitude was not observed for visual-alone trials. This difference in saccade metrics in auditory and visual trials may suggest that the neural representation of a quieter sound is shifted toward the midline relative to the visual representation. In other words, although the LEDs and speakers were aligned in space, the visual and auditory neural images may have been misaligned, effectively forcing the owl into a spatially misaligned condition, which is known to lengthen SRTs.
The lengthening of SRT due to misalignment may stem from a network of local inhibitory neurons in the superior colliculus (Honda 2005
; Munoz and Istvan 1998
; Olivier et al. 1999
). In general, these studies have shown that the presence of a misaligned distracter decreases buildup of activity in bursting superior colliculus neurons prior to saccade initiation (Olivier et al. 1999
) and also lengthens SRTs beyond what is seen for either modality alone (Munoz and Istvan 1998
). The negative violations seen here, however, are due to a skew of the distribution toward values more common in visual trials rather than toward SRT values longer than those observed in unisensory trials. Whether or not lateral inhibition may still play a role in the negative violations observed with our paradigm is empirical and should be addressed in future electrophysiological studies.
Finally, prior experience may also have played a part in inducing the negative violation. The present study incorporated an accuracy criterion for food rewards. Although owls are renown for their ability to localize sounds, their ability to localize a visual stimulus across intensities is better still, as measured by error. When auditory stimuli were quieter, there was a lesser probability of reward, even with our relaxed criteria in these cases (see METHODS). Therefore it is possible that when a visual stimulus was added, the bird weighed the visual input more heavily because it allowed the bird to perform to criterion and receive rewards more consistently. Thus in some or all of the stimulus combinations, we cannot rule out the possibility that the birds may have been incorporating an additional decision step in the planning the saccade.
Error
Audiovisual error at different stimulus-strengths mirrored the more accurate of the two unisensory trends and did not improve beyond the level of the most accurate/precise modality, usually vision (Fig. 10). Others have attributed this lack of enhancement to a ceiling effect, arguing that vision provides such accurate localization that the addition of auditory information cannot further improve performance (Corneil et al. 2002
; Hairston et al. 2003
; Welch and Warren 1986
). Our study, however, extended the stimulus range to include visual trials with intensities so low, that errors were significantly larger than the asymptotic value (Fig. 5), i.e., accuracy was below the hypothetical ceiling, leaving room for improvement. Adding an auditory component to these dim stimuli however, did not improve accuracy beyond the level of the better modality (Fig. 10A). An absolute accuracy ceiling, therefore cannot entirely explain the lack of accuracy/precision enhancement beyond unisensory levels in the owl.
We also compared observed audiovisual errors with the prediction that the winning modality of the race to saccade initiation determined other features of the ensuing saccade, i.e., accuracy. Interestingly, when an auditory stimulus evoking characteristically shorter SRTs was paired with a visual stimulus evoking characteristically more accurate saccades, the ensuing audiovisual saccade was generally initiated as early as the auditory saccades and terminated as accurately as the visual. This observation is inconsistent with the idea that when an object is audible and visible, the owl acts on either the visual or auditory component to generate a saccade, as is the basic assumption of the race model. Our results support Corneil et al. (2002)
in suggesting that an audiovisual saccade is often neither purely auditory nor purely visual (whichever won the race) but an optimal combination of both. Furthermore, this effect was seen primarily for stimulus combinations in which neither unisensory component was loud or bright. This is consistent with the inverse effectiveness rule of multisensory integration. Thus the two modalities do not function independently in the multisensory trial, but rather, are ultimately integrated to enhance the behavioral response.
Neural mechanisms
Studies of the mammalian superior colliculus have typically shown that auditory and visual stimuli co-localized at the center of a neuron's spatial receptive field evoke responses that are greater than the response to either modality alone (Frens and Van Opstal 1998
; King and Palmer 1985
; Meredith and Stein 1986
; Populin and Yin 2002
; Stanford et al. 2005
; Stein and Meredith 1993
; Wallace et al. 1998
). Although there is some controversy whether the audiovisual response is greater than the sum of the responses to the visual and auditory components (Populin and Yin 2002
), this increased neuronal response parallels the improved performance in the detection/orientation studies cited in the preceding text (Stein et al. 1988
; Stein et al. 1989
). Stein and colleagues have pointed out that in both neuronal and behavioral responses, multisensory advantages are greatest when the stimuli are weak, i.e., the principle of inverse effectiveness applies. Moreover, the proper alignment of the stimuli in time and space are required to achieve the improved performance in behavior and the increased response in the neurons.
The recent study of Bell and colleagues (2005)
that examined spike timing in addition to spike rate in the superior colliculus of the behaving monkey sheds light on the improvement of SRTs observed with bimodal stimuli. They showed a significant correlation between the latency of the first spike evoked by weak stimuli and mean SRTs. Importantly, the addition of a second modality decreased both the first-spike latency and SRT. It is difficult, however, to relate these findings to the current study. The paradigm implemented by Bell and colleagues (2005)
was a focused-attention task in which the visual signal was always the target and the auditory signal was either a distracter or an enhancer, depending on spatial orientation. The auditory stimulus was never presented alone as a target. Therefore the decrease in SRT could only be compared with the visual-alone trials and not to race-model predictions. Likewise, the decrease in first-spike latency was again relative to visual-alone trials, even though a number of the cells pooled into the analysis were responsive to the auditory stimulus. Our behavioral results predict that the latency of the first spike evoked by AV stimuli will be similar to those evoked by the auditory component alone, if indeed, first spike latencies of neurons in the superior colliculus or optic tectum determine SRTs.
The observation that the owl's saccades have the combination of speed and accuracy characteristic of the auditory and visual modalities, respectively, suggest an avenue of exploration of neural mechanism in addition to spike rate and latency. This approach would examine the hypothesis that the first modality triggers the saccade, and the other refines the target location sometime before saccade termination (Corneil et al. 2002
; Van Opstal and Munoz 2004
). In the initial phase of this "updating" model of sensory convergence, separate auditory and visual sensory responses race to a common saccade generation site, such as the mammalian superior colliculus or avian optic tectum. Then if the auditory sensory representation brings the saccade generation mechanism to threshold before the visual, the SRT assumes an auditory-like value. Once the saccade is triggered, the target location may be updated and refined by newly arriving visual information. The updating model thus stated, however, does not address negative violations, which may require assumptions about perceived location as well as the incorporation of themes from decision theory, such as prior experience and stimulus salience (Carpenter and Williams 1995
).
Human psychophysical experiments have shown that when two targets are presented in sequence, we are able to update the trajectory of a saccade even after the response has begun (Vliegen et al. 2004
). Behavioral experiments on the precedence effect demonstrate that such a mid-course modification of trajectories is also possible in owls. The precedence effect comprises a cluster of phenomenon related to spatial hearing in echoic environments in which the location of a sound coming directly from the source dominates perception, and an echo, which arrives later, is poorly localized if at all, for review (Litovsky et al. 1999
). Owls turn their heads toward the leading source when the delay between the leading and lagging sounds is less than
10 ms but begin to localize the lagging sound when the delay is >20 ms (Keller and Takahashi 1996
; Spitzer and Takahashi 2006
). In many of the trials at delays >20 ms, owls first turned their heads toward one source and then abruptly changed their head trajectory to localize the later source. Evidence for updating was also observed in the course of the experiments conducted in the present study: In those trials where stimuli were inadvertently presented during a "spontaneous" head turn (66% of excluded trials, see METHODS), the owls often made appropriate mid-course corrections. The owl is therefore neither blind nor deaf during a saccadic head turn, and the sensory information arriving later can affect the final gaze position.
Neurophysiological studies also suggest that an updating model of sensory convergence is plausible. In single multisensory units in the intermediate and deep layers of the barn owl's optic tectum, the auditory first-spike latency is
1020 ms, whereas the first-spike latency to a light flash is
5080 ms (Knudsen 1982
; unpublished observations). Although the visual and auditory receptive fields generally overlap in space, the visual receptive field is considerably finer than the auditory receptive field (Knudsen 1982
; unpublished observations). The timing and spatial resolution of responses in the optic tectum are therefore consistent with the idea that audition often triggers the saccade while later arriving visual cues refine it. To further study the neuronal correlates of the updating model, it will be necessary to observe the complete sensory receptive field as it evolves over the course of an audiovisual stimulus. This may be possible in the barn owl, wherein computer-synthesized visual stimuli and virtual auditory space techniques now allow the rapid assessment of spatial receptive fields (Keller et al. 1998
).
|
|
GRANTS |
|---|
|
|
|
ACKNOWLEDGMENTS |
|---|
|
|
|
FOOTNOTES |
|---|
Address for reprint requests and other correspondence: E. A. Whitchurch, Institute of Neuroscience, 1254 University of Oregon, Eugene, OR 97403-1254 (E-mail: lwhitchu{at}uoneuro.uoregon.edu)
|
|
REFERENCES |
|---|
|
Bell AH, Meredith MA, Van Opstal AJ, and Munoz DP. Crossmodal integration in the primate superior colliculus underlying the preparation and initiation of saccadic eye movements. J Neurophysiol 93: 36593673, 2005.
Burnett LR, Stein BE, Chaponis D, and Wallace MT. Superior colliculus lesions preferentially disrupt multisensory orientation. Neuroscience 124: 535547, 2004.[CrossRef][Web of Science][Medline]
Calvert GA, Spence C, and Stein BE. The Handbook of Multisensory Processes. Cambridge, MA: The MIT Press, 2004.
Carpenter RH and Williams ML. Neural computation of log likelihood in control of saccadic eye movements. Nature 377: 5962, 1995.[CrossRef][Medline]
Colonius H and Arndt P. A two-stage model for visual-auditory interaction in saccadic latencies. Percept Psychophys 63: 126147, 2001.[Web of Science][Medline]
Colonius H and Diederich A. Multisensory interaction in saccadic reaction time: a time-window-of-integration model. J Cogn Neurosci 16: 10001009, 2004.[CrossRef][Web of Science][Medline]
Corneil BD and Munoz DP. The influence of auditory and visual distractors on human orienting gaze shifts. J Neurosci 16: 81938207, 1996.
Corneil BD, Van Wanrooij M, Munoz DP, and Van Opstal AJ. Auditory-visual interactions subserving goal-directed saccades in a complex scene. J Neurophysiol 88: 438454, 2002.
Diederich A and Colonius H. Bimodal and trimodal multisensory enhancement: effects of stimulus onset and intensity on reaction time. Percept Psychophys 66: 13881404, 2004.[Web of Science][Medline]
Diederich A, Colonius H, Bockhorst D, and Tabeling S. Visual-tactile spatial interaction in saccade generation. Exp Brain Res 148: 328337, 2003.[Web of Science][Medline]
du Lac S and Knudsen EI. Neural maps of head movement vector and speed in the optic tectum of the barn owl. J Neurophysiol 63: 131146, 1990.
Engelken EJ and Stevens KW. Saccadic eye-movements in response to visual, auditory, and bisensory stimuli. Aviat Space Envir Md 60: 762768, 1989.
Frens MA and Van Opstal AJ. Visual-auditory interactions modulate saccade-related activity in monkey superior colliculus. Brain Res Bull 46: 211224, 1998.[CrossRef][Web of Science][Medline]
Frens MA, Van Opstal AJ, and Van der Willigen RF. Spatial and temporal factors determine auditory-visual interactions in human saccadic eye movements. Percept Psychophys 57: 802816, 1995.[Web of Science][Medline]
Grice GR, Canham L, and Boroughs JM. Combination rule for redundant information in reaction-time tasks with divided attention. Percept Psychophys 35: 451463, 1984.[Web of Science][Medline]
Hairston WD, Laurienti PJ, Mishra G, Burdette JH, and Wallace MT. Multisensory enhancement of localization under conditions of induced myopia. Exp Brain Res 152: 404408, 2003.[CrossRef][Web of Science][Medline]
Harrington LK and Peck CK. Spatial disparity affects visual-auditory interactions in human sensorimotor processing. Exp Brain Res 122: 247252, 1998.[CrossRef][Web of Science][Medline]
Honda H. The remote distractor effect of saccade latencies in fixation-offset and overlap conditions. Vision Res 45: 27732779, 2005.[CrossRef][Web of Science][Medline]
Hughes HC, Nelson MD, and Aronchick DM. Spatial characteristics of visual-auditory summation in human saccades. Vision Res 38: 39553963, 1998.[CrossRef][Web of Science][Medline]
Hughes HC, Reuter-Lorenz PA, Nozawa G, and Fendrich R. Visual-auditory interactions in sensorimotor processing: saccades versus manual responses. J Exp Psychol Hum Percept Perform 20: 131153, 1994.[CrossRef][Web of Science][Medline]
Institute of Laboratory Animal Resources and NetLibrary. Guide for the Care and Use of Laboratory Animals. Washington, DC: National Academy Press, 1997.
Jiang W, Jiang H, and Stein BE. Two corticotectal areas facilitate multisensory orientation behavior. J Cogn Neurosci 14: 12401255, 2002.[CrossRef][Web of Science][Medline]
Keller CH, Hartung K, and Takahashi TT. Head-related transfer functions of the barn owl: measurement and neural responses. Hear Res 118: 1334, 1998.[CrossRef][Web of Science][Medline]
Keller CH and Takahashi TT. Responses to simulated echoes by neurons in the barn owl's auditory space map. J Comp Physiol [A] 178: 499512, 1996.[Medline]
King AJ and Palmer AR. Integration of visual and auditory information in bimodal neurones in the guinea pig superior colliculus. Exp Brain Res 60: 492500, 1985.[Web of Science][Medline]
Kirchner H and Colonius H. Predictiveness of a visual distractor modulates saccadic responses to auditory targets. Exp Brain Res 155: 257260, 2004.[CrossRef][Web of Science][Medline]
Knudsen EI. Auditory and visual maps of space in the optic tectum of the owl. J Neurosci 2: 11771194, 1982.[Abstract]
Knudsen EI, Blasdel GG, and Konishi M. Sound localization by the barn owl (Tyto alba) measured with the search coil technique. J Comp Physiol [A] 133: 111, 1979.[CrossRef]
Knudsen EI, Knudsen PF, and Masino T. Parallel pathways mediating both sound localization and gaze control in the forebrain and midbrain of the barn owl. J Neurosci 13: 28372852, 1993.[Abstract]
Knudsen EI and Konishi M. Mechanisms of sound localization in the barn owl (Tyto alba). J Comp Physiol [A] 133: 1321, 1979.[CrossRef]
Knudsen EI and Konishi M. A neural map of auditory space in the owl. Science 200: 795797, 1978.
Litovsky RY, Colburn HS, Yost WA, and Guzman SJ. The precedence effect. J Acoust Soc Am 106: 16331654, 1999.[CrossRef][Web of Science][Medline]
McSorley E, Haggard P, and Walker R. Spatial and temporal aspects of oculomotor inhibition as revealed by saccade trajectories. Vision Res 45: 24922499, 2005.[CrossRef][Web of Science][Medline]
Meredith MA, Nemitz JW, and Stein BE. Determinants of multisensory integration in superior colliculus neurons. I. Temporal factors. J Neurosci 7: 32153229, 1987.[Abstract]
Meredith MA and Stein BE. Visual, auditory, and somatosensory convergence on cells in superior colliculus results in multisensory integration. J Neurophysiol 56: 640662, 1986.
Miller JO. Divided attention: Evidence for coactivation with redundant signals. Cognit Psychol 14: 247279, 1982.[CrossRef][Web of Science][Medline]
Mordkoff JT and Yantis S. An interactive race model of divided attention. J Exp Psychol Hum Percept Perform 17: 520538, 1991.[CrossRef][Web of Science][Medline]
Munoz DP and Istvan PJ. Lateral inhibitory interactions in the intermediate layers of the monkey superior colliculus. J Neurophysiol 79: 11931209, 1998.
Nozawa G, Reuter-Lorenz PA, and Hughes HC. Parallel and serial processes in the human oculomotor system: bimodal integration and express saccades. Biol Cybern 72: 1934, 1994.[CrossRef][Web of Science][Medline]
Olivier E, Dorris MC, and Munoz DP. Lateral interactions in the superior colliculus, not an extended fixation zone, can account for the remote distracter effect. Behav Brain Sci 22: 694695, 1999.[CrossRef][Web of Science]
Poganiatz I, Nelken I, and Wagner H. Sound-localization experiments with barn owls in virtual space: influence of interaural time difference on head-turning behavior. J Assoc Res Otolaryngol 2: 121, 2001.[Medline]
Populin LC and Yin TC. Bimodal interactions in the superior colliculus of the behaving cat. J Neurosci 22: 28262834, 2002.
Raab DH. Statistical facilitation of simple reaction times. Trans NY Acad Sci 24: 574590, 1962.[Web of Science][Medline]
Schwarz W. Further tests of the interactive race model of divided attention: the effects of negative bias and varying stimulusonset asynchronies. Psychol Res 58: 233245, 1996.[CrossRef][Web of Science]
Spence C and Driver J. Crossmodal Space and Crossmodal Attention. New York: Oxford, 2004.
Spitzer MW, Bala AD, and Takahashi TT. Auditory spatial discrimination by barn owls in simulated echoic conditions. J Acoust Soc Am 113: 16311645, 2003.[CrossRef][Web of Science][Medline]
Spitzer MW and Takahashi TT. Sound localisation by barn owls in a simulated echoic environment. J Neurophysiol In Press, 2006.
Stanford TR, Quessy S, and Stein BE. Evaluating the operations underlying multisensory integration in the cat superior colliculus. J Neurosci 25: 64996508, 2005.
Stein BE, Huneycutt WS, and Meredith MA. Neurons and behavior: the same rules of multisensory integration apply. Brain Res 448: 355358, 1988.[CrossRef][Web of Science][Medline]
Stein BE and Meredith MA. The merging of the senses. In: Cognitive Neuroscience Series, edited by Gazzaniga MS. Cambridge, MA: MIT Press, 1993, p. xv, 211.
Stein BE, Meredith MA, Huneycutt WS, and McDade L. Behavioral indices of multisensory integration: orientation to visual cues is affected by auditory stimuli. J Cogn Neurosci 1: 1224, 1989.[Medline]
Todd JW. Reaction to multiple stimuli. Arch Psychol 25: 165, 1912.
Townsend JT and Wenger MJ. A theory of interactive parallel processing: new capacity measures and predictions for a response time inequality series. Psychol Rev 111: 10031035, 2004.[CrossRef][Web of Science][Medline]
Van Opstal AJ and Munoz DP. Auditory-visual interactions subserving primate gaze orienting. In: The Handbook of Multisensory Processes, edited by Calvert GA, Spence C, and Stein BE. Cambridge, MA: The MIT Press, 2004, p. 373393.
Vliegen J, Van Grootel TJ, and Van Opstal AJ. Dynamic sound localization during rapid eye-head gaze shifts. J Neurosci 24: 92919302, 2004.
Wallace MT, Meredith MA, and Stein BE. Multisensory integration in the superior colliculus of the alert cat. J Neurophysiol 80: 10061010, 1998.
Welch R and Warren D. Intersensory interactions. In: Handbook of Perception and Human Performance, edited by Boff KR, Kaufman L and Thomas JP. New York: Wiley, 1986, p. 2136.
Wilkinson LK, Meredith MA, and Stein BE. The role of anterior ectosylvian cortex in cross-modality orientation and approach behavior. Exp Brain Res 112: 110, 1996.[Web of Science][Medline]
This article has been cited by other articles:
![]() |
Y. Zahar, A. Reches, and Y. Gutfreund Multisensory Enhancement in the Optic Tectum of the Barn Owl: Spike Count and Spike Timing J Neurophysiol, May 1, 2009; 101(5): 2380 - 2394. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. R. Pluta and M. Kawasaki Multisensory enhancement of electromotor responses to a single moving object J. Exp. Biol., September 15, 2008; 211(18): 2919 - 2930. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. A. Rowland, S. Quessy, T. R. Stanford, and B. E. Stein Multisensory Integration Shortens Physiological Response Latencies J. Neurosci., May 30, 2007; 27(22): 5879 - 5884. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | SEARCH RESULT |
| Visit Other APS Journals Online |