|
|
||||||||
1Jenks Vestibular Physiology Lab, Massachusetts Eye and Ear Infirmary, Boston, Massachusetts; 2Department of Otology and Laryngology, Harvard Medical School, Boston, Massachusetts; and 3Neurotology Research, Legacy Clinical Research and Technology Center, Portland, Oregon
Submitted 31 August 2004; accepted in final form 13 February 2005
| ABSTRACT |
|---|
|
|
|---|
0.2 Hz. | INTRODUCTION |
|---|
|
|
|---|
Internal models describe an explicit set of neural mechanisms2 by which information from disparate sensory systems (e.g., semicircular canals, otolith organs, retina) and nonsensory systems (e.g., efferent copy, cognitive contributions) can be combined and fused by the nervous system. The end products provided by these neural processes of sensory fusion are central estimates of spatial orientation (e.g., central estimates of gravity, central estimates of linear acceleration, central estimates of angular velocity). Even though fusion mechanisms other than internal models may exist, we will use the more specific term, internal models, throughout most of the paper because internal models describe an explicit set of testable hypotheses and because some earlier studies have demonstrated the contributions of internal models to sensory fusion (e.g., Angelaki et al. 1999
; Merfeld and Young 1995
; Merfeld et al. 1999
).
There is no question that sensory fusion contributes to the neural processing of perceived tilt. First consider visual influences; human perception of tilt has long been known to be influenced by visual orientation cues (Asch and Witkin 1948a
,b
) and visual motion cues (Dichgans et al. 1972
, 1974
; Howard and Childerson 1994
; Zupan and Merfeld 2003
) in combination with the otolithic measures of gravity.
It has been hypothesized that internal models explain these well-known visual influences on tilt perception (Young 1984
). We have hypothesized that the nervous system also has an internal model of the physical law that specific gravito-inertial force is the sum of gravity minus linear acceleration (e.g., Merfeld 1990
, 1995a
,b
; Merfeld and Zupan 2002
; Merfeld et al. 1993a
). Putting these two internal models together, we predicted that visual roll rotation cues also contribute to the horizontal translational VOR via internal models (Zupan and Merfeld 2003
), which has been confirmed experimentally (Zupan and Merfeld 2003
). Visual roll rotation cues may similarly influence translation perception via internal models (Merfeld and Zupan 2003
).
The dynamics of human tilt perception as a function of frequency during sinusoidal translation in the dark suggested that human tilt perception is governed by simple low-pass filtering of otolith signals (Seidman and Paige 1996
; Seidman et al. 1998
). However, published internal model predictions for similar translation paradigms (Merfeld and Zupan 2002
) predicted tilt perception dynamics that resemble the output of simple low-pass filters even without the explicit use of low-pass filters. This modeling demonstrates that one cannot determine whether simple filtering contributes to a response by simply measuring the frequency response during Translation. Furthermore, unlike the output of a simple low-pass filter, the phase of perceived tilt has been shown to be relatively constant across a broad frequency range (Glasauer 1995
). Also inconsistent with the hypothesis that tilt perception is elicited by simple low-pass filtering are earlier findings that show that roll cues from the semicircular canals contribute to roll tilt perception (Stockwell and Guedry 1970
; von Holst and Grisebach 1951
). Therefore, our first goal was to determine definitively whether or not simple filtering and/or internal models contribute to perceptions of tilt and/or translation in humans. To achieve this goal, we measured perceptions of both tilt and translation while using both a roll tilt motion paradigm and an inter-aural translation motion paradigm. We performed these motion paradigms using sinusoids across a broad range of frequencies (0.0050.7 Hz) because sinusoids provide the most direct way to test the predictions of the simple filtering hypothesis.
The second goal was to investigate whether human translational VOR responses are influenced by canal cues or are elicited by simple filtering. We investigated this goal by testing across a broader range of frequencies than earlier studies. Specifically, a number of studies using smaller frequency ranges have demonstrated the influence of canal cues on translation VORs (Angelaki et al. 1999
, 2001
; Green and Angelaki 2003
; Merfeld and Young 1995
; Merfeld et al. 1999
, 2001
; Peterka et al. 2004
; Zupan et al. 2000
). In one of these studies (Merfeld and Young 1995
), squirrel monkeys were quickly tilted in roll such that a high-frequency inter-aural otolith cue was provided by gravity; little or no horizontal VOR was observed. This result was inconsistent with the contributions of simple filtering. This finding was confirmed and substantially extended by a study published by Angelaki et al. (1999)
in which rhesus monkeys were roll tilted, translated along their inter-aural axis, or simultaneously roll tilted and translated. The inter-aural shear force during roll tilt alone and translation alone were nearly identical. Only a small horizontal VOR was observed during the roll tilt stimulation, while a substantial horizontal VOR was observed whenever an inter-aural translation was present, showing that the animals used the rotational cue provided via the canals to help resolve the ambiguous gravito-inertial force (GIF) cue. Another set of studies complemented these findings by showing that translational VORs could be elicited via internal models even in the absence of actual translation (Merfeld et al. 1999
; Zupan et al. 2000
). These findings clearly demonstrate the influence of canal cues on the translation VOR response, but do not rule out the possibility that simple filtering (i.e., simple high-pass filtering) might also contribute to the translational VOR.
The third, and most important, goal of this investigation was to measure perception (perceived roll tilt and perceived inter-aural translation) and action (VOR) simultaneously to determine if the same (or similar) neural mechanisms contribute to perceptual and reflexive responses in humans. It has long been known that there are quantitative differences between eye movement and perceptual responses in humans. For example, the amount of tilt perceived during earth-horizontal inter-aural translation (Glasauer 1995
) is not matched by the amount of ocular torsion (Lichtenberg et al. 1982
; Merfeld et al. 1996b
) for similar stimuli nor is the phase of these two tilt responses the same. A limited number of earlier studies investigated tilt/translation resolution by simultaneously measuring perceptual responses and the VOR (Merfeld et al. 2001
; Zupan and Merfeld 2003
). Findings from these studies suggest that the translational VOR may share some common mechanisms with perception.
The three goals described in the preceding text led us to design and perform a comprehensive study in which we measured the VOR and perceptions of tilt and translation during sinusoidal translations and sinusoidal tilts across a broad range of frequencies (0.0050.7 Hz).
| METHODS |
|---|
|
|
|---|
|
Tilt paradigm
Sinusoidal head-centered roll tilts ("Tilt" paradigm, Fig. 1A) were accomplished by placing the ear level of the subject near the rotation axis of an earth-horizontal rotator (e.g., Angelaki et al. 1999
; Merfeld and Young 1995
; Stockwell and Guedry 1970
). More specifically, the rotation axis was located such that the linear accelerations (both tangential and centripetal accelerations) at the otolith organs fell below the human acceleration threshold of 0.05 m/s2 (Benson et al. 1986
; Melvill Jones and Young 1978
). For the highest frequency stimuli (0.7 Hz), where the tangential acceleration was largest, this meant that ear placement had to be accurate within
1 cm. This placement accuracy was verified using two redundant measurements of the location of the external auditory meatus with respect to two distinct landmarks previously located with respect to the rotational axis. (Head placement prohibited direct measurement of ear location with respect to rotation axis.)
To avoid discontinuous angular acceleration at motion onset, which becomes substantial at higher frequencies, the angular velocity was linearly increased over an integer number of sinusoidal cycles for all frequencies >0.02 Hz to yield the steady-state sinusoidal stimuli. (While an example of the linear increase for Tilt is not shown, Fig. 2, B and C, shows a similar linear increase for the Translation paradigm.) The steady-state sinusoidal roll tilt (0.005 to 0.7 Hz, Table 1) was left/right symmetric about the upright position and had a peak amplitude (
max) of 20°. For this paradigm, the steady-state inter-aural (y axis) gravitational force measured by the otolith organs was gy(t) = Gsin[
maxsin(2
ft)]
Gsin(
max)sin(2
ft), where G is gravitational force. The inter-aural gravity component had a peak amplitude of 3.3 m/s2.
|
|
For the Translation paradigms (Fig. 1, B and C), the subject's chair was mounted on a platform that could translate horizontally along the rotational arm of a centrifuge (Merfeld et al. 2001
). For the highest stimulus frequency (0.7 Hz), the upright subject was simply sinusoidally translated from side to side (Fig. 1B) such that the peak inter-aural linear acceleration was 3.3 m/s2. For this condition, the translational linear acceleration, ay =
= D(2
f)2sin(2
ft), was the only inter-aural otolith stimulus (D is peak displacement).
Due to limited track length, for all other frequencies of Translation (0.005 to 0.5 Hz), the subject was first seated at the center of rotation of the centrifuge and rotated in yaw about an earth-vertical axis for
5 min, allowing the yaw canal cue to decay, before proceeding with sinusoidal radial translations (Fig. 1C). This is the same technique used in previous studies (e.g., Glasauer 1995
; Merfeld et al. 2001
; Paige and Seidman 1999
; Seidman et al. 1998
) to yield low-frequency linear acceleration in the absence of a very long track. The centrifuge yaw angular velocity (
) was trapezoidal, with an angular acceleration of 25°/s2 to a constant velocity of 250°/s, which was maintained for
3045 min. During the sinusoidal radial translation, centripetal acceleration (d x
x
), radial acceleration (
), and Coriolis acceleration (2
x
) were present (Fig. 2, Table 1), where d is the displacement vector from the center of rotation and x represents the standard vector cross product operator. Centripetal acceleration and radial acceleration were both sinusoidal and naturally synchronized because they were both phase-locked to radial displacement and were both always aligned with the inter-aural axis. We chose the amplitude of the sinusoidal radial displacement (Table 1), such that the centripetal acceleration and radial acceleration summed to yield a peak inter-aural otolith cue of 3.3 m/s2, matching the inter-aural gravitational force during a 20° tilt. As for the roll tilt sinusoids, acceleration discontinuities were eliminated by linearly increasing the sinusoidal radial velocity (Fig. 2) over an integer number of cycles for all frequencies >0.02 Hz (Table 1).
The sinusoidal inter-aural otolith cue provided by the precise combination of centripetal and radial accelerations mimics the sinusoidal inter-aural otolith cue present during a sinusoidal roll tilt. Therefore only the Coriolis acceleration is a potential contaminant. As such, it is important to note several facts. First, there was no Coriolis acceleration at the highest frequency (0.7 Hz). Second, the Coriolis acceleration was always smaller than the inter-aural acceleration (Table 1). Third, the Coriolis acceleration was at or below the human acceleration detection threshold (Benson et al. 1986
; Melvill Jones and Young 1978
) for the two lowest frequencies. Fourth, the Coriolis acceleration was more than an order of magnitude smaller than the inter-aural acceleration for half the frequencies. Furthermore, the Coriolis acceleration was always orthogonal to the inter-aural cue in two ways. It was always physically orthogonal because it was always aligned with the naso-occipital axis (x axis), and it was always orthogonal in time because it was 90° out of phase with the inter-aural acceleration. Given these considerations, we do not believe that the Coriolis acceleration is likely to have a governing influence on the measured responses.
In addition to these theoretical considerations, we evaluated the influence of Coriolis acceleration on the measured responses directly by reversing the rotation, which reverses the direction of the Coriolis force without changing the total inter-aural linear acceleration. Half of the subjects underwent a clockwise rotation and half a counterclockwise rotation. We found no substantial influence of rotation direction (and hence, direction of Coriolis force) on the measured responses, so all data shown include pooled clockwise and counterclockwise responses.
Analog data acquisition
Analog data were acquired at a rate of 60 Hz via a 12-bit analog to digital converter. For the Tilt paradigm, the roll tilt angle was measured via a potentiometer placed on the tilt rotation shaft. For the Translation paradigm, the radial position of the subject's chair with respect to the centrifuge arm was measured with a potentiometer placed on the linear drive motion shaft.
Subjects and instructions
Eight healthy subjects were prescreened as "normal" via several, standard, clinical, vestibular tests. All subjects signed an informed consent, consistent with institutional procedures prior to participation. The subjects (7 males and 1 female) were between 22 and 60 yr of age. Only data from subjects who successfully completed all trials for a given testing paradigm are included. All eight subjects successfully completed the Tilt paradigm. Seven of eight subjects successfully completed the Translation paradigm; one subject experienced nausea and other symptoms of motion sickness.
Throughout the trials, always performed in darkness, subjects were instructed to open their eyes and look straight ahead without focusing on any point, real or imagined. They were also instructed to remember their perceptions of roll-tilt and inter-aural linear translation so that they could provide verbal reports of their perceived motion after each trial. We used proximal vergence (Jaschinski-Kruza 1990
; Schor et al. 1992
; Wick and Bedell 1989
) to help stabilize and control gaze distance because it is known that human vergence is generally maintained, even in darkness, to be roughly consistent with the distance to surroundings previously viewed in the light. Specifically, we used a dim light to illuminate a poster
1 m in front of the subject for
15 s between trials. Consistent with the published proximal vergence literature, our measurements showed that vergence was maintained relatively constant for each subject for each test condition.
EYE-MOVEMENT ACQUISITION AND ANALYSIS. Binocular eye movements were video-taped with a frame-rate of 60 Hz using an infrared video-oculography (VOG) system (SensoMotoric Instruments). Video cameras were attached to the subject's head via a facemask and a strap around the back of the head. We further stabilized the goggles by adding a bite bar assembly, providing a stable attachment of the camera to the maxilla (upper jaw), using a custom mold of each subject's upper jaw made with dental impression compound (3M Express STD). Infrared light-emitting diodes (LEDs), which were invisible to the subject, provided lighting for the cameras. The weight of the goggle assembly was partially supported by a pair of bungee cords attached to the structure moving with the subject. Eye position was calibrated by having subjects sequentially direct their gaze at 13 targets (1 center, 6 horizontal, and 6 vertical). Targets were separated by roughly 5°, with actual angular displacement dependent on exact eye position in space, which was carefully measured prior to each test session, providing a thorough calibration over approximately ±15° in horizontal and vertical dimensions.
Three-dimensional (3D) geometric projection corrections were used to calculate 3D eye position in Fick coordinates (Haslwanter and Moore 1995
; Moore et al. 1996
). The Fick angles were digitally filtered and differentiated to yield Euler rates, which represent the angular velocity of the eye. These Euler rates were then processed to yield 3D angular velocity of the eye in standard, orthogonal, head-fixed coordinates (Merfeld and Young 1992
). Torsional, vertical, and horizontal eye velocities refer to the eye movements about rotation axes aligned with the head x axis (naso-occipital axis), y axis (inter-aural axis), and z axis, respectively. Slow phase velocity (SPV) was calculated using a computer algorithm based on peak acceleration detection to remove the fast phases, with some manual editing by experienced personnel to verify that the automatic algorithm worked properly.
Eye data from five of the seven subjects who successfully completed testing are reported herein. Video data for two of the seven subjects are not reported. Eye images from one of these two subjects showed a small motion artifact on a few trials, indicating that the subject was not properly biting down on the bite bar used to secure the video cameras with respect to the head. The second subject showed continuous variations in pupil diameter (i.e., "hippus"), which made analysis both noisy and inaccurate.
Verbal reports of tilt and translation
The subjects were trained to answer, at the end of each trial, three specific questions focused on head tilt and bridge-of-the-nose translation.3 The three specific questions were: what was the maximal, side-to-side, bridge-of-the-nose translation that you experienced? What was the maximal head tilt to the left? And what was the maximal head tilt to the right? Using the right-hand-rule, rightward tilts were defined as positive and leftward tilts as negative. Note that peak-to-peak translation was recorded because translation, unlike tilt, lacks an absolute reference. After answering these 3 questions, subjects were asked to describe any other sensation of motion that they experienced. Not once in 96 trials, did any subject report pitch tilt during the roll Tilt condition, and, in only 3 of the 84 Translation trials did subjects report a noticeable pitch sensation.
Data analysis
Sinusoidal fits were used to calculate amplitudes and phases of the motion variables and eye movement responses. Steady-state sinusoidal data were fit using a least-mean-square linear regression to the equation, x(t) = B + Accos(2
ft) + Assin(2
ft), where B is a DC bias, Ac is the amplitude of the cosine component, and As is the amplitude of the sine component. This fit was performed on a cycle-by-cycle basis. The cycle-by-cycle sine and cosine components were averaged to calculate the mean values for that trial, which were then averaged across subjects. Amplitude (A) and phase (
) were then calculated, A = As2+Ac2 and
= tan1(As/Ac). To calculate variability of amplitude and phase, we calculated the two-dimensional covariance ellipses of the sine and cosine components (e.g., Johnson and Wichern 1982
). Error bars shown in all VOR figures represent the SE obtained from the covariance ellipse.
Modeling
Two model structures, "simple filtering" and "internal models," were investigated to explain the measured responses and to examine physiologic explanations that are consistent both with known neural responses and behavioral data. While our modeling treated these model structures as independent sets of hypotheses, this treatment is not meant to preclude the possibility that both mechanisms may contribute simultaneously to any given response. For example, the canals could contribute to the horizontal translational VOR via internal models while simple filters could simultaneously contribute to the VOR. Simple filters were implemented as first-order filters. Since the characteristics of low- and high-pass filters are well known, we did not perform simple filtering simulations before beginning our experiments. The simulated responses shown (Fig. 3, A and B) were obtained by choosing a cut-off frequency that was consistent with the experimental data and performing the filtering simulations only after the data were in hand. The internal model approach was implemented using a previously published model (Merfeld and Zupan 2002
). We performed model predictions using the published model prior to experimentation (solid black lines, Fig. 3, C and D). We also simulated responses after making two parameter changes to the model to fit the experimental data better (dotted gray lines, Fig. 3, C and D). All modeling was done using MATLAB Simulink (v. 2.1 by the Mathworks). Details regarding the structure and implementation of both models are included in the APPENDIX.
|
| RESULTS |
|---|
|
|
|---|
Low-pass filtering of inter-aural otolith cues has been hypothesized to explain roll tilt responses (e.g., ocular torsion, perceived roll tilt) and is consistent with the fact that roll tilt with respect to gravity introduces inter-aural otolith cues. Complementary to this prediction, high-pass filtering of inter-aural otolith cues has been hypothesized to lead to inter-aural translational responses (e.g., horizontal translational VOR, perception of inter-aural translation, etc.), which is consistent with the fact that inter-aural linear acceleration introduces inter-aural otolith cues. To efficiently investigate these hypotheses, we used models to predict the outputs of fixed-parameter filters. Low-pass (Fig. 3A) and high-pass (Fig. 3B) model simulations are shown for both the Translation (empty circle) and Tilt (x) paradigms. Because the output of each of the simple filters depends only on the otolith input and does not depend on canal cues or on any variable other than the inter-aural force measured by the otolith organs, the simple filter predictions are identical for Translation and Tilt.
Internal model predictions
The internal model predictions (Fig. 3, C and D) demonstrate several key points. First, the predicted tilt response during Translation (empty circle, Fig. 3C) looks qualitatively similar to the low-pass filter predictions (empty circle, Fig. 3A), even though this model does not include an explicit low-pass filter yielding this tilt estimate. Similarly, the predicted translation response during Translation (empty circle, Fig. 3D) is qualitatively similar to the high-pass filter predictions (empty circle, Fig. 3B), even though this model does not include an explicit high-pass filter to yield this translation estimate. However, unlike the predicted responses of simple filters, the predicted tilt (x, Fig. 3C) and translation (x, Fig. 3D) responses for this model during the Tilt condition are substantially different from those predicted for the Translation condition (empty circle, Fig. 3, C and D). Specifically, the predicted tilt responses remain relatively constant during Tilt. Furthermore, the predicted translation responses remain relatively small during Tilt compared with the translation responses during Translation, even though the inter-aural otolith cues are identical. In both cases, the difference between the predicted responses to Translation and Tilt is due to the presence of roll canal cues during Tilt and the absence of a change in roll canal cues during Translation.
Eye movements
Figure 4 shows eye movements during the Translation and Tilt paradigms for one subject. A substantial torsional velocity was always present during Tilt with little or no torsional velocity during Translation. This was true for all subjects. In addition, consistent vertical responses were seldom evident. A substantial horizontal response was observed during both Translation and Tilt with the derivative of horizontal eye position having a substantially larger amplitude during Tilt than during Translation (Fig. 4, 4th row). Application of 3D kinematic corrections yielded the horizontal velocity shown in the bottom row, where the amplitude of the horizontal responses during Translation and Tilt were nearly the same. This demonstrates the importance of the full 3D kinematic corrections (Hess et al. 1992
; Merfeld and Young 1992
; Tweed et al. 1990
) to eye movement analysis because reflexive eye responses have been shown to vary with gaze direction (Seidman et al. 1995
). Therefore all eye data discussed from this point onward will only include data that are fully corrected using the 3D kinematic corrections (Merfeld and Young 1992
) described in detail in METHODS.
|
|
The average torsional eye velocity increased as a function of frequency during Tilt (Fig. 5C), which was expected because the peak angular velocity of the stimulus increased linearly with frequency (Table 1). In contrast, a tiny torsional velocity response was observed during the Translation condition, which was also expected, because there was no roll angular velocity stimulation. This means that there is a substantial difference in the torsional VOR during Translation and Tilt, which helps explain why the horizontal response during Tilt exceeds that during Translation at frequencies >0.1 Hz. (see DISCUSSION for details regarding this cross-talk.)
Tilt perception
Verbal reports of tilt perception were roughly constant at lower frequencies (<0.05 Hz) and decreased at higher frequencies for both the Translation and Tilt trials but with a very substantial decrease for the Translation trials (Fig. 6A). For the Tilt trials, the magnitude of the tilt response was overestimated at lower frequencies and converged to the actual tilt angle of 20° at higher frequencies while it decreased to near-zero for the Translation trials. Because the inter-aural otolith cues were nearly identical during the Translation and Tilt trials, this difference between the perceptual tilt responses during roll Tilt and during inter-aural Translation demonstrates the influence of roll canal cues on tilt perception.
|
Verbal reports of translation perception during tilt trials were very small across all frequencies, rising slightly at higher frequencies, consistent with published modeling predictions (Merfeld and Zupan 2002
) and those shown in Fig. 3D. In comparison, the subjects reported substantial translation for all Translation trials regardless of the stimulus frequency (Fig. 6B). The translational perception reported during Translation trials showed great variability for all frequencies and was substantially greater than the translation perception during the Tilt trials.
| DISCUSSION |
|---|
|
|
|---|
VORs compensatory for translational motion occur in many species (Angelaki et al. 2000
; Baloh et al. 1988
; McCabe 1964
; Niven et al. 1966
; Paige 1989
; Paige and Tomko 1991
; Schwarz and Miles 1991
; Schwarz et al. 1989
). Because VOR magnitude increases with frequency during translation, it has been suggested that high-pass filtering ("frequency segregation") elicits these responses (Paige and Tomko 1991
). However, published "internal model" simulations (Merfeld and Zupan 2002
) predict similar frequency response characteristics during translation without explicit high-pass filtering of otolith cues. Therefore unlike any earlier study, we measured VORs across a broad range of frequencies using both Tilt stimuli and Translational stimuli.
We found a substantial human horizontal VOR response at high frequencies (>0.2 Hz) during Tilt stimulation (Fig. 5A) and found that the horizontal VOR behaved qualitatively similarly as a function of frequency during Translation and Tilt. These findings are consistent with the hypothesis that simple filtering (Fig. 3B) yields a translational VOR whenever an inter-aural otolith cue is present. This finding is not consistent with earlier internal model predictions that little (Merfeld and Zupan 2002
) or no (Angelaki et al. 1999
) horizontal VOR ought to be evident during roll tilts (x, Fig. 3D), when the only inter-aural otolith cue is due to gravity.
Vergence is known to modulate the gain of the horizontal VOR (Paige 1989
; Paige et al. 1998
), so it is important to note that the internal model hypotheses predict little or no horizontal VOR during Tilt, even if vergence is maintained at a state consistent with a very near target. Therefore the increasingly large horizontal VOR at frequencies above
0.2 Hz during Tilt cannot be due to internal models and must, at least in large part, be due to the contributions of simple filters.
This finding that simple filters contribute to the human horizontal translational VOR is consistent with earlier reports suggesting that simple filtering contributes to the human (Wood 2002
) and squirrel monkey (Paige and Seidman 2001
; Paige and Tomko 1991
) translational VORs. It also complements earlier findings that the predominant ocular tilt response, ocular torsion, results from simple filtering (Angelaki 1998
; Merfeld et al. 1996b
; Paige and Tomko 1991
; Telford et al. 1997
).
It is interesting to note that the horizontal VOR during Tilt was slightly greater than during Translation. This finding deserves further consideration because this is not consistent with simple filtering predictions that the response should be the same during Translation and Tilt. It appears that this response difference during Translation and Tilt can at least partially be explained by the presence of physiological "cross-talk" that has previously been reported in humans (Tweed et al. 1994
). This earlier study showed that a small horizontal response is measured in humans when theoretically a purely torsional response would be expected during roll stimulation about an earth-vertical axis. This noncompensatory horizontal response was
10% of the torsional response; this physiological cross-talk of unknown origin was present for rotations about an earth-horizontal axis (like our Tilt condition) and for rotations about an earth-vertical axis and was not substantially different for earth-vertical or -horizontal rotations.
At 0.7 Hz, where a 50°/s torsional response was measured in our study, cross-talk of
10% would be expected to yield about a 5°/s horizontal response component, which is about the same as the difference found between the Translation and Tilt responses (Fig. 5A). At 0.5 Hz, such cross-talk would be expected to yield a horizontal response between 3 and 4°/s, which again is consistent with the observed difference between our Translation and Tilt horizontal responses. At lower frequencies, the torsional response becomes smaller (Fig. 5C), so the cross-talk, while still present, would yield a smaller influence on the horizontal VOR as frequency decreases. All of these expected characteristics of the physiological cross-talk are consistent with our findings (Fig. 5A), consequently, physiological cross-talk probably explains, at least to some extent, why the horizontal eye responses were larger during Tilt than during Translation.
Because previous studies have shown that human translational VORs are larger with near vergence than far vergence (Paige 1989
; Paige et al. 1998
), it is also possible that vergence could contribute to the measured horizontal VOR difference during Tilt and Translation. Figure 7 shows vergence versus time for one typical subject for both Tilt and Translation at the highest frequency (0.7 Hz), which was chosen because the influence of vergence is known to increase with frequency and also because the difference in horizontal VOR magnitude between the Tilt and Translation paradigms was the largest at 0.7 Hz (Fig. 5A). While small intra-trial variations are evident, the vergence responses were maintained relatively constant even at this maximum frequency of 0.7 Hz, where the VOR responses were largest. All subjects maintained vergence relatively constant during and across trials. As shown in Fig. 7, this subject demonstrated slightly greater vergence during Tilt (5.1°) than during Translation (4.4°). A similar difference in vergence during Tilt and Translation was also observed across all five subjects with the mean vergence maintained nearer for Tilt [7.0 ± 2.2° (SE)] than for Translation (4.1 ± 1.13°) at 0.7 Hz; this difference was not statistically significant (P = 0.27, t-test). Because the poster distance was the same for both Tilt and Translation, this measured difference in vergence is probably because the size of the Translation room was larger than the Tilt room, which probably factored into setting the proximal vergence state.
|
Given these undesired influences on the horizontal VOR, we cannot rule out the possibility that internal model mechanisms may have also contributed to the horizontal VOR. Hypothetically, with everything else constant, such an influence of internal models would cause the horizontal VOR to be smaller during Tilt than during Translation. As discussed previously, the torsional cross-talk and increased vergence might then act to increase the horizontal response during Tilt such that the total observed horizontal response was greater during Tilt than during Translation.
Although our data unequivocally show that simple filtering contributes to the human horizontal VOR (as evidenced by the increasing response with frequency during Tilt that is not predicted by internal models), we cannot use these data to rule out the possibility that internal models might also contribute to the horizontal VOR, with these contributions masked by cross-talk and vergence variations. Therefore in a companion study,4 we chose to investigate these VOR responses further using a new motion paradigm (Merfeld et al. 2005
).
The finding that simple filtering contributes to the human translational VOR appears to conflict with earlier monkey findings (Angelaki et al. 1999
, 2001
) suggesting that simple filtering does not contribute to monkey translational VORs. More specifically, this finding in humans conflicts with nearly identical measurements made in rhesus monkeys (Angelaki et al. 1999
) and similar measurements made in rhesus monkeys (Green and Angelaki 2003
) and squirrel monkeys (Merfeld and Young 1995
). Similar, and possibly related, differences in how monkey responses (Angelaki and Hess 1994
; Merfeld et al. 1993b
; Wearne et al. 1999
) and human responses (Fetter et al. 1992
; Merfeld et al. 1999
, 2001
) differ during motion paradigms that elicit a sensory fusion of canal and otolith cues have previously been reported. See the companion paper (Merfeld et al. 2004) for a more detailed discussion of these potential species-related influences.
The finding that simple filtering contributes to the human horizontal translational VOR would appear to contradict some earlier studies showing that internal models contribute to the human translational VOR (Merfeld et al. 1999
, 2001
; Peterka et al. 2004
; Zupan and Merfeld 2003
; Zupan et al. 2000
). These studies were performed using low-frequency stimuli provided via trapezoidal stimulation, and the data showed that internal models contribute to the human horizontal translational VOR; simple filters could not reproduce the observed responses. However, these earlier studies did not refute the possibility that simple filtering might also contribute to the human translational VOR (e.g., at higher frequencies) as demonstrated via the translational VOR responses reported herein.
It is also worth noting that the contributions of the internal models may not be observable in the data presented herein because of the presence of the relatively large simple filter response components and other potential contaminants discussed previously. Therefore we conclude that simple high-pass filtering contributes substantially to the human translational VOR while simultaneously acknowledging that other studies have shown that internal models contribute to the human horizontal translational VOR.
Because both simple filtering and internal models contribute to the human translational VOR, it would be parsimonious to suggest that both response components are also present in monkeys with the internal model response components larger than the simple filtering response components. While somewhat speculative, there is some evidence that simple filtering might make a small contribution to monkey translational VOR responses because a small horizontal VOR appears to be present in monkeys during roll tilt (e.g., Fig. 4 in Angelaki et al. 1999
), which could include and/or indicate a very small contribution of simple filtering to the monkey responses.
Internal models contribute to tilt perception
Previous studies have investigated the dynamics of roll tilt perception using low-frequency fixed-radius centrifugation and found that perceived tilt gradually aligns with the resultant force in
1 min (Clark and Graybiel 1963
, 1966
; Graybiel and Brown 1951
). A later study showed that changes in perceived tilt occurred more rapidly (taking just a few seconds) when subjects were tilted in roll (Stockwell and Guedry 1970
). Stockwell and Guedry compared their experimental finding to the earlier findings during centrifugation and concluded that roll canal cues during tilt contribute to rapid shifts in the perception of tilt. However, a recent study showed that the yaw rotational cues experienced during fixed-radius centrifugation delay the alignment of perceived tilt with the resultant force (Merfeld et al. 2001
), raising questions about the validity of this comparison and the associated conclusion.
We found that perceived tilt was relatively constant at low frequencies and decreased as the frequency increased above
0.05 Hz, which is consistent with an earlier report (Glasauer 1995
). Taken alone, these tilt amplitude data are consistent with low-pass filtering of the inter-aural otolith cues. However, we also found substantial differences in perceived tilt during the Translation and Tilt paradigms. Because the difference between these paradigms was the presence (Tilt) or absence (Translation) of a dynamic roll canal cue, the measured response difference leads to the conclusion that roll canal cues influence tilt perception and that the canals contribute to the perceptual processes used by the nervous system to resolve the ambiguous otolith cues. While at least one earlier study showed that yaw canal cues influence roll tilt perception (Merfeld and Zupan 2002
), to our knowledge, this is the first study showing definitively that roll canal cues influence roll tilt perception, though Stockwell and Guedry (1970)
had suggested this to be the case.
Our data show that rotational cues from the semicircular canals influence perceptual tilt responses across a range of frequencies where the canals are sensitive. This finding is inconsistent with predicted contributions of a simple filtering mechanism (Fig. 3A) but is consistent with internal model predictions (e.g., Fig. 3C).
At lower frequencies, subjects overestimated self-tilt (Fig. 6A). This is consistent with the observation that humans often overestimate self-tilt for small tilts during static tilt in the dark (Howard and Templeton 1966
). It is worth noting that visual cues normally predominate at low frequencies but were not available during this study because all measurements were made in the dark. Low-frequency visual rotational motion cues (Dichgans et al. 1972
; Zupan and Merfeld 2003
) and visual orientation (Asch and Witkin 1948a
; Howard and Childerson 1994
) cues are known to have a strong influence on roll tilt perception. The frequency response of the visual (low-frequency) and canal (higher-frequency) components complement one another, analogous to what has previously been found for neurons sensitive to angular velocity in the vestibular nuclei (Henn et al. 1974
). It is reasonable to suggest that these (or similar) angular velocity neurons may contribute to the mechanisms that elicit tilt responses.
Internal models contribute to translation perception
We also found substantial differences in perceived translation during the Translation and Tilt paradigms. These translation reports (Fig. 6B) are qualitatively similar to internal model predictions (x, Fig. 3D). This similarity is surprising because the subjects reported peak-to-peak inter-aural translational displacement while the model predicts peak inter-aural linear acceleration. Linear acceleration should be doubly integrated to yield displacement, which would be predicted to yield much larger displacement reports during Translation at low frequencies than at high frequencies. This prediction is inconsistent with our finding that perceived displacement is about the same (within a factor of 34) across more than a two-decade range of frequencies, suggesting that perfect double integration of inter-aural acceleration (modeled or real) was not achieved by our subjects. This is consistent with some earlier studies suggesting that the nervous system performs inaccurate integration, sometimes referred to as "leaky" integration (e.g., Becker and Klein 1973
; Raphan et al. 1979
). However, another study has suggested that linear acceleration cues are doubly integrated to yield displacement predictions (Israel et al. 1993
). Our conclusion that perfect double integration is not achieved conflicts with this earlier report, but a number of methodological differences could explain this conflict.5 Because relatively few data are available to help us understand the dynamics of how the nervous system might process linear acceleration estimates to yield displacement estimates, we decided not to curve-fit our translation displacement data because such a post hoc fit might be misleading, especially because our experiments were not designed to investigate this integration process.
We found no other human translation displacement perception data taken across a broad range of frequencies to compare with our findings. While translation perception has been previously studied, most human studies of translation perception have focused on detection thresholds (e.g., Arrott et al. 1990
; Benson et al. 1986
; Carpenter-Smith et al. 1995
; Gianna et al. 1996
; Melvill Jones and Young 1978
; Merfeld et al. 1994
), manual control of translational motion (e.g., Huang and Young 1987
; Merfeld et al. 1994
, 1996a
), eye tracking of translational displacement (e.g., Berthoz et al. 1987
, 1988
; Israel and Berthoz 1989
; Israel et al. 1993
), or translation reproduction (e.g., Berthoz et al. 1995
; Israel et al. 1997
).
Our finding that perceived translation differs during Translation and Tilt supports the conclusion that canal cues influence translation perception and that the canals contribute to the perceptual processes by which the nervous system resolves the ambiguous otolith cue. This is the first study to show that canal cues influence translation perception, though one earlier report (Merfeld and Zupan 2003
) suggested that visual rotational cues might influence translation perception. Our perceptual findings are consistent with internal model predictions (e.g., Fig. 3, C and D) and inconsistent with predicted contributions of a simple filtering mechanism (Fig. 3, A and B).
Summary
The measured eye responses demonstrate that simple filtering contributes to the human horizontal translational VOR. Although these eye-movement data do not show evidence that internal models contribute to the translational VOR, they do not preclude that possibility. Our perceptual data clearly show that internal models contribute to perceptions of both roll tilt and inter-aural translation. Although these perceptual data do not show evidence that simple filtering contributes to perceptions of roll tilt and inter-aural translation, these perceptual data do not preclude the possibility that filtering could make a small contribution.
The finding that internal models govern human perception of both tilt and translation at first appears inconsistent with the finding that simple filtering contributes to human translational VORs. Taken together, these findings lead to the principal conclusion of this study: that qualitatively different mechanisms contribute to human perception and action. This is true even when these various behavioral measures are recorded simultaneously. While human perception and eye movements may sometimes share common neural pathways, the reported differences between eye movements and perception clearly show that one cannot simply measure eye movements and assume that this measure is representative of perception nor simply measure perception and assume that this measure is representative of reflexive responses. Taken together, these behavioral measures (both eye movements and perception) can be used to determine what strategies the nervous system uses to process the incoming sensory information that elicit these behavioral responses and to guide neural recordings that elucidate how these strategies are implemented by the nervous system.
| APPENDIX |
|---|
|
|
|---|
t'
(t) x g(t)dt, where
is the angular velocity of the head. The second physical effect was that specific gravito-inertial force (f), which is force per unit mass, as measured by the otolith organs (or any other graviceptor/linear accelerometer), resulted from the combination of gravity (g) and linear acceleration (a). Mathematically, this was captured by the equation, f = g a.
|
|