|
|
||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1Department of Computer and Information Science, Brooklyn College of the City University of New York, Brooklyn; and 2Department of Neurology, Mt. Sinai School of Medicine, New York, New York
Submitted 24 April 2006; accepted in final form 7 September 2006
|
|
ABSTRACT |
|---|
|
|
|
INTRODUCTION |
|---|
|
Modeling of aVOR function has proved useful for revealing the organization of the aVOR (for review, see Raphan and Cohen 2002
) and in understanding the physiological basis of aVOR gain adaptation (Albus 1971
; Highstein et al. 2005
; Hirata and Highstein 2001
, 2002
; Ito 1984
, 2002
; Lisberger et al. 1994a
,b
,c; Marr 1969
).
Using a matrix of gain values, another model was previously implemented in three dimensions to explain the contribution of the individual canals to the gain of the aVOR (Yakushin et al. 1998
). This model is physiologically based, using a canal projection matrix (Tcan) to reflect the projection of the head velocity in head coordinates into a canal coordinate frame. A gain matrix (G) then projects the canal-based vector into the head coordinate frame, driving the oculomotor system. This latter transformation can be represented by a matrix Thead (Yakushin et al. 1998
). In other studies, it was shown that the gravity-dependent adaptation data in three dimensions can be represented by a double sinusoid (Yakushin et al. 2005c
). Such a fit assumes that gain adaptation falls off as a sinusoid from some peak value, regardless of the direction of head orientation away from the point of maximum gain, suggesting that the otolith organs tune the gain adaptation relative to gravity (Yakushin et al. 2005c
). A probable basis for gravity-dependent adaptation is the extensive convergence of otolith inputs onto semicircular canal recipient neurons in the vestibular nuclei (Baker et al. 1984
; Brettler and Baker 2001
; Curthoys and Markham 1971
; Dickman and Angelaki 2002
; Duensing and Schaefer 1958
; Fukushima et al. 1990
; Graf et al. 1993
; Perlmutter et al. 1998
; Yakushin et al. 2005a
, 2006
). The purpose of this study was to implement a mapping of the elements of the gain matrix (Yakushin et al. 1998
) to the structure of the canalotolith convergence in the central vestibular system using an artificial neural network. Such a physiologically based neural network would demonstrate the feasibility of this hypothesis and give insight into the realization of the gravity-dependent adaptation in three dimensions.
|
|
METHODS |
|---|
|
Experimental data used for comparison with model predictions came from five cynomolgus monkeys. The surgical and experimental protocols were described in previous publications (Yakushin et al. 2000b
, 2003c
) and were approved by the Institutional Animal Care and Use Committee (IACUC) of the Mount Sinai School of Medicine. Briefly, one scleral search coil measured the horizontal and vertical components of eye position (Judge et al. 1980
; Robinson 1963
) and a second coil was used to measure the torsional component of eye position (Cohen et al. 1992
). During testing, the animals were in darkness and sat in a primate chair in a four-axis vestibular stimulator surrounded by an optokinetic drum. The diameter of the drum surrounding the animal is 91 cm. Thus the distance between the visual surround and the monkey was 45 cm. Gains were decreased by rotating the animal and visual surround in the same direction and increased by rotating the animal and the visual surround in opposite directions. Adaptation was carried out over a 4-h period in each instance. See Xiang et al. (2004)
and Yakushin et al. (2003c
, 2005b
) for a complete description of the protocol.
Data used for comparison with the model predictions were obtained following vertical aVOR gain adaptation for single-state (Monkeys 1 and 2), dual-state (Monkeys 3 and 4; Yakushin et al. 2003c
), and triple-state conditions (Monkeys 1 and 5). Only data on the dual-state adaptation were obtained from a previous study (Yakushin et al. 2003c
). For the single-state condition, Monkeys 1 and 2 were adapted in left-side-down (LSD) and right-side-down (RSD) head positions. For the dual-stateadapted condition (data obtained from a previous study for Monkeys 3 and 4), the vertical aVOR was adaptively decreased in one side-down position while being increased in the opposite side-down position. The triple-state adaptation condition was implemented for the vertical aVOR by decreasing the gain in the LSD position for 20 min, decreasing the gain in the upright position for 20 min, and then increasing the gain in the RSD position for another 20 min. This was repeated four times over a 4-h cycle and data were collected from Monkeys 1 and 5.
Changes in the vertical aVOR gain were measured for each adapted state, with the head tilted from LSD to RSD in 10° increments. For measuring three-dimensional (3D) gain changes, the animal's head was tilted from 90 to 90° in four sequences: prone-supine and LSDRSD, and in two intermediate planes that were 45° from the pronesupine and LSDRSD planes. Three-dimensional gain surfaces were then created using a spline-interpolation of the data. The neural network was trained using only the gain values before and after the gain modification at the head orientation where adaptation took place. After training, the neural network model predictions were compared with the spatial gain distribution obtained from the experiments. We also compared the predictions of the neural network model, which was trained at a single head orientation, to single-sinusoid fits to data in one plane and to double-sinusoid fits to data in multiple planes that minimized errors between the experimental data and the fits at every test position after adaptation (Yakushin et al. 2005c
).
Paired Student's t-tests with a 5% significance level were used to statistically analyze differences between the neural network model predictions and sinusoidal fits to the experimental data.
Conceptual basis of model development
Artificial neural networks are composed of weighted excitatory and inhibitory connections that sum at processing units or nodes (Anderson 1995
; Bishop 1995
; Rumelhart and McClelland 1986
). These networks have been extensively used to explain the behavior of neural systems that require motor learning (Anastasio 1992
; Anastasio and Robinson 1989
, 1990
; Quinn et al. 1998
; Zipser and Andersen 1988
). A number of problem-specific learning rules were used in training networks, depending on whether the learning is supervised or unsupervised (Hebb 1949
; Kohonen 2000
; Rumelhart et al. 1986
; Widrow and Hoff 1960
).
Because VOR adaptation is driven by an error signal between eye velocity and surround velocity, we assumed that the learning is supervised and is based on a delta rule (Rumelhart et al. 1986
; Widrow and Hoff 1960
), in which the weights are updated in proportion to the gradient of error function. Because the network is distributed in angular space coordinates, we also implemented a local learning rule for each of the cells within the distributed network that represent canalotolith-convergent neurons within the vestibular nuclei (Baker et al. 1984
; Brettler and Baker 2001
; Curthoys and Markham 1971
; Dickman and Angelaki 2002
; Duensing and Schaefer 1958
; Fukushima et al. 1990
; Graf et al. 1993
; Perlmutter et al. 1998
; Yakushin et al. 2005a
, 2006
). Therefore the neural network implementation tested the hypothesis that the network of canalotolith-convergent neurons learns by a supervised delta rule using the error between eye velocity and surround velocity. We further hypothesized that the learning is implemented locally on each of the gain elements, i.e., weights that realize the 3D gain matrix of the aVOR, based on projections of otolith polarization vectors1 on canal recipient central vestibular neurons.
The aVOR gain matrix was characterized as a collection of distributed components, each composed of a bias value and a sum of weighted contributions from central neurons receiving canal input and input from the otolith organs with otolith polarization vectors assumed to be lying in canal planes (Raphan and Schnabolk 1988
; Schnabolk and Raphan 1992
; Sheliga et al. 1999
). The model predictions were statistically compared with data from monkeys with respect to horizontal, vertical, and torsional gain adaptation, obtained over a wide range of head orientations to show the feasibility of the proposed neural organization. We further tested the model by comparing its predictions of gain changes with experimental data as a function of head orientation relative to gravity after adaptation at two and three different head positions, i.e., multistate adaptation.
Model overview
A schematic of the neural network model for adapting the gains during gravity-dependent adaptation is shown in Fig. 1A. The network of canal-related neurons have specific input corresponding to the plane of the canals. These neurons process the canal input and project this information to the oculomotor system with a specific gain implementing each of the gij in the gain matrix (Yakushin et al. 1998
). The gain matrix therefore operates on the canal signal and drives the oculomotor system in three dimensions. In addition, these canal-related neurons receive input from otolith polarization vectors that are weighted and can modify the canal transduction gains gij. Modification of these gains is accomplished by a weighted sum of 108 otolith-related neurons as well as a bias input in a particular canal plane that modify the canal transduction. Each plane in the model modifies three gain elements in a column, representing how that class of canal neurons activates roll, pitch, and yaw eye velocity. This model structure is supported by the fact that there are canal-related cells in the vestibular nuclei that receive otolith input over a wide range of polarization angles (Dickman and Angelaki 2002
).
|
We have represented the connectivity as a weighted summation of these cells whose weights adapt with visual-vestibular mismatch. The question we sought to answer was whether the weights of the 108 neural inputs to the canal transduction gains could be adapted to match the data at one head position and then fit the data at all other head positions in 3D space. Whether the canal sensitivity of neurons in the vestibular nuclei is modified in a gravity-dependent way by this weighted otolith input as predicted by the model will be answered by neural recordings in the vestibular nuclei during and after gravity-dependent adaptation.
The weights connecting the input and the output of the artificial neurons represent the modifiable elements governing the gravity-dependent aVOR gain adaptation, whereas the bias parameter implements the gravity-independent gain adaptation of the aVOR. The computed eye velocity using the neural network (Fig. 1, Eye Vel) is subtracted from the product of the head velocity and experimentally obtained gain values determined at the adaptation site (Fig. 1, Target Eye Vel). The difference is the velocity error, which is used in training the weight and bias of the neurons, by two separate learning rules, which modify the gij values (Fig. 1).
Headcanal coordinate frames
Head and eye position and velocity were referenced to a head coordinate frame, defined by the roll (XH), pitch (YH), and yaw (ZH)axes of the head (Fig. 2A). Normal vectors to the anterior, posterior, and lateral semicircular canal planes defined the canal coordinate frame (XC, YC, and ZC) and were used to describe the activation of the canals. The relationship between head and canal coordinates has been derived (Yakushin et al. 1998
) as
![]() | (1) |
a and
a are the second and third Euler angle rotations (Goldstein 1980
p and
p are the second and third Euler angle rotations of the head pitch axis (YH), and
l and
l are the first and second Euler angle rotations of the head yaw axis (ZH). By choosing these angles appropriately, the axes of the rotated coordinate vectors are aligned with the normals to the anterior, posterior, and lateral canals, respectively (Fig. 2B). Based on experimental data (Yakushin et al. 1995
a =
p = 40°,
a =
p = 135°,
l = 0°, and
l = 30°. Thus given a velocity vector in the head frame in roll, pitch, and yaw, Eq. 1 will convert the vector into the canal frame so that the adaptation related to the anterior, posterior, and lateral canals can be separated. If we denote the inverse of Tcan as
![]() | (2) |
![]() | (3) |
![]() | (4) |
|
Because each gij represents the contribution to the ith direction of the head from the jth canal, each otolith polarization vector is assumed to contribute activity in a particular canal plane. Thus in the neural network model (Fig. 1), the gain parameters gij were implemented as a parallel distributed network in which each gij is a weighted sum of the projections of a unit vector along the acceleration of the gravity on the individual polarization vectors
l, plus a bias bij, which represents the part of the gain that is independent of gravity, given as
![]() | (5) |
The unit vector âg is in the direction of the equivalent acceleration of gravity, pointing upward from the earth, whereas
l is a unit vector in the direction of the polarization for a particular neural unit. For a particular head orientation,
âg,
l
is the inner product (dot product) between the acceleration of gravity and individual polarization vectors, which implements a cosine tuning of activity. The more a unit's vector coincides with the direction of the equivalent acceleration of gravity, the larger the positive stimulus. When the unit's polarization vector is oriented opposite the direction of the acceleration of gravity, the projection is negative. When the head is tilted to various orientations, projections will be sinusoidally modulated. The index l runs from 0 to n 1 and enumerates all the units. Because of the central convergence of otolith units onto canal-related neurons, it was operationally equivalent to consider the n polarization units divided into three groups, each associated with a specific canal plane.
We developed an updating (learning) rule that is very similar to the generalized delta rule, given by
![]() | (6) |
The parameter k is an adjustable parameter determining the speed of the learning process. The error Ei is the ith component of the velocity error in head coordinates. These errors are represented as angular velocities around the roll, pitch, and yaw axes of the head, with E0, E1, and E2 representing the torsional, vertical, and lateral velocity errors, respectively, which are computed as follows.
Let
denote the head velocity in the head coordinate frame, with the superscripts 0, 1, and 2 representing the roll, pitch, and yaw components, respectively. Then the corresponding velocity transformed into the canal frame can be determined by
![]() | (7) |
e) in the head frame, given by
![]() | (8) |
e(target) was simulated by multiplying the target gain by the head velocity
h. The difference between
e(target) and the eye velocity calculated from Eq. 8 forms the error vector
![]() | (9) |
The parameter sij, in Eq. 6, is the amount of command velocity produced along the ith head coordinate arising from the velocity induced along the jth canal. This parameter thus represents the canal input during the adaptation. It is independent of the gain matrix and a larger value of sij induces larger weight modification. We define the parameter matrix S = [sij] as
![]() | (10) |
cj. Multiplication of sij by gij would give the actual eye velocity contributed by
cj. Thus sij forms a matrix of computed values that does not require adaptation. It is equivalent to the derivative of the activation function in a generalized delta-rule learning scheme. For example, g00s00 = g00A
c0 is the component of roll eye velocity induced by velocity along the anterior canal normal, which is initiated by the head rotation. In the case of gain increase where the error is always positive, a positive s00 increases g00 to reduce the error. If s00 is negative, g00 must decrease its value to reduce the error. In addition to the sign of sij, the magnitude of sij also contributes to the rate of gain change.
From Eq. 6 it is thus suggested that a weight change over time is proportional to the velocity error (
Ei/
t), the projection of the acceleration of gravity onto the polarization vector, and the magnitude of the projected head velocity in canal coordinates. This corresponds to the components implemented in learning using the generalized delta rule, i.e., the error, the input, and the derivative of the activation function (Rumelhart et al. 1986
). The bias values bij can be trained in a similar way based on canal input and velocity error, except that they are not affected by the gravity-encoded projection
![]() | (11) |
bij) is equal to that for gravity-dependent gain (contributed by
wijl) when h and k are the same. The value of f has been set at 4 by trial and error.
Thus units having the largest projection of the acceleration of gravity will be trained the most in a gravity-dependent manner, whereas other units will be trained, dependent on the individual degree of the projection of gravity on the polarization vector. Once trained, the gain change in gij will be fully represented by the array of weight values for the specific head position in which adaptation took place. When the head turns to any new positions, the weights will remain the same, whereas the array of projections
âg,
l
change. Thus units that originally had the largest projection will have smaller projections, reducing the contribution of the units that were maximally trained previously. Other units that are now in line with the gravity vector will now have a larger projection, but will have reduced associated weights. Therefore the overall gain values will be less than the maximal value, predicting the gravity-dependent adaptation on head orientation.
Choice of neural network units
The number of neural units used to simulate the combined contribution of the semicircular canals and otolith is not a critical aspect of the model. We have chosen 36 polarization vectors evenly distributed at 10° intervals in each of the three approximately orthogonal canal planes, giving a total of 108 units that were used for simulation. The angular resolution is consistent with the experimental resolution, which was in 10° steps. The 108 units spatially reside in planes coinciding with the canal planes and, within each of the three planes, units are distributed omnidirectionally the same way as the otolith units. This provides the flexibility to represent both the otolith organs and how they interact with the canal system in exerting their contribution to the gravity-dependent adaptation. The positions and orientations of the 108 polarization vectors were represented in canal planes so that they could easily interact with the canal-recipient units (Fig. 3A). Their gravitational projection profiles when the head is upright are shown in Fig. 3B. The polarization vectors distributed within the anterior canal plane were symmetrical with those distributed within the posterior canal plane and were more vertically tilted, whereas the polarization vectors distributed within the lateral canal plane were tilted by 30° around the interaural axis. Therefore the projection profile of polarization vectors in the anterior/posterior canal planes are sinusoids with higher amplitude, and thus exert more weight in gravity-dependent gain adaptation than those in the lateral canal plane (Fig. 3B). When the head is put into different orientation, the projection profiles of the units will be changed and influence the adaptation.
|
The neural network first underwent an initialization process where both the bias values and the weights were trained to produce the initial state of the system. This initialization procedure was motivated by the fact that in an unadapted animal the gain is close to one at all head orientations with small variation with respect to gravity. We also sought to establish the initial state in an unbiased manner, so as not to influence the succeeding gravity-dependent adaptation process. We thus initialized the neural network weights to zero and bias values to 1. We then trained the network by adapting the bias values for the units to a mean value of preadapted gain. This represented the "normal" gain of the aVOR. Consistent with the data, the preadapted gains were not uniformly distributed when the head was oriented into various positions, possibly from residues from previous adaptations (Yakushin et al. 2003b
,c
). We then adapted the weights only, holding the bias values constant to simulate the slight variation in the gain of preadapted normal monkeys. This was the initial preadapted state from which we performed the concurrent gravity-dependent and gravity-independent gain adaptation simulations of the bias and weights at a particular head orientation, using the actual measured gain value at the head position where the adaptation occurred as the target value.
For vertical gain adaptation, we set the head velocity
h for training as
![]() | (12) |
For simulation, the weights and bias values remained fixed throughout, whereas the distributed gain matrix G was changed as the head tilt angle changed. For every new tilt angle, the polarization vector
l of the individual units was reoriented and the new projection
âg,
l
was calculated. The matrix G was then updated following Eq. 5 and the eye velocity was calculated according to Eq. 8. The computed aVOR gain was then obtained by taking the ratio of eye velocity over head velocity.
The neural network program was developed using Matlab 7.0 (The MathWorks, Natick, MA). The neural network simulation was tested for the single-state, dual-state, and triple-state adaptation. The training usually required <100 iterations. In all cases, the predicted changes in gain compared favorably to the data.
|
|
RESULTS |
|---|
|
Experimental changes in vertical aVOR gains produced by out-of-phase stimulation (gain increases) for Monkey 1 are shown in Fig. 4, A and B (filled symbols) at head tilt angles from the position of adaptation to the opposite-sidedown position. The model predictions for the same data, (Fig. 4, A and B, open symbols) were fit by a sinusoid
![]() | (13) |
|
A general property of the learning was that when the learning speed of weights (k) and that of bias values (h) were the same, the side where the adaptation took place would have the same amount of gain change in both the gravity-independent and gravity-dependent components, with the summation equal to the adapted gain. However, with the head positioned to the opposite side, the gravity-dependent and -independent components subtracted and produced a gain close to the preadapted value. The model therefore supports the idea that gain changes constitute a sinusoidal gravity-dependent component as a function of gravity, which modulates around a gravity-independent bias gain change (Yakushin et al. 2003c
).
Dual-state adaptation
The model was also tested to determine whether it could predict dual-state adaptation (Yakushin et al. 2003a
,c
). It was previously found that when adaptation was alternately executed in two states, the spatial gain distribution encoded both adaptive states. For example, if the gain was decreased in the LSD position and increased with the animal right-side down, the gain change distribution reached the maximum negative values left-side down and maximum values right-side down, whereas the gravity-independent components of the adaptations cancelled. Data from four experiments for Monkeys 3 and 4 were collected in previous studies, where each animal was adapted in two dual-state conditions: the first condition was LSD gain increase and RSD gain decrease. The second condition was LSD gain decrease and RSD gain increase. The data from the first condition were flipped horizontally so that they could be combined with the second case. The SDs of the four experiments are shown as a shaded region (Fig. 5A). The thick dotted line is the averaged model predictions (k = 1, h = 1 for all dual-state adaptations). The neural network predicted the simultaneous gain decreases for LSD and gain increases when RSD with no bias component, consistent with the experimental results (Yakushin et al. 2003c
).
|
To further test the validity of the model, we predicted the distribution of gain changes after triple-state adaptation for Monkeys 1 and 5. The neural network was trained simultaneously for a gain decrease in the LSD position (90°), a gain decrease at the upright (0°), and a gain increase in the RSD position (90°), with both learning rates set to be 1 (k = 1, h = 1). The model predicted the distribution of gain changes over all head positions for both animals, although it was trained at only three head positions (Fig. 5B, diamonds for Monkey 1 and circles for Monkey 5). The amount of gain change after adaptation varied between the two animals. Monkey 1 had significant gain changes, where the largest gain decrease occurred in the LSD position (40%), a maximal gain increase in the RSD position (10%), and a gain decrease (10%) in the upright position (Fig. 5B). Monkey 5 had smaller gain changes, with 17, 5, and 10% for the LSD, RSD, and upright positions, respectively. Regardless of the magnitude of the gain changes, the model predictions matched experimental data in both animals. Weight distributions (wijl) before (Fig. 5C) and after (Fig. 5D) training for Monkey 1 were approximately sinusoidal, reflecting the projection profile of the direction of acceleration of gravity onto the unit polarization vectors. Thus the neural network was flexible in morphing its weights to accommodate this complex set of adaptation states and closely predicted the actual data.
Temporal evolution of gain adaptation
The temporal evolution of the gain adaptation was investigated by comparing gain changes computed at every iteration during the training process with those measured experimentally by testing the animals after every 0.5 h of adaptation. Both the gravity-dependent and gravity-independent components of the gain changes had a rising exponential profile as a function of time (Fig. 6, B and C, respectively, open circles for the experimental data, solid lines for the model prediction). As a result, the composite gain changes followed the same exponential relationship (Fig. 6A). The similarity in the gain change versus time profiles between model predictions and data suggests that the learning rules incorporated in the model may be close to the physiological mechanisms that implement the adaptive process in aVOR adaptation.
|
One of the benefits of using a 3D neural structure to model the combined canalotolith influence on aVOR was that we could explore the gain change distribution of the aVOR after adaptation in 3D space. For example, in one experiment with Monkey 1, the vertical aVOR gain was decreased in the LSD position and the gain changes were tested in four sequences: from supine to prone, LSD to RSD, right-posterior to left-anterior, and left-posterior to right-anterior. The gain changes at other head-tilt positions were interpolated with a spline interpolation (Sandwell 1987
) and the resultant surface formed the 3D gain change distribution shown in Fig. 7A. The model was trained using a single target gain value for the LSD position. The gain change predictions for all other head-tilt positions in 3D space were then computed from the model simulation (Fig. 7B). The model prediction reflects the ideal aVOR gain change in a 3D space. There was a good match between the predicted and experimental data.
|
![]() | (14) |
For comparison in a single plane, two versions of the single-sinusoid model were used, one with the phase (B) included, together with the amplitude (A) and bias (C) in the parameter fit (single-sinusoid 1), the other with the phase fixed (single-sinusoid 2). We considered the latter because the neural network model is essentially a phase-fixed model with the assumption that the gain change at the head position of adaptation will always be the maximum. Therefore the comparison between the fit of the phase-fixed sinusoidal and the neural network models has more relevance. Gain changes predicted by the neural network were calculated as the differences between the predicted gain values and the mean value of the preadapted gain values, whereas the gain changes from the sinusoidal models were from Eq. 13 with the equation parameters optimized to minimize the mean square error. Similarly, two versions of the double-sinusoid models were used in 3D comparisons. In the first of the double-sinusoid models, all four parameters, the amplitude (A), bias (C), and phases (B1 and B2) were fit to minimize the mean square error, whereas in the second of these models, the phases (B1 and B2) were fixed to be the theoretical values and were not included in parameter optimization to minimize the mean square error. The 3D surface of the gain changes in the neural network was obtained by subtracting the averaged value of the preadapted gain values from the model-predicted, 3D surface of gain values. For the double-sinusoid models, the 3D gain change surfaces were created from Eq. 14, which was fit to a spline-interpolated 3D surface of the original gain change data in four planes (for details see Yakushin et al. 2005c
).
The root mean square error and correlation coefficients were compared between the model fits and the experimental data of gain changes. The results from tests in the plane where the adaptation took place are shown in Table 1 and the 3D results in Table 2. Because the goal of the sinusoidal fits was to minimize the mean square errors, both the single- or double-sinusoid models performed better than the neural network model for root mean square error (Tables 1 and 2). However, when correlation coefficients, which represent the similarities in the overall shapes of the gain changes, were compared, the neural network model performed on a par with or better than the sinusoidal models (Tables 1 and 2). The reason that the variable-phase sinusoidal fits performed better than the fixed-phase sinusoidal models was that the variable-phase models had more latitude to adjust their parameters.
|
|
|
|
DISCUSSION |
|---|
|
The spatial distribution of gain changes after the neural network parameter adjustments (Figs. 4 and 5) was the result of minimizing the eye velocity error at a particular head orientation. When quantitatively correlated to the least-square-error fits of the data in one dimension (Table 1) and in three dimensions using a double-sinusoid fit (Table 2), the neural network model predictions of gain changes over all space were remarkable. Because the neural network, which constitutes the sum of a large number of sinusoids that had been adapted at one head orientation, closely matched the optimal fits to the data over all head orientations, we concluded that the learning, which we have modeled, may be fundamental to the actual learning rule that is centrally implemented.
A significant characteristic of the neural network model is that it is not a conventional feedforward neural network whose weights are determined by a training algorithm under the influence of a wide range of inputs. Rather, the weights are physiologically constrained by the relative orientations of the polarization vectors with gravity when the head is in a given orientation. This constraint requires that regardless of error, every weight value represents a single point on a sinusoid, and the entire spatial gain distribution is a summation of these sinusoids having the same spatial frequency, although with different amplitudes and phases. Therefore the training results in a prediction of a sinusoid based on the state of the polarization distribution at the head position during adaptation.
The learning rule we implemented was driven by the eye velocity error and thus was related to a generalized delta learning rule (Rumelhart et al. 1986
). In addition, the amount of activation of the polarization vectors, i.e., the relative difference between the polarization vector and gravity, also played an important role in the learning. The stronger the activation of the particular polarization vector, the greater the rate at which the adaptive changes of the particular unit will take place. Thus the units in the neural network that receive the largest projection of the acceleration of gravity will be adapted the most over a particular time, producing the gravity-dependent adaptation. Once adapted the gain change will be localized to the specific head position in which adaptation took place. Moving the head to a new position will reduce the projection of the acceleration of gravity on that polarization vector, with a consequent reduction of the contribution of the maximally adapted network element to the gain of the aVOR, thus producing a sinusoidal spatial gain distribution.
The closeness of both the double-sinusoid and the neural network models to the data supports our theory that gravity-dependent gain adaptation in three dimensions consists of two components: a gravity-independent component and a gravity-dependent one (Xiang et al. 2004
; Yakushin et al. 2000a
, 2003c
). The gravity-independent component is likely produced by alteration of the gain of cells that receive canal but not otolith input, whereas the gravity-dependent component is probably produced by those neurons that receive convergent canal/otolith input. From this, it would be predicted that adjusting the relative learning rates for the weights and the bias would have an important influence of the final shape of the gain profile. For example, an asymmetry in all single-state adaptation tests was that the data for gain reduction did not fit as well as the data for gain increases. Specifically, the gains from head positions on the side opposite to the position of adaptation were smaller than the model predictions. Variations in learning rates are well known, having commonly been encountered when comparing gain increases and decreases in many studies. The gain decreases occur at a much faster rate than the increases (Cohen et al. 1992
; Melvill Jones 1996
; Miles and Eighmy 1980
). Our hypothesis to explain this is that in the cases of gain reduction, the bias (gravity-independent components) adapted faster than the gravity-dependent counterpart. If a greater rate of bias training was implemented in the model by increasing the value of h, or decreasing the value of k, a better fit to the data was achieved (Fig. 4F).
This prediction is simulated in Fig. 8. The learning rate for the bias component was set to be faster than that for the weights that modulate the gravity-dependent component. Consequently, the amplitude of the gravity-dependent component was diminished when it reached the end of the training period, and the combined gain in the opposite side position was adjusted further toward the direction of the gain decrease, as in Fig. 4. In the simulation of Fig. 8, the model was trained to decrease the gain at the RSD position by 40%. When the learning rates for the bias and weights were set to be the same as the default value, the amount of gravity-independent component (bias) and the amplitude of the gravity-dependent component were the same. Thus the gain change at the opposite side (LSD) would be 0%. However, if the learning rate of the bias was increased to threefold the learning rate for the weights, the gravity-independent component constituted 75% of the composite gain change at the adaptation side. Because the amplitude of the sinusoidal gravity-dependent component was reduced, the composite gain change at the opposite side (LSD) was adjusted in the direction of gain change (gain decrease) to reach a value of 20%. Our results have shown that when the gravity-dependent and the gravity-independent components were adapted at different rates, the model better predicted the experimental data. This lends support to the idea that the gravity-dependent and -independent components were separate processes that evolved along their own characteristic timescale.
|
There was generally a good match of the model predictions and the experimental data, although in some cases, the model fits did not accurately predict the data. As shown in Fig. 4B (Monkey 1), the preadapted gains had large variances (not shown) and the afteradaptation point of maximal gain change in the RSD position was not at the position of adaptation. The discrepancy between the model and data could be explained by the intrinsic fixed spatial frequency of the model. Although a better fit could be obtained by doubling the learning rate of the weights while maintaining the learning rate of bias values intact, as shown in Fig. 4E, it should be noted that the physiological constraints imposed by our model will not produce a gain change distribution that would fit an arbitrary profile.
Effects of translational eye movements in space (Medendorp et al. 2000
) were not considered in this development because the neural network model was conceived for adaptation using targets at or close to the horopter or beyond, which would not involve changes in the gain of the aVOR. However, it should be noted that, although the gain of the aVOR was adapted in light, pre- and postadaptive testing was done only in darkness, and that these were the experimental results that were modeled.
To what extent does the model predict how individual neurons in the vestibular nuclei implement the gravity-dependent adaptation? Cells with convergent inputs from both the semicircular canals and the otolith organs as well as canal-onlydependent cells are likely to be the site of the processes modeled in this study. Such cells have been demonstrated in the vestibular nuclei (Baker et al. 1984
; Brettler and Baker 2001
; Curthoys and Markham 1971
; Dickman and Angelaki 2002
; Duensing and Schaefer 1958
; Fukushima et al. 1990
; Graf et al. 1993
; Perlmutter et al. 1998
; Yakushin et al. 2005a
, 2006
). Figure 1B shows how otolithcanal and canal-only cells might implement the gravity-dependent and gravity-independent components of the aVOR gain at the neuronal level. The amplitude of the canal response of the otolithcanal units would increase/decrease maximally after adaptation when the head is rotated in a position such that the polarization vector is aligned with gravity during adaptation. The canal-only units would have canal-related responses independent of head position. The model also predicts that there should also be classes of neurons that summate the gravity-dependent and -independent components. The vestibular nuclei contain a wide range of otolithcanal convergent neurons, however, and whether and how these different classes of vestibular neurons participate in the adaptation process are currently not known.
Other cellular structures might also be responsible for the adaptation and maintenance of the weights and bias values. Cells in the fastigial nuclei (Shaikh et al. 2005
; Siebold et al. 1997
, 2001
; Zhou et al. 2001
) and the nodulus (Sheliga et al. 1999
) also have considerable otolithcanal convergence. Removal of the nodulus and uvula did not alter gravitational dependency of the aVOR adaptation (Yakushin et al. 2003a
), however, so the other structures are more likely sites of the gravity-dependent process. The flocculus, which plays a powerful role in modulating the gain of the aVOR (Hirata and Highstein 2001
; Ito 1984
; Lisberger et al. 1994a
,b
,c; Zee et al. 1981
) could contribute to the gravity-independent component of adaptation, which could then be modulated by the direct otolithcanal convergence as predicted by this model. Regardless of the site(s) involved in processing, the changes in the unit activity identified in this study should be present in secondary neurons in the vestibular nuclei.
In summary, data presented in this study show that a neural network model based on rather simple otolithcanal convergence together with a localized learning rule is sufficient to simulate the concurrent modulation of both the gravity-dependent and the gravity-independent gain changes of the aVOR. The close match between simulations and data in one- and three-dimensional space as well as for single-, dual-, and triple-state adaptation, supports the idea that the neural structure and learning process presented accurately model how the vestibular system implements gravity-dependent adaptation.
|
|
GRANTS |
|---|
|
|
|
FOOTNOTES |
|---|
1 An otolith polarization vector is a direction in head coordinates that maximally activates an otolith cell (Fernández and Goldberg 1976
). ![]()
Address for reprint requests and other correspondence: T. Raphan, Department of Computer and Information Science, Brooklyn College of CUNY, 2900 Bedford Avenue, Brooklyn, NY 11210 (E-mail: raphan{at}nsi.brooklyn.cuny.edu)
|
|
REFERENCES |
|---|
|
Anastasio TJ. Simulating vestibular compensation using recurrent back-propagation. Biol Cybern 66: 389397, 1992.[CrossRef][Web of Science][Medline]
Anastasio TJ and Robinson DA. The distributed representation of vestibulo-oculomotor signals by brain-stem neurons. Biol Cybern 61: 7988, 1989.[Web of Science][Medline]
Anastasio TJ and Robinson DA. Distributed parallel processing in the vertical vestibulo-ocular reflex: learning networks compared to tensor theory. Biol Cybern 63: 161167, 1990.[CrossRef][Web of Science][Medline]
Anderson JA. An Introduction to Neural Networks. Cambridge, MA: MIT Press, 1995.
Baker J, Goldberg J, Hermann G, and Peterson B. Spatial and temporal response properties of secondary neurons that receive convergent input in vestibular nuclei of alert cats. Brain Res 294: 138143, 1984.[CrossRef][Web of Science][Medline]
Baker JF, Perlmutter SI, Peterson BW, Rude SA, and Robinson FR. Simultaneous opposing adaptive changes in cat vestibulo-ocular reflex direction for two body orientations. Exp Brain Res 69: 220224, 1987a.[CrossRef][Web of Science][Medline]
Baker JF, Wickland C, and Peterson B. Dependence of cat vestibulo-ocular reflex direction adaptation on animal orientation during adaptation and rotation in darkness. Brain Res 408: 339343, 1987b.[CrossRef][Web of Science][Medline]
Bishop CM. Neural Networks for Pattern Recognition. Oxford, UK: Oxford Univ. Press, 1995.
Brettler SC and Baker JF. Directional sensitivity of anterior, posterior, and horizontal canal vestibulo-ocular neurons in the cat. Exp Brain Res 140: 432442, 2001.[CrossRef][Web of Science][Medline]
Cohen B and Gizzi M. The physiology of the vestibulo-ocular reflex. In: Textbook of Audiological Medicine, edited by Luxon L, Martini A, Furman J, and Stephens D. Oxford, UK: Oxford Univ. Press, 2003, chap. 41, p. 701716.
Cohen B, Kozlovskaya I, Raphan T, Solomon D, Helwig D, Cohen N, Sirota M, and Yakushin S. Vestibuloocular reflex of rhesus monkeys after flight. J Appl Physiol 73: 121S131S, 1992.[Abstract]
Curthoys IS and Markham CH. Convergence of labyrinthine influences on units in the vestibular nuclei of the cat. I. Natural stimulation. Brain Res 35: 469490, 1971.[CrossRef][Web of Science][Medline]
Dickman JD and Angelaki DE. Vestibular convergence patterns in vestibular nuclei neurons of alert primates. J Neurophysiol 88: 35183533, 2002.
Duensing F and Schaefer KP. Die Aktivität einzelner Neurone im Bereich der Vestibulariskerne bei Horizontal-beschleunigungen unter besonderer Berucksichtigung des vestibulären Nystagmus. Arch Psychiat Nervenkr 198: 225252, 1958.[CrossRef][Medline]
Fernández C and Goldberg JM. Physiology of peripheral neurons innervating otolith organs of the squirrel monkey. I. Response to static tilts and to long duration centrifugal force. J Neurophysiol 39: 970984, 1976.
Fukushima K, Perlmutter SI, Baker JF, and Peterson B. Spatial properties of second order vestibulo-ocular relay neurons in the alert cat. Exp Brain Res 81: 462478, 1990.[Web of Science][Medline]
Goldstein H. Classical Mechanics. Reading, MA: AddisonWesley, 1980.
Graf W, Baker J, and Peterson BW. Sensorimotor transformation in the cat's vestibuloocular reflex system. I. Neuronal signals coding spatial coordination of compensatory eye movements. J Neurophysiol 70: 24252441, 1993.
Hebb O. Organization of Behavior. New York: Wiley, 1949.
Highstein SM, Porrill J, and Dean P. Report on a workshop concerning the cerebellum and motor learning, held in St. Louis October 2004. Cerebellum 4: 140150, 2005.[CrossRef][Web of Science][Medline]
Hirata Y and Highstein SM. Acute adaptation of the vestibuloocular reflex: signal processing by floccular and ventral parafloccular Purkinje cells. J Neurophysiol 85: 22672288, 2001.
Hirata Y and Highstein SM. Plasticity of the vertical VOR: a system identification approach to localizing the adaptive sites. Ann NY Acad Sci 978: 480495, 2002.[CrossRef][Web of Science][Medline]
Ito M. The Cerebellum and Neural Control. New York: Raven Press, 1984.
Ito M. Historical review of the significance of the cerebellum and the role of Purkinje cells in motor learning. Ann NY Acad Sci 978: 273288, 2002.[CrossRef][Web of Science][Medline]
Judge SJ, Richmond BJ, and Chu FC. Implantation of magnetic search coils for measurement of eye position: an improved method. Vision Res 20: 535538, 1980.[CrossRef][Web of Science][Medline]
Kohonen T. Self-Organizing Maps. Berlin: Springer-Verlag, 2000.
Lisberger SG, Pavelko TA, Bronte-Stewart HM, and Stone LS. Neural basis for motor learning in the vestibuloocular reflex of primates. II. Changes in the responses of horizontal gaze velocity Purkinje cells in the cerebellar flocculus and ventral paraflocculus. J Neurophysiol 72: 954973, 1994a.
Lisberger SG, Pavelko TA, and Broussard DM. Neural basis for motor learning in the vestibuloocular reflex of primates. I. Changes in the responses of brain stem neurons. J Neurophysiol 72: 928953, 1994b.
Lisberger SG, Pavelko TA, and Broussard DM. Responses during eye movements of brain stem neurons that receive monosynaptic inhibition from the flocculus and ventral paraflocculus in monkeys. J Neurophysiol 72: 909927, 1994c.
Marr D. A theory of cerebellar cortex. J Physiol 202: 437470, 1969.
Medendorp WP, Van Gisbergen JA, Van Pelt S, and Gielen CC. Context compensation in the vestibuloocular reflex during active head rotations. J Neurophysiol 84: 29042917, 2000.
Melvill Jones G. How and why does the vestibulo-ocular reflex adapt? In: Disorders of the Vestibular System, edited by Baloh RW and Halmagyi GM. New York: Oxford Univ. Press, 1996, p. 8592.
Miles FA and Eighmy BB. Long term adaptive changes in primate vestibuloocular reflex. I. Behavioral observations. J Neurophysiol 43: 14061425, 1980.
Perlmutter SI, Iwamoto Y, Baker JF, and Peterson BW. Interdependence of spatial properties and projection patterns of medial vestibulospinal tract neurons in the cat. J Neurophysiol 79: 270284, 1998.
Quinn KJ, Didier AJ, Baker JF, and Peterson BW. Modeling learning in brain stem and cerebellar sites responsible for VOR plasticity. Brain Res Bull 46: 333346, 1998.[CrossRef][Web of Science][Medline]
Raphan T and Cohen B. The vestibulo-ocular reflex (VOR) in three dimensions. Exp Brain Res 145: 127, 2002.[CrossRef][Web of Science][Medline]
Raphan T and Schnabolk C. Modeling slow phase velocity generation during off-vertical axis rotation. Ann NY Acad Sci 545: 2950, 1988.[Medline]
Reed R. Pruning algorithmsa survey. IEEE Trans Neural Networks 4: 740747, 1993.[CrossRef]
Robinson DA. A method of measuring eye movement using a scleral search coil in a magnetic field. IEEE Trans Biomed Eng BME 10: 137145, 1963.
Rumelhart DE, Hinton GE, and Williams RJ. Learning internal representations by error propagation. In: Parallel Distributed Processing: Explorations in the Microstructure of Cognition: Foundations, edited by Rumelhart DE and McClelland JL. Cambridge, MA: MIT Press, 1986, p. 318362.
Rumelhart DE and McClelland JL. Parallel Distributed Processing: Explorations in the Microstructure of Cognition: Foundations. Cambridge, MA: MIT Press, 1986.
Sandwell DT. Biharmonic spline interpolation of GEOS-3 and SEASAT altimeter data. Geophys Res Lett 2: 139142, 1987.
Schnabolk C and Raphan T. Modelling 3-D slow phase velocity estimation during off-vertical axis rotation (OVAR). J Vestib Res 2: 114, 1992.[Medline]
Shaikh AG, Ghasia FF, Dickman JD, and Angelaki DE. Properties of cerebellar fastigial neurons during translation, rotation, and eye movements. J Neurophysiol 93: 853863, 2005.
Sheliga BM, Yakushin SB, Silvers A, Raphan T, and Cohen B. Control of spatial orientation of the angular vestibulo-ocular reflex by the nodulus and uvula of the vestibulocerebellum. Ann NY Acad Sci 871: 94122, 1999.[CrossRef][Web of Science][Medline]
Siebold C, Anagnostou E, Glasauer S, Glonti L, Kleine JF, Tchelidze T, and Buttner U. Canalotolith interaction in the fastigial nucleus of the alert monkey. Exp Brain Res 136: 169178, 2001.[CrossRef][Web of Science][Medline]
Siebold C, Glonti L, Glasauer S, and Buttner U. Rostral fastigial nucleus activity in the alert monkey during three-dimensional passive head movements. J Neurophysiol 77: 14321446, 1997.
Tan HS, Shelhamer M, and Zee DS. Effect of head orientation and position on vestibuloocular reflex adaptation. In: Annals of the NY Academy of Sciences, edited by Cohen B, Tomko D, and Guedry F. New York: New York Academy of Sciences, 1992, p. 158165.
Tan S and Mavrovouniotis ML. Reducing data dimensionality through optimizing neural network inputs. AIChE J 41: 14711480, 1995.[CrossRef]
Tiliket C, Shelhamer M, Tan HS, and Zee DS. Adaptation of the vestibulo-ocular reflex with the head in different orientations and positions relative to the axis of body rotation. J Vestib Res 3: 181195, 1993.[Medline]
Widrow B and Hoff ME. Adaptive switching circuits. IRE WESCON Convention Record Part 4: 96104, 1960.
Xiang Y, Raphan T, Cohen B, and Yakushin SB. Gravity-dependent and gravity-independent gain changes during vertical vestibulo-ocular reflex (VOR) adaptation. J Gravit Physiol 11: 912, 2004.
Yakushin SB, Bukharina SE, Raphan T, Buttner-Ennever J, and Cohen B. Adaptive changes in the angular VOR: duration of gain changes and lack of effect of nodulo-uvulectomy. Ann NY Acad Sci 1004: 7893, 2003a.[CrossRef][Web of Science][Medline]
Yakushin SB, Dai MJ, Suzuki J-I, Raphan T, and Cohen B. Semicircular canal contribution to the three-dimensional vestibulo-ocular reflex: a model-based approach. J Neurophysiol 74: 27222738, 1995.
Yakushin SB, Palla A, Haslwanter T, Bockisch CJ, and Straumann D. Dependence of adaptation of the human vertical angular vestibulo-ocular reflex on gravity. Exp Brain Res 152: 137142, 2003b.[CrossRef][Web of Science][Medline]
Yakushin SB, Raphan T, Büttner-Ennever J, Suzuki J-I, and Cohen B. Spatial properties of central vestibular neurons of monkeys after bilateral lateral canal nerve section. J Neurophysiol 94: 38603871, 2005a.
Yakushin SB, Raphan T, and Cohen B. Context-specific adaptation of the vertical vestibuloocular reflex with regard to gravity. J Neurophysiol 84: 30673071, 2000a.
Yakushin SB, Raphan T, and Cohen B. Gravity-specific adaptation of the angular vestibuloocular reflex: dependence on head orientation with regard to gravity. J Neurophysiol 89: 571586, 2003c.
Yakushin SB, Raphan T, and Cohen B. Spatial properties of central vestibular neurons. J Neurophysiol 95: 464478, 2006.
Yakushin SB, Raphan T, Suzuki J-I, Arai Y, and Cohen B. Dynamics and kinematics of the angular vestibuloocular reflex in monkey: effects of canal plugging. J Neurophysiol 80: 30773099, 1998.
Yakushin SB, Reisine H, Buttner-Ennever J, Raphan T, and Cohen B. Functions of the nucleus of the optic tract (NOT). I. Adaptation of the gain of the horizontal vestibulo-ocular reflex. Exp Brain Res 131: 416432, 2000b.[CrossRef][Web of Science][Medline]
Yakushin SB, Xiang Y, Raphan T, and Cohen B. The role of gravity in adaptation of the vertical angular vestibulo-ocular reflex. Ann NY Acad Sci 1039: 97110, 2005b.[CrossRef][Web of Science][Medline]
Yakushin SB, Xiang Y, Raphan T, and Cohen B. Spatial distribution of gravity-dependent gain changes in the vestibuloocular reflex. J Neurophysiol 93: 36933698, 2005c.
Zee DS, Yamazaki A, Butler PH, and Gucer G. Effects of ablation of flocculus and paraflocculus on eye movements in primate. J Neurophysiol 46: 878899, 1981.
Zhou W, Tang BF, and King WM. Responses of rostral fastigial neurons to linear acceleration in an alert monkey. Exp Brain Res 139: 111115, 2001.[CrossRef][Web of Science][Medline]
Zipser D and Andersen RA. A back-propagation programmed network that simulates response properties of a subset of posterior parietal neurons. Nature 331: 679684, 1988.[CrossRef][Medline]
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Visit Other APS Journals Online |