|
|
||||||||
The Journal of Neurophysiology Vol. 79 No. 6 June 1998, pp. 3168-3188
Copyright ©1998 by the American Physiological Society
Department of Physiology, Northwestern University Medical School, Chicago, Illinois 60611
| |
ABSTRACT |
|---|
|
|
|---|
Beiser, David G. and James C. Houk. Model of cortical-basal ganglionic processing: encoding the serial order of sensory events. J. Neurophysiol. 79: 3168-3188, 1998. Several lines of evidence suggest that the prefrontal (PF) cortex and basal ganglia are important in cognitive aspects of serial order in behavior. We present a modular neural network model of these areas that encodes the serial order of events into spatial patterns of PF activity. The model is based on the topographically specific circuits linking the PF with the basal ganglia. Each module traces a pathway from the PF, through the basal ganglia and thalamus, and back to the PF. The complete model consists of an array of modules interacting through recurrent corticostriatal projections and collateral inhibition between striatal spiny units. The model's architecture positions spiny units for the classification of cortical contexts and events and provides bistable cortical-thalamic loops for sustaining a representation of these contextual events in working memory activations. The model was tested with a simulated version of a delayed-sequencing task. In single-unit studies, the task begins with the presentation of a sequence of target lights. After a short delay, the monkey must touch the targets in the order in which they were presented. When instantiated with randomly distributed corticostriatal weights, the model produces different patterns of PF activation in response to different target sequences. These patterns represent an unambiguous and spatially distributed encoding of the sequence. Parameter studies of these random networks were used to compare the computational consequences of collateral and feed-forward inhibition within the striatum. In addition, we studied the receptive fields of 20,640 model units and uncovered an interesting set of cue-, rank- and sequence-related responses that qualitatively resemble responses reported in single unit studies of the PF. The majority of units respond to more than one sequence of stimuli. A method for analyzing serial receptive fields is presented and utilized for comparing the model units to single-unit data.
The serial order of events and actions is critical in cognition and behavior. In addressing this issue more than four decades ago, Lashley (1951) The encoding model presented here is an implementation of the conceptual model of cortical-basal ganglionic processing proposed by Houk and Wise (1995)
Sequence encoding with an array of modules
The delayed sequence task begins with an instructional period during which three cues are illuminated in a particular serial order (Barone and Joseph 1989
Neurons were modeled as single membrane-bound compartments with passive leakage conductances. A first-order differential equation relates the membrane leakage current and synaptic currents to the membrane potential for a neuron, j (Eq. 1)
Alternate model assumptions
Most of the simulations reported in this paper used the model of the synaptic current detailed above to calculate excitatory and inhibitory synaptic currents from synaptic weights and presynaptic firing rates. This approach ignores the nonlinear effects of membrane potential on synaptic current values and thus treats the synapse as a "current source." To explore the limitations of the "current-source" synapse assumption, simulations were run with a more physiological synaptic model that treats the weighted sum of the presynaptic firing rates as a synaptic conductances. These excitatory (Eq. 7) and inhibitory (Eq. 8) conductance values are converted into currents by multiplying the difference between the membrane potential and the applicable synaptic reversal potential (Eq. 9)
Simulation methods
An object-oriented simulator was written using the C++ programming language. The simulations were performed using batch processes running across a group of 30 Hewlett-Packard workstations (HP 712/80 i, HP 715/50, and HP 715/33). The nonoverlapping cue presentation paradigm was modeled after the approach used in the caudate studies by Kermadi et al. (Kermadi and Joseph 1995 Glossary
Responses of an isolated module
The response of an isolated module (Fig. 1) to a single cue input serves to illustrate the model's basic processing operations. In its initial resting state, the module's units are quiescent except for the GPi unit, which exists in a tonic state of moderate activation (Fig. 3, GPi). A single event-related input is pulsed on and then off to simulate the onset and offset of an instructional cue (Fig. 3, Cue). This input induces the spiny unit in the CD layer to fire phasically. The short burst of CD activity (Fig. 3, CD) produces a momentary pause in GPi activity (Fig. 3, GPi), thus releasing the T unit from a state of tonic inhibition. Transient removal of pallidal inhibition produces a slow depolarization of the T unit with dynamics initially dominated by the passive properties of the unit (Fig. 3, T). This slow depolarization allows the activation variable, m, to increase, creating an inward calcium spike, which then quickly depolarizes the T unit Interacting array of modules: emergence of spatial patterns
Sensory inputs produce sustained activations within cortical-thalamic loops and thus alter the internal states of individual modules, as illustrated in the previous section (Fig. 3). In an array of modules (Fig. 2), internal state information serves as an additional input modality to the CD layer. These recurrent (R) inputs provide information about past events that can influence future states of the model, thus providing a way of linking temporally spaced sensory inputs.
Tolerance to random corticostriatal weights
The corticostriatal weights of the network discussed in the previous section were selected randomly from a uniform distribution of values spanning the closed-positive interval defined by a maximum (MAX) and range (RANGE) of weight values. The network was instantiated and tested at several combinations of MAX and RANGE until it produced an unambiguous set of PF patterns in response to the 15 sequential contexts. An instantiation of the network that produced such "perfect" performance of the task was found within the first few instantiation attempts. Given the ease by which appropriate distribution parameters were established in this initial study, it appeared quite likely that other combinations of MAX and RANGE combinations also might produce perfect networks.
Analysis of receptive fields
Let us return to the PF activation diagram (Fig. 5) introduced earlier to illustrate the model's ability to encode sequential inputs within spatial patterns of sustained activation. It is also interesting to examine Fig. 5 in a column-wise fashion
Alternative modeling assumptions
To better understand the dependence of the simulation results on our modeling assumptions, we repeated the group of simulations from Fig. 6A with an alternative synaptic model, decreased activation function slope, and increased membrane time constants. In general, we found that our results were quite insensitive to changes in these parameters within the GPi, T, and PF layers. However, they were quite sensitive to changes in the CD layer model, and these findings are reported here. Finally, we repeated simulations using a CD layer with different levels of collateral inhibition and also a feed-forward model of caudate inhibition.
ACTIVATION FUNCTION SLOPE.
We explored the sensitivity of our results to the slope parameter, b, of the sigmoidal activation function of the units in the CD layer. The preceding studies all used an extremely steep slope parameter of 50 in the CD layer. Repeating the study of Fig. 6A with slopes of 5 demonstrated no significant qualitative or quantitative differences in results. However, further reducing the slope to 1 qualitatively changed the shape of the effective coding area and decreased the number of perfect networks to 10 (Fig. 6C). Note that the network fails to encode any sequences at low to moderate values of RANGE.
TIME CONSTANT.
The results in Fig. 6A were also sensitive to the time constant of the CD layer units. Increasing the membrane time constant from 15 to 50 ms (by increasing the membrane capacitance from 0.5 to 1.67 nF) produced a considerably better performance (Fig. 6D). In addition to displaying a qualitatively larger effective coding region, this set of studies produced 460, as compared with 270, perfect networks.
ALTERNATE SYNAPTIC MODELS.
As outlined in METHODS, the model assumes a "current-source" representation of synaptic action. We repeated the random corticostriatal weight studies of Fig. 6A with a more realistic synaptic model in the CD layer. In this alternative synaptic model, excitatory (Eq. 7) and inhibitory (Eq. 8) conductance values of the CD layer are converted into currents by multiplying the difference between the membrane potential and the applicable synaptic reversal potential (Eq. 9). Figure 6E displays the results of a study using an excitatory reversal potential, Eex, of 0 mV and an inhibitory reversal potential, Einh, of LEVELS OF COLLATERAL INHIBITION.
In both the current-source and more realistic synaptic models, adjustments in the level of inhibition have a scaling effect on the number, location, and spread of perfect networks within the MAX-RANGE parameter space. Increases in the level of collateral inhibition produce increases in the number of perfect networks and area of the effective region. However, because increases in inhibition also shift the region to larger values of RANGE, and thus a larger area of the MAX-RANGE parameter space, the overall proportion of perfect networks to imperfect networks stays approximately the same.
FEED-FORWARD INHIBITION.
Although collateral inhibition is strongly suggested by the existence of GABAergic spiny neuron collaterals as well as by a limited amount of physiological evidence (Groves 1983 These simulation results suggest that the circuits linking the basal ganglia, thalamus, and cortex have an inherent capacity for encoding the serial order of events. Whereas we specifically modeled the encoding of a sequence of simple visual inputs, the same mechanisms are equally applicable to the encoding of the serial order of other sensory or internal events, as may occur, for other cortical-basal ganglionic loops, in the haptic recognition of an object or in registering the sequence of words in a sentence. This special computational property does not require adaptive training mechanisms of any kind, although adaptive tuning might improve its encoding efficiency. Another important feature of the model is that the receptive fields of its units bear a close qualitative resemblance to the receptive fields observed in single unit studies.
Computational elements of the model
The ability of the model to encode the serial order of sequential events stems from three computational elements that combine in a cooperative manner due to the structure of the cortical-basal ganglionic network. The computational elements are: working memory, competitive pattern classification, and recursion. In this section, we review the origin and analyze the role of each of these elements.
WORKING MEMORY.
The model's cortical-thalamic loops support self-sustained activations that provide working memories of the results of prior processing. Two features contribute to this operation: focused positive feedback and low-threshold calcium channels. Positive feedback, focused within individual loops, endows the bistability that permits activations to persist after the return of tonic pallidal inhibition. Bistability substantially decouples the dynamics of working memory from operations in the CD layer. Calcium channels inactivate too rapidly to play a role in maintaining either of the stable states. Instead, they are important for initiating loop activity through a postinhibitory rebound of thalamic membrane potential. Rebound is necessary for initiating loop activity, given the double inhibitory pathway through the basal ganglia. Without a rebound mechanism, loop activation would be solely dependent on an excitatory input such as a cortico-cortical projection to PF units.
COMPETITIVE PATTERN CLASSIFICATION.
The model's CD units perform competitive pattern classification on a vector of corticostriatal inputs. Their random synaptic weights provide each unit with a unique perspective on the state of event and recurrent PF activity. Some units react strongly to certain combinations of input whereas others do not react at all. Successful pattern classification does not require training but does require weight matrices with sufficient diversity and a gain balanced with the degree of striatal inhibition.
RECURSION.
The units in the CD layer perform competitive pattern recognition on a spatial array of PF inputs and, in this sense, are similar to units in traditional feed-forward networks. These inputs are composed of a mixture of sensory event-related and recurrent state-related inputs. The latter, being working memories of the results of prior processing, provide a special context for the interpretation of new event inputs. They provide a continuity with the past that is not found in feed-forward networks and is critical for sequence encoding.
COMPUTATIONAL ELEMENTS OF OTHER MODELS OF SEQUENTIAL PROCESSING.
Models of sequential or temporal processing have been proposed for the hippocampus (Granger et al. 1994 Role of learning and modulation
LEARNING.
Many neural network models rely on adaptive mechanisms such as Hebbian learning, reinforcement learning, or backpropagation for training their synapses to participate in useful computations. For example, competitive Hebbian learning has been used to explain the formation of orientation-specific neurons in the visual cortex (Bienstock et al. 1982 MECHANISMS FOR NEUROMODULATION.
Only a portion of MAX-RANGE parameter space produced perfect networks (Fig. 6). If corticostriatal synaptic weights do not naturally exist in this region, it might be functionally advantageous to have a mechanism for guiding them into it. A number of modulatory mechanisms that have been shown to affect corticostriatal interactions could provide such a mechanism (Kitai and Surmeier 1993 Understanding the model's receptive fields
The receptive fields of the model units are remarkably similar to the receptive fields observed in single-unit studies. Still, several differences between the model and experimental data must be addressed. Before addressing these differences, it is helpful to clarify a few of our assumptions regarding receptive fields.
PARADIGM-RELATED ISSUES.
Receptive fields are largely artificial constructs, reflections of an experimental paradigm and necessarily constrained by the design of the experiment. Different behavioral paradigms reveal different cross-sections of a unit's response characteristics. This point is especially true when the concept of receptive fields is applied to sequential tasks and provides the key for resolving many of the apparent discrepancies between our model data and experiment.
MODEL-RELATED ISSUES.
In several cases, differences between the model behavior and single-unit data can be traced to the simplified structure and physiology of the model. For example, in the model's PF layer, we have modeled E units as pure sensory neurons without any ascending input from the thalamus and R units as pure context units without any cortical input from sensory association areas. This approach was taken for the sake simplicity even though such a dichotomous input arrangement is not found in the PF. Single-unit recordings demonstrate that, although some neurons have mainly event-related response components and others have mainly the sustained discharges associated with working memory, many have mixtures of these two components (Funahashi and Kubota 1994 Extensions to other cortical areas
Because the distributed neuronal architecture of Fig. 2 is shared by several other areas of the cerebrum, the mechanisms proposed in the present paper might generalize to additional cognitive-motor processing stages. The abbreviated discussion provided in this section will be limited to comparing PF with two cortical areas known to participate in the execution of sequential limb movements, the supplementary motor area (SMA) and the primary motor cortex (M1). Figure 12, left, schematically illustrates that both SMA and M1 have loops through the basal ganglia that form networks analogous to the PF-basal ganglionic modular array analyzed in the present paper, based on the transsynaptic transport of viral markers (Middleton and Strick 1997a
Conclusions
In this paper, we have shown how a mechanism for encoding the serial order of events could emerge from known interactions between the prefrontal cortex, basal ganglia, and thalamus. This sequence encoding ability is a result of the macro-organization of these circuits rather than the organization of individual synapses. Accordingly, the model's synapses do not need to be individually adapted through training. Rather, the synapses of the model's striatal layer simply require global adjustment into a favorable region of a random weight space. At the behavioral level, this result implies that the brain is capable of creating working memory representations of sequential stimuli without previous training or exposure.
![]()
INTRODUCTION
Abstract
Introduction
Methods
Results
Discussion
References
postulated that the brain analyzes and controls serial order by creating and using a spatial pattern of neural activity, which he referred to as a "determining tendency" or idea. To control sequential actions, this spatial pattern would require translation into expressive action in the time domain through a process he likened to the application of "syntax" in the formation of language from ideas. The inverse transformation also must exist to transform temporally spaced sensory experiences into a sustained spatial pattern of brain activity, for example, to construct a concept from sequential sensations during haptic manipulation or visual survey.
; Wiegersma et al. 1990
), serial-order recognition (Kesner et al. 1994
), or recency judgments (Milner et al. 1991
). Monkeys subjected to bilateral lesions of areas 46 and 9 have difficulty monitoring sequences of novel stimuli (Petrides 1991
). The basal ganglia also are implicated in serial processing through the impairments of cognitive and motor skills in Parkinson's (Brown and Marsden 1990
; Harrington and Haaland 1991
) and Huntington's disease (Gabrieli 1995
; Willingham and Koroshetz 1993
). Some of these deficits are strikingly similar to the ordering deficits of frontal patients (Sagar et al. 1988
; Sullivan and Sagar 1989
; Willingham and Koroshetz 1993
).
; Funahashi et al. 1993
; Kermadi and Joseph 1995
; Kermadi et al. 1993
). Responses that are initiated by the instructions and sustained through the delay period could represent conversions of temporal sequences of sensory input into spatial patterns of neural activation. Similarly, some motor-preparation units in the frontal eye fields, caudate nucleus, and globus pallidus are related to the serial order of the subsequent sequential actions (Barone and Joseph 1989
; Kermadi and Joseph 1995
; Kermadi et al. 1993
; Mushiake and Strick 1995
; Tanji and Shima 1994
). Such activity could represent commands for the conversion of a spatial pattern of activation into the temporal domain of movement. Together, these studies provide persuasive evidence for the existence of conversion mechanisms bridging the temporal domain of sensory input, the spatial domain of Lashley's "determining tendency," and the temporal domain of behavioral expression.
, 1990
; Fuster and Alexander 1971
; Goldman-Rakic 1995
; Goldman-Rakic et al. 1990
; Petrides 1991
). Evidence for working memory activity within analogous areas of the human prefrontal cortex comes from functional imaging studies (Fiez et al. 1996
; Jonides et al. 1993
; McCarthy et al. 1994
). Discharge that is sustained through the delay period also has been identified in the caudate (Hikosaka et al. 1989b
; Schultz and Romo 1992
) and SNr (Hikosaka and Wurtz 1983
) and in the thalamus (Fuster and Alexander 1973
). Evidently neural correlates of spatial working memory and serial processing are found in many of the same areas of the CNS. Indeed, it has been suggested that the mechanisms providing temporal integration in sequencing tasks be viewed as extensions of those providing working memory representations in delayed-response tasks (Fuster 1985
; Goldman-Rakic 1987
).
![]()
MODEL
. These authors based their conceptual model on the modular anatomic organization of "parallel loops" linking the frontal cortex, basal ganglia, and thalamus, originally conceived by Alexander, DeLong, and Strick (1986) and supported by recent transsynaptic labeling studies (Middleton and Strick 1997a
). The present encoding model deals specifically with the loop through area 46 in the prefrontal cortex, through caudate nucleus (CD), internal segment of the globus pallidus (GPi), thalamus (T), and back to the PF. We follow Wise and Houk (1994)
in assuming that this macroscopic module is itself composed of an array of similarly organized microscopic modules. Thus the (microscopic) module illustrated in Fig. 1 follows the basic anatomic plan of the prefrontal cortical-basal ganglionic loop.

View larger version (12K):
[in a new window]
FIG. 1.
Individual cortical-basal ganglionic module (adapted from Houk and Wise 1995
). Convergent projections from many cortical cells (C) make excitatory synapses (depicted as
, where the distribution of dot sizes represents a distribution of synaptic weights) with a spiny neuron in the caudate nucleus (CD). This CD unit sends an inhibitory projection (depicted as "a") to a unit in the internal segment of the globus pallidus (GPi), which in turn inhibits a thalamic relay unit (T). Thalamic unit makes a reciprocal excitatory connection with a cortical unit to complete the module's recurrent loop.
, 1988
). Each medium spiny neuron receives input from ~10,000 different corticostriatal afferents (Wilson 1995
). This highly convergent neuronal architecture, together with the physiological properties of the cells, led Houk and Wise (1995)
to suggest that spiny neurons are positioned ideally for detecting contextual events of behavioral significance. With respect to the instructional phase of a delayed-response task, contextual event detection might involve the recognition of stimulus-related signals conveying an instructional cue's spatial position, identity, or other physical characteristics. In a serial task, context also would include intrinsic signals such as working memory representations of previous stimuli.
). One hypothesis favors convergent input from cells in functionally related, yet distinct, cortical areas (Flaherty and Graybiel 1993
; Parthasarathy et al. 1992
; Yeterian and Van Hoesen 1978
), whereas another favors convergence from neighboring cells in a single cortical area (Selemon and Goldman-Rakic 1985
; Strick et al. 1995
). Either anatomic arrangement would provide the convergence of sensory and recurrent projections onto the CD layer as required by the model. Corticostriatal projections from the prefrontal cortex and several of its reciprocally linked areas (e.g., posterior parietal, orbitofrontal, anterior cingulate, and superior temporal cortex) converge in a general way onto the same volume of caudate, although the predominate pattern is one of segregation or interdigitation of terminal fields as opposed to frank intermixing (Selemon and Goldman-Rakic 1985
). Alternatively, cue-related sensory signals in posterior parietal might be relayed to CD units via the sensory-related cells in the PF through cortical-cortical projections (Bates and Goldman-Rakic 1993
; Selemon and Goldman-Rakic 1988
). What is important to note here is that either mechanism of convergence could be used to provide the model's caudate layer with sensory-related input information.
), which in turn project to nuclei of the thalamus including ventralis anterior (VA) and ventralis lateral (VL) (DeVito and Anderson 1982
). Neurons in the GPi are characterized by a high rate of tonic activity interspersed with momentary pauses due to spiny neuron firing episodes (Wilson 1990
). The tonic activity inhibits projection targets in the thalamus, and the pauses produce a disinhibition of thalamic neurons (Deniau and Chevalier 1985
). This disinhibition initiates a postinhibitory rebound discharge response within thalamic relay neurons that is mediated, in part, by low-threshold T-type calcium channels (Wang et al. 1991
). Thus the dual inhibitory action of this pathway serves to activate thalamic discharge through disinhibition (Deniau and Chevalier 1985
).
). An additional loop is formed by neurons in area 46 of the PF that project in a reciprocal manner back to several thalamic nuclei including MD and VA (Jacobson et al. 1978
; Siwek and Pandya 1991
). It has been suggested that such a cortical-thalamic loop has the potential, given sufficient gain, for sustaining activations, like those thought to be correlates of working memory, through positive feedback (Dominey and Arbib 1992
; Hikosaka 1989
; Houk and Wise 1995
).
; Funahashi et al. 1993
; Kermadi and Joseph 1995
; Kermadi et al. 1993
). After a short delay period, the subject is required to touch the cues in the same order in which they were illuminated. Because the present model focuses on the encoding problem, we will only consider the instruction and delay phases of the task.
). Neurons in area 7a respond to the retinal location of a visual stimulus with receptive fields that are typically unimodal and broadly tuned (Robinson et al. 1978
). Such cue-related signals could be conveyed to cells of the prefrontal cortex via corticocortical projections. Clearly, the model's labeled-line inputs do not exploit much of the rich information contained in parietal responses; however, this simplification allows us to focus on the ordinal, rather than spatial, aspects of the encoding task.

View larger version (26K):
[in a new window]
FIG. 2.
Array of cortical-basal ganglionic modules. Three modules of the type shown in Fig. 1 are combined to illustrate the organization of a modular array regulating prefrontal (PF) cortex activity. C units of Fig. 1 are divided into 2 categories. Those that receive recurrent input via the basal ganglia and thalamus are designated R (recurrent) units, whereas those receiving cue-related input from posterior parietal cortex are designated event (E) units. CD units receive convergent input from many R and E units and themselves are interconnected by inhibitory collaterals to form a competitive network (shown symbolically by the shaded gray area).

View larger version (19K):
[in a new window]
FIG. 3.
Response of an isolated module to a cue input. Single instructional Cue is pulsed on and off. Input from this cortical event unit depolarizes the CD unit, leading to its activation. Inhibitory input from the CD unit causes the tonically active GPi unit to hyperpolarize and pause. This pause in GPi inhibition produces a rebound response in the T unit. Activation of the T unit is relayed to the cortical unit that participates in this module, and its activity is sustained by positive feedback between the T and C units. This illustrates the bistable nature of the model's cortical-thalamic loop. Solid and dashed traces depict membrane potential and activation, respectively.
); this serves to distribute information to CD units across the entire modular array. There is a mixture of input from the E units described in the previous paragraph and input from the R units in the PF cortex, so named because they receive recurrent input from the model's processing modules. The R inputs provide each module access to sustained cortical-thalamic activity, representing results obtained from the processing of prior events. Thus each CD unit is presented a spatial pattern of input representing both present events and context signals based on the processing of prior events.
; Katayama et al. 1981
; Rebec and Curtis 1988
; Wilson 1995
). Wickens (1993)
has modeled spherical zones of mutual inhibition that he calls inhibitory domains. We instead model competitive interactions with a fully connected network of inhibitory CD units. The use of a single domain is a simplification that neglects the potential for more complex interactions.
![]()
METHODS
Abstract
Introduction
Methods
Results
Discussion
References
The passive electrical properties of the model's neurons are representative of those reported for the cortex, striatum, and thalamus (Connors et al. 1982
(1)
; McCormick and Huguenard 1992
; McCormick et al. 1985
; Wilson 1990
). A membrane capacitance (C) value of 0.5 nF and leakage conductance (gL) of 0.0333 µS gives each neuron a time constant of 15 ms. Resting potential (EL) was set to
60 mV. The membrane leakage currents are defined by Eq. 2
The model represents synapses as scalar weights (wj,k) between neurons k and j. Making the simplifying assumption that inputs sum in a linear fashion, we lump the action of many synapses into a single current. The weighted sum of presynaptic firing rates gives the synaptic current (Eq. 3)
(2)
A sigmoidal activation function (Eq. 4) with a threshold (Vth) of
(3)
55 mV is used to convert membrane potential into an output firing rate within a normalized range between 0 and 1. In the CD layer, a large slope parameter, b in Table 2, was used to model the sharp transitions between "up" and "down" states displayed by striatal spiny neurons (Wilson 1995
)
The caudate layer of the module receives convergent excitatory inputs from neurons of the PF cortex, modeled by Eq. 3. In addition, CD units compete through the inhibitory action of GABAergic collaterals. The total inhibitory current for each CD unit is determined by scaling the sum of the activations of all other CD units in the layer; CD units do not receive self-inhibitory input.
(4)
View this table:
TABLE 2.
Synaptic input weights and activation function parameters
0.1665 nA) that depolarizes the membrane potential to Vth. At Vth, the output of the GPi unit is maximally responsive to inhibitory input from the CD layer. The synaptic weights between CD and GPi layers were adjusted such that each CD input strongly inactivated its GPi target.
). This rebound current permitted firing in response to pauses in the inhibitory input from GPi. It was modeled as specified by Wang et al. (1991)
The voltage dependence a of the steady-state activation and inactivation gates m and h was modeled with the Boltzman equation (Eq. 6)
(5)
The constants for these curves were set at physiologically plausible values noted in Table 1. The kinetics of the channel's gating variables both follow first-order differential equations with voltage-dependent time constants (Wang et al. 1991
(6)
).
View this table:
TABLE 1.
T-type calcium channel parameters
76 mV under inhibition from tonically active pallidal units. This hyperpolarized membrane potential results in a strong rebound response from the calcium channel. The recurrent excitatory weights from T to PF and back were selected such that they would produce sustained cortical-thalamic firing rates once the PF unit was activated. All synaptic weights are listed in Table 2.
(7)
(8)
(9)
; Kermadi et al. 1993
). The task is simulated by sequentially toggling the activation of the model's event-related (E) neurons on and then off (Fig. 2). In the Kermadi paradigm, consecutive cues are illuminated for 800 ms at 1,500-ms intervals. However, the time necessary for the network to reach equilibrium was much less than the 800 ms between changes in the state of the cues and varied considerably according to the magnitude of the corticostriatal weights. To minimize the amount of wasted simulation time, the original paradigm was modified so that the three onsets and offsets of the cue sequence were varied to trigger as soon as the network settled into a stable equilibrium.
PF
prefrontal cortex
CD
caudate nucleus
GPi
internal segment of globus pallidus
T
thalamus
MAX
maximum random synaptic weight
RANGE
range of random synaptic weight distribution
V
membrane potential
C
membrane capacitance
IL
leakage current
gL
leakage conductance
EL
membrane resting potential
Isyn
synaptic current
wex
excitatory synaptic weight
winh
inhibitory synaptic weight
Vth
threshold potential
Z
presynaptic firing rate
b
slope of sigmoidal activation function
IT
low-threshold calcium T-type current
ECa2+
calcium reversal potential
m
activation gating variable
h
inactivation gating variable
gT
maximum T-type conductance
a
steady-state activation/inactivation of m and h gates
Vh
half-maximal voltage
k
Boltzman equation slope parameter
gex
excitatory synaptic conductance
ginh
inhibitory synaptic conductance
Eex
excitatory synaptic reversal potential
Einh
inhibitory synaptic reversal potential
![]()
RESULTS
Abstract
Introduction
Methods
Results
Discussion
References
driving it into an activated state (Fig. 3, T). The C, which receives an excitatory input from the T layer, subsequently begins to fire ~32 ms after the CD unit first crossed its firing threshold (Fig. 3, C). Most of the signal pathway's 32-ms delay is due to the kinetics of the thalamic T-type calcium channel. Reciprocal excitatory inputs from the C unit stabilize the membrane potential of the T unit at a level above threshold as it begins to repolarize. The reciprocal system quickly latches into a state of sustained activation that is maintained even after the return of tonic GPi inhibition. The cortical-thalamic loop is a bistable system because it has two stable equilibrium states (activated and inactivated) at moderate levels of pallidal input. Corticothalamic bistability is one of the key computational features of the Houk and Wise module. The transition from the activated state back to the inactive state requires a burst of inhibitory input to this bistable loop. Such a burst response could be effected by a burst of excitatory input to the GPi from the subthalamic nucleus (STN) of the indirect pathway. Presumably, this burst of STN activity would reflect activation of other spiny units within the striatum. The present simulation does not include this mechanism.

View larger version (34K):
[in a new window]
FIG. 4.
Response of CD and PF layers to a sequence (ABC) of cues. Onset of cue A produces brief phasic and longer-duration burst responses among the competing CD units. Bursting CD units trigger bistable cortical-thalamic loops, leading to sustained activations of the corresponding R units in PF cortex (Fig. 3). Recurrent input from R units provides a context that influences the responses of the CD layer during subsequent cue presentation. By the end of the period of cue presentation, the particular temporal sequence of events is effectively encoded by the spatial pattern of sustained activity in the R units of the PF layer.
) and SNr (Hikosaka and Wurtz 1983
) reporting that 10% (80/867) and 16% (15/95) of task-related neurons in these areas, respectively, display sustained responses.

View larger version (51K):
[in a new window]
FIG. 5.
Distinctive spatial patterns of sustained PF activity generated in a perfect network by the 15 sequential contexts that result from the presentation of 6 test sequences of cues (ABC, ACB, BAC, BCA, CAB, and CBA). In each row, the darkened (gray) squares indicate which PF units were active after each period of cue presentation. Note that each row is characterized by a different spatial pattern, indicating that each sequential context is encoded by a unique spatial pattern of PF activity. Columns indicate how each module participates in this encoding task.

View larger version (51K):
[in a new window]
FIG. 6.
Sequence encoding performance of networks instantiated with random corticostriatal weights. Each pixel represents a network instantiated with a uniform distribution of random corticostriatal weights defined by combinations of MAX and RANGE value. Pixel color indicates the number (0-15) of distinct PF patterns produced by that network in response to 15 sequential contexts. A: network with current-source synapses in CD layer. B: current-source networks that produced 15 PF codes in
1 of 10 instantiations. C: current-source networks with decreased activation function slope; D: with increased CD layer time constant. E: network with reversal potential synapses in CD layer. F: network with feed-forward CD layer inhibition.
for a total of 55,640 network instantiations. Figure 6B indicates all parameter combinations which produced "perfect" performance in
1 of 10 instantiations. Note that the parameter combinations producing perfect performance do not fall along the line of maximum RANGE (i.e., the main diagonal of the figure). This result may be due to the fact that maximal RANGE values produce a greater proportion of near-zero weight values. Rather than contributing to distinct network responses, these near-zero weights might be largely useless.
the perspective of a physiologist recording the receptive fields of single units. However, first it is advantageous to make a slight modification to the diagram. Recall that PF units, once activated, remain active through the remainder of the sequence due to sustained cortical-thalamic feedback. Accordingly, with the exception of rows A, B, and C, the rows of Fig. 5 represent responses to the current cue as well as sustained activations produced by previous cues within the sequence. For example, unit 2 in Fig. 5 begins firing when cue C arrives first in a sequence and sustains this activation through the presentation of subsequent cues A and B. These "working memory" squares of the plot, in this case those corresponding to CA, CB, CAB, and CBA, are redundant and thus obscure the responses defining a unit's receptive field. In Fig. 7, the working memory activations are eliminated, and thus each column of the plot can be thought of as a binary vector defining the "receptive field" of a PF unit.

View larger version (43K):
[in a new window]
FIG. 7.
Receptive fields in a network of 30 units. Each column defines the receptive field of an individual unit. Units 2 and 9 display context-dependent Rank1(C) and Seq3(ABC) behavior. Unit 18 displays Cue(A) behavior that is similar to working memory activity. Most units respond to several different serial contexts.
) and frontal eye field (FEF) (Barone and Joseph 1989
), caudate (Kermadi and Joseph 1995
; Kermadi et al. 1993
), and GP (Mushiake and Strick 1995
) during the presentation phase of delayed sequencing experiments. Unit 9, which responds to cue C when it is preceded by the sequence AB, displays a sequence dependence we term Seq3(XYZ). Other instantiations produced units with Seq2(XY) responses. This type of unit has been identified experimentally in the FEF (Barone and Joseph 1989
) and caudate (Kermadi and Joseph 1995
; Kermadi et al. 1993
).
we refer to these as "compound" receptive fields. For example, unit 18 responds to cue A, independent of serial rank or context. This compound receptive field, which we term Cue(X), is composed of a mixture of Rank1(X), Seq2(YX), Seq2(ZX), Seq3(YZX), and Seq3(ZYX) responses. Such a response is similar to the spatial working memory responses of the dorsolateral PF (Funahashi et al. 1989
, 1993
). Cue-related responses also have been recorded in the caudate during spatial-delayed sequencing; however, in contrast to their PF counterparts, they have phasic activations (Kermadi and Joseph 1995
; Kermadi et al. 1993
) .

View larger version (41K):
[in a new window]
FIG. 8.
Thirty-five most-common receptive fields displayed by 20,640 units. Receptive fields are sorted by decreasing frequency beginning at bin 1. Simple receptive fields represented include Rank1(B) in bin 3, Seq2(CB) in bin 6, pure Rank1 in bin 8, pure Rank2 in bin 19, and Cue(B) in bin 35. Note that the vast majority of bins represent complex fields.
). Similarly, units (bin = 19, n = 243) responding to all six Seq2(XY) contexts (i.e., AB, AC, BA, BC, CA, and CB) can be classified as pure Rank2 and units (not shown, n = 27) responding to all six Seq3(XYZ) contexts as pure Rank3. However, Rank2 and Rank3 responses have not yet been described in the literature.
). Several other compound receptive fields resemble those observed in single-unit studies. For example, two types of units, one (not shown, n = 27) displaying both Rank1(X) and Rank2(X) and the other (not shown, n = 32) displaying a mixture of Rank2(X) and Rank3(X) activity, resemble units reported in the FEF (Barone and Joseph 1989
). Units with a combination of Rank2(X) and Rank3(X) activity also have been reported in the caudate (Kermadi and Joseph 1995
; Kermadi et al. 1993
).
90 mV. Although the results of this group of simulations (Fig. 6E) looks qualitatively similar to the current-source results of Fig. 6A, only 64 perfect networks, compared with 270 in the current-source case, were produced. A further decrease in coding ability was observed in simulations using an inhibitory reversal potential of
70 mV, which only produced 34 perfect networks.
; Katayama et al. 1981
; Rebec and Curtis 1988
; Wilson 1995
), there is also clear evidence for feed-forward inhibition via GABAergic interneurons (Kitai and Surmeier 1993
). To address this possibility, we constructed an alternative model incorporating feed-forward, rather than collateral, inhibition. As before, the feed-forward model was composed of 30 modules coursing through the PF, CD, GPi, and T layers; however, the inhibitory collaterals of the CD layer were omitted. In their place, an additional, and separate, layer of 30 interneuron units was added to provide feed-forward inhibition to the original CD layer. Like the original CD layer, each unit in the interneuron layer received corticostriatal projections from the entire PF layer; however, instead of making inhibitory projections to the GPi layer, each unit sent inhibitory projections to all of the units of the original CD layer.
1 of 10 instantiations. Comparing Fig. 6, B and F, note that although the feed-forward version of the model is capable of encoding sequences, it does so within a region of the parameter space that is smaller and distinctly different in shape. Like the case with low activation function slopes (Fig. 6C), the feed-forward model fails to resolve small differences in corticostriatal weight and thus fails to produce perfect networks in regions of low RANGE. In addition, fewer instantiations of the feed-forward model produce perfect performance on the encoding task. Both of these results suggest that the range of appropriate corticostriatal weights is much smaller for the feed-forward version.
![]()
DISCUSSION
Abstract
Introduction
Methods
Results
Discussion
References
): cortical-cortical loops with PP cortex, cortical-cortical loops within PF, cortical-cerebellar loops between PF and dentate nucleus, and trans-striatal loops through basal ganglia. It is likely that each of these loops contributes to the net gain of positive feedback, thus contributing to sustained activity in PF neurons. For simplicity, we focused here on just one loop, the cortical-thalamic. Although not the emphasis of our study, sustained trans-striatal activity did occasionally arise within the network.
; Fuster and Alexander 1971
). Other models of sequence encoding have used decaying working memory profiles (e.g., Wang and Arbib 1990
). Although decaying profiles have certain computational advantages, within a complex recurrent network they have the potential to produce limit cycles or chaotic states. Sustained working memory traces, by contrast, tend to stabilize the overall network and ensure that it settles into a stable equilibrium. Sustained traces also create PF codes that are relatively insensitive to the rate of cue presentation. Accordingly, they encode the serial order rather than the timing of the cue presentation. Although this lack of timing information might pose a problem for a network attempting to encode a musical phrase (Cummins et al. 1993
), it can be an asset in skill learning. For example, motor responses in a delayed-sequencing task may be performed slowly initially. As the speed of the cue presentation and motor performance is increased, the representations in the PF would remain unchanged. This should simplify learning because what is learned during a slow-motion rehearsal remains relevant to performance at faster speeds.
; Katayama et al. 1981
; Park et al. 1980
; Rebec and Curtis 1988
); 2) inputs from the cerebral cortex to GABAergic interneurons produce feed-forward inhibition (Jaeger et al. 1994
; Kitai and Surmeier 1993
); or 3) both mechanisms coexist (Kita 1993
). Our simulations, which contrast the performance of collateral and feed-forward inhibition, indicate that both help to regulate the extent of CD layer activation, which ultimately results in sparser patterns of PF layer activation. Sparse PF patterns allow room for a greater number of sequences to be stored within the PF layer, whereas excessive CD layer activation leads to a completely activated PF layer
a useless state from an encoding standpoint. This effect is particularly important as increasing numbers of PF units become activated by successive cues in the sequence.
). Spiny units display a characteristic mixture of burst durations in the simulations with collateral inhibition, in good agreement with single unit data (Wilson 1990
). Feedforward inhibition, on the other hand, produces spiny unit activations that move in unison rather than in competitive opposition. Instead of differentiating between similarly activated units, feed-forward inhibition simply reduces the activation of all units through global downregulation. The simulations indicate that feed-forward inhibition (Fig. 6F) only produces perfect networks at large values of RANGE, whereas collateral inhibition is effective across a larger portion of the parameter space (Fig. 6, A and B). The greater magnification of small differences in collateral networks is accentuated in current-source type synapses as opposed to synapses with reversal potentials because the former do not suffer from shunting or saturation. Activation functions with steep slopes, justified on the basis of the abrupt transitions between "up" and "down" states observed in membrane potentials (Wilson 1995
), also enhance the CD layer's ability to magnify small differences in the input patterns (Fig. 6, A vs. C).
independent of the initial state of the network. However, Fukai and Tanaka (1997)
proved that collateral inhibitory networks with small or zero self-inhibition are sensitive to initial conditions and thus do not always select the unit with the strongest inputs. Consider the simple case of a two-neuron inhibitory network (A and B) without self-inhibition where unit A starts in an active state (and thus inhibits unit B). If equal inputs then are applied to the network, unit A will remain the winner because of inhibition to unit B. To win, unit B must receive inputs greater than unit A's by an amount large enough to overcome this inhibition. Thus a collateral network may not resolve small differences in input patterns.
1) inhibitory inputs. Thus active and inactive units differ by only a single inhibitory input. In the current-source network, the magnitude of this difference is always a constant. However, with reversal-potential synapses, this difference becomes increasing small as additional inhibitory inputs are added. Combining the effects of excitatory and inhibitory inputs, current-source synapses tend produce large excursions in membrane potential for winning (and losing) units. This results in equilibrium states where units are quite depolarized (or hyperpolarized) with respect to firing thresholds. This highly polarized state requires larger differences in synaptic input to change state than a state where units are all close to threshold. By contrast, the difference in membrane potential between winning and losing CD units with reversal-potential synapses becomes smaller with increasing numbers of inputs. Accordingly, the membrane potential of these units tends to stay quite close to the threshold potential providing for effective WTA computations.
it works as a low-pass filter. In a slow network, fast changes in input do not result in changes in output firing. This effect is potentiated in the highly polarized states produced by current-source synapses. This effect is diminished in networks with reversal potential synapses because the units stay closer to threshold. Somewhat paradoxically, the poor WTA performance can increase coding performance by limiting the number of CD layer state changes, which ultimately reduces the degree of PF-layer activation.
or, metaphorically speaking, reinterpreted
by the CD layer. In this way, the internal state of the network evolves into a higher-order representation that registers not only the occurrences of events but also the order in which they occurred. Creating higher-order representations through recurrence differs substantially from the idea of hierarchical processing commonly ascribed to the visual system. In hierarchical visual processing, low-level information is transformed into progressively more and more complex representations through successive processing of the current events in different neural regions (Ungerleider 1995
). Recursion has the advantage of allowing a single neural region to successively process, and thus reinterpret, the results of its own processing.
; Reiss and Taylor 1991
), central pattern generators in tritonia (Kleinfeld and Sompolinsky 1989
), and basal ganglia (Dominey 1995
; Dominey et al. 1995
; Mitchell et al. 1991
; Wickens and Arbuthnott 1993
). Each of these networks derives a portion of its processing capabilities from computational elements similar to those found in our model, such as lateral inhibition (Dominey 1995
; Granger et al. 1994
; Wickens and Arbuthnott 1993
), working memory loops (Dominey 1995
), and recursion (Dominey 1995
; Mitchell et al. 1991
; Reiss and Taylor 1991
). However, these models also rely on additional elements such as synaptic eligibility traces (Granger et al. 1994
; Wickens and Arbuthnott 1993
), refractory periods (Wickens and Arbuthnott 1993
), widely distributed neural time constants (Dominey 1995
; Reiss and Taylor 1991
), and transmission delays (Kleinfeld and Sompolinsky 1989
; Wickens and Arbuthnott 1993
). Of these models, the model of saccade generation by Dominey (1995)
bears closest structural resemblance to ours; however, the Dominey model focuses primarily on the decoding, rather than the encoding and registration, phases of the delayed-spatial sequencing task.
; Linsker 1986
; von der Malsburg 1973
). In another example, a recurrent network by Jordan (1986)
, which bears passing resemblance to our model, uses a variant of backpropagation to transform spatial inputs from "plan" units into complex sequences of phonemes. Our model, which uses random rather than trained synapses, is a departure from this general approach. Rather than focusing on the learning, we have focused on the inherent sequence encoding abilities of the cortical-basal ganglionic architecture. The implication is that the brain has an innate ability to encode serial events into integrated concepts represented in spatial patterns of neural activity. This is not to say that adaptive mechanisms do not play a role; for example, they would be quite useful for tuning the competitive pattern classification stage so as to improve the encoding performance of the network. This might provide a more efficient code or it might emphasize serial events that are of particular relevance to the organism.
). Neuromodulators generally are thought to effect changes extrasynaptically. Thus rather than adjusting synaptic efficiency, many mechanisms appear to modulate the excitability and bias of the pre- and postsynaptic neurons by altering ionic conductances. The computational question is can changes in excitability be equated with changes in MAX-RANGE parameters? To a first approximation, the answer to this question is probably yes. For example, consider the neuromodulatory action of either a pre- or postsynaptic muscarine-sensitive potassium channel (Calabresi et al. 1993
; Caulfield et al. 1993
; Lovinger and Tyler 1996
). Modulation of potassium channels also can be effected by dopaminergic inputs via the D1 or D2 families of receptor subtype (Surmeier and Kitai 1993
). Serotonin, a global modulator, also could play an important role in such a modulatory system because it appears to exert both an excitatory action on the striatum and stimulate dopamine release from terminals (Jacobs and Azmitia 1992
).
; Ward and Brown 1996
), this could guide corticostriatal weights into an effective region of the parameter space during an encoding task.
). These channels are affected by changes in the dopamine-cholinergic balance via a mechanism involving the D2 receptor subtype found in cholinergic interneurons. The latter exist in a state of tonic inhibition (Lehmann and Langer 1983
), and a decrease in striatal dopamine levels releases them to produce an increase in acetylcholine (ACh). Simulations suggest that an increase in potassium conductance, produced by high levels of ACh, serves to deemphasize the effects of GABAergic synapses and thus lowers the level of striatal competition (Wickens et al. 1991
). Conversely, decreased potassium conductances, produced by high levels of dopamine, enhances competition. Although important details such as the sign and magnitude of this effect remain the subject of debate, the end result appears somewhat equivalent to that of modulating the synaptic weights of the CD layer's GABAergic synapses.
defined by spatial patterns of PF activation. This assumption ignores firing rate codes and temporal firing codes that also could be used by an assembly of neurons to encode information. However, in attempting to capture the essential character of system interactions, we chose to simplify many of the physiological and anatomic features that would support such alternative codes. For example, the model's PF units respond in a manner that is essentially binary. Accordingly, no information can be coded in their firing frequencies. In addition, our analysis only considers the steady-state portion of the PF response: thus ignoring any information that might be contained in the nuances of the transient states of the units. Even with these two major simplifications, there are
215 potential ways in which a neuron could respond to this set of 15 contexts. The bistable design of the cortical-thalamic loops, which sustain PF activations once elevated, reduces the realm of possibilities further to 1,000 different sets of responses, of which only 190 are operationally distinct.
, 1990
; Fuster and Alexander 1971
; Goldman-Rakic 1995
; Goldman-Rakic et al. 1990
; Petrides 1991
). The activity does not decay and, in fact, often shows a slight increase over the delay period (Funahashi et al. 1989
). Area 46 neurons displaying delay period activity are activated preferentially by targets at specific retinotopic positions termed memory fields (Goldman-Rakic 1995
). Similar sustained delay-period activity that is highly selective for particular shapes, colors, and scenes also have been reported in the temporal cortex during similar short-term memory tasks (Miyashita and Chang 1988
).

View larger version (31K):
[in a new window]
FIG. 9.
Model receptive fields analyzed with respect to a spatial working memory task. Columns of shaded squares define the 4 operationally distinct receptive fields of a working memory task with gray squares indicating the activation of PF units. Figure lists the number and percent of 9,076 task-related model units displaying each receptive field.
). This paradigm used two cues, A and B, presented in the order AB or BA. The authors found that 58% of their task-related units displayed what could be considered Rank1(X) behavior, 21% Cue(X), and 21% pure Rank1. By contrast, the model units displayed these fields 6, 0.82, and 2.9% of the time, respectively. However, reanalyzing the portion of the model data spanned by this two-cue paradigm, the fractions adjust considerably. First, note that in the two-cue paradigm, only nine different receptive fields are possible. Of these, only six are operationally distinct as listed and defined in Fig. 10. Using the receptive field definitions of Fig. 10 to reclassify the model units, we see a substantial increase in the number of units fitting the Cue(X), Rank1(X), and pure Rank1 descriptions. Note that while rank-dependent responses to the second cue are common in the model, none were observed in the experimental study by Funahashi et al. (1993)
. One possible explanation for this is that in the two-cue paradigm, the second cue does not provide any additional information to the monkey. In other words, once the first cue is presented, the identity of the second cue is strictly determined and need not be encoded by the monkey. A decrease of attentional modulation during the second cue could result in a decrease or elimination of Seq2(XY) and pure Rank2 responses.

View larger version (25K):
[in a new window]
FIG. 10.
Model receptive fields reanalyzed with respect to a 2-cue sequential paradigm. Columns of shaded squares define the 6 operationally distinct receptive fields of a 2-cue paradigm. The figure lists the number and percent of 12,170 task-related model units displaying each receptive field as well as that of experimentally observed units for comparison (source: Funahashi et al. 1993
).
; Kermadi and Joseph 1995
; Kermadi et al. 1993
). Similar to the paradigm applied to the model, each of these studies used six sequences of three cues. However, because the paradigm only involves three different cues, once the first two cues of the sequence arrive, the third is determined. Although the model pays equal attention to all three cues in the sequence, the monkey need not. Thus, experimentally, only 9 of the 15 contexts are salient. When responses are concentrated on the first two cues, then there are only 29 (i.e., 512) possible receptive fields. Adding the restriction imposed by the latching cortical-thalamic loops, this leaves 125 classes, of which, only 30 are operationally distinct. Unlike the working memory and two-cue tasks, many of these 30 classes are not readily describable by simple names such as Cue(X) or Rank2; rather, the response contingencies represented by these classes are often quite complex. When the model units are reclassified according to these 30 classes, 46% of the responses fall into experimentally observed classes. Figure 11 lists these experimentally observed receptive fields along with the number and percent of model units displaying them.

View larger version (33K):
[in a new window]
FIG. 11.
Model receptive fields reanalyzed with respect to a 3-cue sequential paradigm where the 3rd target is not salient. Columns of shaded squares define experimentally observed receptive fields during 3 target delayed sequencing task. Figure lists the number and percent of 15,756 task-relatedmodel units displaying these receptivefields. Note that Cue(X) is equivalent to Rank1(X) + Rank2(X) in this task.
; Funahashi et al. 1990
). Such mixed responses of the PF, which could be produced in units receiving both recurrent and event-related inputs, have been left out of the model for simplicity.
), also would promote phasic-tonic membrane potential profiles.
). This departure reflects the model's simplified input structure and threshold activation functions. We chose to use labeled-line inputs instead of spatially tuned responses because we wanted to emphasize the serial, as opposed to spatial, aspects of the task. Certainly, spatial-tunings would arise if a course-coded input layer was used instead: modeled, perhaps, after the retinotopic responses of area 7a of the posterior parietal cortex. Once again, as mentioned above, linear activation functions allow such graded responses.
; Strick et al. 1995
). In Fig. 12, right, we illustrate the additional observations (Middleton and Strick 1997b
) that PF and M1 also have loops through the cerebellum. Note that the three loops through basal ganglia and the two through cerebellum each involve segregated groups of cells in GPi and in the dentate nucleus (DN), respectively.

View larger version (73K):
[in a new window]
FIG. 12.
A schematic illustration of how the model's architecture could be generalized to other areas of the cortex and also integrated into a larger schema of sensorimotor processing. Left: 3 segregated cortical-basal ganglionic loops through the PF, supplementary motor area (SMA), and primary motor cortex (M1). Right: 2 segregated channels for information processing connecting the dentate nucleus (DN) of the cerebellum with the PF and M1. Signals within segregated channels could be shared via cortical-cortical connections.
). M1 neurons recorded in the same task exhibited yet shorter bursts that were sequence independent and movement specific. Evidence that basal ganglionic networks might participate in the generation of the SMA bursts comes from the recording in GPi neurons of pausing patterns of discharge with similar sequence-dependent properties (Mushiake and Strick 1995
). The short bursts of sustained discharge in M1 correspond to the relatively brief durations of the movements, although some units also discharge during the waiting periods that were imposed between the individual movements of a sequence. We propose that the brief, movement-related bursts in M1 are analogous, though on a shorter time scale, to the long-duration sustained discharges that encode serial order in PF. We further propose that the intermediate-duration, sequence-specific discharge is the appropriate analogy in SMA. In addition to being of shorter duration than in PF, the bursts in M1 and SMA are movement related and are not involved in encoding serial order. Instead, they seem to be involved in the generative decoding process mentioned in the INTRODUCTION.
). In addition, the substantial working memory discharge found in the dorsomedial thalamic nucleus (Fuster and Alexander 1973
), which relays cerebellar input to PF (Kuroda et al. 1993
), suggests that the cerebellar loop also may be important in producing sustained PF activity. In contrast, a cerebellar channel subserving the SMA currently lacks demonstration (Middleton and Strick 1997a
), so the sequence-specific sustained discharge in SMA probably relies more on the alternative mechanisms.
). In addition, 25% of the population discharges during the waiting period, and this discharge specifies the next movement in a sequence-independent manner. We suggest that the striatal target of M1 uses convergent input from SMA (Inase et al. 1996
) to classify, and thus select, the appropriate cells to discharge in the waiting period. These discharges, combined with another convergent input reflecting the auditory "go" signal, then might be used to initiate the more intense movement-related burst discharge. Recursion in the cortical-basal ganglionic loop could facilitate the recruitment of a sufficient and appropriate population of neurons to command the specified movement.
; Houk et al. 1993
). This may reflect a degree of redundancy in the overall system or simply errors in our interpretation. We wish to stress that the concepts formulated here should be treated as testable working hypotheses, advanced to illustrate the substantial potential of cortical-basal ganglionic architectures for analyzing and controlling serial motor behaviors.
| |
ACKNOWLEDGEMENTS |
|---|
The authors are grateful to Drs. Andrew Barto and Sara A. Solla for comments on the manuscript.
This work was supported by National Institute of Mental Health Grant P50-MH-48185.
| |
FOOTNOTES |
|---|
Address for reprint requests: J. C. Houk, Dept. of Physiology, M211, Ward Building 5-315, 303 E. Chicago Ave., Chicago, IL 60611-3008.
Received 4 November 1996; accepted in final form 12 February 1998.
| |
REFERENCES |
|---|
|
|
|---|
basal ganglia system in primates. Crit. Rev. Neurobiol. 317-356, 1996.This article has been cited by other articles:
![]() |
K. W. McCairn, M. Bronfeld, K. Belelovsky, and I. Bar-Gad The neurophysiological correlates of motor tics following focal striatal disinhibition Brain, June 8, 2009; (2009) awp142v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. P. Blomeley, L. A. Kehoe, and E. Bracci Substance P Mediates Excitatory Interactions between Striatal Projection Neurons J. Neurosci., April 15, 2009; 29(15): 4953 - 4963. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. G. ASHBY and J. G. WALDSCHMIDT Fitting computational models to fMRI data Behav Res Methods, August 1, 2008; 40(3): 713 - 721. [Abstract] [PDF] |
||||
![]() |
S. Taverna, E. Ilijic, and D. J. Surmeier Recurrent Collateral Connections of Striatal Medium Spiny Neurons Are Disrupted in Models of Parkinson's Disease J. Neurosci., May 21, 2008; 28(21): 5504 - 5512. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. T. Moyer, J. A. Wolf, and L. H. Finkel Effects of Dopaminergic Modulation on the Integrative Properties of the Ventral Striatal Medium Spiny Neuron J Neurophysiol, December 1, 2007; 98(6): 3731 - 3748. [Abstract] [Full Text] [PDF] |
||||
![]() |
J.C Houk, C Bastianen, D Fansler, A Fishbach, D Fraser, P.J Reber, S.A Roy, and L.S Simo Action selection and refinement in subcortical loops through basal ganglia and cerebellum Phil Trans R Soc B, September 29, 2007; 362(1485): 1573 - 1583. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. J Frank, A. Scheres, and S. J Sherman Understanding decision-making deficits in neurological conditions: insights from models of natural action selection Phil Trans R Soc B, September 29, 2007; 362(1485): 1641 - 1654. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Botvinick and T. Watanabe From Numerosity to Ordinal Rank: A Gain-Field Model of Serial Order Representation in Cortical Working Memory J. Neurosci., August 8, 2007; 27(32): 8636 - 8642. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. D. Humphries, R. D. Stewart, and K. N. Gurney A Physiologically Plausible Model of Action Selection and Oscillatory Activity in the Basal Ganglia J. Neurosci., December 13, 2006; 26(50): 12921 - 12942. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. J. Plessen, R. Bansal, H. Zhu, R. Whiteman, J. Amat, G. A. Quackenbush, L. Martin, K. Durkin, C. Blair, J. Royal, et al. Hippocampus and Amygdala Morphology in Attention-Deficit/Hyperactivity Disorder. Arch Gen Psychiatry, July 1, 2006; 63(7): 795 - 807. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. R. Marchand and V. Dilda New Models of Frontal-Subcortical Skeletomotor Circuit Pathology in Tardive Dyskinesia Neuroscientist, June 1, 2006; 12(3): 186 - 198. [Abstract] [PDF] |
||||
![]() |
A. Leblois, T. Boraud, W. Meissner, H. Bergman, and D. Hansel Competition between feedback loops underlies normal and pathological dynamics in the basal ganglia. J. Neurosci., March 29, 2006; 26(13): 3567 - 3583. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Shima and J. Tanji Binary-Coded Monitoring of a Behavioral Sequence by Cells in the Pre-Supplementary Motor Area J. Neurosci., March 1, 2006; 26(9): 2579 - 2582. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. S. Simo, C. M. Krisky, and J. A. Sweeney Functional Neuroanatomy of Anticipatory Behavior: Dissociation between Sensory-driven and Memory-driven Systems Cereb Cortex, December 1, 2005; 15(12): 1982 - 1991. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. C. Price, A. L. Jefferson, J. G. Merino, K. M. Heilman, and D. J. Libon Subcortical vascular dementia: Integrating neuropsychological and neuroradiologic data Neurology, August 9, 2005; 65(3): 376 - 382. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Koos, J. M. Tepper, and C. J. Wilson Comparison of IPSCs Evoked by Spiny and Fast-Spiking Neurons in the Neostriatum J. Neurosci., September 8, 2004; 24(36): 7916 - 7922. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Jaeger and H. Haas Harnessing Nonlinearity: Predicting Chaotic Systems and Saving Energy in Wireless Communication Science, April 2, 2004; 304(5667): 78 - 80. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Shohamy, C. E. Myers, S. Grossman, J. Sage, M. A. Gluck, and R. A. Poldrack Cortico-striatal contributions to feedback-based learning: converging data from neuroimaging and neuropsychology Brain, April 1, 2004; 127(4): 851 - 859. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Taverna, Y. C. van Dongen, H. J. Groenewegen, and C. M.A. Pennartz Direct Physiological Evidence for Synaptic Connectivity Between Medium-Sized Spiny Neurons in Rat Nucleus Accumbens In Situ J Neurophysiol, March 1, 2004; 91(3): 1111 - 1121. [Abstract] [Full Text] [PDF] |
||||
![]() |
R Vergara, C Rick, S Hernandez-Lopez, J A Laville, J N Guzman, E Galarraga, D J Surmeier, and J Bargas Spontaneous voltage oscillations in striatal projection neurons in a rat corticostriatal slice J. Physiol., November 15, 2003; 553(1): 169 - 182. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. N. Guzman, A. Hernandez, E. Galarraga, D. Tapia, A. Laville, R. Vergara, J. Aceves, and J. Bargas Dopaminergic Modulation of Axon Collaterals Interconnecting Spiny Neurons of the Rat Striatum J. Neurosci., October 1, 2003; 23(26): 8931 - 8940. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Fujii and A. M. Graybiel Representation of Action Sequence Boundaries by Macaque Prefrontal Cortical Neurons Science, August 29, 2003; 301(5637): 1246 - 1249. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. J. Gruber, S. A. Solla, D. J. Surmeier, and J. C. Houk Modulation of Striatal Single Units by Expected Reward: A Spiny Neuron Model Displaying Dopamine-Induced Bistability J Neurophysiol, August 1, 2003; 90(2): 1095 - 1114. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Centonze, C. Grande, A. Usiello, P. Gubellini, E. Erbs, A. B. Martin, A. Pisani, N. Tognazzi, G. Bernardi, R. Moratalla, et al. Receptor Subtypes Involved in the Presynaptic and Postsynaptic Actions of Dopamine on Striatal Interneurons J. Neurosci., July 16, 2003; 23(15): 6245 - 6254. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. K. Goerendt, C. Messa, A. D. Lawrence, P. M. Grasby, P. Piccini, and D. J. Brooks Dopamine release during sequential finger movements in health and Parkinson's disease: a PET study Brain, February 1, 2003; 126(2): 312 - 325. [Abstract] [Full Text] [PDF] |
||||
![]() |
U. Czubayko and D. Plenz Fast synaptic transmission between striatal spiny projection neurons PNAS, November 26, 2002; 99(24): 15764 - 15769. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. J. Tunstall, D. E. Oorschot, A. Kean, and J. R. Wickens Inhibitory Interactions Between Spiny Projection Neurons in the Rat Striatum J Neurophysiol, September 1, 2002; 88(3): 1263 - 1269. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Centonze, E. Saulle, A. Pisani, G. Bernardi, and P. Calabresi Adenosine-mediated inhibition of striatal GABAergic synaptic transmission during in vitro ischaemia Brain, September 1, 2001; 124(9): 1855 - 1865. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Dagher, A. M. Owen, H. Boecker, and D. J. Brooks The role of the striatum and hippocampus in planning: A PET activation study in Parkinson's disease Brain, May 1, 2001; 124(5): 1020 - 1032. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. S. Dave and D. Margoliash Song Replay During Sleep and Computational Rules for Sensorimotor Vocal Learning Science, October 27, 2000; 290(5492): 812 - 816. [Abstract] [Full Text] |
||||
![]() |
J. G. Partridge, K.-C. Tang, and D. M. Lovinger Regional and Postnatal Heterogeneity of Activity-Dependent Long-Term Changes in Synaptic Efficacy in the Dorsal Striatum J Neurophysiol, September 1, 2000; 84(3): 1422 - 1429. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Xing and R. A. Andersen Memory Activity of LIP Neurons for Sequential Eye Movements Simulated With Neural Networks J Neurophysiol, August 1, 2000; 84(2): 651 - 665. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. D. Lawrence, L. H. A. Watkins, B. J. Sahakian, J. R. Hodges, and T. W. Robbins Visual object and visuospatial cognition in Huntington's disease: implications for information processing in corticostriatal circuits Brain, July 1, 2000; 123(7): 1349 - 1364. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. Hikosaka, Y. Takikawa, and R. Kawagoe Role of the Basal Ganglia in the Control of Purposive Saccadic Eye Movements Physiol Rev, July 1, 2000; 80(3): 953 - 978. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Flores-Hernandez, S. Hernandez, G. L. Snyder, Z. Yan, A. A. Fienberg, S. J. Moss, P. Greengard, and D. J. Surmeier D1 Dopamine Receptor Activation Reduces GABAA Receptor Currents in Neostriatal Neurons Through a PKA/DARPP-32/PP1 Signaling Cascade J Neurophysiol, May 1, 2000; 83(5): 2996 - 3004. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Luo and D. J. Perkel A GABAergic, Strongly Inhibitory Projection to a Thalamic Nucleus in the Zebra Finch Song System J. Neurosci., August 1, 1999; 19(15): 6700 - 6711. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. F. Carpenter, A. P. Georgopoulos, and G. Pellizzer Motor Cortical Encoding of Serial Order in a Context-Recall Task Science, March 12, 1999; 283(5408): 1752 - 1757. [Abstract] [Full Text] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Visit Other APS Journals Online |