|
|
||||||||
1Department of Technology, Computational Neuroscience, Institució Catalana de Recerca i Estudis Avançats, Universitat Pompeu Fabra, Barcelona, Spain; and 2Department of Experimental Psychology, University of Oxford, Oxford, United Kingdom
Submitted 19 October 2004; accepted in final form 27 January 2005
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
New observations from a number of cognitive neuroscience experiments led to a promising account of attention termed the "biased competition hypothesis, " which aims to explain the computational processes governing visual attention and their implementation in the brain's neural circuits and neural systems. According to this hypothesis, attentional selection operates in parallel by biasing an underlying competitive interaction between multiple stimuli in the visual field toward one stimulus or another, so that behaviorally relevant stimuli are processed in the cortex whereas irrelevant stimuli are filtered out (Chelazzi 1998
; Chelazzi et al. 1993
; Duncan 1996
; Reynolds and Desimone 1999
). Thus attending to a stimulus at a particular location or with a particular feature biases the underlying neural competition in a certain brain area in favor of neurons that respond to the location, or the features, of the attended stimulus. This attentional effect is produced by generating signals in areas outside the visual cortex that are then fed back to extrastriate visual cortical areas, where they bias the competition such that when multiple stimuli appear in the visual field, the cells representing the attended stimulus win, thereby suppressing the firing of cells representing distracting stimuli (Desimone and Duncan 1995
; Duncan 1996
; Duncan and Humphreys 1989
; Reynolds et al. 1999
). According to this line of work, attention appears as a property of competitive/cooperative interactions that work in parallel across the cortical modules. Neurophysiological experiments are consistent with this hypothesis in showing that attention serves to modulate the suppressive interaction between the neuronal firing elicited by 2 or more stimuli within the receptive field (Chelazzi 1998
; Miller et al. 1993
; Motter 1993
; Reynolds and Desimone 1999
; Reynolds et al. 1999
). Further evidence comes from functional magnetic resonance imaging (fMRI) in humans (Kastner et al. 1998
, 1999
), which indicates that when multiple stimuli are present simultaneously in the visual field, their cortical representations within the object-recognition pathway interact in a competitive, suppressive fashion, which is not the case when the stimuli are presented sequentially. It was also observed that directing attention to one of the stimuli counteracts the suppressive influence of nearby stimuli.
Neurodynamical models providing a theoretical framework for biased competition have been proposed and successfully applied in the context of attention and working memory. In the context of attention, Usher and Niebur (1996)
introduced an early model of biased competition to explain the attentional effects in neural responses observed in the inferotemporal cortex, and this was followed by a model for V2 and V4 by Reynolds et al. (1999)
based on the shunting equations of Grossberg (1988)
. Deco and Zihl (2001)
extended Usher andNiebur's model to simulate the psychophysics of visual attention by visual search experiments in humans. Their neurodynamical formulation is a large-scale hierarchical model of the visual cortex whose global dynamics is based on biased-competition mechanisms at the neural level. Attention then appears as an effect related to the dynamical evolution of the whole network. This large-scale formulation has been able to simulate and explain in a unifying framework visual attention in a variety of tasks and at different cognitive neuroscience experimental measurement levels: single cells (Deco and Lee 2002
; Rolls and Deco 2002
), fMRI (Corchs and Deco 2002
, 2004
), psychophysics (Deco and Rolls 2004
; Deco et al. 2002
), and neuropsychology (Deco and Rolls 2002
; Heinke et al. 2002
). In the context of working memory, further developments (Deco and Rolls 2003
; Deco et al. 2004
; Szabo et al. 2004
) managed to model in a unifying form attentional and memory effects in the prefrontal cortex integrating single-cell and fMRI data, and different paradigms in the framework of biased competition.
In spite of the successful application of the biased-competition principle in large-scale cognitive neuroscience modeling, a detailed dynamical analysis of the underlying synaptic and spiking mechanisms is still missing. The existing models cited above are all based on rate models [apart from the one-layer model of Usher and Niebur (1996)
, considered further in the DISCUSSION], which simplify and capture the qualitative behavior of the dynamics, but are not quantitatively directly related with the underlying synaptic and spiking neural activity. Moreover, because the biased competition effect is essentially a dynamical effect emerging from a complex system (even in the most simple and minimal case), a mere qualitative description of the dynamics could be very wrong and misleading (e.g., the whole system could show different behavior if the underlying nonlinearity describing the single neurons and their interconnections is wrong or has incorrect time constants and delays). A detailed parameter space exploration of the different dynamical attractors of a neural system, even for the most simple and minimal neural circuit involved in biased competition as set out by Reynolds et al. (1999)
, is also still missing and could yield extremely relevant information about the role of cooperation mechanisms, the roles of feedforward and feedback interactions between neural layers (with each layer typically a different cortical area in the brain), and the role of the different synaptic components [such as
-amino-3-hydroxy-5-methyl-4-isoxazolepropionic acid (AMPA) vs. N-methyl-D-aspartate (NMDA)] in biased competition.
The aim of this paper is to perform a detailed analysis of the synaptic and neural spiking dynamics underlying biased competition. We study a biologically plausible minimal cortical circuit in the framework of the standard biased-competition experiments (Reynolds et al. 1999
). We perform a detailed analysis of the dynamical capabilities of the system by exploring the stationary attractors in the parameter space by a mean-field reduction consistent with the underlying synaptic and spiking dynamics. The nonstationary dynamical behavior, for direct comparison with neuronal recording experiments, is studied by a dynamically realistic synaptic and spiking model. This allows us to discover: 1) the particular role of cooperative and competitive effects in the dynamics of biased competition, 2) the role of feedforward and feedback interactions between cortical modules, 3) why the feedback connections are weaker by a factor of about 2.5 than the feedforward connections, and 4) the role of NMDA synaptic connections for biased competition and attention. Furthermore, we are particularly interested in investigating the interaction between (top-down) attention and (bottom-up) stimulus contrast effects in the light of the recent experimental results obtained by Reynolds and Desimone (2003)
and Martinez-Trujillo and Treue (2002)
. These results were obtained in a paradigm that allowed bottom-up and top-down cortical interactions to be analyzed because both were altered parametrically. A detailed and biologically plausible investigation of the contrastattention modulation paradigm gives us insight into the implementation of attention, and in particular about whether attention results from an effect caused by an explicitly multiplicative contrast gain modulation effect at the neuronal level [as suggested by Martinez-Trujillo and Treue (2002)
], or can be explained just by external biasing synaptic interactions operating on neurons with nonlinear response functions.
| METHODS |
|---|
|
|
|---|
Several experimental results of single-cell recording studies in monkeys are consistent with the biased-competition hypotheses in showing that attention serves to modulate the suppressive interaction between 2 or more stimuli within the receptive field (Chelazzi 1998
; Chelazzi et al. 1993
; Miller et al. 1993
; Moran and Desimone 1985
; Motter 1993
, 1994
; Reynolds and Desimone 1999
; Sato 1989
; Spitzer et al. 1988
). Moran and Desimone (1985)
showed that the firing activity of visually tuned neurons in the cortex depended on the location of the target stimulus to which the monkeys were instructed to attend. Based on results of this type, the spatial attentional modulation could be described as a gain change and shifting of V4 receptive fields depending on the locus of attention (Connor et al. 1997
; see also Luck et al. 1997
and Reynolds et al. 1999
for V4 and V2).
In particular, we will first concentrate here on the experimental protocol of Reynolds et al. (1999)
because they performed single-cell recordings of V2 and V4 neurons in a behavioral paradigm that explicitly separated sensory processing mechanisms from attentional effects, to test the biased-competition hypothesis more directly. They first examined the presence of competitive interactions in the absence of attentional effects within the receptive field by having the monkey attend to a location far outside the receptive field of the neuron that they were recording. They used oriented bars as visual stimuli. They compared the firing activity response of the neuron, when a single reference stimulus was within the receptive field, with the response when a second, "probe," stimulus was added to the field. When the probe was added to the field, the activity of the neuron was shifted toward the activity level that would have been evoked if the probe had appeared alone. When the reference was an effective stimulus (high response) and the probe was an ineffective stimulus (low response), the firing activity was suppressed after adding the probe. On the other hand, the response of the cell increased when an effective probe stimulus was added to an ineffective reference stimulus. Thus the response of a V4 neuron to 2 stimuli in its field is not the sum of its responses to both, but rather is a weighted average of the responses to each stimulus alone. Attentional modulatory effects have been independently tested by repeating the same experiment but now having the monkey attend to the reference stimulus within the receptive field of the recorded neuron. The effect of attention on the response of the V2 or V4 neuron was to almost compensate the suppressive or excitatory effect of the probe. That is, if the probe caused a suppression of the neuronal response to the reference when attention was outside the receptive field, then attending to the reference restored the neuron's activity to the level corresponding to the response of the neuron to the reference stimulus alone. Symmetrically, if the probe stimulus had increased the neuron's level of activity, attending to the reference stimulus compensated the response by shifting the activity to the level that had been recorded when the reference was presented alone.
In a second step, we will also consider manipulation of the contrast of the visual stimuli as used by Reynolds and Desimone (2003)
and Martinez-Trujillo and Treue (2002)
to analyze the mutual interaction between top-down attentional effects and (bottom-up) stimulus contrast effects. They used the biased-competition design explained above presenting 2 stimuli (reference and probe) simultaneously. The experiment of Reynolds and Desimone (2003)
(again using oriented bars as stimuli) showed that in the absence of attention, increasing the physical contrast of one stimulus caused V4 neurons to respond preferentially to that stimulus, and reduced their responses to the competing stimulus. On the other hand, when attention was directed to the lower-contrast stimulus, it partially overcame the influence of a competing, higher-contrast stimulus. Furthermore, with a similar design, but using middle temporal (MT) neurons and presenting simultaneously both a preferred and a nonpreferred stimulus that consisted of moving dots in a given direction, Martinez-Trujillo and Treue (2002)
showed that when attention is directed to the nonpreferred competing stimulus, the response of the neuron decreased with respect to the case when attention was not allocated on either of both stimuli. This attentional effect was maximal at intermediate contrast of the preferred stimulus.
The network model
We analyze the synaptic and spiking mechanisms underlying biased competition for the experimental design described in the previous section by introducing a minimal model of the dynamics between the 2 cortical brain areas involved (see Fig. 1). These 2 cortical areas correspond to V2 and V4 for the Reynolds et al. design (Reynolds and Desimone 2003
; Reynolds et al. 1999
) and to V1 (or V3) and MT for the Martinez-Trujillo and Treue (2002)
design. Both cortical areas have the same internal architecture, and implement a dynamical competition between different neurons. Each cortical area contains NE (excitatory) pyramidal cells and NI inhibitory interneurons. In our simulations, we use NE = 800 and NI = 200, consistent with the neurophysiologically observed proportion of 80% pyramidal cells versus 20% interneurons (Abeles 1991
; Rolls and Deco 2002
). In each cortical area, the neurons are fully connected (with synaptic strengths as specified below). Neurons in each cortical area of the network shown in Fig. 1 are clustered into populations or pools.
|
Because in our model the specific V4 pools have overlapping receptive fields, whereas the specific V2 pools do not have overlapping receptive fields, we consider that the level of competition in V4 is higher than that in V2. This is because the inhibition in both layers is local and therefore the stronger the neighborhood relationship, the stronger the inhibition. Consequently, in topographically organized layers (such as visual cortex layers), the greater degree of overlapping of the receptive fields, the stronger the competition. We thus use in our simulations wI = 1 and wI' = 1.25. The connection strength between 2 neurons in 2 different specific excitatory pools in the same layer is weak and given by w = 1 f(w+ 1)/(1 f), so that the overall recurrent excitatory synaptic drive in the spontaneous state remains constant as w+ is varied (Brunel and Wang 2001
). Neurons in a specific excitatory pool are connected to neurons in the nonselective pool in the same layer with a feedforward synaptic weight w = 1 and a feedback synaptic connection of weight wn = (fJb fKb)/(1 2f) + w in layer V4 and wn' = (fJf fKf)/(1 2f) + w in layer V2, and these connections normalize each layer so that the overall recurrent excitatory synaptic drive in the spontaneous state remains constant as the external cortical connections Jf, Jb, Kf, and Kb are varied.
Each neuron (pyramidal cells and interneurons) receives Next = 800 excitatory AMPA synaptic connections from outside the network. The external inputs are given by a Poisson train of spikes. To model the background spontaneous activity of neurons in the network (Brunel and Wang 2001
), we assume that Poisson spikes arrive at each external synapse with a rate of 3 Hz, consistent with the spontaneous activity observed in the cerebral cortex (Rolls and Treves 1998
; Wilson et al. 1994
). In other words, the effective external spontaneous background input rate of spikes across the relevant synapses to each cell is
ext = Next x 3 Hz = 2.4 kHz. The presentation of a stimulus is simulated by selectively increasing the external rates afferent to the corresponding specific population in layer V2,
ext =
ext +
in. In our experiments, both S1 and S2 pools in V2 were simultaneously exposed to a stimulus. Attentional biasing is also simulated by selectively increasing the external rates afferent to the corresponding specific population,
ext =
ext +
att, in layer V2 for spatial attention and in layer V4 for object attention. In our simulations we use
in = 250 Hz and
att = 10 Hz.
This cortical architecture implements competition within each cortical layer. In each layer, the competition is biased by the external inputs that could originate either from the external visual stimuli presented (to V2) or from attention. The competition in both layers is also mutually biased by the excitatory connections between brain areas. The whole system is therefore a dynamical system with a complex dynamics resulting from local competition mechanisms at each layer and cooperative mechanisms between cortical layers. A thorough analysis thus requires a biologically plausible implementation of the underlying synaptic and neural mechanisms, incorporating the realistic nonlinearities, and synaptic and membrane time constants. The question now is how to implement this, and which methodology to use to analyze the complex dynamics, to extract the conditions for having a combination of cooperative and competitive neural and synaptic mechanisms so that whole dynamics behaves in the same way as is found in the neurophysiological results.
The synaptic and spiking mechanisms
We assume that a proper level of description at the microscopic level is captured by the spiking and synaptic dynamics of one-compartment, pointlike models of neurons, such as integrate-and-fire models. The realistic dynamics allows the use of realistic biophysical constants (such as conductances, delays, etc.) in a thorough study of the realistic timescales and firing rates involved in the evolution of the neural activity underlying cognitive processes, for comparison with experimental data. We believe that it is essential of a biologically plausible model that the different timescales involved are properly described because the system that we are describing is a dynamical system that is sensitive to the underlying different spiking and synaptic time courses, and the nonlinearities involved in these processes. For this reason, it is convenient to include a thorough description of the different time courses of the synaptic activity, by including fast and slow excitatory receptors (AMPA and NMDA) and
-aminobutyric acid (GABA)inhibitory receptors.
An integrate-and-fire neuron can be described by a basic circuit consisting of a capacitance (the cell membrane capacitance Cm) in parallel with a resistance (the cell membrane resistance Rm) driven by input currents coming from connected neurons. When the voltage across the membrane capacitance reaches a given threshold the circuit is shunted and the neuron generates a spike that is then transmitted to other neurons. The spikes arriving to a given neuron produce postsynaptic excitatory or inhibitory potentials (basically through a low-pass filter formed by the synaptic and membrane time constants) and constitute the incoming input to the neuron. We also include spike-frequencyadapting mechanisms, by Ca2+-activated K+ hyperpolarizing currents (Liu and Wang 2001
), but these are for biophysical realism and to make the time courses shown similar to those measured neurophysiologically, and are not essential to any of the effects described. In the integrate-and-fire neuronal model used the recurrent excitatory postsynaptic currents have 2 components: a fast one mediated by AMPA receptors and a slow one mediated by NMDA receptors. We consider that the NMDA currents have a voltage dependency that is controlled by the extracellular magnesium concentration (Jahr and Stevens 1990
). The inputs from neurons not explicitly modeled in the network considered are mediated by AMPA receptors. The inhibitory currents into both excitatory and inhibitory cells are GABAergic. The mathematical formulation of the integrate-and-fire neurons and synaptic currents as well as the values of the constants used are given in APPENDIX A.
Stationary and nonstationary dynamics
The simulation of a network of integrate-and-fire neurons allows the study of the dynamical behavior of the neuronal spiking rates. However, these simulations are computationally expensive and their results probabilistic, which makes them rather unsuitable for systematic parameter explorations. The standard strategy to solve this problem is to simplify the dynamics by the mean-field approach at least for the stationary conditions (i.e., for periods after the dynamical transients) and to analyze there the different dynamical states. The essence of the mean-field approximation is to simplify the integrate-and-fire equations by replacing, after the diffusion approximation (Tuckwell 1988
), the sums of the synaptic components by the average DC component and a fluctuation term. The stationary dynamics of each population can be described by the population transfer function, which provides the average population rate as a function of the average input current. The set of stationary, self-reproducing rates
i for the different populations i in the network can be found by solving a set of coupled self-consistency equations. This enables a posteriori selection of the parameter region that shows in the dynamics the behavior that we are looking for (e.g., biased competition).
After that, with this set of parameters, we perform the full nonstationary simulations using the true dynamics described only by the full integrate-and-fire scheme. The mean-field study ensures that the dynamics will converge to a stationary attractor that is consistent with what we were looking for (Brunel and Wang 2001
; Del Giudice et al. 2003
). Therefore we used a mean-field approximation to explore how the different operational regimes of the network depend on the values of certain parameters. The mean-field analysis performed in this work uses the formulation derived by Brunel and Wang (2001)
, which is consistent with the network of neurons used. Their formulation departs from the equations describing the dynamics of one neuron to reach a stochastic analysis of the mean first-passage time of the membrane potentials, which results in a description of the population spiking rates as functions of the model parameters. The mathematical framework is summarized in APPENDIX B.
| RESULTS |
|---|
|
|
|---|
We consider first a detailed parameter analysis of the possible stationary states of a simplified model of the attentional effects on competing visual stimuli as found by Reynolds et al. (1999)
and Reynolds and Desimone (1999)
. For this we use the consistent mean-field approximation described in the preceding section and in APPENDIX B. We explore the behavior of the network as a function of the feedforward and feedback synaptic connections between the 2 cortical brain areas described in the model (i.e., as a function of Jf and Jb). With this analysis we aim to characterize the different modes of operation of the network and their robustness, which arise from the complex dynamical interplay between the 2 cortical modules, with cooperation between cortical areas and competition within a cortical area mutually biasing each other.
In the standard experimental design both stimuli are presented simultaneously. We consider this by externally and simultaneously exciting the 2 specific pools S1 and S2. This is done by selectively increasing the external rates afferent to both specific pools S1 and S2 in layer V2, i.e.,
extS1 =
extS1 +
in and
extS2 =
extS2 +
in. (The supraindex denotes the name of the pool.) Let us denote with
Sinoatt the stationary values of the averaged population activity in pool Si under this condition of simultaneous presentation of both visual stimuli in the absence of attention. To examine the effects of attention across neurons, the experimental work computed a change measurement M, in which the difference between the attended (att) and not-attended (noatt) responses is scaled by the size of the not-attended responses. If spatial attention is allocated to the preferred stimulus, the neural activity is enhanced. On the other hand, if spatial attention is allocated to the nonpreferred stimulus the neural activation is partially suppressed. To consider both effects, we computed the same attentional change measurement M on all 4 specific pools in both cortical modules. For this, we also consider the stationary values of the averaged population activity in all specific pools under the condition where spatial attention is allocated to the location corresponding to the stimulus associated with the pool S1 and both stimuli are simultaneously presented. This is done by selectively increasing the external rates afferent to specific pool S1, taking into account the external stimulus and the extra external attentional bias (i.e.,
extS1 =
extS1 +
in +
att) and increasing the external rates afferent to specific pool S2, taking into account only the presence of the external stimulus (i.e.,
extS2 =
extS2 +
in). Let us denote with
Siatt the stationary values of the averaged population activity in pool Si under this condition of simultaneous presentation of both visual stimuli, with spatial attention allocated to stimulus S1. The enhancement effect of attention on the activity of pools S1 in V2 and S1' in V4 is then given by
![]() |
![]() |
![]() |
![]() |
The experimental values reported for attentional enhancement modulation in V2 are about 10% and in V4 about 30% (Reynolds et al. 1999
). On the other hand, the experimental values reported for attentional suppressive modulation are in V2 about 8%, and in V4 about 25% (Reynolds and Desimone 1999
, 2003
; Reynolds et al. 1999
). To consider all attentional effects in one measure MBC, we define a modulation index that incorporates all these experimental quantitative values into one, which is given by
![]() |
Figure 2 shows the parameter exploration for the connection strengths between cortical areas plotting the attentional modulation measure MBC for the stationary states as a function of the feedforward and feedback V2V4 synaptic connections Jf and Jb. Figure 2A shows a 3-dimensional (3D) plot that reflects a narrow parameter region where a delicate dynamical equilibrium between intracortical competition (in each layer) and mutual cooperation between cortical areas yields biased competition according to the quantitative experimental observation. This narrow region, where MBC is close to 1, is around the point "A" where the optimal value of MBC is with Jf = 1.5 and Jb = 0.6. This result allows us to conclude 2 important facts. First, the region of the parameter space for the connection strengths between cortical areas where the system shows biased competition according to the experimental modulation and response values is very narrow. This implies a delicate dynamical interplay between interarea cortical cooperation and intraarea cortical competition. Second, these results show that the feedback interarea cortical interactions (at least in the visual cortex) must for optimal performance be weaker (by a factor of about 1.5/0.6 = 2.5) than the feedforward connections, which is a frequent assumption in the neurophysiological literature but not based on quantitative analysis of the dynamics. Moreover, it is with this ratio for feedforward strength/top-down strength of about 2.5 that the interaction effects between top-down attention and bottom-up contrast changes are found (described in the next section).
|
Figure 2C shows the attentional modulation measure M plotted separately for V4 (top, indicated by a ') and for V2 (bottom), for both the neuronal pools selective for S1 (left) and for S2 (right) (see RESULTS for definitions). Figure 2C thus shows the 4 components of MBC, as defined above, and shows that attentional enhancement is largely restricted to a C-shaped region in which the feedforward connection strength is nearly twice the feedback connection strength. At those values, for our modeled neurons responding to stimulus 1, we see that there is up to a 30% increase in the responses of V2 cells and up to a 25% increase in the responses of V4 cells, consistent with reported attentional effects. Figure 2C thus confirms that the attentional modulation is operating correctly in both V4 and V2, and for both the neuronal populations selective to S1 and those selective to S2. The attentional modulation is qualitatively similar in V4 and V2, consistent with the fact that both areas operate with internal competition, which is influenced by both bottom-up and top-down external inputs.
Figure 3 shows the nonstationary behavior of the neurodynamical activity in the full spiking and synaptic simulation of the network for the particular point "A" of the region showing biased competition. The simulation corresponds to the experimental design of Reynolds et al. (Reynolds and Desimone 1999
; Reynolds et al. 1999
). After a period of spontaneous activity of 100 ms without stimulation, the stimuli are presented for 250 ms. After that the stimuli disappear again and a period of 250 ms is shown. Figure 3A plots the development of the firing rate activity for V4 specific neurons tuned to the preferred stimulus showing that the attended stimulus controls the response of the neuron. The rates were calculated by averaging the responses over 20 trials of all the neurons (80) in the pool of specific V4 neurons responding to the preferred stimulus. The line in the middle shows the response when the 2 stimuli are shown, a preferred (good or effective) stimulus and a nonpreferred (poor) stimulus, with attention directed away from the receptive field ("Pair" condition). The line at the top shows the response when the 2 stimuli are presented together, with attention directed to the good stimulus ("Pair+Attend Good" condition). An attentional enhancement is observed. The line at the bottom shows the response when the 2 stimuli are presented together, with attention directed to the poor stimulus ("Pair+Attend Poor" condition). An attentional suppression is observed. Figure 3B plots the rastergrams of randomly selected neurons for each pool in the network (5 neurons for each pool). Two conditions are shown: the Pair condition (Without Attention) and Pair+Attend Good condition (With Attention). The spatiotemporal spiking activity shows the up-regulation of the spiking activity in the V2 and V4 neurons whose preferred stimulus (labeled Ref) is attended, and also simultaneously the down-regulation of spiking activity in the V2 and V4 neurons whose nonpreferred stimulus (labeled Probe) is attended (see Fig. 3B). The corresponding plots for the nonstationary behavior of the neurodynamical activity of the network for the particular point "C" of the region showing no biased competition are shown in Fig. 4.
|
|
To investigate the interactions between bottom-up visual salience information (such as that influenced by stimulus contrast) and attentional top-down information, Reynolds and Desimone (2003)
and Martinez-Trujillo and Treue (2002)
performed variations of the standard biased-competition design by manipulating the contrast of one of the competing stimuli, as described above (see METHODS). The special relationship found in the neurophysiological experiments between contrast and attention suggested (Martinez-Trujillo and Treue 2002
) that attention provokes a multiplicative change of the neural contrast gain [see Figs. 1 and 3 of Reynolds and Desimone (2003
)]. To understand the interactions found, we simulated both experiments as follows.
The design of the experiments and the different measures as implemented in our simulations are shown graphically in Fig. 5. Figure 5A shows the design of Reynolds and Desimone (2003)
. They measured the neuronal response in V4, manipulating the contrast of the nonpreferred stimulus and comparing the response to both stimuli when attention was allocated to the poor stimulus. They observed that the attentional suppressive effect of the competing nonpreferred stimulus is higher when the contrast of that stimulus increases. In our simulations we measured neuronal responses from neurons in pool S1' in V4 to both preferred and nonpreferred stimuli simultaneously presented within the receptive field. We manipulated the contrast of the stimulus that was nonpreferred for the neurons S1' (in the simulation by altering
in to S2). We analyzed the effects of this manipulation for 2 conditions: without spatial attention or with spatial attention on the nonpreferred stimulus S2, implemented by adding an extra bias
att to S2.
|
att on the responses of neurons S1' of the competing nonpreferred stimulus is higher when the contrast of the nonpreferred stimulus increases, as in the original neurophysiological experiments. The top figure shows the response of a V4 neuron to different log contrast levels (abscissa) in the no-attention condition (AO: attending outside the receptive field) and in the attention condition (AI: attending inside the receptive field). The top right part of Fig. 6 shows the difference between both conditions. As in the experimental observations the suppressive effect of the competing nonpreferred stimulus is higher when the contrast of that stimulus increases but, at higher levels of salience (contrast), the top-down attentional effect disappears.
|
We further studied the relevance of the NMDA receptors by repeating the analysis in Fig. 6 (top), but with the time constants of the NMDA receptors set to be the same values as those of the AMPA receptors [and as in Fig. 6 (middle) with the NMDA voltage-dependent effects disabled by setting Mg2+ = 0]. (We again compensated for the effective change of synaptic strength by rerunning the mean-field analysis to obtain the optimal parameters for the simulation.) The results are shown in Fig. 6 (bottom), where it is shown that top-down attentional effects are now substantially reduced. [That is, there is very little difference between the no-attention condition (AO: attending outside the receptive field), and the attention condition (AI: attending inside the receptive field).] This effect is not just because the NMDA receptor system with its long time constant may play a generic role in the operation of the integrate-and-fire system, by facilitating stability and helping to prevent oscillations, because a similar failure of attention to operate normally was also found with the mean-field approach, in which the stability of the system is not an issue. Thus the long time constant of NMDA receptors does appear to be an important factor in enabling top-down attentional processes to modulate correctly the bottom-up effects to account for the effects of attention on neuronal activity. This is an interesting result that deserves further analysis. The mean-field Eq. B2 of APPENDIX B effectively defines the nonlinear transfer function of the neurons, and this will be affected by the long time constant of the NMDA receptors, as can be seen in the preceding equations of APPENDIX B.
Figure 5B shows the design of Martinez-Trujillo and Treue (2002)
. They measured the neuronal response in MT, manipulating the contrast of the preferred stimulus and comparing the response when attention was allocated on that competing nonpreferred stimulus. They observed that the attentional suppressive effect of the competing nonpreferred stimulus is higher when the contrast of the preferred stimulus is at intermediate values. In our simulations we measured neuronal responses in MT from neurons S1' to 2 moving patterns, manipulating the contrast of the preferred stimulus (for neurons S1), and comparing the response from neurons S1' when spatial attention was allocated to the competing nonpreferred stimulus implemented by adding
att to alter the responses of neurons S2.
Figure 7 shows the simulation results for the design of Martinez and Treue. We again found that the attentional suppressive effect implemented through
att of the competing nonpreferred stimulus is higher when the contrast of the preferred stimulus is at intermediate values, as in the original neurophysiological experiments. Part of the significance of this result is that the model parameters including Jf and Jb for this simulation of biased-competition effects in 2-layer networks were constrained to be those discovered to be effective based on the quantitative analyses described above of the model developed to account for the data of Reynolds et al. in V2 and V4. Thus the identical dynamical system can account quantitatively for the MTV1 results of Martinez-Trujillo and Treue (2002)
and also for the V4V2 results of Reynolds and Desimone (2003)
. The results shown in Fig. 7 were replicated in further simulations when the NMDA receptors were made inactive by setting Mg2+ = 0. Thus the nonlinearity of NMDA receptors is not necessary for the gain modulationlike effects of top-down attentional bias.
|
att and of the contrast of one of the stimuli (normalized so that a contrast of 1 refers to the maximum modulation). The attentional modulation effect found is maximal for intermediate levels of contrast.
|
in to S2). We analyzed the effects of this manipulation for 2 conditions: without object attention or with top-down object attention on the nonpreferred stimulus S2', implemented by adding an extra bias
att to S2'. The results of the simulation shown in Fig. 9 lead to the prediction that the attentional-suppressive effect implemented through
att on the responses of neurons S1' of the competing nonpreferred stimulus (S2) is higher when the contrast of the nonpreferred stimulus (S2) is at intermediate values.
|
| DISCUSSION |
|---|
|
|
|---|
The computational model we analyzed here can be used to make specific neurophysiological predictions. For example, one prediction already made relates to the contrastattention paradigm. Instead of manipulating spatial attention as in the experimental designs considered so far in this paper, object attention in a spatial search task could be investigated using the general paradigm described by Chelazzi et al. (1993)
. Introducing into that paradigm extra manipulations of stimulus contrast for one of the competing stimuli (the distractor or the target in the visual search task) should also interact with object attention. The design for this prediction was specified in Fig. 5C, and the predictions of our computational neuroscience model are shown in Fig. 9. The results showed that, again, attentional modulation interacts maximally with the saliency of the input at intermediate contrast. This design is extremely interesting, however, because object attention interacts at the V4 level (consistent with the top-down inputs from representations of objects in the inferior temporal visual cortex; see Rolls and Deco 2002
), and contrast at the V2 level, which is different from the other designs of Reynolds and Martinez where spatial attention also interacted at the V2 level.
The dynamical interplay evident in Fig. 2 and the narrow region of parameter space in which biased-competition effects are found is not intuitive at all and results from the dynamical interactions of a complex system, which can be analyzed only with the theoretical tools and with the biologically plausible and realistic modeling elements that we assumed (i.e., realistic synaptic and spiking dynamics, realistic time constants, and realistic nonlinearities). This is thus a good example of the relevance of computational neuroscience in the analysis of experimental data. For example, the results shown in Fig. 2 indicate that the feedback interarea cortical interactions (at least in the visual cortex) must be weaker (optimally by a factor of about 2.5) than the feedforward connections, which is a frequent assumption in the neurophysiological literature but not based on quantitative analysis of the dynamics.
The analyses presented here extend previous concepts of the role of biased competition in attention (Desimone and Duncan 1995
; Duncan 1996
; Reynolds et al. 1999
; Usher and Niebur 1996
) by providing the first analysis we know at the integrate-and-fire neuronal level that allows the neuronal nonlinearities in the system to be explicitly modeled, to investigate realistically the processes that underlie the apparent gain modulation effect of top-down attentional control. In the integrate-and-fire model, the competition is realized realistically by the effects of the excitatory neurons on the inhibitory neurons, and their return inhibitory synaptic connections. This is also the first integrate-and-fire analysis of top-down attentional influences in vision that explicitly models the interaction of several different brain areas (including V2, V4, IT, V3, and MT in the different simulations). Interesting earlier work by Reynolds and Desimone (1999)
was based on Grossberg's shunting equations for analyzing competition and cooperation (see Grossberg 1988
) and investigated feedforward saliency effects and competition. We considered top-down, biased-competition attentional effects (also considered by Spratling and Johnson 2004
), but also introduced spiking neurons, synaptic dynamics, and a consistent mean-field analysis, and so were able to perform analyses of the contributions of different synaptic mechanisms and different nonlinearities in the system. The interesting work of Usher and Niebur (1996)
reported model-biased competition and attention with a spiking network in a single-layer network, and we introduced here a model that has more than one layer so that the interaction between layers can be modeled, and moreover has synaptic dynamics and a mean-field analysis derived from the spiking formulation [introduced for a one-layer network by Brunel and Wang (2001)
] developed for multiple-layer networks as described in APPENDIX B.
A further part of the originality and interest of the model described here is that in the form in which it can account for attentional effects in V2 and V4 in the paradigms of Reynolds et al. (1999)
in the context of biased competition, the model with the same parameters effectively makes predictions that show that the "contrast gain" effects in MT of Martinez-Trujillo and Treue (2002)
can be accounted for by the same model. In addition, the spiking model allows comparisons of the change in the spiking activity after the stimulus is presented during the transient period before the stable state of the mean field is reached. The rastergrams show simulated neuronal response onsets that are similar to those found neurophysiologically. These detailed and quantitative analyses of neuronal dynamical systems are an important step toward understanding the operation of complex processes such as top-down attention, which necessarily involve the interaction of several brain areas.
| APPENDIX A |
|---|