|
|
||||||||
The Journal of Neurophysiology Vol. 85 No. 5 May 2001, pp. 2130-2149
Copyright ©2001 by the American Physiological Society
1Department of Physiology, 2Department of Otolaryngology, 3Neuroscience Graduate Program, 4W. M. Keck Center for Integrative Neuroscience, and 5Sloan Center for Theoretical Neurobiology, University of California, San Francisco, California 94143-0444
| |
ABSTRACT |
|---|
|
|
|---|
Kayser, Andrew, Nicholas J. Priebe, and Kenneth D. Miller. Contrast-Dependent Nonlinearities Arise Locally in a Model of Contrast-Invariant Orientation Tuning. J. Neurophysiol. 85: 2130-2149, 2001. We study a recently proposed "correlation-based," push-pull model of the circuitry of layer 4 of cat visual cortex. This model was previously shown to explain the contrast-invariance of cortical orientation tuning. Here we show that it can simultaneously account for several contrast-dependent (c-d) "nonlinearities" in cortical responses. These include an advance with increasing contrast in the temporal phase of response to a sinusoidally modulated stimulus; a change in shape of the temporal frequency tuning curve, so that higher temporal frequencies may give little or no response at low contrast but reasonable responses at high contrast; and contrast saturation that occurs at lower contrasts in cortex than in the lateral geniculate nucleus (LGN). In the context of the model circuit, these properties arise from a mixture of nonlinear cellular and synaptic mechanisms: short-term synaptic depression, spike-rate adaptation, contrast-induced changes in cellular conductance, and the nonzero spike threshold. The former three mechanisms are sufficient to explain the experimentally observed increase in c-d phase advance in cortex relative to LGN. The c-d changes in temporal frequency tuning arise as a threshold effect: voltage modulations in response to higher-frequency inputs are only slightly above threshold at lower contrast, but become robustly suprathreshold at higher contrast. The other three nonlinear mechanisms also play a crucial role in this result, allowing contrast dependence of temporal frequency tuning to coexist with contrast-invariance of orientation tuning. Contrast saturation, and the observation that responses to stimuli of increasing temporal frequency saturate at increasingly high contrasts, can be induced both by the model's push-pull inhibition and by synaptic depression. Previous proposals explained these nonlinear response properties by assuming contrast-invariant orientation tuning as a starting point, and adding normalization by shunting inhibition derived equally from cells of all preferred orientations. The present proposal simultaneously explains both contrast-invariant orientation tuning and these contrast-dependent nonlinearities and requires only processing that is local in orientation, in agreement with intracellular measurements.
| |
INTRODUCTION |
|---|
|
|
|---|
The response properties of
simple cells in layer 4 of cat primary visual cortex (V1) serve as a
model system for studying the mechanisms underlying cerebral cortical
processing. These cells are perhaps the best-studied cortical cells and
are the site of emergence of the strong selectivity for stimulus
orientation seen throughout visual cortex (Hubel and Wiesel
1962
).
One of the defining characteristics of simple cells is the largely
linear nature of their responses. Their responses to arbitrary stimuli
can be reasonably well predicted from a weighted sum of stimulus
intensity, where the weighting is given by the cell's receptive field
and negative values of the weighted sum are taken to yield zero
response (DeAngelis et al. 1993
; Hubel and Wiesel 1962
; Jones and Palmer 1987
). As predicted by a
linear response model, the shape of a simple cell's orientation tuning
curve is invariant to changes in stimulus contrast (Sclar and
Freeman 1982
; Skottun et al. 1987
): a change in
contrast scales all responses by a constant, rather than changing the
form of the response tuning curve.
However, other aspects of simple cell responses show a nonlinear
dependence on stimulus contrast (reviewed in Carandini et al.
1998
). In this paper we will examine three such properties: 1) contrast-dependent phase advance: as the contrast of a
sinusoidal grating stimulus increases, the response of a cortical cell
occurs earlier in the stimulus cycle (Albrecht 1995
;
Dean and Tolhurst 1986
); 2)
contrast-dependent temporal frequency tuning: higher temporal
frequencies that yield small or zero responses at low contrast yield
reasonable responses at high contrast (Albrecht 1995
;
Holub and Morton-Gibson 1981
); and 3)
contrast saturation: the change in response amplitude with contrast has
a sigmoidal rather than linear dependence on contrast, saturating at
intermediate contrasts (e.g., Albrecht 1995
). The third
property involves the nonlinear dependence of scaling on contrast. The
first two involve changes in response more complex than a simple
scaling by contrast: responses either move earlier in time (property 1)
or increase differentially across the tuning curve (property 2).
In this paper, we address the question of how a single model circuit, consistent with existing experimental knowledge of cat visual cortex, can simultaneously account for both the linear-like response scaling of contrast-invariant orientation tuning and the above three nonlinear response properties. In principle, accounting for nonlinear response properties in isolation may not be difficult, given the many inherently nonlinear properties of the synapses, cells, and circuits involved. We suggest that the true difficulty lies in simultaneously accounting for both linear-like and nonlinear response properties: how can the underlying nonlinear mechanisms be manifest in some aspects of response and yet simultaneously be hidden in other aspects? Indeed, the difficulty of generating any linear-like responses at all is well illustrated by the contrast-invariance of orientation tuning in response to drifting sinusoidal luminance gratings. LGN cells do not provide linear input to simple cells, because their response rates cannot decrease below zero. As a result, LGN mean firing rates increase with contrast. Under a linear response model, an increase in stimulus contrast would increase the amplitude of temporal modulation of firing rates without affecting mean rates. Furthermore, cortical cells integrate this input through the nonlinearity of a nonzero spike threshold. Due to the increase both in modulations and means of LGN firing rates, a broader range of stimulus orientations should produce suprathreshold LGN input at higher contrasts. Thus the orientation tuning of the LGN input to a simple cell should widen with increasing stimulus contrast.
We have recently demonstrated (Troyer et al. 1998
) that
the contrast-invariance of orientation tuning can be accounted for by
the combination of 1) a simple model intracortical circuit motivated by numerous intracellular studies (e.g., Anderson et al. 2000
; Chung and Ferster 1998
; Ferster
1986
, 1988
; Ferster et al. 1996
;
Hirsch et al. 1998
; Nelson et al. 1994
)
and 2) a "Hubel-Wiesel" (1962)
arrangement of lateral geniculate nucleus (LGN) inputs to simple cells,
in which oriented bands of ON- or OFF-center
LGN inputs provide input to the ON- or
OFF-subregions, respectively, of the simple cell's
receptive field. Here we demonstrate, for the first time, a unified
mechanistic account of both the linear and nonlinear aspects of simple
cell responses. Our previous model incorporated a number of nonlinear
mechanisms, including spike-rate adaptation, contrast-induced changes
in cellular conductance, and the nonzero spike threshold. We now add
one additional nonlinear mechanism, short-term synaptic depression
(Abbott et al. 1997
; Tsodyks and Markram
1997
). We show that the resulting model explains the three
nonlinear properties noted above, while retaining contrast-invariant orientation tuning.
Importantly, this is the first explanation of these properties using a
model circuit that is purely local in orientation (see DISCUSSION for other models). That is, both the excitatory
and the inhibitory intracortical input received by a simple cell comes primarily from cells having similar preferred orientation, as suggested
by numerous experiments in cat V1 (Anderson et al. 2000
; Chung and Ferster 1998
; Ferster 1986
,
1988
; Ferster et al. 1996
; Hirsch
et al. 1998
).
Some of these results have appeared in abstract form (Priebe et
al. 1997
).
| |
MODELING FRAMEWORK |
|---|
|
|
|---|
We begin by summarizing the essential information about our model needed to understand our results. Full details sufficient to replicate our work are in APPENDIX A.
Intracortical circuit
We study a circuit (Troyer et al. 1998
) in which
1) geniculocortical synaptic weights to a cell are described
by Gabor functions (Jones and Palmer 1987
), with
ON-center (OFF-center) inputs corresponding to
positive (negative) portions of the Gabor; and 2)
intracortical connections are made between cortical cells based on the
correlations between their receptive fields (RFs), i.e., between the
geniculocortical synaptic weights they receive. An excitatory cell
makes strong connections onto other excitatory cells with which it is
strongly correlated; an inhibitory cell makes strong connections onto
excitatory cells with which it is strongly anticorrelated. The dominant
resulting connections follow a "push-pull" scheme and are
illustrated in Fig. 1. A crucial
requirement is that inhibition be dominant: the feed-forward inhibitory
pathway LGN
I
E must have stronger overall gain than the
feed-forward excitatory pathway LGN
E (where E and I indicate
excitatory and inhibitory cortical cells, respectively), as assessed by
the mean feed-forward inhibition exceeding mean feed-forward excitation
over a cycle of response to a sinusoidal stimulus. More specifically,
the mean conductance opened by the two pathways over a cycle must have
a sufficiently subthreshold reversal potential to prevent spiking to a
stimulus with orientation orthogonal to a cell's preferred
orientation.
|
This architecture can account for cortical orientation tuning and its
contrast invariance (Troyer et al. 1998
). How does this circuitry account for orientation tuning? For a stimulus at a cell's
preferred orientation and spatial phase, other neurons with similar
preferred orientation and spatial phase (both excitatory and
inhibitory) are strongly activated. However, the inhibition is directed
onto cells with similar preferred orientation but antiphase (opposite
spatial phase) RFs. In the case of a drifting sinusoidal grating of the
preferred orientation, the resulting inhibition received by a cell
comes out-of-phase with its excitation, permitting excitatory cells to
respond during the temporal phase in which more excitation is received
than inhibition (Fig. 1A). As the orientation is shifted
away from the preferred, temporal modulation of both feed-forward
excitation and feed-forward inhibition decreases. Since inhibition is
dominant in the mean, at some orientation the modulation is small
enough that inhibition is dominant at all times, and the cell cannot
fire. In particular, for a stimulus at a cell's null orientation
(perpendicular to the preferred), there is essentially no modulation,
inhibitory neurons of both the cell's preferred phase and the opposite
phase are continuously activated, and thus excitatory cells of both
phases are continuously inhibited (Fig. 1B).
The contrast-invariance of orientation tuning arises because an
increase in contrast equally increases the geniculocortical drive to a
given cell and to the anti-phase cells from which it receives
inhibition. Thus the cutoff orientation (the orientation for which
input modulation is sufficiently small that inhibition dominates
throughout the cycle) remains essentially invariant across contrast. A
more detailed analysis is given in Troyer et al. (1998)
.
Rate model
We studied two forms of model: a conceptual rate model and a more biophysically accurate spiking model. The rate model allowed exploration of the cortical circuit and its elements within a simple framework. This allowed us both to work out the basic mechanisms underlying circuit properties, and to explore a significant portion of the given parameter space, thereby establishing the robustness of these insights. The spiking model, on the other hand, allowed us to establish that the insights gained from the rate model translated to a more detailed, more biophysically realistic setting, and thus provided a verification of the rate model findings. The spiking model also allowed us to examine the role of spike-rate adaptation, which was not easily accommodated in the rate model.
The rate model consisted of 96 excitatory and 96 inhibitory neurons, with RFs of 12 different orientations and 8 different spatial phases, all centered at the same retinotopic point. Connections between cortical neurons were made deterministically based on the correlation between their RFs, as described above. Model neuron firing rates were calculated as the weighted sum of all the input firing rates from geniculocortical, intracortical excitatory, and intracortical inhibitory sources, rectified at a threshold; hence the term, "rate model." The model was described by eight parameters: the thresholds and membrane time constants of excitatory and inhibitory cells, the gains of geniculocortical (G), intracortical inhibitory-to-excitatory (I), and excitatory-to-excitatory (E) cell connections, and a lower bound on the membrane voltage. Appropriate values for these eight variables were obtained by constraining the output of the circuit to match a set of experimental findings, including the width and contrast-invariance of orientation tuning (see APPENDIX A); this set did not include the nonlinear responses properties studied here. In addition, two parameters describe synaptic depression, as described below. For each choice of synaptic depression parameters, we typically show average results over all sets of the other parameters that met these criteria, thus examining the robustness of the results across experimentally reasonable model parameters that are consistent with contrast-invariant tuning.
Spiking model
To expand on the insights obtained from the rate model in a more
biophysically realistic framework, we used the spiking model of
Troyer et al. (1998)
. One thousand six hundred
excitatory and 400 inhibitory neurons were laid out in a 

as explained in Troyer and Miller
(1997a
,b
). Excitatory neurons had spike-rate adaptation
currents. We included only fast
[
-amino-3-hydroxy-5-methyl-4-isoxazolepropionic acid (AMPA) and
GABA-A] synaptic currents, deferring examination of slow currents
[e.g., N-methyl-D-aspartate (NMDA) and GABA-B] to future work (e.g., Krukowski 2000
). Again, parameters
were chosen to achieve appropriately narrow, contrast-invariant
orientation tuning, and nonlinear response properties were then studied
(see APPENDIX A). Due to the complexity of the model, we
present results for only a single set of circuit parameters for each
set of synaptic depression parameters used.
Visual stimuli and LGN inputs
Visual inputs to the models were drifting full-field sinusoidal
gratings. LGN responses were assumed to arise from a spike rate that
was the sum of a linear stimulus-induced temporal modulation and a
constant background rate, with rates rectified at zero. Amplitudes of
the stimulus modulation were matched to LGN data on X-cell responses
across contrast and temporal and spatial frequency (Sclar
1987
), as described in APPENDIX A. The rate model used this rate directly as the LGN response, while the spiking model
used Poisson spike trains sampled from these rates.
The geniculocortical synaptic weights to the simple cells in the model layer 4 were described by Gabor functions, with parameters matched to experimental measurements of simple cell RFs. In the rate model, the geniculocortical (G) weights were defined deterministically by the Gabor distribution, with negative Gabor values indicating OFF weights; the spiking model RFs were established probabilistically by sampling from the Gabor distribution.
Synaptic depression
Synaptic depression is a use-dependent decrease in synaptic
efficacy (Abbott et al. 1997
; Markram and Tsodyks
1996
); as the firing rate of a presynaptic neuron increases,
the influence of single synapses from that cell onto the postsynaptic
neuron declines. Intuitively, this relationship holds because higher
firing frequencies prevent recovery from depression between input
spikes, as discussed below.
One can characterize synaptic depression by two parameters:
f, the ratio of the synaptic efficacy immediately after a
presynaptic spike to the efficacy before the spike (0
f
1), and
, the time constant of recovery from
depression. Smaller values for f lead to a greater loss of
synaptic efficacy after every spike; smaller values of
cause faster
recovery from this depression. In both the rate and spiking models,
like forms of depression are used: the rate-model depression equation
is equal to the average, over Poisson-sampled spike trains, of the
spiking-model depression equation (see APPENDIX B), and
their behavior in simulations is qualitatively and quantitatively quite similar.
In the experimental literature, two classes of data appear to be
present: one in which synaptic depression is studied through the use of
paired-pulse stimuli, and one in which depression is characterized by
probing with trains of stimuli (S. Nelson, personal communication).
These two types of experiment result in different measured values for
f and
, which we call the "pulse" and "train" parameters, respectively (Table 1). Given
this experimental uncertainty in parameter values, we examined all
results under both choices of parameters.
|
Contrast-invariance of orientation tuning
As we have mentioned, one of the criteria for selection of our
model parameters was that the resulting circuit should have contrast-invariant orientation tuning. More generally, we have found
that the principles outlined in Troyer et al. (1998)
suffice to robustly produce contrast-invariant tuning across temporal frequencies and in the presence of synaptic depression, two issues not
addressed in the previous work, although we do not discuss this point
further here.
| |
EXPERIMENTAL FINDINGS ADDRESSED |
|---|
|
|
|---|
Having summarized the model circuit, we now summarize the experimental data on response nonlinearities that we will address with this model.
Contrast-dependent phase advance
Simple cells respond earlier in time to drifting gratings as the
contrast of those gratings increases, as quantified by the difference
in the phase of the first harmonic (F1) of the cortical spiking
responses at each contrast (Albrecht 1995
; Dean
and Tolhurst 1986
). We reviewed the literature to determine the
size of this contrast-dependent (c-d) phase advance (Fig.
2). We examined both V1 and LGN c-d phase
advance, because only the difference between these values needs to be
accounted for by cortical mechanisms. In all cases we report the
advance over three octaves of contrast (e.g., the relative advance
between 10 and 80% contrast).
|
For V1 simple cells in the cat, c-d phase advance has been measured for
approximately 30 cells (Dean and Tolhurst 1986
) in one
study, and for over 100 cells in another (Albrecht
1995
). Mean c-d phase advances were comparable: 42° for a
2-Hz grating in the former study, 47 and 49° for 2- and 8-Hz
gratings, respectively, in the latter. In the LGN, X cells show 25°
mean c-d phase advance in response to 8-Hz (Sclar 1987
)
and 3-Hz (Saul and Humphrey 1990
) gratings, while Y
cells demonstrate as much or more c-d phase advance as cortical simple
cells. Both the LGN and cortical measurements are characterized by
large standard deviations. Without a knowledge of the X or Y nature of
the geniculocortical inputs to the cortical cells studied previously,
it is difficult to know how much c-d phase advance the cortex must add,
or even whether it adds any at all. An additional uncertainty is raised
by the fact that we are modeling layer 4, where the first
transformation of LGN inputs occurs. Further cortical transformations
could add more c-d phase advance, so layer 4 might show less c-d phase
advance than the cortical mean; however, the data on cortical cells
were not broken down by layers.
We make perhaps the simplest assumption: cortical layer 4 should
account for the mean difference in c-d phase advance between X cells
and V1 simple cells. This is based in part on observations suggesting
that X cells are the physiologically dominant input in V1
(Ferster 1990a
,b
; Ferster and Jagadeesh
1991
). Thus we assume that layer 4 must account for roughly
20° of c-d phase advance over 3 octaves of contrast. Note that we do
not include LGN c-d phase advance in our simulations, so the
simulations should be compared only to this difference between
experimentally observed LGN and V1 c-d phase advance.
Contrast-dependent changes in temporal frequency tuning
In response to an increase in stimulus contrast, cortical
temporal-frequency tuning curves change their shape. Higher
temporal-frequency stimuli that yield small or zero responses at low
contrast yield reasonable responses at higher contrast. One measure of
this is given by comparing the ratios, at each temporal frequency, of the response at high contrast to the response at low contrast. In data
taken from an LGN X cell [Fig. 3,
top; replotted from Sclar (1987)
], this
ratio is relatively constant across temporal frequencies, although
slightly larger at higher frequencies. This behavior was fairly typical
of 27 X cells studied in Sclar (1987)
. In two cortical
simple cells reported in Albrecht (1995)
, however (one
replotted in Fig. 3, bottom), this ratio increases sharply with increasing temporal frequency: higher temporal frequencies give
very small responses at low contrast, but reasonable responses at
higher contrast. The cortical data for cats is very sparse: we are
aware of only the two cells from Albrecht (1995)
and one additional cell in Holub and Morton-Gibson (1981)
for
which temporal frequency tuning at multiple contrasts is reported; all
three cells show this effect. The effect is also common, although not universal, in monkey V1 cells [M. Hawken, private communication; Hawken et al. 1992
; of 3 published tuning curves, effect
is seen in Carandini et al. (1997)
, Fig. 6 but not Fig.
9 and not seen in Albrecht (1995)
, Fig. 11], suggesting
that a relative boosting with contrast of the high temporal-frequency
portion of the temporal tuning curve may be a common V1 property.
However, there are no data as to whether, or how strongly, this effect
is seen in layer 4 neurons. Moreover, LGN Y cells show a more
pronounced c-d boosting of the high-frequency portion of the tuning
curve (Sclar 1987
) than do X cells. Just as for c-d
phase advance, without knowledge of the relative X and Y cell input to
studied simple cells, it is unclear how much of this boost, if any, is
accomplished by the cortex. We again make the assumption that the
cortex must account for the difference in response between X cells and
V1 simple cells. Last, these data also suggest, as does one published cell in monkeys (Carandini et al. 1997
, Fig. 6), that
increases in contrast might also shift the peak of the temporal
frequency response curve to higher frequencies.
|
It is important to note that the relative boosting of high-frequency
responses by contrast does not correspond to an increase in contrast
gain (the slope of response versus contrast) at higher temporal
frequencies. Plotting the responses at each temporal frequency versus
contrast (Fig. 4) makes clear that this
slope is not enhanced at higher frequencies and, if anything, is
reduced. Another of the three cells in the literature (Holub and
Morton-Gibson 1981
) showed similar contrast gain at high and
low temporal frequencies, but an elevated threshold contrast for higher
temporal-frequency responses. Thus the greater relative amplification
with contrast of responses to higher temporal frequencies arises
because low-contrast responses at higher frequencies are very small,
due to lower contrast gain and/or to elevated contrast threshold, and
not because high-contrast responses show an elevated contrast gain.
|
Saturation of responses with increasing contrast
Simple-cell responses tend to reach a plateau with increasing stimulus contrast (Fig. 4, bottom); this is known as contrast saturation. This cannot be explained by intrinsic saturation of the cell's ability to fire. As evidenced, for example, by the contrast-invariance of orientation tuning, saturation does not occur at a fixed response level, but rather at different responses levels for different stimuli (so that orientation tuning curves are similar in shape at saturating contrasts and at low contrasts). LGN inputs show contrast saturation as well (Fig. 4, top). If LGN input firing does not change with increasing contrast, neither will cortical firing. Thus the question arises of whether cortical saturation level is independent of LGN saturation level.
While the LGN X cell in Fig. 4 indeed saturates at higher contrasts
than the cortical cells in that figure, it is not clear whether this is
a general phenomenon. Contrast saturation can be measured by a
parameter C50: the contrast at which response is
half of the maximal, saturating response (determined from a fit of the
Naka-Rushton equation, Eq. A1 in APPENDIX A, to
the contrast-response curve). In Table 2,
we show the value of C50 for the cell of Fig. 4
and for 1 additional cortical and 5 additional LGN X cells for which we
found contrast response curves in the literature, along with the mean
value reported for over 100 cat cortical simple cells in
Albrecht (1995)
. From these values, it is not obvious
whether cortical cells saturate earlier than LGN cells. The same
uncertainty applies in monkey, where V1 cells saturate over a range of
contrasts similar to the combined saturation ranges of magnocellular
and parvocellular LGN cells (Allison et al. 2000
;
Sclar et al. 1990
).
|
However, the phenomenon of contrast adaptation (Albrecht et al.
1984
; Ohzawa et al. 1985
) strongly suggests a
cortical role in setting contrast saturation levels. Sustained
presentations of low (high) contrast stimuli shift the cortical
response functions to lower (higher) C50s,
without corresponding shifts in the LGN response functions. Both
threshold and saturating contrasts are shifted by adaptation. This
indicates that the cortex can set its saturation level independently of
the level at which LGN responses saturate, and motivates us to explore
the effects of our model circuit mechanisms on cortical contrast
saturation. The strongest components of adaptation operate over time
scales longer than that of any mechanism incorporated in our network
(mean time to 
), so we cannot address these effects.
However, contrast adaptation or related phenomena are seen on multiple
time scales, including short time scales (Bonds 1991
;
Geisler and Albrecht 1992
; Nelson
1991a
,b
) that are within the range of mechanisms studied here
(spike-rate adaptation, synaptic depression, recruitment of dominant
opponent inhibition). Here we address the contributions of these
mechanisms to contrast saturation, while noting that other mechanisms
might be involved in both saturation and adaptation over longer time scales.
The data on contrast saturation also suggest an additional point that
we will address: simple cell responses saturate at higher contrasts as
temporal frequency increases. This effect was noted by Albrecht
(1995)
in discussing the two cells for which temporal frequency
tuning was studied at multiple contrasts, and is shown particularly
prominently by the cortical cell of Fig. 4. Similar findings have been
noted in monkeys (Carandini et al. 1997
).
| |
RESULTS |
|---|
|
|
|---|
Contrast-dependent phase advance
At least three mechanisms can contribute to cortical c-d phase
advance beyond that of the LGN inputs: synaptic depression, spike-rate
adaptation, and contrast-dependent increases in conductance. Synaptic
depression is evoked by the presynaptic spiking response to the grating
stimulus, and differentially suppresses the later portions of the
input, and thus of the postsynaptic response, over each stimulus cycle.
As illustrated in Fig. 5, this shifts the
response peak forward in time. Because the effect of synaptic depression grows with presynaptic firing rate, and thus with contrast, this shift increases with stimulus contrast, yielding a c-d phase advance. Spike-rate adaptation is evoked by postsynaptic rather than
presynaptic spiking response, but otherwise it causes c-d phase advance
for the same reasons as synaptic depression. Finally, as emphasized in
studies of the normalization model (e.g., Carandini et al.
1998
; see DISCUSSION), increases in postsynaptic
conductance cause a decrease in membrane time constant, and this
decrease in integration time causes the phase of responses to advance. If conductance grows with stimulus contrast, this also yields a c-d
phase advance.
|
We first examined the role of synaptic depression. We began by studying
the effects of the depression parameters, f (the fraction of
synaptic strength remaining after each presynaptic action potential) and
(the time constant of depression; Fig.
6). Parametric variations of f
and
were carried out only for the geniculocortical synapses: we
examined the c-d phase advance of the total geniculocortical input to
simple cells in response to optimally oriented spatial gratings
drifting at three temporal frequencies. Depression at geniculocortical
synapses yields c-d phase advances of 5-10° across a broad range of
parameters. We show only rate model results in Fig. 6, as spiking model
results are virtually identical.
|
The dependence of c-d phase advance on f and
can be
understood as follows. A smaller f, representing stronger
depression, induces stronger c-d phase advance, up to a point. Once
f becomes small enough that the synaptic efficacies are
close to zero within the stimulus cycle at some contrast, further
increases in contrast have less and less additional effect, so too
great a reduction in f can decrease the c-d phase advance
(Fig. 6A). Smaller
yields greater recovery from
depression between spikes, hence less depression and less c-d phase
advance. As
increases, the depression becomes stronger and the
phase advance increases, until
becomes comparable to the period of
the stimulus cycle. At this point,
is preventing recovery of
synaptic efficacy between response cycles. Further increases in
have little effect on c-d phase advance: such increases change the
dynamic range over a cycle, lowering the mean synaptic efficacy and
mean response, but do not seem to appreciably alter the time course of
depression and recovery within that dynamic range (of course, as
, the steady-state response level will go to zero, and c-d phase
advance will become undefined). Finally, an increase in temporal
frequency is roughly equivalent to moving the graphs down and to the
left: at higher temporal frequency, there is less time in each cycle
for depression to occur, so a larger f is needed to get an
equivalent amount of depression, and there is less time in each cycle
to recover from depression, so a smaller
gives an equivalent amount
of recovery.
Next, for fixed f and
(set either according to the pulse
or train parameters, Table 1), we examined the relative contributions of synaptic depression at different synaptic loci in the full model
circuit, using the rate model. This model has no spike-rate adaptation
and has a fixed membrane time constant, so only depression should
contribute to the c-d phase advance. Synaptic depression can be found
in any of three locations: in the geniculocortical synapses (G), in the
intracortical excitatory synapses (E), and in the intracortical
inhibitory synapses (I). This yields eight possible configurations for
the locations of depressing synapses. Depression in the I synapses had
little effect on phase advance, so we illustrate the c-d phase advance
produced by the four configurations not involving I as well as for the
case of depression at all locations (Fig.
7). Matching these data across the
different depression conditions is not trivial; one must ensure that
the data are comparable by matching firing rates, for example, or by
using the same set of parameters in all cases. We chose to show the
distribution of results for all model parameter sets that satisfied the
known experimental constraints (see APPENDIX A) at a given
temporal frequency. Similar plots in which we include only model
parameter sets that fit the constraints at all temporal
frequencies give similar results with less variability, but there are
no such parameter sets within our search range for some cases (both
sets of "G" cases, and the train "E" case).
|
As evidenced by Fig. 7, depression of either geniculocortical or intracortical excitatory synapses can induce approximately 5° of c-d phase advance, and these advances sum when depression is present in both locations. In the absence of any depression, there is no c-d phase advance, as expected. These general results are for the most part similar across temporal frequency of the input and choice of synaptic depression parameters (pulse vs. train), except that the train parameter set tends to produce somewhat larger phase shifts than the pulse set, as is also evident in Fig. 6.
To consider the additional effects of spike-rate adaptation and of contrast-dependent changes in membrane time constant, we turn to the spiking model. In this model, depression was included only at geniculocortical synapses, for reasons described in APPENDIX A. In the absence of depression ("D") or adaptation ("A"), a c-d phase shift of 3-4° appears, increasing slightly with temporal frequency (Fig. 8, "No A, No D"). This is roughly consistent with the observed contrast-induced decreases in membrane time constant.1 Adding either adaptation alone ("A, no D") or geniculocortical depression alone adds roughly another 5°, and the effects of these two mechanisms together are additive.
|
With all three mechanisms present, the spiking model shows mean c-d phase advance of 13-15°, relative to LGN, for either set of depression parameters (Fig. 8). Depression in intracortical excitatory synapses can easily add another 5° (Fig. 7). This suggests that these mechanisms may be sufficient to account for the roughly 20° difference between LGN X cell and V1 c-d phase advances that have been observed in cats (Fig. 2). However, while we have found that the effects of geniculocortical depression add with those of intracortical E depression (Fig. 7) and with those of adaptation (Fig. 8), we have not studied the three together. We tried modeling adaptation in the rate model, but did not see an effect on c-d phase advance. In our simple rate model, adaptation was proportional to the rate, and therefore was active even at low rates. In reality and in our spiking model, the net effect of adaptation increases faster than linearly with firing rate: the mean adaptation current increases proportionally to the rate, but the effect of this current on spiking increases with rate, because at higher rates (smaller interspike intervals), there is less time for the spike-induced current to decay between spikes. This difference appears to be critical to the c-d phase advance induced by adaptation. Rather than include a more complicated (and underconstrained) dependence of adaptation on rate, we elected to study only the effects of synaptic depression in the rate model, and to study adaptation only in the spiking model. Conversely, as discussed in APPENDIX A, for reasons of computational complexity, we did not study depression of intracortical synapses in the spiking model.
We also examined the dependence of phase shift on stimulus orientation in the rate model (data not shown). c-d phase advance remains essentially constant across orientations that give reasonable response.
Contrast-dependent changes in temporal frequency tuning
We next studied the contrast dependence of temporal frequency
tuning. As in our studies of c-d phase advance, we wanted to isolate
the cortical contribution to temporal frequency tuning; in this case,
to understand the cortical response in the absence of any incoming
temporal information beyond the stimulus-driven temporal modulation of
the input rates. Experimentally, the LGN inputs show temporal-frequency
dependence in the amplitude of their rate modulations (response F1;
Fig. 3, top). Thus we found it convenient to consider an
even simpler model of LGN responses, in which the LGN response F1 was
constant across temporal frequencies at a given contrast, with larger
F1s representing higher contrast. We refer to such an LGN response
profile as "flat," in distinction to the experimental tuning of
Fig. 3, top, which we refer to as "Sclar" tuning
[because the experimental data are from Sclar (1987)
]. Using flat LGN tuning, we can examine cortical contributions to temporal tuning; we can then examine full cortical responses using Sclar LGN tuning.
Assuming flat LGN tuning, there are at least four cortical factors that contribute to temporal frequency tuning and its contrast dependence: 1) the cellular time constant and its decrease with increasing stimulus contrast; 2) the spike-threshold nonlinearity; 3) spike-rate adaptation; and 4) synaptic depression. We consider the effects of each of these in turn.
Cellular (and synaptic) time constants act as low-pass filters, causing
the modulation of the simple cell's voltage response (the 1st harmonic
of F1 of the voltage response) to decrease with increasing temporal
frequency.2 As we
have already noted, the average membrane time constant of a cortical
cell shrinks as the amount of synaptic input to the cell increases,
because increasing synaptic drive increases membrane conductance. As a
result, at higher contrasts the voltage responses to higher temporal
frequencies are less attenuated by cellular filtering than at lower
contrasts (Carandini and Heeger 1994
). This effect is
captured in the spiking model, but not in the rate model which has a
fixed time constant. The effect is modest: the mean time constant in
the spiking model shrinks from 12.5 to 8 ms between the low (F1 = 30) and high (F1 = 90) flat input levels (further details in
footnote 1). Assuming a linear model of voltage response, this yields
about an 18% increase in the high-contrast voltage F1 at 12 Hz
relative to that expected from the low-contrast time constant.
However, this modest effect can become significant when combined with
the nonlinearity of a nonzero spiking threshold: the threshold gives
rise to an "iceberg" effect. Figure
9A shows responses in the
spiking model to four levels of flat LGN input, when adaptation but not
depression is present. At 12-Hz input frequency, the response is close
to zero for input F1s of 15 or 30 spikes per second, but thereafter
grows with increasing input F1, suggesting a threshold effect. This can
be confirmed by viewing the corresponding intracellular voltage traces
for a randomly chosen cell with spiking turned off (Fig.
9B); the spike threshold of
52.5 mV is indicated as a
dashed line. The modest attenuation of voltage modulation due to
membrane filtering is, on average, sufficient to keep voltage responses
subthreshold at the lower input levels. Higher input modulation levels,
however, yield higher voltage modulations that consistently cross
threshold. This threshold effect depends on our circuit model, in which
inhibition is dominant so that the mean response to a sinusoidal
grating is always subthreshold and spiking occurs only on voltage
modulations (Troyer et al. 1998
); in a model in which
the mean input to a preferred stimulus was suprathreshold, the modest
affects of cellular filtering on the voltage modulations would have
only modest effects on spike response.
|
To examine the effects of the other mechanisms, we examined temporal frequency tuning curves with and without synaptic depression (Fig. 10: A, no depression; B, pulse depression parameters) and, for the spiking model, with and without spike-rate adaptation currents (Fig. 10, spiking model: middle panels, no adaptation; bottom panels, with adaptation). In all cases, we present data for both flat (dashed lines in Fig. 10) and Sclar (solid lines in Fig. 10) LGN tuning. The top panels of Fig. 10, A and B, show the LGN input to simple cells. LGN cells respond better to high than to low temporal frequencies and show slightly more contrast-dependent enhancement of both high and low temporal frequencies than of middle temporal frequencies (Fig. 10A, Sclar inputs). In the absence of depression or adaptation, the filtering by the cortical cell's membrane time constant, combined with the spike threshold, produces strongly low-pass cortical responses (Fig. 10A, middle panels). Both spike-rate adaptation (Fig. 10, bottom panels) and synaptic depression (Fig. 10B) suppress responses to lower-frequency stimuli much more strongly than responses to higher-frequency stimuli, and can convert low-pass cortical response into a more band-pass response. This property of synaptic depression also virtually eliminates the difference between flat and Sclar inputs (Fig. 10B, top panels). Train parameters for synaptic depression produce results similar to pulse parameters, except that there is less difference between responses to low versus high contrasts (not shown).
|
Both synaptic depression and spike-rate adaptation contribute to the relative enhancement of higher temporal-frequency responses at high contrast. Each is more strongly activated by higher-contrast than by lower-contrast stimuli, and each more strongly suppresses responses to lower-frequency than to higher-frequency stimuli. These contrast-dependent effects are most clear in the "normed" insets in each panel of Fig. 10, which show the ratio of high-contrast to low-contrast responses versus temporal frequency. This ratio strongly increases at higher temporal frequencies for cortical responses in every case except for that of the rate model without depression (Fig. 10A). That case is the only one that lacks any of the three mechanisms of contrast-dependent changes in membrane time constant, synaptic depression, and spike-rate adaptation. Adding depression alone (Fig. 10B, rate model) or membrane time constant changes alone (Fig. 10A, spiking model, no adaptation) suffices to give contrast-dependent enhancement of high-frequency responses. Addition of spike-rate adaptation in the spiking model tends to eliminate any relative enhancement of lower frequencies while preserving such enhancement at higher frequencies. Synaptic depression also suppresses the contrast-dependent differences between LGN input conductances, making different contrasts appear more alike to the cortical cell. This reduces the strength of contrast-dependent response enhancement at all temporal frequencies.
We see at best only a weak shift in the peak of the temporal frequency tuning curve with increasing contrast. At present, there are no experimental data as to whether LGN-recipient cells in cat layer 4 show such a shift in peak. If they do not, but instead show only a relative increase in responses to higher temporal frequencies at higher contrast, this could be sufficient to induce shifts in the tuning peaks of downstream cells.
Saturation of responses with increasing contrast
Last we examined the saturation of cortical responses with increasing contrast (Fig. 11). Even in the absence of depression or spike-rate adaptation, model cortical responses tend to saturate somewhat earlier than their LGN inputs, particularly at lower temporal frequencies (Fig. 11B). If either pulse or train depression is active, saturation occurs significantly earlier than in either the LGN inputs or the models without depression. (The one exception is at the highest temporal frequency of the spiking model, for which responses are small and the measure of saturation probably inaccurate.) Moreover, clearly in the depression cases, and also somewhat in the examples lacking depression, there is a tendency for responses to higher temporal frequencies to saturate later than responses to lower temporal frequencies: for cases with depression, C50 values increase monotonically with temporal frequency if the lowest temporal frequency is excluded. The same pattern is seen in the V1 cell of Fig. 4, although the model C50 values are somewhat lower than those measured by Albrecht.
|
The contrast saturation effects induced by synaptic depression can be
readily understood. As demonstrated by Abbott et al. (1997)
and Tsodyks and Markram (1997)
, in the
presence of depression, as a presynaptic neuron's firing rate
increases to values much larger than 1/
(where
is the time
constant of recovery from depression), the overall postsynaptic effect
of its synapses (proportional to rate times efficacy) saturates at a
plateau value. The postsynaptic cell cannot "see" further increases
in rate. Thus as LGN firing rates increase with contrast, the impact on
the cortical cells will plateau earlier than it would without
depression. This saturation occurs at higher contrasts for higher
temporal frequencies, because depression more strongly suppresses lower
than higher-frequency inputs.
As can be seen in the table, however, cortical responses can saturate at lower contrasts than LGN even when depression is absent. This results from the inhibition in our circuit model. Because the cortical response is determined by a thresholded version of the membrane voltage, for a sinusoidal input grating the response of the cortex can be largely understood from the peak membrane voltage. We estimate this peak as the sum of the mean voltage and the modulation amplitude, or first harmonic, of the voltage. In the absence of inhibition, this peak voltage closely follows the modulation of the LGN input: tuning curves of peak voltage and of LGN modulation show very similar C50s under various conditions (data not shown). However, when inhibition is added, the peak voltage can show C50 values that are lower than the corresponding LGN values, because the inhibition in the model both decreases the slope of, and adds a constant negative DC offset to, the curve of peak voltage versus contrast. The DC offset originates from the background firing of the LGN, which, because the cortex is inhibition dominated, is net inhibitory. By both flattening and shifting the cortical response curve closer to zero, inhibition effectively causes cortical neurons to saturate sooner than their inputs.
| |
DISCUSSION |
|---|
|
|
|---|
We have established that a simple circuit model of cat layer 4 that achieves contrast-invariant orientation tuning can also account for three c-d nonlinearities in simple cell responses to s