|
|
||||||||
The Journal of Neurophysiology Vol. 87 No. 6 June 2002, pp. 2741-2752
Copyright ©2002 by the American Physiological Society
1Department of Psychology, Neuroscience and Cognitive Science Program, University of Maryland, College Park, Maryland 20742; 2National Aeronautics and Space Administration Ames Research Center, Moffett Field 94035-1000; and 3Departments of Physiology and Otolaryngology, W. M. Keck Center for Integrative Neuroscience, Sloan-Swartz Center for Theoretical Neurobiology, University of California, San Francisco, California 94143-0444
| |
ABSTRACT |
|---|
|
|
|---|
Troyer, Todd W., Anton E. Krukowski, and Kenneth D. Miller. LGN Input to Simple Cells and Contrast-Invariant Orientation Tuning: An Analysis. J. Neurophysiol. 87: 2741-2752, 2002. We develop a new analysis of the lateral geniculate nucleus (LGN) input to a cortical simple cell, demonstrating that this input is the sum of two terms, a linear term and a nonlinear term. In response to a drifting grating, the linear term represents the temporal modulation of input, and the nonlinear term represents the mean input. The nonlinear term, which grows with stimulus contrast, has been neglected in many previous models of simple cell response. We then analyze two scenarios by which contrast-invariance of orientation tuning may arise. In the first scenario, at larger contrasts, the nonlinear part of the LGN input, in combination with strong push-pull inhibition, counteracts the nonlinear effects of cortical spike threshold, giving the result that orientation tuning scales with contrast. In the second scenario, at low contrasts, the nonlinear component of LGN input is negligible, and noise smooths the nonlinearity of spike threshold so that the input-output function approximates a power-law function. These scenarios can be combined to yield contrast-invariant tuning over the full range of stimulus contrast. The model clarifies the contribution of LGN nonlinearities to the orientation tuning of simple cells and demonstrates how these nonlinearities may impact different models of contrast-invariant tuning.
| |
INTRODUCTION |
|---|
|
|
|---|
The tuning of visual neurons
for the orientation of contrast edges is the most thoroughly explored
response property of cortical neurons. Cortical layer 4 of cat primary
visual cortex (V1) is composed primarily of simple cells
(Bullier and Henry 1979
; Gilbert 1977
;
Hubel and Wiesel 1962
), i.e., cells with receptive
fields (RFs) containing oriented subregions each responding exclusively to either light onset/dark offset (ON subregions) or dark
onset/light offset (OFF subregions). Hubel and
Wiesel (1962)
proposed that these response properties arose
from a corresponding, oriented arrangement of inputs to simple cells
from ON-center and OFF-center cells in the
lateral geniculate nucleus (LGN) of the thalamus; such an arrangement
is referred to as Hubel-Wiesel thalamocortical connectivity. The
usual formulation of this so-called "feed-forward" model assumes
that simple cell response properties arise from a linear summation of
these LGN inputs, followed by a rectification nonlinearity due to spike
threshold (Carandini and Ferster 2000
; Ferster 1987
; Ferster and Miller 2000
).
The feed-forward model is challenged by the fact that spiking responses
in simple cells are invariant to changes in stimulus contrast
(Sclar and Freeman 1982
; Skottun et al.
1987
): under this model, inputs at nonoptimal orientations are
expected to be subthreshold at low contrast but become
suprathreshold at higher contrast. We have previously presented
simulation results showing that a simple form of strong "push-pull"
inhibition (inhibition induced by light in OFF subregions
or dark in ON subregions), combined with Hubel-Wiesel
thalamocortical connectivity, is sufficient to overcome
this difficulty and robustly yield contrast-invariant orientation
tuning (Troyer et al. 1998
). In this paper, we analyze the conditions required to achieve contrast-invariant orientation tuning in such a push-pull model.
In our previous work, we studied two versions of the push-pull model. In one version ("network model"), the cortex was modeled as a fairly realistic network of spiking neurons, each modeled as a single-compartment conductance-based integrate-and-fire neuron. The LGN responses were modeled as Poisson spike trains sampled from the stimulus-driven LGN firing rates. The second version ("conceptual model") was much simpler. In this model, both cortical and LGN neuronal activities were represented by firing rates, and the only nonlinearity was the rectification of firing rates at some threshold level of input (rates could not go below zero). While the network model also included intracortical connections from excitatory neurons, the conceptual model included only direct thalamic excitation and thalamic-driven feed-forward inhibition (meaning inhibition driven by LGN via inhibitory cortical interneurons).1 Despite the differences, the two models produced quantitatively similar orientation tuning curves.2 This suggests that the simpler conceptual model retained the key elements responsible for contrast-invariant orientation tuning in the more complex model, and in particular that the rectification or threshold nonlinearity is the primary nonlinearity that is essential for an understanding of this tuning.
In this article, we characterize the conditions required for contrast
invariance in the conceptual model, which is simple enough to allow
analysis. We begin by deriving a general equation for the total LGN
input to a cortical simple cell receiving Hubel-Wiesel thalamocortical
connections, making minimal assumptions other than that LGN responses
can be well-described by an instantaneous firing rate and that the
total LGN input to the simple cell is given by an appropriately
weighted sum of the LGN firing rates. We then show that, in the case of
a periodic grating stimulus, this equation is dominated by two terms: a
linear term, representing the sinusoidally modulated part of the input,
and a nonlinear term, representing the mean input. Except at very low
stimulus contrasts, this nonlinear term grows with stimulus contrast
due to the rectification of LGN firing rates. We then examine how the
combination of this LGN input, LGN-driven push-pull inhibition, and a
cortical cell threshold can yield contrast-invariant orientation tuning
in two regimes. In one regime, representing all but very low contrasts,
contrast invariance of orientation tuning can arise if the growth of
the linear and the nonlinear input terms have the same shape as a
function of contrast. Further analysis demonstrates that this condition
should be at least approximately true for a wide range of LGN models.
In the second regime, representing very low contrasts, the nonlinear
input term does not change with contrast, so that the stimulus-induced
input is simply given by the linear term, which scales with contrast.
In this regime, where the total input is small, input noise results in
a smoothed threshold. Over a wide range of thresholds, this smoothing
results in an input/output function that is approximated by a power-law
function (Miller and Troyer 2002
). The combination of
input that scales with contrast and a power-law input/output function
yields contrast-invariance of tuning. Finally, we demonstrate that
these two mechanisms can combine to yield contrast-invariant tuning
over all contrasts.
An abstract of this work has appeared (Troyer et al.
1999
).
| |
RESULTS |
|---|
|
|
|---|
LGN input to simple cells
Previous investigations into the origins of orientation selectivity have made a variety of simplifying assumptions regarding the nature of the LGN input to cortical simple cells. Often, the visual stimulus is transformed directly into a pattern of cortical input, ignoring important nonlinearities contributed by LGN responses. In this paper we focus on periodic gratings, i.e., stimuli that are spatially periodic in one dimension and uniform in the other dimension. Our model is a purely spatial model and ignores cortical temporal integration. We consider an "instantaneous" pattern of LGN activity across a sheet of cells indexed by the two dimensional vector x representing the center of each LGN receptive field. Cortical output is derived as a static function of this pattern of activity.
In this section, we demonstrate that the total LGN input to a simple
cell in response to a grating stimulus with contrast C,
orientation
, and spatial location given as a phase variable
stim, can be well approximated by a function of the
following form
|
(1) |
2 For periodic grating stimuli, the second term represents the mean (DC) level of input averaged over all spatial phases, while the first term represents a sinusoidal (1st harmonic or F1) modulation of the input as a function of the grating's spatial phase.
3 The level of the mean input depends only on contrast. The amplitude of the modulation can be factored into the product of a function that depends only on contrast and a function that depends only on orientation.
Input from a Gabor RF: general expression
We view the LGN as a uniform sheet of ON-center and
OFF-center X cells, and let
LON(x)
[LOFF(x)] denote the response of
the LGN ON (OFF) cell at position vector
x at a particular time t [for simplicity, we
omit the time dependence in LON(x,
t) and LOFF(x,
t)]. Initially, we make no explicit assumptions regarding the relationship between ON and OFF cells
responses. However, we expect ON and OFF cells
at the same location to have roughly opposite responses to changes in
luminance. To extract the ON/OFF difference we let
Ldiff(x) = [LON(x)
LOFF(x)]/2. Letting
Lavg(x) = [LON(x) + LOFF(x)]/2 denote the
average LGN response at position x, we can rewrite
LON(x) = Lavg(x) + Ldiff(x) and
LOFF(x) = Lavg(x)
Ldiff(x).
We assume that the spatial pattern of LGN connections to a simple cell
can be described by a Gabor function (Jones and Palmer 1987b
; Reid and Alonso 1995
). For simplicity, we
will refer to this pattern of LGN connectivity as the receptive field
(RF) of the cell. We let the vector fRF
represent the preferred spatial frequency and orientation of the RF,
and choose our x coordinates so that
fRF is parallel to the
x1 axis and so that the RF center is at the origin. Thus we write the Gabor RF as
|
1 and
2 determine the size, and
RF the spatial
phase of the Gabor RF. Positive values of the Gabor, G+(x) = max
[G(x), 0], give the connection strength from
ON cells, whereas the magnitude of the negative values,
G
(x) = |min
[G(x), 0]|, gives the connection strength from OFF cells. Note that
G(x) = G+(x)
G
(x), while
|G(x)| = G+(x) + G
(x). Using a linear summation
model, the total input to the cortical cell is given by
|
|
|
|
(2) |
In the common linear model of LGN input, the firing rate modulation of OFF cells is assumed equal and opposite to ON cell modulations at a given point x. Therefore Lavg(x) is a constant equal to the average background firing rate of LGN cells, and the ON/OFF-averaged term contributes only a stimulus-independent background input to the cortex. As a result, the linear model only considers the ON/OFF-specific component of the LGN input. However, since LGN firing rates cannot be modulated below 0 Hz, at higher contrasts the balance between ON and OFF cell modulations cannot be maintained, and the ON/OFF-averaged term grows with increasing contrast. Hence it becomes important to retain both terms in modeling LGN input.
Further insight into this decomposition can be gained by considering, for any given cortical cell, the cell's antiphase partner: an imaginary cell that has identical Gabor receptive field except for an overall sign reversal, so that ON subregions (connections from ON cells) are replaced with OFF subregions (connections from OFF cells) and vice versa. Then the ON/OFF-specific term represents the component of LGN input that is equal and opposite to a cell and to its antiphase partner, while the ON/OFF-averaged term represents the input component that is identical to a cell and to its antiphase partner. It is in this sense that we call these the ON/OFF-specific and ON/OFF-averaged input components, respectively. The decomposition into these two components is key to the results presented below.
Input from grating stimuli
In this paper, we focus on responses to periodic gratings, i.e.,
stimuli that are spatially periodic in one dimension and uniform in the
other dimension. The periodicity of the gratings is described by a
two-dimensional spatial frequency vector
fstim, and we assume that LGN cells
have circularly symmetric receptive fields. Therefore the response of
an LGN ON-center cell is determined by its position
relative to the grating: LON(x) = lON(fstim
· x), where lON(
) is some
function of a scalar variable
determining a cell's location
relative to the periodic modulation of the overall LGN activity
pattern. Similarly, LOFF(x) = lOFF(fstim
· x). Throughout we will assume
|fstim| = |fRF|. LGN spatial frequency tuning
can be included by writing
LON(x) = FON(|fstim|)lON(fstim
· x), where
FON(|fstim|)
is the spatial frequency tuning of ON cells scaled so that FON(|fRF|) = 1. Similar definitions apply for OFF cells.
Given that LGN activity is periodic with spatial frequency
fstim, the
Lavg(x) and
Ldiff(x) components of LGN activity
can each be written as a cosine series
|
|


and 0


represent the phase of each component.
Using these expansions, we can evaluate the integral in Eq. 2 and rewrite the total LGN input to a simple cell as the sum of a
pair of cosine series
|
(3) |
(nfstim) and
|
|(nfstim) are the
amplitudes of the Gabor and absolute Gabor filters, respectively,
evaluated at the spatial frequency vector
nfstim (i.e., the amplitudes of
the Fourier transforms of G and |G| at this
frequency). The phases are 















We reiterate that our model is a spatial model. The instantaneous spatial distribution of activity across a two-dimensional sheet of LGN cells (described by adiff and aavg) is multiplied by the appropriate Gabor filters to determine the total LGN input to a cortical cell at a given time, and the cortical response is assumed to depend instantaneously on this input. For temporal patterns of input, such as a drifting or counterphased grating, we simply assume that the cortical responses can be computed as instantaneous responses to a sequence of patterns of LGN activity that change in time. For flashed stimuli, the model will apply to the degree that cortical responses simply track the changing patterns of LGN activity triggered by the flash.
Reduction to two terms
In this subsection, we argue that each of the two sums in
Eq. 3 should be dominated by the contribution of a single
term. In particular, the ON/OFF-averaged sum is
dominated by the zero frequency (DC) term, and the
ON/OFF-specific sum is dominated by the first
harmonic (F1) term. Having shown that this is the case, we will have
demonstrated that
|
(4) |
stim


We consider a simple rectified-linear model of LGN responses.
ON and OFF cells are assumed to have background
firing rates of 10 and 15 Hz, respectively, and response modulations
are assumed to result from linear filter properties of the LGN cells.
Therefore a drifting sinusoidal grating stimulus leads individual LGN
cells to sinusoidally modulate their firing rates with time, rectified at 0 Hz (Fig. 1, A and
B). This means that at each instant the spatial pattern of
response across the sheet of LGN cells is a rectified sinusoidal
modulation. The amplitude of the modulation of activity across LGN
depends on the contrast of the grating (Fig. 1C) and was
calculated using measured contrast response functions from cat LGN
X cells in response to drifting sinusoidal gratings. The
phase of the modulation is assumed opposite (180° apart) for an
ON cell and an OFF cell at the same position.
This ignores the actual spread of response phases for both
ON and OFF cells (Saul and Humphrey
1990
; Wolfe and Palmer 1998
). At low contrasts,
the stimulus-induced modulations of firing rates do not exceed the
background firing rates, and so the mean input does not grow with
contrast. However, once the stimulus-induced modulation is as large as
the background firing rates
which occurs at about 5% contrast
then
the LGN firing rates rectify at 0 Hz for each trough of the modulation
(Fig. 1), and so increases in contrast above 5% lead to an increase in
the mean LGN input.
|
Using the measured contrast response functions, we compute the
magnitudes of a





|
We now turn our attention to the Gabor filters. Since a Gabor function
is obtained by multiplying a sinusoid and a Gaussian, the Fourier
transform of G consists of a Gaussian convolved with a pair
of delta functions located at the positive and negative frequencies of
the sinusoid (Fig. 2C). The specific parameter values used
have been extracted from two sets of experimental data. We started with
Gabor parameters taken as the mean values from measured simple cell RFs
[Jones and Palmer (1987b)
; full parameters are given in
METHODS section of Troyer et al. (1998)
]. The length of each subregion was then reduced so that the orientation tuning width of the F1 of the total LGN input (38.0° half-width at
half-height) matched the mean tuning width of experimentally measured
intracellular voltage (Carandini and Ferster 2000
). This was accomplished by multiplying the standard deviation of the Gaussian
envelope in the direction parallel to the subregion by a factor of
0.58. One important feature to note is that the width of the Gaussian
in Fourier space is similar to the preferred spatial frequency of the
Gabor RF. This condition will hold when the simple cell RF has roughly
two subregions; more subregions will narrow this width. The
absolute value of the Gabor, |G|, is obtained by
multiplying a Gaussian times the absolute value of a sinusoid. Thus the
Fourier transform |
| consists of a Gaussian convolved with the
Fourier series for the absolute value of a sinusoid, which has only
even harmonics (Fig. 2C). We focus on the response at the
optimal orientation, and fix the stimulus spatial frequency to match
the optimal spatial frequency of the Gabor filter, i.e., fstim = fRF. In this case, the multiples of the
stimulus spatial frequency vector,
nfstim, fall on the peaks of the
Gaussian-shaped humps in the Gabor and absolute Gabor filters (Fig. 2,
A and B, marked x). The result is that
the
(nfstim) coefficients are
dominated by the F1 (n = 1) component (Fig. 2A, gray bars), while the
|
|(nfstim) coefficients
are dominated by the DC (n = 0) component (Fig. 2B, gray bars).
Having calculated the values a

(nfstim) and
|
| (nfstim), we need
only multiply the corresponding components to calculate the
ON/OFF-specific and
ON/OFF-averaged components of the total LGN
input using Eq. 3. The results, shown by the black bars in Fig. 2, A and B, demonstrate that only two terms
contribute significantly to the total LGN input. Since both the Gabor
filter
and the Fourier series for
Ldiff are dominated by the F1 component,
99.9% of the total power in the diff terms is concentrated in the
first harmonic. Similarly, the avg terms are dominated by the DC, with
98.6% of the power concentrated in the zero frequency (mean) component
of the input. Thus the approximation given by Eq. 4 is valid
for the rectified-linear model of LGN response used here, with DC = |
|(0)a
) =
(fstim)a
Thus we have shown that the total LGN input to a cortical simple cell in response to a periodic grating (Eq. 3) is expected to be well-approximated by a sinusoidal modulation about some mean level (Eq. 4). This was derived assuming a linear-rectified LGN response model and a sinusoidal grating, but is likely to hold for more general LGN response models and for more general types of gratings. So long as the F1 of Ldiff is at least as large as the other Fourier coefficients, the dominance of the F1 in the Gabor filter will lead the ON/OFF-specific terms to be dominated by the F1. Similarly, the ON/OFF-averaged components will be dominated by the DC so long as the DC of Lavg is at least as large as the other Fourier coefficients.
Orientation tuning
We now examine the orientation and contrast dependence of the DC
and F1 terms of the total LGN input. We express the orientation
in
degrees from the preferred orientation, i.e., 0 degrees is the optimal
stimulus. Since the responses of individual LGN cells are untuned for
orientation, changing the orientation of the stimulus will rotate this
activity pattern, but will leave the shape of the activity modulation
unchanged. Therefore the coefficients a





|
(5) |
We now turn our attention to the F1 term. First, we write the spatial
frequency vector of the stimulus in polar coordinates, fstim = {|fstim|,
} and let
0 =
({|fstim|, 0}). Then we
factor the amplitude of the modulation as follows
|
(6) |
|
|
0a
) =
({|fstim|,
})/
0 captures the orientation tuning of the Gabor
RF, normalized so that at the preferred orientation,
h(0) = 1. Therefore the total LGN input can be written
in its final form
|
(7) |
) is determined by the evaluation of the filter G
along the circle of radius |fstim|
(Fig. 2C). If the Gabor RF has two or more subregions and
the spatial frequency |fstim| = |fRF|, h(
) is
dominated by the contribution from the Gaussian centered at
fRF. In this case
|
(8) |

x and 
y determine the dimensions of the Gaussian
envelope of the Gabor filter in Fourier space. For
|fstim|
|fRF|, h(
) takes on a
more complex form. Qualitatively, tuning narrows for
|fstim| > |fRF| and broadens for
|fstim| < |fRF|.
This concludes our analysis of the LGN input to a simple cell. We now turn to the analysis of the conditions yielding contrast-invariant orientation tuning.
| |
CONTRAST-INVARIANT ORIENTATION TUNING I: HIGHER-CONTRAST REGIME |
|---|
|
|
|---|
Intracortical inhibition
A key to our model of contrast-invariant orientation tuning at
higher contrasts is the inclusion of strong, contrast-dependent feed-forward inhibition. We will take the inhibition a cell receives to
be spatially opponent to, or antiphase relative to, the excitation a
cell receives; this is also known as a push-pull arrangement of
excitation and inhibition. By spatial opponency of inhibition and
excitation, we mean that in an ON subregion of the simple cell RF, where increases in luminance evoke excitation [i.e., where
G+(x) > 0], decreases in
luminance evoke inhibition; the opposite occurs in OFF
subregions [where G
(x) > 0]. Since the LGN projection to cortex is purely excitatory
(Ferster and Lindström 1983
), this inhibition must come from inhibitory interneurons in the cortex. A simple hypothesis that is consistent with a broad range of experimental data is that a
given simple cell receives input from a set of inhibitory simple cells
that collectively 1) have RFs roughly overlapping that of
the simple cell, 2) are tuned to a similar orientation as
the simple cell, and 3) have OFF subregions
roughly overlapping the simple cell's ON subregions, and
vice versa (Troyer et al. 1998
); that is, collectively a
cell's inhibitory input has a receptive field like that of a cell's
"antiphase partner." This circuitry leads to inhibition that, like
the inhibition observed physiologically, is spatially opponent to the
excitation a cell receives (Anderson et al. 2000a
;
Ferster 1988
; Hirsch et al. 1998
), and
has the same preferred orientation and tuning width as a cell's
excitatory inputs from the LGN (Anderson et al. 2000a
;
Ferster 1986
).
The role of antiphase inhibition in cortical orientation tuning can be understood quite generally by returning to our decomposition of the LGN input into an ON/OFF-specific term and an ON/OFF-averaged term. The ON/OFF-specific component typically is tuned for stimulus parameters such as orientation while the ON/OFF-averaged component typically is untuned or poorly tuned. The ON/OFF-specific term represents input that is equal and opposite to a cell and to its antiphase partner. Thus, for this component, the antiphase inhibition goes down when LGN excitation goes up, and vice versa. The net result is that strong antiphase inhibition serves to amplify the well-tuned ON/OFF-specific component of the LGN input. The ON/OFF-averaged term represents input that is identical to a cell and to its antiphase partner. If the antiphase inhibition is stronger than the direct LGN excitation, the ON/OFF-averaged term elicits a net inhibitory input that is poorly tuned for orientation but depends on contrast. Thus antiphase inhibition serves to eliminate the untuned component of the LGN input and replace it with a net inhibition that sharpens the spiking responses driven by the ON/OFF-specific tuned component of the input. This argument generalizes to any type of stimulus, transient or sustained. We now work out the details of this for the case of a drifting sinusoidal grating stimulus.
In our reduced model, we represent the set of inhibitory cells
providing inhibition to a cell by a single inhibitory simple cell. This
cell receives input from the LGN specified by a Gabor function
identical to that which specifies the input to the excitatory cell
except that the sinusoidal modulation of the RF is 180° out of phase.
The total LGN input to the inhibitory cell is then
|
|
|
(9) |
inh is spike threshold, and
binh is the amount of nonspecific input received
at background. For simplicity, we assume that input to the inhibitory
cell never dips below threshold [i.e., DC(100)
F1max(100) + binh
inh
0], allowing us to ignore inhibitory cell
rectification. This assumption is not unreasonable given the
experimental evidence suggesting that some inhibitory
interneurons have high background firing rates (Brumberg et al.
1996It is important to note that the model inhibitory cell responds
to all orientations, although it also shows orientation
tuning. For nonpreferred orientations, for which h(
)
0, the cell still receives positive input due to the term
DC(C) in Eq. 9, which drives inhibitory response.
This inhibition is critical for overcoming the strong,
contrast-dependent LGN excitation that would otherwise drive excitatory
simple cells at the null orientation. The DC term contributes an
untuned platform to the inhibitory cell orientation tuning curve,
identical for all orientations, on which the tuned response component
due to the term containing h(
) is superimposed. That is,
the inhibitory cell tuning follows the tuning of the total LGN input to
a simple cell, which includes both an untuned and a tuned component.
One of the key predictions of our model (Troyer et al.
1998
) was that there should be at least a subset of layer 4 interneurons that show contrast-dependent responses to all orientations
(see DISCUSSION), which is embodied here in the response of
our single model inhibitory neuron.
To facilitate analysis, we assume that excitation and inhibition
combine linearly, at least in their effect on spiking output. Because
this assumption ignores reversal potential nonlinearities, the total
input in our model should not be equated with the membrane voltage,
particularly for voltages near the inhibitory reversal potential (see
DISCUSSION). Letting 
|
|
|
|
|
(10) |

Inhibition has opposite effects on the DC and F1 terms of the total
input: it acts to suppress the mean input [DC
(1
w)DC], while it enhances the modulation [F1
(1 + w)F1]. The increase in the modulation is due to the fact
that, at any given contrast, increases in LGN excitation are
accompanied by a withdrawal of cortical inhibition, while decreases of
excitation are accompanied by an increase of inhibition. A key
requirement of our model of contrast-invariant tuning is that the
inhibition be dominant (w > 1), i.e., the inhibitory
input is stronger than the direct input from the LGN. This implies that
the contrast-dependent increase in the mean input from the LGN is not
only suppressed, it is actually reversed (1
w < 0), so that the mean feed-forward input to the cortical simple cell
decreases with increasing contrast.
Contrast invariance
Given the assumption that output spike rate results from a
rectified linear function of the input, the excitatory spike rate is
|
(11) |
eff =
exc
bexc + w(binh
inh) is
an effective threshold that incorporates tonic inhibitory activity,
nonspecific input to the excitatory cell bexc
and the excitatory-cell spike threshold
exc. Note that,
after specifying the Gabor RF, the cortical component of our model has
only two free parameters, w and
eff;
the gain factor g is simply a scale factor that determines
the magnitude of response without affecting tuning.
For the orientation tuning of the response to be contrast invariant, we
must be able to write r(C,
,
stim) as the product of a contrast response function
p(C) and an orientation tuning function
q(
,
stim). Since the F1 term is the only
term that depends on the orientation and phase of the stimulus, we look for an expression of the form
|
|
(12) |
|
|
(13) |
w), guarantees that the response is contrast invariant
|
(14) |
stim) = 1],
suprathreshold responses require h(
) > k(w
1)/(w + 1). For
larger values of inhibitory strength w, h(
)
has to be closer to its maximum value before threshold is crossed,
i.e., tuning width is narrower.
Equation 14 implies a strong version of contrast-invariant tuning. Not only is the tuning of average spike rate contrast invariant, but the spike rate at any given phase of the response at a given orientation multiplicatively scales with changes in contrast.
Parameter dependence
Equation 14 is equivalent to the statement that the DC
and F1 terms have the same shape as functions of contrast,
up to a scale factor determined by k and an offset
determined by
eff/(w
1). Using
Eqs. 5 and 6, Eq. 14 can be rewritten
|
(15) |







|
Figure 3 shows, for our linear-rectified model of LGN response, that
a

eff, are chosen to satisfy Eq. 15 for some
constant k. To determine the robustness of our model, we
simply varied these two parameters and measured the resulting contrast
dependence of orientation tuning. We consider three levels of
inhibition (left to right columns of Fig.
4) and three levels of effective
threshold (thick, thin, and dashed lines in Fig. 4, where the thick
line represents the optimal threshold for achieving contrast-invariant
tuning for that level of inhibition). The orientation tuning width, as measured by half width at half height of the orientation tuning curve,
is contrast invariant down to about 5% contrast for the optimal
threshold, as expected (Fig. 4B). With nonoptimal
thresholds, modest deviations from contrast invariance arise in the
range of 5-10% contrast.
|
Contrast invariance: an intuitive explanation
From Fig. 4A we can obtain an intuitive picture of how our model achieves contrast-invariant orientation tuning in the higher-contrast regime. Since the magnitude of the input modulation increases with increasing contrast, the tuned component of the total input also increases. However, with dominant inhibition, the mean input decreases with increasing contrast, so that the growing tuned component rides on a "sinking" untuned platform. Thus there will be some orientation where the input tuning curve for a higher contrast stimulus will cross the corresponding curve for a low contrast stimulus (Fig. 4A). Placing spike threshold at the level where the contrast-dependent tuning curves cross then yields contrast-invariant responses (Fig. 4, A and B, thick lines). Note that as the level of inhibition increases, the crossing point of the input tuning curves occurs closer to the peak, resulting in narrower tuning.
The fact that this crossing point occurs at the same orientation across
a range of contrasts is exactly equivalent to the condition that
a











|
Given this intuitive picture, it is clear that the crucial role for
inhibition in our model is to counteract the untuned component of LGN
excitation and convert it into an untuned suppression that acts like a
contrast-dependent threshold. In our push-pull model, this inhibition
comes from the cell's antiphase partner and is directly driven by the
untuned component of the LGN input. Other sources of this inhibition
are possible, such as complex inhibitory cells that are untuned for
orientation (Hirsch et al. 2000
), or inhibition
coming from simple cells having a range of spatial phases
(McLaughlin et al. 2000
; Wielaard et al.
2001
) (see DISCUSSION). The key properties of the
inhibition for our model are that it should be able to cancel the
large, contrast-dependent LGN input expected at the null orientation,
and should result in a net inhibition having a similar contrast
dependence as that of the modulated component of the total LGN input.
| |
CONTRAST-INVARIANT ORIENTATION TUNING II: LOW-CONTRAST REGIME |
|---|
|
|
|---|
Thus far we have shown that the threshold nonlinearity in the LGN, combined with strong push-pull inhibition, can be used to counteract the threshold-induced broadening of orientation tuning in cortical simple cells. Since the model relies in a crucial way on the rectification of LGN response, the model fails at low contrasts where LGN responses do not rectify. However, in this regime, where stimulus-induced LGN input is small and linear in contrast, the effects of noise on the spike threshold can yield contrast-invariant tuning.
We examine a model in which cortical spiking is governed by a rectified
linear output function, and in which additive Gaussian noise is added
to the stimulus-induced input (Amit and Tsodyks 1991
;
see also Abeles 1991
). The output firing rate is then
the mean rate averaged over the distribution of the noise. Adding such
noise has the effect of "smoothing" the neuron's input/output function (Fig. 6A) in the
vicinity of threshold, allowing some responses even when the mean input
is below threshold.3 In the
regime of large inputs (far above threshold), the noise averages out
and does not alter the linear-threshold input-output function.
|
Because the noise is able to boost the response to just-subthreshold inputs, it acts to broaden orientation tuning for low contrast stimuli. Thus comparing the peak input level (thin lines in Fig. 6B) with the resulting spike rate (thick lines), we find that the spike-rate tuning is significantly broader than the rectified input at contrasts below 5%. Plotting orientation tuning halfwidth versus contrast for a range of noise levels (Fig. 6C) demonstrates that this broadening counterbalances the narrowing of tuning seen for low contrast stimuli in the absence of noise and can lead to nearly contrast-invariant responses down to contrasts below 2% (dark line).
Power-law behavior
The mechanism underlying this noise-induced contrast invariance is
that, for perithreshold inputs, the noise smoothing converts the
rectified linear input/output function into a power-law input/output function (Miller and Troyer 2002
)
|
) = p(C)q(
), then so too does the output r
|
(16) |
For low contrasts, before LGN responses rectify, we expect the LGN
input to scale with contrast. This can be seen from the fact that at
these contrasts, the DC component does not change from background, so
the input is given by a function of contrast times a function of
orientation
|
Note that, for n > 1, the In response function sharpens output tuning relative to input F1 tuning, with larger n yielding greater sharpening. For higher contrasts where LGN responses rectify and the input-output function becomes linear, the output tuning is sharpened relative to input F1 tuning by the inhibition, with larger inhibition yielding greater sharpening. For tuning to be contrast invariant across the full range of contrasts, the sharpening induced by these two mechanisms must be equal.
| |
DISCUSSION |
|---|
|
|
|---|
There is widespread theoretical and experimental (Carandini
and Ferster 2000
; Gardner et al. 1999
) agreement
that the measured orientation tuning of simple cells requires
relatively strong nonlinearities of cortical origin to sharpen the
broadly tuned input from the LGN. The debate centers on
which nonlinearities are best able to account for the range
of experimental data. The threshold nonlinearity is perhaps the most
obvious candidate (Carandini and Ferster 2000
;
Ferster 1987
; Hubel and Wiesel 1962
).
However, in a naive threshold model, orientation tuning is expected to broaden with increasing contrast.
We have analyzed a very simple model of orientation tuning in which the threshold (or firing-rate rectification) nonlinearity was the only nonlinearity considered. We began with a careful analysis of the total LGN input to a cortical simple cell in response to a sinusoidal grating, finding 1) this input is well-approximated by a sinusoidal modulation about some mean level; 2) the amplitude of the modulation depends on both orientation and contrast, while that of the mean depends only on contrast; 3) the growth of the mean input with contrast is induced by the rectification of LGN responses; and 4) at contrasts where rectification of LGN responses is significant, both the mean level and the modulation amplitude grow roughly proportionally to the peak response of typical LGN cells.
Based on this analysis of the input, we then analyzed two routes to
contrast-invariant orientation tuning. For a linear-threshold model of
cortical spiking, we showed that contrast invariance requires that the
mean and the modulation amplitude of the LGN input grow with identical
shapes as a function of contrast, which is satisfied in the regime
where rectification of LGN responses is significant. Intuitively,
strong push-pull inhibition converts the mean input into a net
inhibitory input, which eliminates the contrast-dependent broadening of
tuning expected from a simple threshold model and yields sharp,
contrast-invariant tuning. At very low contrasts, LGN rectification
does not occur. However, neuronal noise can smooth the linear-threshold
response function, yielding a power function to a good approximation.
As a result, the model again yields sharp, contrast-invariant tuning
(Miller and Troyer 2002
). If the sharpening of tuning
induced by the approximate power-law nonlinearity at lower contrasts
matches the sharpening induced by inhibition and spike threshold at
higher contrasts, this simple model can account for contrast-invariant
orientation tuning over the full range of contrasts.
Parameter dependence
Our analysis suggests that the contrast invariance of orientation
tuning depends on correct choice of parameters in two respects. First,
the threshold
eff must be chosen to satisfy the analytic
condition for contrast invariance in the high-contrast regime. However,
we showed here in numerical simulations that the resulting contrast
invariance of tuning, as measured by the half-width at half height of
the orientation tuning curve, is relatively insensitive to the exact placement of spike threshold. Second, parameter tuning seems required to match tuning width at very low contrasts, where sharpening of tuning
is determined by the exponent in the power-law response function, to
tuning width at higher contrasts, where sharpening of tuning is
determined by the strength of inhibition.
The analysis of the simple model studied here may underestimate the
robustness with which dominant antiphase inhibition and neural noise
achieve contrast-invariant tuning. In our previous simulations of a
more biophysically realistic network model, we did not tune threshold
or noise levels,4 yet
contrast-invariant tuning resulted down to very low contrasts and
across a range of inhibition strengths (see Fig. 5A of
Troyer et al. 1998
; orientation tuning broadened
slightly at the lowest contrast studied, 2.5%, but such broadening at
very low contrasts is also suggested by the experimental data,
Skottun et al. 1987
).
Alternative models
The two nonlinear mechanisms considered here
1) linear
LGN input and a nonlinear (power-law) cortical transfer function, and 2) nonlinear LGN input (rectified, so that the mean grows
with contrast), strong push-pull inhibition, and a linear-threshold cortical transfer function
can be contrasted with a third class of
nonlinear models. These models rely on the interaction between the
threshold nonlinearity and strong center-surround connectivity within
the cortex to achieve contrast-invariant tuning (Ben-Yishai et
al. 1995
; Somers et al. 1995
). These models use
recurrent dynamics that result in an "activity bubble" of cortical
response, whose shape is determined by the pattern of intracortical
connections in a manner largely independent of the pattern of LGN
input. The shape of this activity bubble in turn determines the shape
of the orientation tuning curve. As a result, orientation tuning is
invariant to changes in the pattern of LGN input, and in particular is
invariant to changes in stimulus contrast. However, these models are
inconsistent with a number of experimental results demonstrating basic
linear elements of simple cell receptive fields. In particular, there
is a correspondence between the linear component of response as
measured with spatially localized stimuli and with drifting sinusoidal
gratings (Gardner et al. 1999
; Jones and Palmer
1987a
; Lampl et al. 2001
; Movshon et al.
1978
). Moreover, orientation tuning is not independent of all
input parameters. Rather, tuning depends on the spatial frequency of
oriented gratings (Hammond and Pomfrett 1990
;
Vidyasagar and Sigüenza 1985
; Webster and De Valois 1985
) and the length of oriented bars (Vogels
and Orban 1990
) in the manner predicted by the present model
(Troyer et al. 1998
): stimuli that evoke narrower tuning
of the LGN input modulation evoke narrower tuning of the cortical spike response.
A recent model of monkey layer 4 (McLaughlin et al.
2000
; Wielaard et al. 2001
) also relies on
strong feed-forward inhibition to cancel the nonlinear component of LGN
input, but assumes that this inhibition has no phase specificity. This
accords with the lack of phase specificity of transient inhibitory
responses reported by some workers (Borg-Graham et al.
1998
) but not by others (Ferster 1988
;
Hirsch et al. 1998
). Phase-nonspecific inhibition would contribute an inhibitory DC component, but no F1 component, to the
response to a periodic grating. Thus the effect of including such
inhibition in our model would be to replace the factor (1
w) multiplying DC(C) in Eq. 10 by
(1
w
wnon), where
w still represents the phase-specific component of
inhibition and wnon represents the
phase-nonspecific component. This in turn would cause the condition for
contrast-invariant tuning, Eq. 14, to be modified simply by
replacing (w
1) with (w + wnon
1). Thus phase-nonspecific
inhibition is equally as effective as phase-specific inhibition in
achieving contrast-invariant tuning. The main difference between the
two is that phase-specific inhibition enhances the response to a
preferred stimulus (it adds to the F1 while subtracting from the DC),
while phase-nonspecific inhibition equally suppresses responses to a
preferred or nonpreferred stimulus (it does not add to the F1 while
subtracting from the DC). Thus substituting phase-nonspecific for
phase-specific inhibition will reduce responses and sharpen orientation
tuning by rendering more orientations subthreshold.
A key prediction of our model is that at least a subset of layer 4 inhibitory cells should respond in a contrast-dependent manner to all
orientations (Troyer et al. 1998
). Inhibition from these
cells is necessary to prevent the untuned component of the LGN input
from driving spiking in tuned simple cells at orientations far from the
cell's preferred orientation. Hirsch et al. (2000)
recently reported two classes of inhibitory cells in cat V1 layer 4:
simple cells that appeared to have good orientation tuning and complex
cells that were essentially untuned for orientation. Such complex cells
could provide the contrast-dependent untuned inhibition required by our
formulation, although in a different form than used in our model. The
inhibition from these complex cells would contribute to the DC but not
to the F1, just as phase-nonspecific inhibition would. The simple
inhibitory cells would provide phase-specific, push-pull inhibition at
orientations around the preferred but would not spike far from the
preferred orientation, presumably because they are inhibited by the
complex cells in a manner similar to excitatory simple cells.
Voltage tuning versus input tuning
Anderson et al. (2000b)
have published experimental
data indicating that the full range of contrast responses lies in a
noise-dominated regime. They found that voltage noise was large, e.g.,
rms 5 mV, comparable with the voltage DC and F1 at high contrast, which were each 5-10 mV. They also found that the mean and modulation of the
voltage response each had similar orientation tuning and that the
tuning of each simply scaled with contrast. They showed that this, the
large noise, and a linear-threshold model for converting voltage
responses to spiking responses could yield contrast-invariant spiking
responses, much as in our low-contrast regime. At first glance it might
appear that this makes our analysis of the high-contrast regime
irrelevant, because the noise-smoothed part of the input/output curve
applies over the full range of contrasts. However, the model proposed
by Anderson et al. begins with the observed voltage responses; it does
not address the mechanisms by which the LGN input drive is converted
into the observed tuning and contrast scaling of the voltage.
In the model analyzed here, spiking output is assumed to be a rectified
linear function of a cell's synaptic input, smoothed by noise. The
intermediate step of conversion of synaptic input into membrane voltage
is not addressed. In particular, the model does not include the
nonlinearity contributed by synaptic reversal potentials. This
nonlinearity should not have much effect on spiking responses, since
deviations from linear behavior are small at voltages near and above
threshold (Holt and Koch 1997
). However, near the
inhibitory reversal potential, inhibitory drive is significantly reduced, but excitatory drive is relatively unchanged. Thus a linear
model would systematically overestimate the degree of membrane hyperpolarization for subthreshold ranges of voltage.
Nonetheless, our analysis of the LGN input has interesting implications
for understanding the results of Anderson et al.
(2000b)
. We focus on two significant results. First,
Anderson et al. (2000b)
found little stimulus-induced
voltage change from rest at any contrast in response to stimuli of the
null orientation (the orientation orthogonal to the preferred), a
result also seen by Carandini and Ferster (2000)
. Our
analysis suggests that null-oriented stimuli should evoke significant
LGN excitation that grows with
contrast.5 To prevent
significant depolarization from rest, this excitation must be balanced
by an equal amount of inhibitory current. However, equivalence of
excitatory and inhibitory currents near rest translates into a
dominance of inhibition for membrane potentials near threshold. Thus
the lack of depolarization to stimuli at the null orientation suggests
that the mean input in response to these stimuli is inhibition dominated, as required by our model.
Second, the mean level of membrane depolarization (the voltage DC) in
simple cells is tuned for orientation and has tuning similar to that of
the voltage modulation (the voltage F1) (Anderson et al.
2000b
; Carandini and Ferster 2000
). The
orientation tuning of the mean voltage response may be induced by two
nonlinearities interacting with the voltage modulation. First, the
inhibitory reversal potential prevents much hyperpolarization from
rest, but does not limit depolarizations, so that voltage modulations will be primarily depolarizing from rest. Second, the tuned spiking response driven by the modulation of the LGN input will be amplified by
input from other cortical cells in the same or nearby orientation columns that are also driven to spike (Anderson et al.
2000a
; Douglas et al. 1995
; Ferster
1986
; Troyer et al. 1998
). The net depolarization from each of these nonlinearities should cause the
voltage DC to inherit orientation tuning similar to that of the voltage F1.
Conclusion
A number of factors can contribute to the orientation tuning of
voltage and spiking in cortical simple cells. These include nonlinearities in the LGN input, the threshold nonlinearity and its
possible smoothing by noise, antiphase inhibition, phase-nonspecific or
orientation-untuned inhibition, reversal potential nonlinearities, and
the input from other cortical excitatory cells. We have addressed the
first three factors in a model that is simple enough to allow analysis,
and that emphasizes the role played by contrast-dependent nonlinearities in the LGN. The results of Anderson et al.
(2000b)
suggest that noise smoothing plays a key role in the
contrast invariance of orientation tuning over the full range of
contrasts, which differs from the model presented here. Nonetheless,
our analysis of the LGN suggests the need for dominant feed-forward inhibition that eliminates response to the DC component of the LGN
input, while also pointing out a route to contrast-invariant tuning for
stimuli that might push a cell beyond the noise-smoothed regime. More
generally, our analysis suggests that decomposing simple cell input
into ON/OFF-averaged and
ON/OFF-specific components may be a useful step
toward a more complete understanding of orientation tuning in layer 4.
| |
ACKNOWLEDGMENTS |
|---|
We gratefully acknowledge support by National Eye Institute Grant EY-11001 (K. D. Miller) and the Howard Hughes Medical Institute predoctoral fellowship program (A. E. Krukowski).
| |
FOOTNOTES |
|---|
Address for reprint requests: T. W. Troyer, Dept. of Psychology, University of Maryland, College Park, MD 20742 (E-mail: ttroyer{at}glue.umd.edu).
1
By referring to feed-forward inhibition, we
refer to that component of the inhibition driven by the LGN. A single
inhibitory cell might be driven both by LGN and by cortical cells, so
that it could contribute both feed-forward and feedback inhibition. In
Troyer et al. (1998)
we showed that the feed-forward
component of the inhibition was necessary and sufficient to yield
contrast-invariant tuning, and that the feedback component had no
effect on orientation tuning in the context of the model circuit
studied there.
2 The two models differed at the lowest contrast studied, 2.5%, where the network model but not the conceptual model maintained contrast-invariant tuning. This difference arose because noise was present in the network model but not in the conceptual model; adding noise to the conceptual model eliminates this difference, see RESULTS.
3
Given the standard deviation of the noise
,
the spike threshold
, and the gain g, one can explicitly
calculate the output firing rate as a function of the mean level of
input I: r=g(I
)/2 [1+erf(I
/

/
(I
)2/2
2,
where erf is the error function, erf (x)=2/


y2.
4
Spike threshold was set to
52.5 mV, and then
sufficient nonspecific background excitatory conductances were injected
to produce low, physiological levels of spontaneous spiking activity in
excitatory cells at the "default" level of inhibition. This amounts
to setting threshold in units of the noise. Neither threshold nor the
amount of injected background conductance were varied as the strength of inhibition was changed.
5
Yet another complication is that the mean LGN
input may be suppressed by frequency-dependent synaptic depression at
lower temporal frequencies, e.g., 2 Hz (Krukowski 2000
).
However, at higher temporal frequencies, e.g., 8 Hz, the effects of the
untuned mean LGN input should be strong. Anderson et al.
(2000b)
and Carandini and Ferster (2000)
studied
only lower temporal frequency stimuli.
Received 8 June 2001; accepted in final form 12 February 2002.
| |
REFERENCES |
|---|
|
|
|---|
.
Proc Natl Acad Sci USA
97:
8087-8092, 2000This article has been cited by other articles:
![]() |
S. E. Palmer and K. D. Miller Effects of Inhibitory Gain and Conductance Fluctuations in a Simple Model for Contrast-Invariant Orientation Tuning in Cat V1 J Neurophysiol, July 1, 2007; 98(1): 63 - 78. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. A. Cleland, B. A. Johnson, M. Leon, and C. Linster Relational representation in the olfactory system PNAS, February 6, 2007; 104(6): 1953 - 1958. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. F. Teich and N. Qian Comparison Among Some Models of Orientation Selectivity J Neurophysiol, July 1, 2006; 96(1): 404 - 419. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. J. Alitto and W. M. Usrey Influence of Contrast on Orientation and Temporal Frequency Tuning in Ferret Primary Visual Cortex J Neurophysiol, June 1, 2004; 91(6): 2797 - 2808. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Z. Lauritzen and K. D. Miller Different Roles for Simple-Cell and Complex-Cell Inhibition in V1 J. Neurosci., November 12, 2003; 23(32): 10201 - 10213. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Shu, A. Hasenstaub, M. Badoual, T. Bal, and D. A. McCormick Barrages of Synaptic Activity Control the Gain and Sensitivity of Cortical Neurons J. Neurosci., November 12, 2003; 23(32): 10388 - 10401. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. L. Ringach, M. J. Hawken, and R. Shapley Dynamics of Orientation Tuning in Macaque V1: The Role of Global and Tuned Suppression J Neurophysiol, July 1, 2003; 90(1): 342 - 352. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Visit Other APS Journals Online |