|
|
||||||||
The Journal of Neurophysiology Vol. 88 No. 3 September 2002, pp. 1363-1373
Copyright ©2002 by the American Physiological Society
1University of California, Davis California 95616; and 2Center for Neural Science, New York University, New York, New York 10003
| |
ABSTRACT |
|---|
|
|
|---|
Sceniak, Michael P., Michael J. Hawken, and Robert Shapley. Contrast-Dependent Changes in Spatial Frequency Tuning of Macaque V1 Neurons: Effects of a Changing Receptive Field Size. J. Neurophysiol. 88: 1363-1373, 2002. Previous studies on single neurons in primary visual cortex have reported that selectivity for orientation and spatial frequency tuning do not change with stimulus contrast. The prevailing hypothesis is that contrast scales the response magnitude but does not differentially affect particular stimuli. Models where responses are normalized over contrast to maintain constant tuning for parameters such as orientation and spatial frequency have been proposed to explain these results. However, our results indicate that a fundamental property of receptive field organization, spatial summation, is not contrast invariant. We examined the spatial frequency tuning of cells that show contrast-dependent changes in spatial summation and have found that spatial frequency selectivity also depends on stimulus contrast. These results indicate that contrast changes in the spatial frequency tuning curves result from spatial reorganization of the receptive field.
| |
INTRODUCTION |
|---|
|
|
|---|
According to previous studies of
visual responses in mammalian primary visual cortex, V1, orientation,
and spatial frequency tuning of single V1 neurons do not change with
stimulus contrast (Albrecht and Hamilton 1982
;
Bradley et al. 1987
; Li and Creutzfeldt 1984
; Movshon et al. 1978
; Sclar
and Freeman 1982
; Sclar et al. 1990
;
Skottun et al. 1987
). Contrast normalization has been
introduced as a mechanism to account for response scaling with contrast
without changing the response tuning for spatial parameters like
orientation and spatial frequency (Bonds 1989
;
Heeger 1992
; Ohzawa et al. 1985
).
Recently, it has been shown that the interactions between the
"classical" receptive field of a V1 neuron and its surroundings are
influenced by the contrast adaptation state of the neuron (Kapadia et al. 1999
; Levitt and Lund
1997
; Sceniak et al. 1999
). Not only is the
classical receptive field's responsiveness influenced by contrast, but
so too is its spatial extent (Sceniak et al. 1999
).
Therefore the contrast adaptation state of the neuron alters the
spatial properties of the receptive field in a way that is more complex
than simply scaling the gain. We investigated whether other spatial
properties of the receptive field might be dependent on the contrast
adaptation state of the neuron. To do this we examined the spatial
frequency tuning of V1 cells that show contrast-dependent changes in
spatial summation. The result of these experiments is that spatial
frequency selectivity also depends on stimulus contrast. Spatial
frequency tuning bandwidth is consistently smaller at low contrast than
at high contrast (our findings agree with some of the data of
Bradley et al. 1987
on cat visual cortex but not with
the paper's summary and conclusions). The reduction occurs predominately on the high end of the spatial frequency tuning curve.
Spatial frequency preference is relatively unaffected by contrast.
By considering models for signal combination in cortical cells, one can
make predictions about whether the previously observed changes in
spatial summation (Sceniak et al. 1999
) are related to
the contrast-dependent changes in spatial frequency selectivity. Changes in spatial summation along the length axis should not affect
the spatial frequency characteristics, although such changes may
influence the orientation tuning bandwidth. Changes in width summation
might be expected to affect spatial frequency tuning. The effects may
involve either a widening of each subunit of the receptive field or an
increase in the number of subunits of a fixed size.
Simple cell receptive fields have been modeled as a Gabor filter
followed by a static nonlinearity (Daugman 1980
, 1984
,
1985
; DeAngelis et al. 1991
, 1993
;
Field and Tolhurst 1986
; Jones and Palmer
1987
; Kulikowski et al. 1982
; Marcelja
1980
; Stork and Wilson 1990
). A model with
multiple spatially overlapping linear Gabor filters that are rectified
and then pooled as a spatial sum has been used to describe complex
cells (Emerson et al. 1987
; Glezer et al.
1980
; Heggelund 1981
; Movshon et al.
1978b
; Spitzer and Hochstein 1985a
,b
, 1988
;
Szulborski and Palmer 1990
). For either simple or
complex cells, the linear first stage filters' spatial properties
determine the spatial frequency tuning (preference and selectivity) of
the neuron (Spitzer and Hochstein 1988
). If the spatial
envelope of the impulse response of a Gabor filter increases keeping
the subunit size constant, the bandwidth of the frequency response
decreases and vice versa (Fig. 1,
A and B). This property holds for both the simple
cell and complex cell models.
|
Although the Gabor filter model seems to describe the basic subunit structure of a cortical simple cell, it suffers from several limitations. First of all, in such a model the receptive field subregions are all spatially identical. This is the result of modeling all of the subunits with a single sine or cosine function. Moreover, the bandwidth reduction of the Gabor filter when its envelope is increased in width must be symmetric about the peak spatial frequency, contrary to what we observed. Therefore the contrast-dependent changes in spatial frequency tuning we observed could not be easily modeled by adjusting the parameters of Gabor filters.
It has been shown previously that V1 simple cells are better described
by difference of Gaussian (DOG) functions than by Gabor functions
(Hawken and Parker 1987
). One advantage of the DOG model is that it allows for independent manipulation of subunit size and
strength. The DOG model provides a wider range of possible changes in
the spatial frequency tuning than the Gabor model (Wallis 2002
). If the receptive field is modeled as a DOG with
two flanking Gaussian subunits, the resulting profile resembles a Gabor
filter with the added feature that we can alter independently the
height and spread of the subunits. By increasing the spread of the
center Gaussian and also increasing the strength of flanking Gaussians, we can produce a spatial filter with a reduction in response amplitude only at high spatial frequencies and little change in the spatial frequency preference (Fig. 1, E and F). This is
the nature of the change we observe in V1 neurons' spatial frequency
tuning when contrast is reduced (shown in
RESULTS).
| |
METHODS |
|---|
|
|
|---|
Standard electrophysiological recording techniques were used in
acute preparation of macaque monkeys (Hawken et al.
1996
, Sceniak et al. 2001
).
Extracellular action potentials were collected from isolated single
neurons using extracellular microelectrodes. Spikes were analyzed both
during experiments and off-line, using standard software packages and
custom software written specifically for this purpose. Details of the
procedures used in the experiments and the data analysis are given below.
Animal preparation
Acute experiments were performed on adult Old World monkeys
(Macaca fascicularis) in strict compliance with the
guidelines for humane care and use of laboratory animals published by
National Institutes of Health and Public Health Service.
Animals were initially tranquilized with acepromazine (50 µg/kg, im).
After administering the tranquilizer (approximately 15 min), we
anesthetized the animal with ketamine (30 mg/kg, im). Additional
ketamine was given as needed during the initial phase of surgery.
Venous cannulation and tracheotomy were carried out under ketamine, and
we then transferred to an opioid anesthetic sufentanyl (sufentanyl
citrate, 6 µg/kg/h, iv) and maintained anesthesia throughout the
experiment with sufentanyl. A broad spectrum antibiotic (Bicillin,
50,000 iu/kg, im) and anti-inflammatory steroid (dexamethasone, 0.5 mg/kg, im) were given at the initial surgery and every other day during
the recording period. Anesthesia level was determined by analysis of
the EEG waveform, heart rate, blood pressure, and
CO2 output. Anesthetic state was judged to be
satisfactory if there was predominant slow wave EEG activity and if
potentially mildly noxious stimuli produced no change in EEG, heart
rate, or blood pressure. Expired CO2 was
maintained close to 5%. Rectal temperature was monitored and kept at a
constant 37.5°C. A small craniotomy was performed over the striate
cortex for recording. Across animals, the craniotomies were positioned so that the recorded receptive fields displayed a parafoveal
eccentricity between 2 and 5° of visual angle. Anesthesia was
administered throughout the recording period with sufentanyl (6 µg/kg/h, iv) and paralysis was induced with pancuronium bromide (0.1 mg/kg/h, iv). The anesthetic and paralytic were administered in
balanced physiological solution at a rate of 10-20 ml/h. Experiments
were terminated with a lethal dose of pentobarbital (60 mg/kg, iv). The
animal was perfused through the heart with a mixture of heparinized saline followed by 2 liters of fixative (4% paraformaldehyde in phosphate buffer, pH 7.4) for later histological reconstruction. Histological reconstruction was performed using the same methods as
described in Hawken et al. (1988
, 1996
).
Optics
The eyes were initially treated with 1% atropine sulfate
solution to dilate the pupils. The eyes were protected by gas-permeable contact lenses. Prior to adding the lenses, the eyes were treated with
a topical antibiotic (gentamicin sulfate, 3%). Foveae were mapped onto
a tangent screen using a reversing ophthalmoscope (Eldridge
1979
). The visual receptive fields of isolated neurons were mapped on the same tangent screen, keeping reference to the foveae. Proper refraction was achieved by placing corrective lenses mounted in front of the eyes on custom designed lens holders. Refraction adjustments were made during the recording session by
stimulating a responsive cell with a grating with a spatial frequency
near the cutoff frequency. The lens power was adjusted to produce a
maximal response.
Extracellular recording
The electrode was advanced through the gray matter via a
stepping motor (1-µm step size) mounted to a microdrive (Narashige, Japan). Single unit recordings were made with glass-coated tungsten microelectrodes with exposed tips of 5-15 µm
(Merrill and Ainsworth 1972
). The signal was
amplified using a Dagan (MN) EX4-400 differential amplifier
and band-pass filtered (0.1-10 kHz). This analog signal was then sent
to an A/D signal processing board of a digital computer (SGI,
Mountain View, CA). Spikes were discriminated and time-stamped using software custom designed for this purpose and running on a
Silicon Graphics O2 computer. Single spikes were isolated from the
recording, using tailored waveform windowing. Spikes were time stamped
with an accuracy of 1 ms. Strict criteria for single-unit recording
included fixed shape of the action potential and the absence of spikes
during the absolute refractory period.
Experimental protocol
All of the experiments discussed here were conducted using drifting sinusoidal gratings. The optimal stimulus parameters for orientation, size, and temporal frequency were estimated prior to conducting the following experiments. These optimal parameters were used to generate grating stimuli for the spatial frequency experiments. Sinusoidal drifting gratings of the preferred orientation and temporal frequency were presented centered over the receptive field. Each stimulus presentation lasted 4 s. Ten spatial frequencies ranging from 0.1 cycles/° to slightly above the cell's cutoff frequency were presented randomly in logarithmic steps. The contrast was routinely set to be high during these experiments (64-90%). During the characterization of each cell's color properties, we collected spatial frequency response curves using cone isolating stimuli as well as equiluminant red-green and luminance stimuli that were set to the same cone contrast. Cone contrast was equal to the maximal achievable cone contrast of our monitor for equiluminant red-green stimuli (equivalent to 20% luminance contrast). Therefore spatial frequency tuning curves for luminance (black-white) stimuli were estimated at a fixed low (20%) and high contrast (64-90%) for each cell.
In separate experiments, to estimate the spatial frequency bandwidth more accurately, spatial frequency was sampled more densely. Drifting sine wave gratings were presented in a rectangular aperture (4° square) oriented parallel to the cell's preferred orientation and centered over the excitatory receptive field. For cells that exhibit substantial end inhibition (50% or more), the rectangle's length was reduced to exclude the inhibitory zone along the length, but the width was kept fixed at 4°. Each grating patch size was presented for 4 s. Blanks (4 s) of the same mean luminance as the grating stimuli were presented interleaved with grating stimuli to determine the spontaneous firing rate and to avoid response adaptation. The spatial frequency of the drifting grating was varied in a random order. Spatial frequency was sampled logarithmically using 10-12 spatial frequencies centered around the preferred spatially frequency (estimated from previous experiments). The low and high cutoff frequencies were tailored to each cell based on previous spatial frequency characterizations as discussed above. Three repeats of the response to each spatial frequency were collected. By collecting several points around the peak spatial frequency, we were able to make precise estimates of the spatial frequency bandwidth of each cell.
We repeated this procedure at two contrast levels. The contrast levels were taken from the sloping region of the contrast response function of each cell. Therefore the contrast levels are chosen based on the cell's response. Low contrasts were chosen such that they were near the low end of the contrast response function, but elicited responses that were significantly greater than the spontaneous firing rate (2 SDs or more). High contrasts were selected to elicit responses that were <90% of the saturation response for each cell.
Each cell was also tested for spatial summation at multiple contrast levels. The center of the receptive field was carefully located using a small (0.2° diam) circular grating patch. Once the center was located, circular patches of drifting sinusoidal grating were presented and centered over the receptive field. Each grating patch size was presented for 4 s. Four-second blanks of the same mean luminance as the grating stimuli were presented interleaved with grating stimuli to determine the spontaneous firing rate and to avoid response adaptation. The patch sizes were presented in a random order. The radius ranged from 0.1° to 5° of visual angle in logarithmic steps. Each summation curve consisted of 10 radii with two repeats at each size. Contrast levels were held constant during repeats to avoid effects of adaptation. Outside each patch, the rest of the screen (12° × 17° visual angle) was kept at the mean luminance of 56 cd/m2. The contrast levels chosen for low and high contrasts were identical to those used in the spatial frequency experiments.
We repeated the summation experiment using rectangular patches that extend independently in the length or width dimension. The patch length was varied randomly in the same manner described above for the circular patch summation experiments. Then we conducted a similar experiment by varying the width in a similar random fashion. Therefore we acquired area, length, and width summation curves at two contrast levels in three temporally separated experiments.
Receptive field model simulations
To determine the possible interactions of contrast on the
receptive field spatial substructure, we modeled the spatial frequency responses of a theoretical neuron. This was accomplished by two separate models. First, we used a Gabor filter (Daugman 1980
, 1984
, 1985
; Kulikowski et al. 1982
). Next, we
repeated the simulation with a model comprised of a difference of
Gaussians with flanking Gaussian subunits (Hawken and Parker
1987
).
The spatial impulse response of the Gabor filter,
g(x), is defined as
|
is the space constant of the Gaussian
envelope, and
is the subunit spatial phase offset.
For a given spatial stimulus input, s(x), where
|
) is
|
) and G(
) are the Fourier
transform of the input signal s(x) and the Gabor
filter g(x), respectively. The convolved input
signal is then subjected to a static nonlinear threshold such that the
output is given by
|
To determine the effects of nonuniform subunits, receptive fields were
also modeled as a difference of Gaussians (DOG) with flanking Gaussians
(Fig. 1, E and F). Here the central subunit is
modeled as a DOG defined by
|
and
are their
space constants. The flanking Gaussians are defined by
|
is the spatial
displacement of the flanking subunits from the center, and
is their
space constant. The resulting spatial profile would be the sum
|
Data analysis
To quantify the spatial frequency responses, each tuning curve
was fitted using the following empirical function
|
e,
I,
and µ were optimized to provide the least squared error fit to the
data. This function is a difference of Gaussians. The spatial frequency
peak and bandwidth were estimated empirically from the fitted curves
for low and high contrast. Peak spatial frequency was taken as the
spatial frequency that elicits maximal response.
Bandwidth estimates were estimated as the log ratio (in octaves) of the spatial frequencies that elicited half the maximal response for the high-frequency cutoff to the low-frequency cutoff [log2(SFhigh cutoff/SFlow cutoff)]. Bandwidth and peak spatial frequency estimates were taken from the fitted curves for the first harmonic response of simple cells and the DC response of complex cells.
Spatial summation tuning curve analysis
Each spatial summation curve was fitted using the following
empirical function
|
Iceberg analysis
To determine the effects of a response threshold on low contrast
responses, we simulated the "iceberg" effect on the empirical data.
The point of the simulation is to compare the actual empirical responses collected at low contrast to those simulated from the high
contrast responses. Initially the spontaneous firing rate was
subtracted from both the high and low contrast responses. The iceberg
responses at low contrast were constructed by subtracting the
difference between the peak response at high
low contrast. This
produced responses that are matched in peak response by a linear
scaling assumed from the contrast response function. Next, the iceberg
responses are subjected to a response threshold at 0 spikes/s. The
resulting iceberg responses are the prediction of the low contrast
responses. These responses are compared with the actual empirical
responses at low contrast.
| |
RESULTS |
|---|
|
|
|---|
Spatial frequency tuning at low and high contrast
We compared the spatial frequency response function at low and high contrast (Fig. 2, A and B). For a significant number of cells, reducing the contrast causes a reduction in the spatial frequency bandwidth (Fig. 2, A and B). However, the spatial frequency peak is relatively unaffected by contrast. Normalizing the response magnitude to equate the response peaks makes the change in spatial frequency bandwidth more obvious. While some cells show equal reduction on both ends of the spatial frequency tuning curve (Fig. 2A), more cells show a reduction which is stronger on the high end of the tuning curve (Figs. 2B and 3). Therefore there is a bias toward preserving low frequencies and reducing the response to high frequencies as contrast is reduced.
|
|
Population estimates of spatial frequency selectivity at low and high contrast
In our initial study, we collected spatial frequency tuning curves at high (64-99% contrast) and low (20%) contrast, using optimal drifting gratings in a circular aperture. Our sample population includes only those cells that gave responses that were significantly above the spontaneous firing rate (>2 SDs above the spontaneous firing rate and more than 10 spikes/s) and had a maximum response well below response saturation (90% or less of the saturating response as judged from the contrast response function). Each spatial frequency response was fitted with a difference of Gaussians (DOG) empirical function (see METHODS). The spatial frequency optimal value was taken as the spatial frequency eliciting maximal response from the fitted DOG curves. Bandwidth estimates were taken as the log ratio (in octaves) of the spatial frequencies that produced responses that were one-half of the maximum response on either side of the peak (the ratio was defined as the log2 of the high cutoff frequency to the low cutoff frequency, SFhigh cutoff/SFlow cutoff). Mean spike rates were used for complex cells and first harmonic responses for simple cells. Across the population (n = 47, cells with peak spatial frequencies at low contrast below 0.4 c/° were excluded), bandwidth estimates are greater at high contrast than low contrast for both simple and complex cells (Fig. 3A). The mean bandwidth ratio for high to low contrast (BWhigh/BWlow) is 1.24 (the mean is significantly different from unity, Wilcoxon ranked sum test, P < 0.01, Fig. 3C) indicating that the bandwidth is roughly 24% larger at high contrast. For the same population of cells (n = 47), although there is not significant difference in the ratio of spatial frequency peak at high to the peak at low contrast (the mean of the ratios is not statistically different from unity, P > 0.05, Wilcoxon ranked sum test, Fig. 3D), there is a slight trend for peak spatial frequency to be higher at high contrast (Peakhigh/Peaklow = 1.10). Cells showing the largest changes in spatial frequency bandwidth are located predominantly in the lower layers, layers 5 and 6 (Fig. 4A).
|
Initially, all spatial frequency estimates for the cells in Fig. 3, A and B were made at a fixed low contrast (20%). To compare contrast-dependent spatial summation changes to the spatial frequency changes presented here, we collected spatial frequency tuning curves at the same contrast used to estimate spatial summation (n = 19). This reduced population shows a similar trend for spatial frequency bandwidth at high to low contrast (BWhigh/BWlow = 1.7, Fig. 3E) and the population mean is significantly different from unity (Wilcoxon ranked sum test, P < 0.01). Spatial frequency peak or optimal tuning does not vary significantly with contrast (BWhigh/BWlow = 1.10), and the mean is not significantly different from unity (P > 0.05, Wilcoxon ranked sum test, Fig. 3F). The contrast-dependent change in spatial frequency tuning bandwidth is also seen in the lower layers in the small sample (Fig. 4B), but we did not sample enough cells in the middle and upper layers to determine the prevalence of contrast-dependent changes in those layers (n = 14). There are fewer cells in Fig. 4A than Fig. 3, E and F, because not all cells could be assigned a laminar position from the histological reconstruction procedure.
For the more thoroughly studied population of neurons
(n = 19), we compared the change in spatial frequency
cutoff with contrast for the high and low end of the spatial frequency
tuning curve (Fig. 5). The absolute
degree of change in the spatial frequency cutoff with contrast
(low-high contrast) is, on average, smaller for the low-frequency
cutoff (mean = 0.16) than for the high-frequency cutoff (mean =
0.49). The asymmetric effect of contrast on cutoff frequency
suggests that the spatial frequency tuning is not well described by a
Gabor filter with a varying envelope size. Furthermore, although the
spatial frequency peaks vary little with contrast, this does not
necessarily suggest that the spatial spread of the subunits is
unaffected by contrast. A Gabor filter would predict that if the
spatial frequency peak is unaffected by contrast, then the subunit size
is also unaffected. The asymmetric change in spatial frequency
bandwidth taken together with the relative contrast-invariance of the
spatial frequency peak suggested that the spatial impulse response is
more complex than a Gabor filter. As was shown in Fig. 1F,
asymmetric changes in spatial frequency bandwidth coupled with little
to no change in the spatial frequency peak can be explained by a model
where the size and number of subunits both vary with contrast.
|
Spatial frequency selectivity and the iceberg effect
It has been suggested that neuronal response tuning may exhibit
sharpening by thresholding through what is known as the iceberg effect
(Carandini and Ferster 2000
; Sompolinsky and
Shapley 1997
; Volgushev et al. 2000
). Shifting
the response gain of a given tuning curve around a fixed response
threshold will cause responses along the tails of the tuning curve to
fall below the firing rate threshold and become subthreshold. The
remaining "tip of the iceberg" is effectively reduced in bandwidth
(Fig. 6A). To determine
whether such an iceberg effect can explain our results, low contrast
responses were calculated on the assumption that the responses scale
proportional to contrast and that there is a response threshold
simulated from the empirical responses collected at high contrast. The
high contrast responses were reduced in magnitude such that the peak
responses at high and low contrast matched (with the spontaneous firing rate subtracted from both curves, Fig. 6D). This was
accomplished by taking the difference between the peak response at
high
low contrast and subtracting this value from the high
contrast response. Next, the responses were thresholded at zero. This
new curve represents the tuning curve that would result from the
iceberg effect, if we assume that the responses are scaled linearly
with contrast and that there is a response threshold. From these new
curves, we can estimate the hypothetical spatial frequency bandwidth at low contrast and compare it to the actual low contrast spatial frequency tuning curve.
|
We examined our population of neurons (n = 19) for the
iceberg effect and found that most show bandwidth changes that are larger than the changes predicted by the iceberg effect. We compared the empirical responses collected at high and low contrast (Fig. 6,
B and C) with responses simulated from the
iceberg model. Figure 6B shows a sample neuron with a
significant change in the bandwidth ratio for high to low contrast
(BWhigh/BWlow = 1.3, Fig.
6B). Normalizing the responses with respect to the peak
response values at low and high contrast makes the change in bandwidth
more obvious (Fig. 6C). To determine the degree of change in
spatial frequency bandwidth resulting from the iceberg effect, we
simulated the low contrast responses from the empirical high contrast
estimates. First, the spontaneous firing rate was subtracted from the
low and high contrast responses. This response can be normalized to its
peak value to compare it to the empirical estimates at low contrast
(Fig. 6D). The iceberg model does cause a reduction in the
spatial frequency bandwidth (iceberg BWlow = 1.9, Fig. 6D) compared with the high contrast spatial frequency
bandwidth (BWhigh = 2.2, Fig. 6D).
However, the bandwidth ratio at high to low contrast (BWhigh/BWlow) is not as
large for the iceberg model
(BWhigh/BWlow = 1.15, Fig.
6D) as it is for the ratio of the actual data
(BWhigh/BWlow = 1.30, Fig.
6C). Therefore, if there were a significant change in the
cell's firing rate with contrast, it could explain part of the
heightened selectivity for spatial frequency at low contrast. However,
there is a significant portion of the bandwidth change that cannot be
accounted for by the iceberg model. In addition, the iceberg gives a
bandwidth reduction on the high and low frequency limbs of the tuning
curve whereas the data most often shows a substantial change on the
high spatial frequency limb but considerably less change on the low
spatial frequency limb of the tuning curve. Because the spatial
frequency peak does not change much with contrast, it is instructive to
view this analysis on a linear scale where the high-frequency limb of
the tuning curve is not logarithmically scaled (Fig. 6,
E and F). The bandwidth estimates were also
calculated as absolute differences between the spatial frequencies that
produce 50% of the peak response on either side of the peak (BW = SFhigh cutoff
SFlow
cutoff). The results on the linear scale confirm that the
iceberg effect cannot explain the complete amount of change in spatial
frequency bandwidth with contrast
(BWhigh/BWlow = 1.6 for the
actual data and
BWhigh/BWlow = 1.2 for the
estimated bandwidth change in the iceberg model).
To determine whether or not the spatial frequency responses were
subject to significant modification from a response threshold, we
estimated spatial frequency tuning at contrast levels that are
significantly above the contrast response threshold (
2 SDs above the
spontaneous firing rate and >10 spikes/s). There was no significant
dependence on contrast of the spontaneous activity sampled during the
blank runs that were interleaved between stimuli (data not shown).
Therefore it is unlikely that a change in response threshold between
contrast levels contributed significantly to enhanced selectivity
through an iceberg effect.
Spatial frequency selectivity and spatial summation
Spatial frequency tuning predominantly depends on the size and
number of subunits aligned orthogonal to the preferred orientation (DeAngelis et al. 1993
). Therefore contrast-dependent
changes in spatial summation along the receptive field width should
have a direct effect on the spatial frequency tuning.
Contrast-dependent changes in length summation do not relate directly
to spatial frequency tuning. Increasing subunit length might sharpen
orientation tuning while leaving spatial frequency tuning unaffected.
For a representative neuron that displays reduction of the spatial frequency bandwidth at low contrast (Fig.
7A), we show the contrast dependence of length and width summation. Although there is no difference in the optimal length of summation with stimulus contrast (Fig. 7B), there is a significant decrease in the optimal
width of summation at high contrast (Fig. 7C). Such a
decrease in width summation at high contrast is consistent with an
increase in spatial frequency bandwidth at high contrast. However, this
particular cell displays a reduction only on the high-frequency end of
the spatial frequency tuning curve. This example is representative of
the population. Such a pattern of contrast-dependent variation of
spatial frequency tuning cannot be explained solely by a change in the
spatial envelope of a Gabor filter (Fig. 1). It is better described by
a spatial filter where the central subunit expands at low contrast and
also the strength of the flanking subunits increases at low contrast
(see Fig. 1, E and F).
|
We compared the relative changes in spatial summation and spatial frequency bandwidth with contrast for area, length, and width summation (Fig. 8, A-C). For the sample of neurons studied (n = 15), area, length, and width summation each show a reduction in the optimal summation radius, half-length, or half-width, respectively, as contrast is increased (there are fewer cells in Fig. 8 than in Fig. 3E, because we were unable to gather data for all of the summation experiments on all of the original 19 cells). Contrast-dependent changes in width summation are significantly correlated with contrast-dependent changes in the spatial frequency tuning bandwidth (R2 = 0.54, P < 0.01, Fig. 8C). Although circular area summation and length summation both show significant change with contrast for the population of neurons studied (n = 15), changes in neither dimension are significantly correlated with contrast-dependent changes in the spatial frequency bandwidth (for area summation R2 = 0.001 and for length summation R2 = 0.05, Fig. 8, A and B). The lack of correlation between changes in area summation and spatial frequency bandwidth likely results from the fact that the circular-patch area experiments include summation along the length and width of the receptive field. Because cells that show changes in length summation are uncorrelated with changes in spatial frequency bandwidth with contrast, the area experiment includes such length summation effects as well as changes in width summation, resulting in a lack of correlation.
|
| |
DISCUSSION |
|---|
|
|
|---|
The major result of this paper is that reduction of stimulus contrast causes significant sharpening of spatial frequency tuning, and that this sharpening is correlated with expansion of spatial summation at low contrast.
Previously, we showed that spatial summation depends on contrast
(Sceniak et al. 1999
). This result is consistent with
the findings of others (Kapadia et al. 1999
). Spatial
summation is, on average, 2.3 times greater at low contrast than at
high contrast. These contrast-dependent changes in spatial summation
can occur at both the ends and sides of the receptive field. Analysis
with an empirical, DOG, model of spatial summation reveals that
contrast-dependent changes in the optimal radius of summation are not
correlated with contrast-dependent changes in surround strength.
Therefore changes in the classical or excitatory receptive field size
result from changes in excitation rather than from inhibitory sharpening.
It has been suggested that the spatial properties of the classical
receptive field such as spatial frequency selectivity or orientation do
not depend on stimulus contrast (Albrecht and Hamilton 1982
; Bradley et al. 1987
; Li and
Creutzfeldt 1984
; Movshon et al. 1978
;
Sclar and Freeman 1982
; Sclar et al.
1990
; Skottun et al. 1987
). However,
Bradley et al. (1987)
did mention that the spatial
frequency tuning bandwidth was systematically smaller at lower
contrast. It is widely believed that contrast scales response magnitude
with no differential effect on particular stimuli (Heeger
1992
; Robson 1975
). Response
normalization has been proposed to account for contrast-invariant
spatial tuning for properties such as orientation and spatial frequency
(Carandini and Heeger 1994
; Carandini et al.
1997
; Heeger 1992
; Tolhurst and
Heeger 1997a
,b
).
If spatial summation is not contrast-invariant, then other spatial
properties of the receptive field should be expected to change with
spatial summation at different contrast levels. Spatial frequency
selectivity and receptive field spatial spread (envelope) are inversely
related. Using spatial frequency analysis, it has been shown that the
inverse Fourier transform of the frequency response of complex cells
gives spatial impulse responses that are similar in shape to simple
cell's spatial weighting functions (Movshon et al.
1978b
; Spitzer and Hochstein 1985a
,b
). This
suggests that complex cells' spatial selectivity is dominated by
subunits that are similar in structure to simple cells' spatial
weighting functions. These subunits determine the dominant
characteristics of the spatial frequency response function despite
significant nonlinearities in summation (Movshon et al.
1978b
).
To account for our findings about spatial frequency tuning and
contrast, one must postulate that there are contrast-dependent changes
in the spatial spread of individual receptive field subunits as well as
the number and strength of flanking subunits. There is evidence that
subthreshold excitatory regions surround the classical excitatory
receptive field and may form the basis for this recruitment
(Bringuier et al. 1999
). What needs to be explained is
that the change in spatial frequency bandwidth tends to be asymmetric
with more bandwidth reduction occurring at the high spatial frequency
cutoff. Also, there is little change with contrast in the location of
the spatial frequency peak. A simulation of a receptive field composed
of a DOG core with flanking Gaussian subunits (Fig. 1, E and
F), predicts that asymmetric reductions in spatial frequency
bandwidth biased for high frequencies might result from increases in
the subunit size as well as an increase in the number of subunits
within the receptive field.
We compared estimates of the change in optimal length and width summation with contrast to changes in the spatial frequency selectivity for each neuron across our population of V1 neurons. Changes in spatial frequency selectivity are significantly correlated with changes in the extent of summation along the receptive field width, but not length. A change in the spatial structure of the receptive field might explain the contrast-dependent change in spatial frequency selectivity reported here, but this change would only be related to changes in summation along the receptive field width. The correlation between changes in spatial frequency selectivity and width summation observed here are consistent with contrast-dependent spatial reorganization of the receptive field subunits.
Many investigators have found that orientation bandwidth does not vary
much with contrast (including our own data that is not shown). The
changes in receptive field structure that we needed to introduce into
the model to explain the pattern of the spatial frequency results
the
associated increase in envelope spread and carrier period of the Gabor
or size and number of DOG subunits
are the kinds of changes that would
tend to leave orientation bandwidth relatively unchanged. Thus,
although contrast modulates spatial frequency bandwidth, because
contrast affects width summation, it tends to leave orientation
bandwidth invariant.
| |
ACKNOWLEDGMENTS |
|---|
We thank Dr. Dario Ringach, E. Johnson, Dr. Isabelle Mareschal, and A. Henrie for help in data collection and L. Smith for assistance in the histological reconstruction work and help during physiology experiments.
This work was supported by National Eye Institute Grants EY-01472 and EY-08300, core Grant P30 EY-13079, and National Science Foundation-Learning and Intelligent Systems Grant IBN-9720305.
| |
FOOTNOTES |
|---|
Address for reprint requests: M. P. Sceniak, Center for Neuroscience, University of California Davis, 1544 Newton Ct., Davis, CA 95616 (E-mail: mpsceniak{at}ucdavis.edu).
Received 26 November 2001; accepted in final form 29 May 2002.
| |
REFERENCES |
|---|
|
|
|---|