JN Add DOIs to your references at manuscript stage!
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


J Neurophysiol 100: 239-248, 2008. First published May 14, 2008; doi:10.1152/jn.90273.2008
0022-3077/08 $8.00
This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
100/1/239    most recent
90273.2008v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Malone, B. J.
Right arrow Articles by Ringach, D. L.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Malone, B. J.
Right arrow Articles by Ringach, D. L.

Dynamics of Tuning in the Fourier Domain

Brian J. Malone and Dario L. Ringach

Department of Neurobiology and Psychology, David Geffen School of Medicine, University of California, Los Angeles, California

Submitted 15 February 2008; accepted in final form 8 May 2008


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
Neurons in primary visual cortex (area V1) are jointly tuned to the orientation and spatial frequency of sinusoidal stimuli (the Fourier domain). The role that suppressive mechanisms play in shaping the tuning and dynamics of cortical responses remains the subject of debate. Here we used subspace reverse correlation to study the relationship between suppression by nonoptimal stimuli, the spectral-temporal separability of the responses, and their persistence in time. Two clear relationships emerged from our data. First, cells with inseparable responses were often accompanied by suppression to nonpreferred stimuli, while separable responses showed mostly enhancement by their preferred stimuli. Second, inseparable responses were characterized by a longer persistence in time compared with those with separable dynamics. A parametric model that assumes the additive combination of separable enhancement and suppression signals, with suppression constrained to be low-pass in spatial frequency and untuned for orientation, explained the data well. These new findings, in addition to an established correlation between selectivity and suppression for nonoptimal stimuli, clarify how the dynamics and selectivity of cortical responses are shaped by suppressive signals and how their interplay generates the large diversity of responses observed in primary visual cortex.


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
Neurons in primary visual cortex are tuned to the orientation and spatial frequency of sinusoidal stimuli (K. K. De Valois et al. 1979Go; R. L. De Valois et al. 1982aGo,bGo; Jones et al. 1987Go; Movshon et al. 1978Go; Webster and De Valois 1985Go). How this selectivity arises in V1 from thalamic inputs that are untuned for orientation and low-pass in spatial frequency, and what role cortical inhibition plays in shaping the tuning of cortical cells, remains a central question in visual neuroscience (Ferster and Miller 2000Go; Priebe and Ferster 2008Go; Shapley et al. 2003Go; Sompolinsky and Shapley 1997Go).

It has long been known that nonoptimal stimuli can suppress firing rates in cortical neurons below their spontaneous levels (Bauman and Bonds 1991Go; Blakemore and Tobin 1972Go; De Valois and Tootell 1983Go; De Valois et al. 1982bGo; Deangelis et al. 1992Go; Jones et al. 2001Go; Monier et al. 2003Go; Morrone et al. 1982Go; Nelson and Frost 1978Go; Sato et al. 1996Go; Sillito 1975Go). It has also been proposed that suppression by nonoptimal stimuli is important in establishing sharp neuronal selectivity (Bauman and Bonds 1991Go; Bonds 1989Go; Ringach et al. 2002Go; Shapley et al. 2003Go; Sillito et al. 1980Go). Detecting suppressive effects in extracellular recordings in response to single drifting sinusoidal gratings is difficult due to low spontaneous rates in V1, but can be readily unmasked by elevating the activity of cells pharmacologically or via a conditioning visual stimulus. Another strategy to tease apart suppressive effects has been to look at the dynamics of tuning with the idea that differential dynamics of enhancement and suppression could lead to inseparable behavior that could unveil the underlying components of the response. We adopted this strategy in previous studies that separately measured the dynamics of orientation and spatial frequency tuning (Bredfeldt and Ringach 2000Go; Ringach et al. 1997Go, 2003Go; Shapley et al. 2003Go). In the orientation domain, our results suggested that global and tuned suppression could generate a narrowing of the orientation tuning bandwidth, create Mexican-hat tuning profiles in the late parts of the response, and induce small but detectable shifts in the preferred orientation of neurons (Ringach et al. 1997Go, 2003Go). In the spatial frequency domain, suppression at low spatial frequencies correlated with a shift in the preferred spatial frequencies from low to high and an accompanying decrease of their bandwidth (Bredfeldt and Ringach 2000Go). In all these cases, suppression generates a temporal inseparability in the tuning curve of the cells.

To gain further insight about how suppression by nonoptimal stimuli affects V1 responses, we measured the dynamics of tuning in the Fourier domain, defined jointly by orientation and spatial frequency (Jones et al. 1987Go; Ringach et al. 2002Go; Webster and De Valois 1985Go). Our goals were to characterize the prevalence of suppression by nonoptimal stimuli in V1 neurons and the degree of spectro-temporal separability of their tuning curves and to test the notion that temporal inseparability and suppression may be correlated with each other.

We found a substantial diversity in the responses: some neurons have temporally separable spectral kernels (Mazer et al. 2002Go), but a sizeable fraction exhibit inseparable dynamics. We also find that the extent to which the kernel is dominated by a response of a single sign (i.e., net enhancement or net suppression relative to the baseline response) correlates with the degree to which its tuning in the Fourier domain is temporally separable, as well as its persistence in time (i.e., the length of the response tail). The observed dynamics in the Fourier domain are well accounted by a simple model that linearly combines two signals, enhancement and suppression, each of which is separable on its own.

These new findings add to an already established correlation between selectivity and suppression for nonoptimal stimuli (Ringach et al. 2002Go) and indicate that both the dynamics and the tuning selectivity of cortical responses are shaped by suppressive signals.


    METHODS
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
Preparation, recording, and optics

All experiments were approved by the Chancellor's Animal Research Committee at UCLA and were carried out following National Institutes of Health's Guidelines for the Care and Use of Mammals in Neuroscience. Acute experiments are performed on anesthetized and paralyzed adult Old-World monkeys (Macaca fascicularis). Initially, the animal is sedated with acepromazine (30–60 mg/kg), anesthetized with ketamine (5–20 mg/kg im), and transported to the surgical suite. Initial surgery and preparation are performed under isofluorane (1.5–2.5%). Two intravenous lines are put in place. A urethral catheter is inserted to collect and monitor urine output, and an endotracheal tube is inserted to allow for artificial respiration. All surgical cut-down sites are infused with local anesthetic [lidocaine (Xylocaine), 2%, sc]. Pupils are dilated with ophthalmic atropine and custom-made gas-permeable contact lenses are fitted to protect the corneas. After this initial surgery the animal is transferred to a stereotaxic frame. At this point anesthesia is switched to a combination of sufentanil (0.15 µg·kg–1·h–1) and propofol (2–6 mg·kg–1·h–1). We then proceed to perform a craniotomy over primary visual cortex. The animal is paralyzed [pancuronium (Pavulon), 0.1 mg·kg–1·h–1] only after all surgical procedures, including the insertion of the electrode arrays, are complete.

To ensure a proper level of anesthesia throughout the duration of the experiment, rectal temperature, heart rate, noninvasive blood pressure, end-tidal CO2, and electroencephalography (EEG) are continually monitored via an Hewlett-Packard Virida 24C neonatal monitor. Urine output and specific gravity are measured every 4–5 h to ensure adequate hydration. Drugs are administered in balanced physiological solution at a rate to maintain a fluid volume of 5–10 ml·kg–1·h–1. Rectal temperature is maintained by a self-regulating heating pad at 37.5°C. Expired CO2 is maintained between 4.5 and 5.5% by adjusting the stroke volume and ventilation rate. The maximal pressure developed during the respiration cycle is monitored to ensure that there is no incremental blocking of the airway. A broad-spectrum antibiotic [penicillin G (Bicillin), 50,000 IU/kg] and anti-inflammatory steroid (dexamethasone, 0.5 mg/kg) are given at the beginning of the experiment and every other day.

In some experiments, a 10 x 10 electrode array (Cyberkinetics, Salt Lake City, UT) with 1- or 1.5-mm-long electrodes was implanted in primary visual cortex. The center of the array was aimed at 6 mm posterior to the lunate sulcus and 8 mm lateral to the midline. In other experiments, extracellular action potentials were recorded with an array of independently movable glass-coated tungsten microelectrodes with exposed tips of 5–15 µm (Alpha-Omega Engineering). Electrical signals were amplified and spikes discriminated using a Cerebus 128-channel system (Cyberkinetics). Spike sorting was performed off-line using principle-component analysis on the waveform shapes with software developed in our laboratory.

Stimuli were generated on a Silicon Graphics O2 and displayed on monitor at refresh rate of 100 Hz and a typical screen distance of 80 cm. The mean luminance was 60 cd/m2. A Photo Research Model 703-PC spectro-radiometer was used for calibration. The eyes were initially refracted by direct ophthalmoscopy to bring the retinal image into focus for a stimulus roughly 80 cm from the eyes. Once neural responses were isolated, we measured spatial frequency tuning curves and maximized the response at high spatial frequencies by changing external lenses in steps of 0.25 D. This procedure was performed independently for both eyes.

Kernel estimation, data selection, and model fitting

We measured the dynamics of tuning in the joint spatial frequency and orientation plane using the subspace reverse correlation technique employed by Ringach et al. (2002)Go. The visual stimulus is a sequence of luminance modulated sine wave gratings varying in orientation, spatial frequency, and spatial phase, presented in pseudorandom order at an effective rate of 50 Hz (each frame is repeated twice with a monitor refresh rate of 100 Hz). Each image in the stimulus set consisted of a Hartley basis function of size M x M pixels, such that Formula for all 0 ≤ l, m ≤ M – 1. Here, cas {theta} Formula sin {theta} + cos {theta} and kx and ky represent the spatial frequency of the grating in units of cycles per stimulus side. The stimulus set consisted of all Hartley basis functions Formula such that |kx| ≤ kmax and |ky| ≤ kmax (with the origin H0,0 excluded). The maximum spatial frequency for the Hartley basis set was chosen to exceed the range of spatial frequencies (SFs) that elicited responses during initial characterizations with drifting gratings. In the Fourier plane, stimuli with the highest SFs lying along the perimeter of the stimulus domain (the edges bounding the kernel) can be used to estimate a baseline firing rate for the cell for noneffective stimuli, as they are outside the "window of visibility" for the neuron at hand (Ringach et al. 2002Go) (Fig. 1B). The maximum tested spatial frequency in these experiments (attained at the corners of the subspace tested, see Fig. 1B) varied from 2.3 to 19.6 cycle/°, with a median of 8.5. The number of unique images in the stimulus set varied from 160 to 5616 with a median of 1,248; with four spatial phases for each orientation/spatial frequency, this means that the typical experiment had roughly 312 unique combinations of spatial frequency and orientation. The stimulus contrast in all experiments was 99%. Stimuli were spatially extended and were about two to three times larger than the classical receptive field of the neurons. In general, the stimulus was 4 x 4° in size to allow the coverage of all receptive fields under measurement with the electrode array. These receptive fields scattered over an area of 1 deg2 and had average linear size of 1/2°.


Figure 1
View larger version (36K):
[in this window]
[in a new window]

 
FIG. 1. Spectro-temporal kernels and a parametric model. A: the relative probability of occurrence of stimuli of varying spatial frequency and orientation is computed for different delay times between a neuronal spike and the stimulus. B: these data can be organized in the Fourier domain. The panel illustrates the resulting calculation for 1 particular time delay. The origin of the Fourier domain coincides with the center of the image. For each pixel, spatial frequency is given by the distance from the origin. The distance from the origin in pixels translates into to the number of cycles per stimulus side. The angle with respect to the positive x axis represents the orientation of the grating. In all depictions of spectral kernels in this paper, the results are symmetric around the origin. C: a 2-component model where a symmetric enhancement signal and a low-pass, nonoriented suppressive signal are summed to approximate the response. In this example, the optimal fit to the data shown in B are shown separately along with the actual prediction resulting from their sum.

 
For each neuron, we generated an estimate of the spectral kernel by computing the probability that a grating with spatial frequency {omega}, and orientation, {theta}, preceded a spike by {tau} ms: p({omega};{theta};{tau}) (Fig. 1, A and B). The responses to the different spatial phases were averaged for each combination of orientation and spatial frequency (Ringach et al. 2002Go). However, we used spatial phase information to assign all of the cells in the database a spatial-phase modulation index as described by Nishimoto et al. (2005)Go. For a given orientation and spatial frequency, all four spatial phases responses are combined to compute the modulation index (MI) MI = 2(|R0 R180| + |R0R180|)/|R0 + R90 + R180 + R270|, where the subscript indicates the spatial phase of the four different gratings for the optimal orientation and spatial frequency combination. Cells with a modulation index >1 were defined as simple cells, whereas cells with a modulation index <1 were defined as complex. A previous study showed that this measure correlates very well with the classical F1/F0 ratio obtained by stimulation with drifting gratings (Nishimoto et al. 2005Go).

A typical spectral kernel obtained via subspace reverse correlation at the optimal time delay is shown in Fig. 1B. Each pixel corresponds to the response to a specific combination of orientation and spatial frequency. The relative probabilities of occurrence were normalized by the median value of those in the perimeter of the region (indicated by "baseline" in Fig. 1B). The logarithm of this ratio was taken to provide a measure of discriminability of each stimuli from those in the baseline set (Ringach et al. 2002Go). The kernels, normalized in this way, are denoted by R({omega};{theta};{tau}) = log[p({omega};{theta};{tau})/p(baseline; {tau})]. Thus one can consider the kernels as representing the log-likelihood of a stimulus preceding a spike at different time delays. In the representation of Fig. 1B the radial distance from the origin (the center of the image) represents spatial frequency and the angle with respect to a line running horizontally through the origin represents orientation. Red regions indicate stimuli that induce the cell to fire more than the baseline, whereas blue regions correspond to stimuli that induce the cell to fire less than the baseline response.

The kernel norm at a fixed time delay is defined as the L2 norm of R({omega};{theta};{tau}) denoted by ||R({omega};{theta};{tau})||. We define the signal-to-noise ratio (SNR) for each kernel as SNR = ||R({omega};{theta};{tau})||/E{||R||}, where E{||R||} is the average baseline value of the norm obtained at time delays that are uncorrelated with the response ({tau} > 150 ms). Only cells with SNR > 7 were included in the population, and the median SNR was 44.1. In addition to the foregoing SNR criterion, we measured the degree to which the responses were well contained within the subspace. For example, if the responses reached the boundaries of the subspace, it would mean that we had poorly chosen our maximal spatial frequency. To obtain a quantitative measure of the spillover of responses onto the boundary, we first computed the sum of the absolute value of R({omega};{theta};{tau}peak) over the boundary region, then we selected from the interior region the stimuli that achieved the top K absolute values (where K is the number of stimuli in the boundary area), and finally we took the ratio of the latter by the former. As a criterion we selected a minimum ratio of 1.75, which resulted in a median of 4.6 for the selected population. Using these criteria, data from 564 cells from 19 animals were included in the analysis.

The optimal time delay, {tau}peak, was defined as the time lag at which the kernel norm peaked. Onset and decay times were defined as the points closest to this optimal time such that the norm exceeded four times the SD of the baseline value. Across the sampled population, the distributions of onset, optimal, and decay times had medians of 36, 58, and 100 ms, respectively. The large value for the median decay time reflects the fact that many kernels exhibited significant response components at long delays (long tails). We refer to the interval bracketed by the early and late delay times as the analysis interval. The selection of the analysis interval was intended to ensure that our analyses spanned the duration of the response but avoided fitting the noise at long or short time delays.

The spectro-temporal separability of the kernels was assessed by singular value decomposition (SVD) of the kernel matrix within the analysis interval (Mazer et al. 2002Go). The three-dimensional spectral kernel R({omega};{theta};{tau}) was rewritten as a matrix with the spectral coefficients at different time delays arranged in different columns. A singular value decomposition was performed on this matrix yielding a spectrum {lambda}i of decreasing eigenvalues. The relative variance accounted for each of the individual components is given by SVDi = {lambda}i2/{Sigma}i{lambda}i. Thus for a kernel that is spectro-temporally separable, a single component is sufficient to account for the response and SVD1 ~ 1. Thus we define the separability index as SVI1. It should be noted that this estimate of separability applies to the entire spectral kernel bracketed by the analysis interval and that the choice of the analysis interval affects the resulting estimates of separability. Delimiting the analysis interval with respect to the noise floor was intended to ensure that changes in the spectral kernel over time reflect genuine response dynamics rather than fluctuations due to noise in the data. The results of the SVD analysis can also be used to estimate the best separable model of the spectral kernel in the least squares sense. As we will see, investigating the structure of the residuals of the optimal separable model provided some insights into the features of the responses that were left unexplained.

Parametric analysis

In addition to the optimal separable model derived from the SVD decomposition, we also used a parametric model to fit the data. The model assumes the responses are the result of a linear combination of two signals. Each signal is temporally separable as they are constructed by the product of a fixed spectral tuning and a temporal response function. One of the signals is positive (enhancement) and the other is negative (suppression). The spectral tuning of enhancement is given by a two-dimensional Gaussian function centered at a given preferred location on the Fourier domain (as would be expected from a 2-dimensional Gabor receptive field; Fig. 1C, left). The suppressive component is constrained to be isotropic and centered at the origin (Fig. 1C, middle). The model also included a baseline constant which was always much smaller than the two other components and is not discussed further here. The sum of these two components provided an adequate fit to most of our data.

In mathematical form, the model is expressed as

Formula

Here {omega} = ({omega}x,{omega}y)T is the spatial frequency vector. The first term represents the enhancement component centered around ±{omega}0 with a covariance matrix of {Sigma}E. The covariance matrix is unconstrained, allowing for the elongation of the enhancement signal along different directions. In a simple linear Gabor receptive field, this could occur if the carrier and the envelope do not share a common axis (Jones and Palmer 1987Go). The amplitude of the enhancement at any one time is given by A({tau}). The second term represents suppression centered at the origin, with a SD given by {sigma}s. The amplitude of the suppressive component at any one time is given by B({tau}). Finally, the third term, C({tau}) represents a baseline.

To fit the model, we first found the optimal parameters of the spectral tuning at the optimal time delay. We then fixed the shape of the two components and found the weights for the enhancement and suppressive components that best accounted for the response at each time slice. This results in a nonparametric estimate of the temporal profiles for each component. We then iterated the procedure by fixing the temporal profiles and finding the best shape for the components that explained the entire kernel. It turned out that, in practice, this step did not usually modify the results of the first iteration significantly, so the first estimates were taken as the best possible fit. The quality of the fit was assessed by computing the fraction of the total kernel variance explained by each model throughout the analysis interval.


    RESULTS
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
We observed a diversity of dynamical responses in the Fourier domain, as conveyed by the examples in Fig. 2. The kernels are presented at time lags for which their norms are a fixed fraction of the one attained at the optimal delay time. The selected fraction levels appear at the top of Fig. 2, and time proceeds from left to right. The frames occurring after the optimal delay are shown at finer temporal separations because, as we will see in the analysis in the following text, the responses exhibit more complex dynamics in their tails.


Figure 2
View larger version (74K):
[in this window]
[in a new window]

 
FIG. 2. Examples of spectro-temporal dynamics. Each row displays a series of time slices of the spectro-temporal kernel at the delays indicated by the inset. The numbers at the top of each column indicate the values of the kernel norm with respect to the peak value at the optimal delay, normalized to 1. The separability indices for each response are indicated on the right side.

 
Kernels that were dominated by net enhancement in the entire Fourier domain over the full duration of the response were commonly observed (Fig. 2A). Less frequently we also observed responses that were dominated by net suppression (Fig. 2B). It was normal to find areas of net suppression developing later in the response. Figure 2C shows an example where net enhancement dominates early while net suppression dominates in the late phase of the response. The most prevalent shape of the dynamical response is shown in Fig. 2D. Here there is an initial net enhancement in a restricted region of the Fourier domain close to the origin (the center of the image). As the response progresses, one observes net suppression developing at orientations orthogonal to the preferred one and at the origin. At the optimal delay time, both enhancement and suppression have a characteristic bow-tie shape. Later in time, the response is dominated by net suppression. It can also be seen that the peak spatial frequency of enhancement shifts from lower to higher spatial frequencies as if being "pushed away" by the development of suppression at lower spatial frequencies. A similar example is shown in Fig. 2E. From these examples, it appears that situations where the relative dominance of enhancement and suppression shifts during the response (as in these 2 examples) might be related to the inseparability of the responses. Finally, the example in Fig. 2F was included to demonstrate that regions of enhancement can sometimes be surrounded by suppression at higher spatial frequencies.

We computed the separability of V1 spectral kernels via singular value decomposition (SVD) (see METHODS). The method allowed us to compute the percentage of variance in the spectral dynamics that could be accounted for by an optimal separable model (by including only the 1st component) as well as the best possible combination of two separable components (by including the 1st 2 components). We note, however, that the latter method does not necessarily yield components with opposite signs, meaning that they cannot be readily interpreted as separate enhancement/suppression signals. The separability indices for the kernels in Fig. 2 are indicated to the right of each sequence.

Across the population, the mean fraction of variance accounted for the optimal separable model was 81% (Fig. 3A, {blacksquare}). The second component on its own accounted for an additional 11% of the variance on average (Fig. 3A, {blacksquare}). Both components together, therefore, accounted for 92% of the dynamic structure of the kernels (Fig. 3A, {square}). There was no correlation between the relative variance accounted for by the first component and the modulation index of the cell, meaning that the degree of separability was not statistically correlated with cell class (simple or complex; r = –0.07; P > 0.05).


Figure 3
View larger version (14K):
[in this window]
[in a new window]

 
FIG. 3. Separability and response valence ratio correlate. A: the histograms in a illustrate the fraction of the total kernel variance explained by the 1st and 2nd singular values obtained from the singular value decomposition (SVD) of the spectral kernel as well as their sum. B: the relative dominance of enhancement and suppression correlate with the separability of the responses. The response valence ratio (RVI) ranges from –1 to 1, where –1 indicates a kernel comprised entirely by suppression, 1 indicates pure enhancement, and 0 indicates a perfect balance of the two. The less separable responses are the ones where there is a balance between net enhancement and suppression across the response.

 
Our discussion of the examples in Fig. 2 suggested that a possible reason for temporal inseparability is the different tuning and timing of enhancement and suppression. For example, the two kernels that are most separable are either entirely enhanced (Fig. 2A) or entirely suppressed (Fig. 2B), whereas those that show early enhancement and late suppression are more inseparable. We thus asked if spectro-temporal inseparability, as measured by the amount of variance explained by the first principal component, correlates with a measure of overall dominance by one response sign. To do so we defined the response valence index by RVI = (vevs)/(ve + vs), where ve = var{[R({omega};{theta};{tau})]+}, vs = var{[–R({omega};{theta};{tau})]+}, and [x]+ represents half-way rectification. The variance is computed over the entire spectro-temporal analysis window, so these represent measures of total net suppression and enhancement. An RVI near +1 implies that the response is dominated by enhancement; an RVI near –1 implies a response dominated by suppression; and an RVI near 0 implies net enhancement and suppression are well balanced over the entire response period (though one or the other may dominate at different time delays).

When the response valence index is plotted against the separability index (the amount of variance accounted for by the 1st component alone), the points are distributed in the form of a "V" (Fig. 3B), indicating that their kernels are least separable when enhancement and suppression have roughly equivalent magnitudes. In other words, the separability of V1 spectral kernels is correlated with the balance of net enhancement and suppression. This is verified by the fact that the absolute value of the response valance index was significantly correlated with the separability index (r = 0.58; P < 10–10). Also, note that the distribution of the response valance index makes it evident that while there is a wide range of dynamical responses, there is a preponderance of cells dominated by enhancement.

Having shown that the net balance of enhancement and suppression across the entire response is correlated with the separability of the responses, we attempted to characterize the time courses of these components. To do so, we defined a measure of net enhancement in the Fourier domain at a given time delay as {surd}ve and, similarly a measure of net suppression as {surd}vs. Here the quantities were computed across the Fourier domain for each time delay. We independently normalized these values by subtracting a baseline estimated at time delays where the response and the stimulus were uncorrelated, and then divided by the norm of the full kernel at the optimal time. To average across cells, we first aligned the curves at the optimal delay time.

We performed the analysis for our population by splitting responses into those that were dominated by enhancement (RVI > 0; Fig. 4A) and those dominated by suppression (RVI < 0; Fig. 4B). In both groups of cells, it is evident that suppression peaked after enhancement (Fig. 4, A and B). Therefore even when suppression dominates, the response enhancement is seen as preceding.


Figure 4
View larger version (35K):
[in this window]
[in a new window]

 
FIG. 4. Average time course of kernel norms. A and B: average (normalized) time courses of the kernel norm (blue), net enhancement (red), and suppression (green), are shown for cells that were dominated by enhancement (A) or suppression (B). The average behavior shows a delayed suppression for both cell groups. A similar analysis for the optimal separable model is shown in C and D. Error bars are twice SE.

 
The differential timing of enhancement and suppression, of course, cannot be captured by a separable model. This is made clear by repeating these calculations for the optimal separable fits obtained in the SVD analysis (Fig. 4, C and D). As these curves are based on kernels that are by construction separable, the time courses of enhancement and suppression differ only in their magnitude and not their shape. There is, furthermore, an interesting difference between the two cell groups that can be seen in these fits. The optimal separable fits to the kernels dominated by suppression have long tails that extend ≤40 ms after the response peak. In contrast, responses that are dominated by enhancement have a more symmetric temporal profile and tails decay within 25 ms of the peak of the responses. This suggested to us that an increase presence of suppression may cause longer tails and therefore a more asymmetric profile of the temporal time course of the response.

To investigate if the temporal shape of the kernel was indeed more asymmetric and had longer tails for the responses dominated by suppression, we performed an additional analysis. We first found, for each cell, the best Gaussian fit to the kernel norm as function of time delay. The quality of this fit was assessed by the percentage of variance it explained. We observed that the average residuals from these fits showed structure that concentrated in the late part of the response, around 20–60 ms after the optimal time delay (Fig. 5A). This implies that the initial phase of the responses ≤10–20 ms after the peak were roughly Gaussian in shape, but that some cells deviated at longer delays.


Figure 5
View larger version (12K):
[in this window]
[in a new window]

 
FIG. 5. Separability correlates with the shape of the temporal response. A: residuals from a Gaussian fit to the temporal shape of the response (the norm of the kernels as a function of time delay) show maximum deviation in the tails (20–40 ms after the peak response). Error bars are twice SE. B: the quality of the Gaussian fit (and indication of the symmetry of the responses) correlates with the separability index. Cells with heavy tails tend to be more inseparable than those with symmetric temporal responses.

 
Further, the deviation of the temporal kernels from normality, as assessed by quality of the fit, correlated with the separability index of the responses (Fig. 5B; r = 0.49; P < 10–10). In addition, as one would expect from the relationship between separability and the response valance ratio (Fig. 3B), there was also a significant but weaker correlation between the deviation from normality and the response valance ratio (r = 0.22; P < 2·10–6). These results indicate that responses with balanced suppression and enhancement tended to have longer tails and be less separable than responses dominated by one response sign.

It should be noted that regions of enhancement and suppression in our data represent areas of net enhancement and suppression. The tuning of these signals in the Fourier domain can certainly overlap to generate the measured kernels. We wanted to investigate if a parametric model, where enhancement and suppression had simple shapes, could explain some the dynamics of responses in the Fourier domain. The simplest model is one where enhancement has a spectral tuning consistent with that of a Gabor filter. In this case, the spectral tuning is the sum of two Gaussian functions symmetric across the origin. Suppressive influences from within the classical receptive field have often been described as being broadly tuned in spatial frequency and untuned for orientation (Bauman and Bonds 1991Go; Bonds 1989Go; Deangelis et al. 1992Go). Thus we opted to model suppression as a single Gaussian centered at the origin of the Fourier plane. The model assumes that both enhancement and suppression are separable in time and that they add linearly. This model was adequate, on average accounting for 71% of the variance of the responses (Fig. 6).


Figure 6
View larger version (101K):
[in this window]
[in a new window]

 
FIG. 6. Examples of spectral kernels and their separable and parametric fits for 3 neurons. The examples shown in A–C correspond to Fig. 2, C–E, respectively. In each set of panels, the actual kernel is shown in the top row, the separable model in the second, the parametric model as the bottom row. As in Fig. 2, the columns indicate the magnitude of the kernel norm, relative to the peak, for the kernel frame that is shown.

 
The model fits were used to estimate the relative magnitude and time course of the underlying enhancement and suppression components. To do this, we computed the population average of their magnitudes after normalizing to the maximum absolute value between the two. Cells were split into enhancement or suppression dominated as above. This analysis showed that the magnitudes of the two components inferred from the model were well balanced with each other for both groups of cells (Fig. 7). Consistent with the results of the nonparametric analysis, peak enhancement precedes suppression in both cases by a few milliseconds. Similar results were also obtained when the population averages were computed for cells with the highest and lowest separability indices (SI > 0.9 and SI < 0.7; data not shown). This indicates that balance of the magnitudes of suppression and enhancement was rather typical of the responses. The variability in the degree of net suppression and inseparability seen in the data, therefore must be generated by a difference in the tuning and timing of the components in the Fourier domain and not by a strong imbalance in the underlying magnitude of the two signals.


Figure 7
View larger version (13K):
[in this window]
[in a new window]

 
FIG. 7. Temporal profiles of enhancement and suppression components of the parametric model. The coefficients of the parametric models were each normalized to the peak of the larger of the 2 (enhancement or suppression), aligned with respect to the peak of actual kernel norm for the cell, and averaged over the population. A: results from responses dominated by enhancement. B: results from responses dominated by suppression. Error bars are twice SE. It can be seen that the estimated magnitudes of the 2 components are well balanced in both classes of neurons.

 

    DISCUSSION
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
Here we showed two novel relationships between suppressive signals and the dynamics of V1 neurons. First, there is a correlation between temporal separability in the Fourier domain and the dominance of net suppression or enhancement in the responses (Fig. 3B). A different way to express this fact is that when responses are dominated by net enhancement (or, in a minority of cells, by net suppression), they tend to be separable. Inseparable responses, on the other hand, exhibit both net enhancement and suppression in different regions of the Fourier domain. Second, inseparable responses tend to have longer tails than those that are separable (Fig. 5). These new data add to an already established correlation between selectivity in the Fourier domain and the presence of net suppression: cells that are suppressed by nonoptimal stimuli are better tuned than cells with only an enhanced response (Ringach et al. 2002Go). Altogether these findings show that the balance among enhancement/suppression, tuning selectivity, temporal separability, and persistence in time are factors that are closely linked in the dynamics of V1 neurons.

The observed relationships can be understood within the context of the conceptual two-component model where a suppressive component, delayed in time, shapes the initial tuning of the enhancement signal. In the process, suppression not only increases the selectivity of the cell (by restricting the region in the Fourier plane that induces the cell to increase its firing rate) but also introduces, due to its different tuning in the Fourier domain, an associated temporal inseparability. We note that while the majority of the kernels could be adequately accounted for using a broadly tuned suppressive signal, there is no denying that there are clear instances where tuned suppression is necessary to explain the responses. This is particularly the case for extended stimuli that cover both the classical receptive field and its surround (Blakemore and Tobin 1972Go; Ringach et al. 2003Go; Xing et al. 2005Go) and is most prevalent in well-tuned neurons (Ringach et al. 2003Go).

Modeling the responses as the sum of two separable enhancement/suppression components allowed us to tease apart their magnitude and temporal dynamics. This simple model is meant to be interpreted as a convenient and succinct summary of the dynamical responses and cannot be directly mapped onto the different components of the underlying circuitry. In particular, there might be a number of different mechanisms contributing to the generation of a broadly tuned suppression signal. It may originate from saturation in the LGN (Li et al. 2006Go; Priebe and Ferster 2006Go), from short-term synaptic depression in thalamo-cortical synapses (Carandini et al. 2002Go; Freeman et al. 2002Go) [although recent data suggest that synaptic depression is nearly saturated in vivo, limiting its potential role (Boudreau and Ferster 2005Go)] and from intra-cortical inhibition (Bauman and Bonds 1991Go; Bonds 1989Go; Deangelis et al. 1992Go; Hirsch et al. 2003Go; Morrone et al. 1982Go; Troyer et al. 1998Go). These factors are not mutually exclusive and may all combine with different weights. One intriguing clue that points to a cortical contribution is the fact that responses with longest tails, which as we showed here are the ones that tend to be inseparable, have been largely found outside the input layers of V1 (Ringach et al. 1997Go; Schummers et al. 2007Go). This suggests that studying the dynamics in the Fourier domain in the different cortical layers may shed some light on the contributions of feed-forward thalamic input (such as LGN saturation, synaptic depression, and LGN inseparability), and those that require further cortical elaboration (Monier et al. 2003Go; Ringach et al. 1997Go; Williams and Shapley 2007Go). In addition, the dynamics in the Fourier domain may depend on the location of neurons within the orientation map or, more generally, on the tuning properties of neurons within its immediate neighborhood (Nauhaus et al. 2008Go; Schummers et al. 2002Go, 2004Go). A better understanding of how the diversity of dynamical responses depend on laminar location and the tuning properties of neurons in the neighborhood could help specify the contribution of feed-forward inputs and intra-cortical feedback to the dynamics of cortical responses.

Prior studies have confirmed the presence of some of the inseparable features we observed in the Fourier domain, such as the sharpening in orientation bandwidth, shifts in preferred spatial frequency, and shifts in preferred orientation (Chen et al. 2005Go; Mazer et al. 2002Go; Schummers et al. 2007Go), but some conflicting data exist (Gillespie et al. 2001Go; Mazer et al. 2002Go; Sharon and Grinvald 2002Go). One previous study explicitly quantified separability in the Fourier domain, reporting the kernels to be largely separable (Mazer et al. 2002Go). One key difference between the studies is the way the data are normalized (see a detailed discussion in Ringach et al. 2003Go). Indeed when we repeat the SVD analyses on the raw data, p({omega};{theta};{tau}), instead of the normalized log-likelihood responses, R({omega};{theta};{tau}), we obtain separability indices with a mean of 94%, which is very similar to what they reported. We note that despite such high separability indices, these authors also found a clear shift in the optimal spatial frequency of neurons (an inseparable feature of the dynamics). Mazer et al. (2002)Go did not report signs of suppression for nonpreferred stimuli. This prior study was carried out in awake animals, whereas ours was done in anesthetized animals. One possibility is that the presence of fixation noise would make suppression more difficult to detect in the awake preparation. Another possibility is that suppression is enhanced in the anesthetized preparation. A recent study in (anesthetized) cat visual cortex used a dynamic noise pattern to stimulate the cells and computed the average local spectrum of the stimulus that correlated with the spiking of the neuron (Nishimoto et al. 2006Go). While some of the measured responses resemble the ones reported here (see their Fig. 10), they comprised a substantially smaller proportion of the cases. It is likely the differences here are due to differences in the class of stimuli used. The finely grained dynamic noise used in Nishimoto et al. (2006)Go could fail to efficiently drive suppressive mechanisms that, in our hands, are centered at low spatial frequencies.

To summarize, the observed relationships among tuning selectivity (Ringach et al. 2002Go), separability, and persistence in time of V1 responses can be explained by the combination of two signals that are differentially tuned in the Fourier domain and have different temporal dynamics. Further studies are needed to explain the circuitry underlying these two components and to investigate if the substantial variability in the dynamics could be partly explained by the laminar locations of cells or their position within the functional maps of the cortex.


    GRANTS
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
This work was supported by National Eye Institute National Research Service Award EY015365-01 and National Eye Institute Grants EY-12816, and EY-18322 and Defence Advanced Research Projects Agency FA 8650-06-C-7633.


    ACKNOWLEDGMENTS
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
We thank I. Nauhaus for helpful comments regarding an earlier version of the manuscript.


    FOOTNOTES
 
The costs of publication of this article were defrayed in part by the payment of page charges. The article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

Address for reprint requests and other correspondence: D. Ringach, Dept. of Neurobiology, University of California, Los Angeles, CA 90095-1563 (E-mail: dario{at}ucla.edu)


    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
Bauman LA, Bonds AB. Inhibitory refinement of spatial-frequency selectivity in single cells of the cat striate cortex. Vision Res 31: 933–944, 1991.[CrossRef][Web of Science][Medline]

Blakemore C, Tobin EA. Lateral inhibition between orientation detectors in the cat's visual cortex. Exp Brain Res 15: 439–440, 1972.[Web of Science][Medline]

Bonds AB. Role of inhibition in the specification of orientation selectivity of cells in the cat striate cortex. Vis Neurosci 2: 41–55, 1989.[Web of Science][Medline]

Boudreau CE, Ferster D. Short-term depression in thalamocortical synapses of cat primary visual cortex. J Neurosci 25: 7179–7190, 2005.[Abstract/Free Full Text]

Bredfeldt CE, Ringach DL. Dynamics of spatial frequency tuning in macaque V1. Perception 29: 60–60, 2000.

Carandini M, Heeger DJ, Senn W. A synaptic explanation of suppression in visual cortex. J Neurosci 22: 10053–10065, 2002.[Abstract/Free Full Text]

Celebrini S, Thorpe S, Trotter Y, Imbert M. Dynamics of orientation coding in area V1 of the awake primate. Vis Neurosci 10: 811–825, 1993.[Web of Science][Medline]

Chen G, Dan Y, Li CY. Stimulation of non-classical receptive field enhances orientation selectivity in the cat. J Physiol 564: 233–243, 2005.[Abstract/Free Full Text]

De Valois KK, De Valois RL, Yund EW. Responses of striate cortex cells to grating and checkerboard patterns. J Physiol 291: 483–505, 1979.[Abstract/Free Full Text]

De Valois KK, Tootell RB. Spatial-frequency-specific inhibition in cat striate cortex cells. J Physiol 336: 359–376, 1983.[Abstract/Free Full Text]

De Valois RL, Albrecht DG, Thorell LG. Spatial frequency selectivity of cells in macaque visual cortex. Vision Res 22: 545–559, 1982a.[CrossRef][Web of Science][Medline]

De Valois RL, Yund EW, Hepler N. The orientation and direction selectivity of cells in macaque visual cortex. Vision Res 22: 531–544, 1982b.[CrossRef][Web of Science][Medline]

Deangelis GC, Robson JG, Ohzawa I, Freeman RD. Organization of suppression in receptive fields of neurons in cat visual cortex. J Neurophysiol 68: 144–163, 1992.[Abstract/Free Full Text]

Ferster D, Miller KD. Neural mechanisms of orientation selectivity in the visual cortex. Annu Rev Neurosci 23: 441–471, 2000.[CrossRef][Web of Science][Medline]

Freeman TCB, Durand S, Kiper DC, Carandini M. Suppression without inhibition in visual cortex. Neuron 35: 759–771, 2002.[CrossRef][Web of Science][Medline]

Gillespie DC, Lampl I, Anderson JS, Ferster D. Dynamics of the orientation-tuned membrane potential response in cat primary visual cortex. Nature Neuroscience 4: 1014–1019, 2001.[CrossRef][Web of Science][Medline]

Hirsch JA, Martinez LM, Pillai C, Alonso JM, Wang QB, Sommer FT. Functionally distinct inhibitory neurons at the first stage of visual cortical processing. Nat Neurosci 6: 1300–1308, 2003.[CrossRef][Web of Science][Medline]

Jones HE, Grieve KL, Wang W, Sillito AM. Surround suppression in primate V1. J Neurophysiol 86: 2011–2028, 2001.[Abstract/Free Full Text]

Jones JP, Palmer LA. An evaluation of the two-dimensional Gabor filter model of simple receptive fields in cat striate cortex. J Neurophysiol 58: 1233–1258, 1987.[Abstract/Free Full Text]

Jones JP, Stepnoski A, Palmer LA. The two-dimensional spectral structure of simple receptive fields in cat striate cortex. J Neurophysiol 58: 1212–1232, 1987.[Abstract/Free Full Text]

Li B, Thompson JK, Duong T, Peterson MR, Freeman RD. Origins of cross-orientation suppression in the visual cortex. J Neurophysiol 96: 1755–1764, 2006.[Abstract/Free Full Text]

Mazer JA, Vinje WE, McDermott J, Schiller PH, Gallant JL. Spatial frequency and orientation tuning dynamics in area V1. Proc Natl Acad Sci USA 99: 1645–1650, 2002.[Abstract/Free Full Text]

Monier C, Chavane F, Baudot P, Graham LJ, Fregnac Y. Orientation and direction selectivity of synaptic inputs in visual cortical neurons: a diversity of combinations produces spike tuning. Neuron 37: 663–680, 2003.[CrossRef][Web of Science][Medline]

Morrone MC, Burr DC, Maffei L. Functional implications of cross-orientation inhibition of cortical visual cells. I. Neurophysiological evidence. Proc R Soc Lond B Biol Sci 216: 335–354, 1982.[Medline]

Movshon JA, Thompson ID, Tolhurst DJ. Spatial summation in receptive fields of simple cells in cats striate cortex. J Physiol 283: 53–77, 1978.[Abstract/Free Full Text]

Nauhaus I, Benucci A, Carandini M, Ringach DL. Neuronal selectivity and local map structure in visual cortex. Neuron 57: 673–679, 2008.[CrossRef][Web of Science][Medline]

Nelson JI, Frost BJ. Orientation-selective inhibition from beyond the classic visual receptive field. Brain Res 139: 359–365, 1978.[CrossRef][Web of Science][Medline]

Nishimoto S, Arai M, Ohzawa I. Accuracy of subspace mapping of spatiotemporal frequency domain visual receptive fields. J Neurophysiol 93: 3524–3536, 2005.[Abstract/Free Full Text]

Nishimoto S, Ishida T, Ohzawa I. Receptive field properties of neurons in the early visual cortex revealed by local spectral reverse correlation. J Neurosci 26: 3269–3280, 2006.[Abstract/Free Full Text]

Priebe NJ, Ferster D. Mechanisms underlying cross-orientation suppression in cat visual cortex. Nat Neurosci 9: 552–561, 2006.[CrossRef][Web of Science][Medline]

Priebe NJ, Ferster D. Inhibition, spike threshold, and stimulus selectivity in primary visual cortex. Neuron 57: 482–497, 2008.[Web of Science][Medline]

Ringach DL, Bredfeldt CE, Shapley RM, Hawken MJ. Suppression of neural responses to nonoptimal stimuli correlates with tuning selectivity in macaque V1. J Neurophysiol 87: 1018–1027, 2002.[Abstract/Free Full Text]

Ringach DL, Hawken MJ, Shapley R. Dynamics of orientation tuning in macaque primary visual cortex. Nature 387: 281–284, 1997.[CrossRef][Medline]

Ringach DL, Hawken MJ, Shapley R. Dynamics of orientation tuning in macaque V1: the role of global and tuned suppression. J Neurophysiol 90: 342–352, 2003.[Abstract/Free Full Text]

Sato H, Katsuyama N, Tamura H, Hata Y, Tsumoto T. Mechanisms underlying orientation selectivity of neurons in the primary visual cortex of the macaque. J Physiol 494: 757–771, 1996.[Abstract/Free Full Text]

Schummers J, Cronin B, Wimmer K, Stimberg M, Martin R, Obermayer K, Koerding K, Sur M. Dynamics of orientation tuning in cat V1 neurons depend on location within layers and orientation maps. Front Neurosci 1: 145–159, 2007.[CrossRef]

Schummers J, Marino J, Sur M. Synaptic integration by V1 neurons depends on location within the orientation map. Neuron 36: 969–978, 2002.[CrossRef][Web of Science][Medline]

Schummers J, Marino J, Sur M. Local networks in visual cortex and their influence on neuronal responses and dynamics. J Physiol 98: 429–441, 2004.

Shapley R, Hawken M, Ringach DL. Dynamic's of orientation selectivity in the primary visual cortex and the importance of cortical inhibition. Neuron 38: 689–699, 2003.[CrossRef][Web of Science][Medline]

Sharon D, Grinvald A. Dynamics and constancy in cortical spatiotemporal patterns of orientation processing. Science 295: 512–515, 2002.[Abstract/Free Full Text]

Sillito AM. The contribution of inhibitory mechanisms to the receptive field properties of neurones in the striate cortex of the cat. J Physiol 250: 305–329, 1975.[Abstract/Free Full Text]

Sillito AM, Kemp JA, Milson JA, Berardi N. A re-evaluation of the mechanisms underlying simple cell orientation selectivity. Brain Res 194: 517–520, 1980.[CrossRef][Web of Science][Medline]

Sompolinsky H, Shapley R. New perspectives on the mechanisms for orientation selectivity. Curr Opin Neurobiol 7: 514–522, 1997.[CrossRef][Web of Science][Medline]

Troyer TW, Krukowski AE, Priebe NJ, Miller KD. Contrast-invariant orientation tuning in cat visual cortex: Thalamocortical input tuning and correlation-based intracortical connectivity. J Neurosci 18: 5908–5927, 1998.[Abstract/Free Full Text]

Webster MA, De Valois RL. Relationship between spatial-frequency and orientation tuning of striate-cortex cells. J Opt Soc Am A 2: 1124–1132, 1985.[Web of Science][Medline]

Williams PE, Shapley RM. A dynamic nonlinearity and spatial phase specificity in macaque V1 neurons. J Neurosci 27: 5706–5718, 2007.[Abstract/Free Full Text]

Xing DJ, Shapley RM, Hawken MJ, Ringach DL. Effect of stimulus size on the dynamics of orientation selectivity in Macaque V1. J Neurophysiol 94: 799–812, 2005.[Abstract/Free Full Text]




This article has been cited by other articles:


Home page
J. Neurophysiol.Home page
R. Kimura and I. Ohzawa
Time Course of Cross-Orientation Suppression in the Early Visual Cortex
J Neurophysiol, March 1, 2009; 101(3): 1463 - 1479.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
100/1/239    most recent
90273.2008v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Malone, B. J.
Right arrow Articles by Ringach, D. L.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Malone, B. J.
Right arrow Articles by Ringach, D. L.


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
Visit Other APS Journals Online
Copyright © 2008 by the The American Physiological Society.