JN Fuel your research with LabChart
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


J Neurophysiol 96: 404-419, 2006. First published April 19, 2006; doi:10.1152/jn.00015.2005
0022-3077/06 $8.00
This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
96/1/404    most recent
00015.2005v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via ISI Web of Science (5)
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Teich, A. F.
Right arrow Articles by Qian, N.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Teich, A. F.
Right arrow Articles by Qian, N.

Comparison Among Some Models of Orientation Selectivity

Andrew F. Teich and Ning Qian

Center for Neurobiology and Behavior, Mahoney Center for Brain and Behavior Research, and Department of Physiology and Cellular Biophysics, Columbia University, New York, New York

Submitted 7 January 2005; accepted in final form 14 April 2006


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
Several models exist for explaining primary visual cortex (V1) orientation tuning. The modified feedforward model (MFM) and the recurrent model (RM) are major examples. We have implemented these two models, at the same level of detail, alongside a few newer variations, and thoroughly compared their receptive-field structures. We found that antiphase inhibition in the MFM enhances both spatial phase information and orientation tuning, producing well-tuned simple cells. This remains true for a newer version of the MFM that incorporates untuned complex-cell inhibition. In contrast, when the recurrent connections in the RM are strong enough to produce typical V1 orientation tuning, they also eliminate spatial phase information, making the cells complex. Introducing phase specificity into the connections of the RM (as done in an original version of the RM) can make the cells phase sensitive, but the cells show an incorrect 90° peak shift of orientation tuning under opposite contrast signs. An inhibition-dominant version of the RM can generate well-tuned cells across the simple–complex spectrum, but it predicts that the net effect of cortical interactions is to suppress feedforward excitation across all orientations in simple cells. Finally, adding antiphase inhibition used in the MFM into the RM produces a most general model. We call this new model the modified recurrent model (MRM) and show that this model can also produce well-tuned cells throughout the simple–complex spectrum. Unlike the inhibition-dominant RM, the MRM is consistent with data from cat V1, suggesting that the net effect of cortical interactions is to boost simple cell responses at the preferred orientation. These results suggest that the MFM is well suited for explaining orientation tuning in simple cells, whereas the standard RM is for complex cells. The assignment of the RM to complex cells also avoids conflicts between the RM and the experiments of cortical inactivation (done on simple cells) and the spatial-frequency dependency of orientation tuning (found in simple cells). Because orientation-tuned V1 cells show a continuum of simple- to complex-cell behavior, the MRM provides the best description of V1 data.


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
Strong orientation tuning first emerges in primary visual cortex (V1) (Hubel and Wiesel 1962Go). However, despite decades of studies and progress, the exact mechanism underlying the phenomenon remains controversial. Hubel and Wiesel initially proposed that orientation tuning of a V1 cell is established by receiving feedforward inputs from several lateral geniculate nucleus (LGN) cells with properly aligned, center-surround receptive fields. The oriented ON and OFF subregions of a V1 cell are assumed to arise from the ON-center and OFF-center LGN cells, respectively. With a sufficiently large aspect ratio (the length-to-width ratio of a subregion) and a high spiking threshold to remove responses from nonpreferred orientations, the V1 cell can have sharp orientation tuning as observed. Although the proposed LGN-to-V1 connectivity pattern has been supported by experimental evidence (Chapman et al. 1991Go; Reid and Alonso 1995Go; Tanaka 1983Go), this feedforward model cannot be the whole story because it has difficulties with other observations, especially the contrast invariance of orientation tuning (Sclar and Freeman 1982Go; Skottun et al. 1987Go) and the effects of cortical inhibition blockade (Sillito 1975Go; Sillito et al. 1980Go; Tsumoto et al. 1979Go; for thorough reviews also see Ferster and Miller 2000Go; Somers et al. 1995Go, Sompolinsky and Shapley 1997Go). Contrast invariance refers to the fact that the orientation tuning width of a cell does not change much with the stimulus contrast. The feedforward model predicts broadening of tuning with increasing contrast because a cell's membrane potential evoked by nonoptimal orientations should increase with contrast and is therefore more likely to reach the firing threshold at higher contrasts. Anderson et al. (2000a)Go recently found that neuronal noise explains why a contrast-invariant membrane potential leads to a contrast-invariant firing rate. However, this important result does not solve the problem of the feedforward model because the model cannot explain contrast invariance of the membrane potential in the first place. As the authors indicate, a cortical contribution is required to completely explain contrast invariance.

To resolve these problems, several groups proposed the recurrent model (RM), which assumes a weak feedforward orientation bias (i.e., a small aspect ratio) by the mechanism of Hubel and Wiesel and a subsequent sharpening of the tuning by both recurrent excitation and inhibition among the V1 cells (Ben-Yishai et al. 1995Go; Carandini and Ringach 1997Go; Douglas et al. 1995Go; Somers et al. 1995Go). The RM can account for an impressively large amount of experimental data (Somers et al. 1995Go; Sompolinsky and Shapley 1997Go), including the contrast invariance and the effects of inhibition blockade. However, it has difficulties with more recent cortical inactivation experiments (Chung and Ferster 1998Go; Ferster et al. 1996Go). By silencing cortical spiking activities through cooling or electrical shock and thereby greatly reducing intracortical interactions, Ferster and coworkers found that the intracellularly recorded orientation tuning curves of simple cells in the input layer of cat V1 did not become broader than those under the normal condition (Chung and Ferster 1998Go; Ferster et al. 1996Go). Another problem with the RM is that the width of tuning in the model is largely determined by the strong intracortical recurrent interactions. Consequently, the model cannot explain the fact that the orientation tuning width of simple cells decreases with increasing spatial frequency of a grating stimulus (Vidyasagar and Siguenza 1985Go).

These problems of the RM prompted Troyer et al. (1998)Go to propose a new modification of Hubel and Wiesel's feedforward model. According to Troyer et al., V1 orientation tuning mainly results from the feedforward mechanism and the contrast invariance is maintained by cortical inhibition, which is strongest between simple cells with opposite receptive-field polarities. This inhibition is termed "antiphase" inhibition, and there is partial evidence for it from intracellular recordings (Anderson et al. 2000bGo; Ferster 1988Go; Hirsch et al. 1998Go). The full version of the model also incorporates intracortical recurrent interactions for amplifying neuronal responses, but not for sharpening the bandwidth of feedforward tuning. In the rest of this paper, we will follow Ferster and Miller (2000)Go to refer to this model as the modified feedforward model (MFM). Although successful in many ways (Kayser et al. 2001Go; Krukowski and Miller 2001Go; Troyer et al. 1998Go), the MFM has more difficulties than the RM in explaining temporal dynamics (Pei et al. 1994Go; Ringach et al. 1997Go, 2003Go; Shapley et al. 2003Go; Shevelev et al. 1993Go, 1998) and the learning- and adaptation-induced plasticity of orientation tuning (Dragoi et al. 2002Go; Felsen et al. 2002Go; Teich and Qian 2003Go).

There is a new variant of the MFM (Lauritzen and Miller 2003Go) that includes untuned complex-cell inhibition, in addition to the antiphase inhibition, for the maintenance of contrast invariance. There is also a new inhibition-dominant variant of the RM (McLaughlin et al. 2000Go), such that V1 recurrent interactions suppress a cell's response at every stimulus orientation. We have examined these interesting variants together with the original RM and MFM in our study. However, it is more logical for our presentation to first focus on the original RM and MFM before moving on to their variants.

Both the RM and the MFM contain orientation-tuned feedforward input (as suggested by Hubel and Wiesel) and recurrent intracortical interactions. However, there are major differences between them (see METHODS), and these differences lead to divergent properties of the models, as we show in this paper. It is difficult to compare the models fairly based on the published studies because different studies often implement the models with very different degrees of complexity. We therefore implemented the RM and MFM at the same level of detail in our comparison. In particular, we investigated the possibility that the MFM and RM may be better suited for explaining strong orientation tuning in simple and complex cells, respectively. This is an interesting possibility because, if true, it helps alleviate the above-mentioned conflicts between the models and some experimental data (see DISCUSSION). Another difficulty with comparing the models is that a given result can sometimes be described differently. For example, antiphase inhibition in the MFM removes the contrast-dependent DC responses across all orientations (Troyer et al. 1998Go). This result can be described as either the presence or the absence of cortical sharpening of orientation tuning in the MFM depending on whether one includes or excludes the DC responses in the tuning-width calculation. Similarly, one can emphasize either the same-orientation or the orthogonal-orientation inhibition generated by broadly tuned antiphase inhibition. We will be as specific as possible when comparing models herein.

Both the RM and MFM were originally proposed as orientation mechanisms for simple cells (Somers et al. 1995Go; Troyer et al. 1998Go). The MFM is applicable only to simple cells because the connections in the model are based on receptive-field phases (Troyer et al. 1998Go). (Of course, one can assume that complex cells inherit their sharp orientation tuning from the simple cells described by the MFM, but such complex cells are unlikely to explain the observed orientation plasticity induced by learning or adaptation; Felsen et al. 2002Go; Teich and Qian 2003Go; Teich and Qian, unpublished observations.) In contrast, the RM is applicable to complex cells because the connections are independent of receptive-field phases. However, it is not clear whether the RM really works for simple cells as well. On the one hand, Chance et al. (1999)Go showed that strong recurrent interactions lead to complex-cell behavior in a related model on frequency tuning. On the other hand, the V1 cells in the RM of Somers et al. (1995)Go did seem to maintain discrete ON and OFF subregions despite the strong recurrent interactions, suggesting that the recurrent model should work for simple cells. Other implementations of the RM do not bear on this issue because they are more abstract and do not explicitly include receptive fields (Ben-Yishai et al. 1995Go; Carandini and Ringach 1997Go; Douglas et al. 1995Go; Teich and Qian 2003Go).

A close examination of Somers et al.'s model reveals that they made the simplifying assumption that all model cells have the same even receptive-field phase (but different orientations). Real V1 cells, however, cover a broad range of receptive-field phases. If the strong recurrent connections are introduced among V1 cells with different phases (determined by the feedforward inputs) to generate sharp orientation tuning, can the cells in such a multiphase RM maintain their phase sensitivity? A related question is: Do phase-sensitive cells in Somers et al.'s single-phase RM really behave like simple cells? Furthermore, because there is also recurrent excitation in the MFM, do cells in that model maintain their simple-cell identity? We address these questions in this study. We then examined the receptive-field structure of the two variant models mentioned above. Finally, we combined the RM and the MFM into a new model, the modified recurrent model (MRM), and investigated whether the new model provides the best description of orientation tuning in V1 where cells have various degrees of simple to complex behavior. Preliminary results were previously reported in abstract form (Teich and Qian 2004Go).


    METHODS
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
We first implemented the RM and the MFM at an equal level of detail and compared the responses of the two models to drifting gratings and flashed stationary bars. These responses would allow us to determine whether a model cell behaves like a simple or complex cell. Our implementation of the MFM was based on Troyer et al. (1998)Go, who simulated both a very simple "conceptual" model and a more detailed "computational" model. We implemented their computational model but replaced the spiking neurons with rate-coding neurons; this simplification greatly improves computational efficiency without altering the essential features of the original model. Their conceptual model is not sufficient for our purpose because it has only two receptive-field phases and does not include recurrent cortical excitation; we wanted to know whether the MFM cells could behave like simple cells in a multiphase implementation with recurrent excitation. After constructing the MFM, we then built the RM at the same level of complexity. Both models start with a field of LGN cells, and the LGN response to a stimulus is then fed to a cortical network of V1 cells. The LGN cells are modeled the same way in both models and will be described first. The main difference between the two models is found in the aspect ratio of the thalamocortical connections and at the connectivity patterns among the V1 cells (detailed below). After implementing the RM and the MFM, we then considered the MFM with complex inhibition, the inhibition-dominant RM, and the MRM. All simulations were performed with Matlab (The MathWorks, Natick, MA) on a Linux computer.

Stimuli

We used drifting sinusoidal gratings and stationary bars to probe whether the cells in a model behave like simple or complex cells. The drifting gratings had the form

Formula 1(1)
where wt represents the temporal frequency and wx and wy represent the spatial frequencies in the x and y dimensions, respectively. We did not explicitly include constant background luminance and contrast in the equation because the former would be filtered out by the LGN receptive fields whose DC responses were near zero, and the latter would be determined by scaling according to contrast response of real LGN cells (see following text). In our simulations, wt/(2{pi}) = 2 Hz, wx = w cos {theta}, and wy = w sin {theta}, where w/(2{pi}) = 0.8 cycles/deg and {theta} is the grating orientation (0 means vertical). The bars had a width of 30 arc min and a length spanning the entire receptive fields, and were turned on for 400 ms and off for 300 ms. Thus for a vertical bar stimulus, S(x, y, t) = 1 for –15 ≤ x ≤ 15 and 0 ≤ t ≤ 400 (where x is in arc minutes and t is in milliseconds), and S(x, y, t) = –1, otherwise. Other bar orientations were obtained by rotation.

LGN responses

The LGN cells in both models closely follow the design of Troyer et al. (1998)Go but have a more convenient temporal response function. We considered 240 ON-center and 240 OFF-center cells. The cells of each type are arranged in a rectangular grid to provide inputs to vertically oriented V1 cells; feedforward inputs to V1 cells of other orientations were computed by proper rotations of the inputs to the vertical cells. Before rotation, centers of LGN cells were spaced 9 min apart along the vertical axis and 6 min apart along the horizontal axis. There was no spatial offset between the ON and OFF grids. Each cell's spatial receptive field is center-surround, modeled with a difference of Gaussians

Formula 2(2)
where r is the distance in any direction from the center of the receptive field, {sigma}center = 15' and {sigma}surround = 60'. The temporal response function of the LGN cells takes the form (Chen et al. 2001Go)

Formula 3(3)
for t ≥ 0 and h(t) = 0, otherwise. Here {tau} determines the duration of the temporal envelope, and the sinusoidal term with frequency {omega}to generates excitatory and inhibitory responses. Because for many real cells the first half-cycle of the temporal response is shorter by various amounts than the second half-cycle, the parameter {phi}t is introduced to reduce the length of the first half-cycle (because of the rapid decay of the exponential, the durations of the third and later half-cycles are not important). In this paper, we let {tau} = 16 ms, {omega}to/(2{pi}) = 4 Hz, and {phi}t = 0.24. The overall duration of this temporal response function is about 140 ms. The resulting temporal response curve was consistent with physiologically reported temporal responses in LGN cells (Gielen et al. 1981Go; Saul and Humphrey 1990Go). We also added a 50-ms delay to V1 cells' responses without explicitly simulating random individual synaptic delays.

The LGN response Rlgn to a given stimulus was first determined through spatiotemporal convolutions of the stimulus S(x, y, t) with the kernel f(r)h(t)

Formula 4(4)
and then scaled according to a contrast response curve. LGN responses to a drifting grating of contrast C were scaled so that the amplitude of the F1 component follows the contrast response curve from Troyer et al. (1998)Go: R = RmaxCn/(C50n + Cn), where Rmax = 53 spikes/s, n = 1.2, and C50 = 13.3% for ON-center cells; Rmax = 48.6 spikes/s, n = 1.29, and C50 = 7.18% for OFF-center cells. These contrast response functions assume stimulation at the optimal spatial frequency for the LGN filter (about 0.55 cycles/deg). LGN responses to nonoptimal frequencies were scaled down proportionally using the ratio of the filtered responses to the nonoptimal and the optimal frequencies. We assumed that the LGN contrast response curve to a stationary bar would have the same shape as the F1 curves for gratings. Because LGN responses to a flashed spot do not vary significantly between ON- and OFF-center cells (Saul and Humphrey 1990Go), we used an average n value of 1.245 and an average C50 value of 10.24 for both types of cells. We scaled the curve's amplitude by letting Rmax = 285 spikes/s so that the initial peak response to a flashed bar of 50% contrast and optimal width was about 250 spikes/s (Tanaka 1983Go). Because the background firing rate is not included in the contrast responses functions, we added a background firing rate of 10 spikes/s for ON-center cells and 15 spikes/s for OFF-center cells after scaling [values taken from Troyer et al. (1998)Go].

Cortical receptive fields

The LGN inputs to a V1 cell were determined by a two-dimensional Gabor connectivity function (a Gaussian multiplied by a sinusoid). For vertically oriented V1 cells, it had the form

Formula 5(5)
where wg/(2{pi}) = 0.8 cycles/deg, {phi} determined the V1 receptive-field phase, and {sigma}x and {sigma}y determined the number of subregions and the aspect ratio (see following text). For V1 cells with orientation {theta}, the Gabor connectivity function g(x, y, {theta}) was obtained from g(x, y) by proper rotation. All LGN-to-V1 connections are excitatory. The positive values of the Gabor function represented excitatory connections from ON-center LGN cells and negative values represented excitatory connections from OFF-center LGN cells; in either case, the strength of the connection was represented by the absolute value of the Gabor at a given point. The Gaussian was elongated in the preferred orientation of the V1 cell under consideration and the sinusoid had the same orientation. The width and length of the Gaussian were defined by the 5% points of the peak, along the short and long axes of the Gaussian, respectively. The number of subregions was determined by the ratio of the Gaussian width to the width of a half-cycle of the sinusoid. The aspect ratio was defined as the ratio of the Gaussian length to the width of a half-cycle of the sinusoid. As noted in Eq. 5, we used a sinusoid with a half-cycle width of 0.625°, corresponding to a spatial frequency of 0.8 cycles/deg [roughly the mean preferred spatial frequency of cat cortical cells at 5° eccentricity (Movshon et al. 1978bGo); all spatial dimensions can be scaled down to represent monkey V1 cells without affecting our conclusions]. The MFM and RM used the high and low values of the reported aspect ratios, respectively. For the MFM, we picked the Gaussian envelope to match the physiological receptive fields reported by Jones and Palmer (1987)Go and used by Troyer et al. (1998)Go: 2.65 subregions and an aspect ratio of 4.54 (Fig. 1A1). These values yielded a feedforward input well tuned for orientation (Fig. 1A2). For the RM, we also had 2.65 subregions but lowered the aspect ratio to 2 (Fig. 1B1), which is in line with the value used by Somers et al. (1995)Go and is close to the value of 1.7 reported by Pei et al. (1994)Go. This yielded a broadly tuned feedforward input in Fig. 1B2.


Figure 1
View larger version (41K):
[in this window]
[in a new window]
 
FIG. 1. Contrast-invariant orientation tuning of our implementations of the modified feedforward model (MFM, top row) and the recurrent model (RM, bottom row). In the MFM, the lateral geniculate nucleus (LGN) to primary visual cortex (V1) connectivity pattern is assumed to have a long orientation axis (aspect ratio = 4.54; A1), which yields a feedforward input well tuned for orientation (A2). In A1 and B1, solid lines represent connections from ON-center LGN cells, whereas dotted lines represent connections from OFF-center LGN cells. DC baseline component in the feedforward input (A2) is eliminated through cortical antiphase inhibition, yielding contrast invariance (A3). Although the V1 output at the lowest contrast (2.5%) is actually narrower than higher contrasts in the MFM, this problem vanishes when noise is introduced to the system (Troyer et al. 2002), and does not affect our results (which are at 50% contrasts) in the rest of the paper. RM has a shorter axis of orientation (aspect ratio = 2; B1), which yields a feedforward input broadly tuned for orientation (B2). Intracortical connections both eliminate the DC baseline component in the feedforward input and sharpen the tuned component, yielding sharp, contrast-invariant tuning (B3).

 
Features common to the MFM and the RM

Both the MFM and the RM have cortical excitatory cells and cortical inhibitory cells at eight evenly spaced spatial phases over 2{pi} and 64 evenly distributed orientations over 180°, for a total of 512 excitatory cells and 512 inhibitory cells. In the real cortex, excitatory cells outnumber inhibitory cells by a factor of three. For computational efficiency, we did not explicitly model this fact; instead, we simply let each excitatory cell in our models represent three cells of the same type. The connections from an excitatory cell to any cell represent the combined strengths of the three cells. The model cells are assumed to represent the same area of visual space and have retinotopically overlapping receptive fields (they draw feedforward input from the same field of LGN cells). Each cortical cell receives feedforward input from its LGN afferents as well as cortical input from other V1 cells. The nature of the intracortical connections depended on which model we used and is detailed below. For a given V1 cell, the connections it received were divided into three sets: the connections from LGN cells, the connections from excitatory V1 cells, and the connections from inhibitory V1 cells. We first normalized each set of the connections such that the sum of all connections within the set was one. We then multiplied all connections within a set by a common weighting factor, and different weighting factors were applied to different sets to manipulate their relative strengths. Because the recipient V1 cell can be either excitatory or inhibitory, there are a total of six weighting factors denoted by e -> e, e -> i, i -> e, i -> i, F -> e, and F -> i, which represent weights for excitatory to excitatory, excitatory to inhibitory, inhibitory to excitatory, inhibitory to inhibitory, feedforward to excitatory, and feedforward to inhibitory connections, respectively.

The membrane potential (V) of each cortical cell was updated according to the differential equation (Carandini and Ringach 1997Go; Teich and Qian 2003Go)

Formula 6(6)
where {tau} is the membrane time constant (15 ms in all simulations), and Vf, Ve, and Vi are the synaptic potentials generated by the feedforward, cortical excitatory, and cortical inhibitory inputs to the cell, respectively. For example, for an excitatory cell with orientation {theta}, these three values were given by

Formula 7(7)

Formula 8(8)

Formula 9(9)
where Rlgn, Re, and Ri are the firing rates of LGN cells, excitatory V1 cells, and inhibitory V1 cells, respectively; Cee and Cie represent the normalized connectivity strengths from excitatory and inhibitory V1 cells to the excitatory V1 cell under consideration, and are determined differently for each model (see details below). Similar equations can be written for each inhibitory V1 cell. The interval ({Delta}t) at which we integrated Eq. 6 was 1 ms in all simulations. Our models directly relate membrane potential to firing rate by the threshold function with a gain factor {alpha} (Carandini and Ringach 1997Go; Teich and Qian 2003Go)

Formula 10(10)
The {alpha} value was 5 for excitatory cells, 8 for inhibitory cells in our MFM [to give inhibitory cells a higher firing rate as in Troyer et al. (1998)Go], and 6.5 for all cortical cells in our RM.

Intracortical circuitry for the MFM

Following Troyer et al. (1998)Go, we determined the connection between any two cortical cells a and b by computing the normalized correlation between their receptive fields

Formula 11(11)
where c'(a, b) is the raw correlation computed by multiplying the two Gabor connectivity functions (feedforward receptive fields) g(x, y, {theta}) of cells a and b point by point and then summing across all points. Two cortical cells with the exact same orientation and spatial phase have a normalized correlation of 1 and thus have the strongest positive (excitatory) connection; two cortical cells with the exact same orientation and opposite spatial phase (antiphase) have a normalized correlation of –1 and have the strongest negative (inhibitory) connection. All other combinations are intermediate, with the nature of the connection (excitatory or inhibitory) being determined by the sign of the normalized correlation, and the strength of the connection being determined by the absolute value of the normalized correlation raised to a power Npow. Mathematically, the connection strength C(a -> b) from cell a to cell b is based on the normalized correlation according to

Formula 12(12)
where sgn (a) = 1 if a is excitatory and –1 if a is inhibitory, and [x]+ = x if x > 0 and 0, otherwise. Npow determines the strength of connectivity as a function of correlation; larger values lead to less connectivity between cells that are weakly correlated. We used an Npow value of 6, which is the default value used in Troyer et al. (1998)Go.

As we mentioned earlier, after normalizing the feedforward, cortical excitatory, and cortical inhibitory connections to a given cell separately, each connection type was then weighted. In the MFM, e -> e was set to 0.13, e -> i was set to 0.15, i -> e was set to 0.22, i -> i was set to 0, and F -> e and F -> i were set to 0.1. The background V1 activity resulting from the LGN input in the absence of a stimulus was 0 spikes/s for excitatory cells and 10 spikes/s for inhibitory cells. Note that for the MFM, i -> i was set to 0 as in Troyer et al. (1998)Go; a nonzero connection among the inhibitory cells will reduce their activities across all orientations and thus make antiphase inhibition to excitatory cells less effective.

Anatomical studies have suggested that thalamic inputs constitute a small percentage of the total inputs to V1 simple cells, and that among cortical inputs, excitatory synapses are more prevalent than inhibitory synapses (Ahmed et al. 1994Go; Gabbott and Somogyi 1986Go; Peters and Payne 1993Go). On the other hand, physiological studies showed that V1 responses to thalamic input are more powerful than to cortical input (Stratford et al. 1996Go). In addition, cortical inhibitory synapses are closer to the soma (Douglas and Martin 2004) and are thus probably more powerful than cortical excitatory synapses. We did not explicitly simulate these facts for the sake of computational efficiency. Instead, we can simply assume that when comparing the feedforward connection strengths with intracortical connection strengths, each intracortical connection should be viewed as the combined effect of many synapses. Likewise, when comparing the intracortical excitatory connections with inhibitory connections, each excitatory connection should be viewed as the combined connections from a few excitatory cells. The same comments also apply to all of the other models discussed in the following text.

The MFM with complex-cell inhibition

In addition to the standard MFM, we also ran a newer version of the MFM with complex-cell inhibition (Lauritzen and Miller 2003Go). This involved applying the same untuned, phase-insensitive inhibition to every cortical cell. All of the parameters for the MFM with complex inhibition are identical to the standard MFM except for the connection strengths between different cell types and Npow. The connection strengths were as follows: e -> e was set to 0.16, e -> i was set to 0.18, i -> e was set to 0.25, i -> i was set to 0.1, and F -> e and F -> i were set to 0.1. Npow was set to 24. These adjustments were made to accommodate added complex inhibition. In addition, ci -> e was set to 1.2, ci -> i was set to 1.2, and F -> ci was set to 0.1, where ci represents untuned, complex inhibition. Because complex inhibition is applied equally to every cell, we had only one inhibitory complex cell in every simulation and simply applied its output to every other cortical cell at each time step.

Intracortical circuitry for the RM

Unlike the MFM, the connections between two cells in the standard RM were independent of the cells' receptive field phase relationship and were determined only by the difference in the cells' preferred orientations. We used Gaussians to describe the connection strengths as a function of the difference in the cells' preferred orientations. The connections from an excitatory cell to any cell (the excitatory profile) are given by a Gaussian with SD of 35°. The connections from an inhibitory cell to any cell (the inhibitory profile) are given by a Gaussian with SD of 52°. When these two profiles are subtracted from each other, the resulting net recurrent interaction as a function of preferred orientation difference has the "Mexican-hat" shape, a prominent feature of the RM. Also unlike the MFM, there are direct connections among the inhibitory cells.

Our previous work with the RM (Teich and Qian 2003Go) followed the study of Ferster (1986)Go in assuming that stimuli orthogonal to a cell's preferred orientation do not evoke either excitatory or inhibitory postsynaptic potentials. However, in this study we discovered that once multiple phases are included, the RM requires at least some net inhibition at orthogonal orientations to avoid spurious peaks at orthogonal orientations. The reason can be understood by considering two cells with the same receptive-field phase that are oriented 90° away from one another. When a light bar is aligned with the ON subregion of one cell and thus generating maximum feedforward input to the cell, the second, orthogonal cell receives minimum feedforward input for this stimulus (see solid curve in Fig. 4A). When a dark bar is aligned with the ON subregion of the first cell, the first cell receives minimum feedforward input, whereas the orthogonal cell receives maximum feedforward input (see dashed curve in Fig. 4A). This orthogonal peak in the feedforward input to the second cell will lead to a spurious peak in its orientation tuning unless there is some stimulus-driven inhibition at the orthogonal orientation. There is also experimental evidence supporting the presence of orthogonal inhibition (Allison et al. 2001Go; Bonds 1989Go; Martinez et al. 2002Go; Monier et al. 2003Go; Shapley et al. 2003Go). In particular, Monier et al. (2003)Go found that in one fifth of their intracellular recordings, inhibition peaked at the orthogonal orientation, and some evidence for this was also found by Martinez et al. (2002)Go in layer 5. Note that the MFM also requires inhibition at orthogonal orientations (see DISCUSSION).


Figure 4
View larger version (13K):
[in this window]
[in a new window]
 
FIG. 4. Comparison between single-phase and multiphase RMs showing that the single-phase RM has the problem of producing orthogonal tuning peaks for light and dark bars. Example cells used here are vertically oriented and even phase as in the previous figures. Bars are centered on the central ON region of the receptive field, and the tuning curves used here are made from the ON responses 400 ms after stimulus onset. A: tuning curves of the feedforward inputs, with the peak and the trough at the cell's preferred orientation for the light bar (solid curve) and dark bar (dashed curve), respectively. They are the same for the single-phase and multiphase RM cells. B: tuning curves of the firing output for the single-phase RM cell. Problem is that they simply follow the input curves, with the tuning peaks differing by 90° for the light and dark bars. C: tuning curves of the firing output for the corresponding mulitiphase RM cell, with the same peak location for the light and dark bars.

 
For the RM, e -> e and e -> i were 1.6, i -> e and i -> i were 1.8, and F -> e and F -> i were 0.07. With these parameters, the background V1 activity in the absence of a stimulus was 2.47 spikes/s for all cortical cells. Figure 1 confirms that our multiphase implementations of the MFM and the RM work well in generating contrast-invariant orientation tuning curves (Fig. 1, A3 and B3) as observed in V1.

In the preceding implementation of the RM, the V1 cells' receptive fields had eight different spatial phases (determined by the feedforward LGN inputs) just as in the MFM. To compare with the original single-phase RM of Somers et al. (1995)Go, we also considered a single-phase version of our RM by letting the eight cells at each orientation all have the same even spatial phase. In this single-phase RM, the values we used for e -> e and e -> i were 1.55, and i -> e, i -> i, F -> e, and F -> i were identical to the values in the multiphase RM. The background V1 activity in the absence of a stimulus was 2 spikes/s for all cortical cells in the single-phase RM.

Differences between the MFM and the RM

The terms recurrent model and modified feedforward model may be a little misleading because both models contain orientation-tuned feedforward input and recurrent intracortical interactions. However, there are major differences between them, as we alluded to above. To make the discussions more precise, note that for both models, the feedforward input to a V1 cell can be viewed as a sum of a DC component (the constant baseline response across all orientations) and a tuned component (the orientation-dependent portion) (Fig. 1, A2 and B2). The DC component equals the response at the orientation orthogonal to the cell's preferred orientation. By subtracting this component from the total feedforward input, one obtains the tuned component. Both components grow with the stimulus contrast (Fig. 1, A2 and B2) because a V1 cell receives inputs from both ON-center and OFF-center LGN cells, and as the stimulus intensity increases relative to the background, the OFF-center cell firing rates quickly drop to zero and cannot go any lower to counter the growing ON-center cell responses before they saturate (Shapley et al. 2003Go; Troyer et al. 1998Go).

There are several crucial differences between the two models. Most prominently, the RM assumes that the tuned component of the feedforward input is broad and is sharpened by intracortical interactions, whereas the MFM assumes that the tuned component in the feedforward input is narrow and is not sharpened further by intracortical connections. These different feedforward tuning curves arise from different assumptions about the aspect ratio of the LGN-to-V1 projections; the RM assumes a small aspect ratio, whereas the MFM assumes a large one.

The intracortical processing is necessary for both models because, without it, the V1 tuning would be similar to the feedforward tuning, with a DC component (across all orientations) and a tuned component. Because these components grow with contrast, a fixed firing threshold cannot make the V1 tuning contrast invariant. In the RM, the intracortical interactions both sharpen the tuned component and remove the DC component of the feedforward input (Fig. 1B3). However, according to the MFM, the intracortical interactions do not sharpen the tuned component further but only remove the DC component (Fig. 1A3).

These different requirements are realized by different intracortical connection patterns used in the two models: 1) The connections in the MFM depend on both the orientation difference and the receptive-field phase difference between two V1 cells. In the RM, the connections are only orientation dependent but not phase dependent. 2) Both models assume that the connection strengths fall off with increasing difference between two cells' preferred orientations. However, the RM assumes that the inhibitory connections cover a wider range of orientation differences than that of the excitatory connections, resulting in a Mexican-hat net interaction profile, whereas the MFM assumes equal width of the excitatory and inhibitory profiles. In the RM, the inhibitory connections among cells tuned to very different orientations suppress responses to the nonpreferred orientations including the orthogonal orientation. In the MFM, the antiphase inhibition among cells with similar receptive-field orientation (but opposite receptive-field phases) does the same job. Although the inhibitory connections in the MFM do not cover a broader range of orientation differences than the excitatory connections, the inhibitory cells' responses are assumed to have a broad, contrast-dependent DC component, which suppresses the excitatory cells' responses at the nonpreferred orientations including the orthogonal orientation (Troyer et al. 1998Go). So strictly speaking, orthogonal inhibition is also present in the MFM. 3) There are direct connections among the inhibitory cells in the RM; such connections are not allowed in the MFM because they would diminish the antiphase inhibition mentioned above. Antiphase inhibition has also been called feedforward inhibition (Ferster and Miller 2000Go), although the inhibitory cells do receive recurrent cortical inputs.

There is also a quantitative difference between the two models: cortical weights are an order of magnitude greater in the RM than in the MFM. This is explained by the fact that cortical excitation and inhibition are occurring at the same time in the RM because both are phase-independent in the RM. Thus they tend to cancel each other, and for the net excitation and inhibition after cancellation to be of significant magnitudes, the original magnitudes must be large. In contrast, the MFM has cortical excitation and cortical inhibition occurring at different times because inhibition is antiphase from excitation. Thus when cortical excitation is large in magnitude, cortical inhibition will be small in magnitude, and vice versa. This relationship allows the cortical weights in the MFM to be much smaller for cortical connections to have a substantial effect on a cell's spiking output.

The modified recurrent model

We constructed a new model, the modified recurrent model (MRM), which essentially is the RM with antiphase inhibition of the MFM added. We started with the RM exactly like the one described above. We will call the excitatory cells E cells and the inhibitory cells I cells. These cells have LGN input and phase-independent recurrent cortical connections exactly as described for the RM above. We then added a third set of V1 cells that are antiphase inhibitory cells. We will call these cells AI cells. The AI cells receive LGN input and cortical excitation from E cells of the same spatial phase, using the correlation-based rule from the MFM with an Npow value of 6. These cells then deliver inhibition to the E and I cells, again using the correlation-based rule and an Npow of 6 so that the maximum inhibition is to cells with opposite spatial phases. Our simulations with the MRM used 512 AI cells and 512 I cells (i.e., for each category of inhibitory cell, there was one cell of each orientation/phase type) as well as 512 E cells. The respective sums of all recurrent excitatory connection strengths, recurrent inhibitory connection strengths, and antiphase inhibitory connection strengths were each individually normalized to one. We set e -> e and e -> i to 3.2, i -> e and i -> i to 3.5, e -> ai to 0.7, ai -> e and ai -> i to 0.2, and F -> e, F -> i, and F -> ai to 0.07. The background V1 activity resulting from LGN input in the absence of a stimulus was 0 spikes/s for E cells and I cells, and 5.7 spikes/s for AI cells.

The inhibition-dominant recurrent model

We also ran some simulations with the inhibition-dominant RM of McLaughlin et al. (2000)Go. This model, like the standard RM, uses phase-insensitive intracortical connections. In the standard RM, the inhibitory profile is broader (i.e., covers a broader range of orientation differences) than the excitatory profile to generate a net Mexican-hat profile (see Intracortical circuitry for the RM above). This is the opposite in the inhibition-dominant RM. However, in the inhibition-dominant RM, inhibitory cells are more broadly tuned than excitatory cells to produce an effective Mexican hat. The most important difference between the standard and inhibition-dominant RMs is that in the latter, inhibition is much stronger than excitation. We have included these main features in our implementation. We first took the inhibitory and excitatory connectivity profiles from our standard RM and switched them so that the inhibitory profile is narrower than the excitatory profile. We then used aspect ratios of 4.54 and 2.0 for the V1 excitatory and inhibitory cells, respectively, to generate sharply tuned excitatory cells and broadly tuned inhibitory cells. Finally, we set e -> e to 0.05, e -> i was set to 0.1, i -> e was set to 0.5, i -> i was set to 0.4, and F -> e and F -> i were set to 0.5. Our implementation was quite simplified compared with McLaughlin et al.'s original model; for example, we neglected the variability in V1 orientation selectivity (Ringach et al. 2002Go). However, the model is sufficient for our purpose of studying the receptive-field structure of well-tuned cells.

Parametric variations

Our conclusions in this paper are robust against the model parameters. For example, Fig. 3 shows that the RM cannot generate sharply tuned simple cells when the intracortical connection strengths vary in small steps from 0 to the full strengths. In addition, we reached the same conclusion when we doubled the sampling density of the LGN inputs, changed the contrast-response curves for our LGN responses, or used different Mexican-hat interaction profiles.


Figure 3
View larger version (9K):
[in this window]
[in a new window]
 
FIG. 3. Dependency of the response modulation ratio (F1/F0) and the tuning width on the intracortical connections in the MFM (dashed curve) and the RM (solid curve). Tuning width is measured by half-width at half-height (HWHH). For both models, we started with all cortical connections set to zero (the rightmost points of the curves), and then gradually increased the cortical connection strengths until they reached their full standard values (leftmost points on the curves). Note that both models have F1/F0 >1 (horizontal thin line) for a purely feedforward input (the rightmost point on each curve). As we increase the strength of the cortical connections, the F1/F0 ratio of the MFM increases, keeping the ratio >1. However, in the RM, the same procedure is accompanied by a decrease in the F1/F0 ratio, making the cell complex. Cortical connections reduce HWHH in both models. However, the feedforward input in the MFM is already well tuned for orientation; subsequent sharpening by the cortical connections mainly arises from elimination of the DC component.

 
In our implementations of the two models, we followed the standard assumptions that the RM and MFM have a low (2.0) and high (4.54) aspect ratio for their LGN input connection pattern, respectively (Fig. 1, A1 and B1). We also ran some simulations of the RM with a 4.54 aspect ratio and some simulations of the MFM with a 2.0 aspect ratio. In the former case, the input was already well sharpened, and recurrent cortical connections simply removed the DC component to maintain robust contrast invariance. Interestingly, even in this case, the cells were complex, with response modulation ratios (F1/F0) <1. In the latter case, the antiphase inhibition still removed the DC components to maintain the contrast invariance and keep cells simple. However, because there is no cortical sharpening of orientation tuning in the MFM, the model V1 cells' output firing was not well tuned for orientation. In either case, the conclusion that the MFM and the RM generate contrast-invariant simple and complex cell receptive-field structure remains the same.


    RESULTS
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
We start by presenting our comparisons between the RM and MFM because the other models we considered depend on them. We will then show our simulations of the recent variations of the RM and MFM, i.e., the inhibition-dominant RM and the MFM with complex inhibition. Finally, we will present our MRM, which combines the key features of the RM and MFM.

The multiphase RM and the MFM response to drifting gratings

After implementing the multiphase MFM and the multiphase RM at the same level of complexity (see METHODS and Fig. 1), we compared the response properties of the two models. In particular, we wanted to know whether the cells in the two models behaved like simple or complex cells. Because the F1/F0 ratio calculated from a cell's response to a sinusoidal grating is widely used for classifying the cell as either simple or complex (Movshon et al. 1978aGo,cGo; Skottun et al. 1991Go), we first used the ratio to assess our model cells. The F0 component of a response time course is the unmodulated DC component, and the F1 component is the first harmonic (the component at the temporal frequency of the grating stimulus). The amplitude ratio of these two components is the F1/F0 ratio, and a cell is classified as simple or complex according to whether the ratio is >1 or <1 (Skottun et al. 1991Go). Although the simple and complex cells may not form two discrete classes but are idealized cases of a continuum (Chance et al. 1999Go; Mechler and Ringach 2002Go; Tao et al. 2004Go), we can still use the ratio to indicate how simple or complex a cell is.

Each model contained V1 cells of different orientations and phases (as determined by the alignment of the LGN inputs). In the following, we will use the vertically oriented even-phase cell from each model as an example; cells of other orientations or phases have similar behavior. The LGN-to-V1 connectivity patterns for the two example cells are shown in Fig. 1, A1 and B1, respectively. We simulated their responses to a sinusoidal grating at the cells' preferred orientation and preferred spatial frequency, and a temporal frequency of 2 Hz, which is the temporal frequency originally used to define the F1/F0 ratio (Movshon et al. 1978aGo,cGo).

The results are shown in Fig. 2. For each model, we considered the special case with all intracortical connections set to zero (Fig. 2, first column), and the normal case with full-strength intracortical connections (Fig. 2, second column). The MFM and RM cells have similar response time courses when the intracortical connections are turned off (Fig. 2, A1 and B1). Both responses have large temporal modulations and the F1/F0 ratio is 1.37 for both models, indicating that the cells are simple cells with phase sensitivity. However, intracortical connections are essential for both models to generate contrast-invariant orientation tuning. For the MFM, the intracortical connections did not reduce the temporal modulation of the response (Fig. 2A2). Instead, the antiphase inhibition in the MFM significantly suppressed the DC component of the responses, making the cell more simple with an F1/F0 ratio of 1.63. In contrast, the intracortical connections in the RM reduced the amplitude of the temporal modulation and increased the DC component of the response (Fig. 2B2), making the cell complex with an F1/F0 ratio of 0.38. Note that simply establishing a threshold in Fig. 2B2 to eliminate the DC component will not work. Such a threshold will eliminate the entire response to a low-contrast drifting grating.


Figure 2
View larger version (16K):
[in this window]
[in a new window]
 
FIG. 2. Responses of a V1 cell from the MFM (top row) and a V1 cell from the RM (bottom row) to a 2-Hz drifting grating at the cells' preferred orientation (vertical) and preferred spatial frequency (0.8 cycles/deg). All panels show steady-state responses (after the initial transient response). A1 and B1: temporal firing output of the model cells with the cortical connections turned off. A2 and B2: temporal firing output of the model cells with the cortical connections turned on. Antiphase cortical inhibition in the MFM sharpens the cell's temporal modulation (A2), whereas the recurrent cortical connections in the RM increase the DC component (B2).

 
Figure 2 suggests that the recurrent mechanism of the RM comes at a "cost" insofar as it cannot both sharpen orientation tuning and leave spatial phase information intact. To obtain a better sense of this relationship, we measured the relationship between the orientation tuning width and the F1/F0 ratio for both the MFM and the RM when the intracortical connections are varied systematically. The measure of tuning width is the standard half-width at half-height (HWHH) of an orientation tuning curve. To vary the intracortical connections, we scaled the standard values (see METHODS) of all connection strengths by a factor between 0 and 1. The results are shown in Fig. 3, with the dashed and solid curves for the MFM cell and the RM cell, respectively. For both models, we began by setting all cortical connections to zero (as in Fig. 2, A1 and B1). This represents the rightmost point for each curve in Fig. 3. As the scaling factor increases from 0 to 1, the curves move toward the left, indicating sharper tuning (smaller HWHH). The leftmost points on the curves are for the full-strength intracortical connections. Note that, although both curves start at similar F1/F0 ratios, they start at different HWHH values because the MFM has a feedforward input that is much more sharply tuned than the feedforward input in the RM. Indeed, the only reason that the cortical connections in the MFM change the HWHH value is that they eliminate the baseline DC component of the feedforward input (see Fig. 1). The tuned component of the feedforward input is not sharpened by the intracortical connections in the MFM.

In Fig. 3, the RM clearly shows a trade-off between the F1/F0 ratio and tuning width as recurrent cortical connections are strengthened. There is no point on the curve where the F1/F0 ratio is >1 (simple cell) and yet the HWHH is close to the physiological mean of about 20 to 25°. This confirms that the RM cannot both generate sharp orientation tuning and leave spatial phase information intact. By contrast, with increasing intracortical connections, the MFM shows the expected increase in the F1/F0 ratio as the HWHH decreases. Therefore the RM can be a valid orientation tuning mechanism only for complex cells, whereas the MFM is suitable for simple cells. However, because the orientation tuning width of V1 cells varies considerably, it is possible that the RM mechanism with weak recurrent connections is involved in some weakly orientation tuned simple cells, such as those in the input layer of monkey V1.

The multiphase RM and MFM responses to bars

Because Hubel and Wiesel (1962)Go used bar stimuli to define simple and complex cells, we confirmed our above conclusions with bars. Briefly, we simulated the responses of the two example V1 cells to either a vertical light bar or a vertical dark bar aligned with the ON subregion of the receptive field for 400 ms. The time courses of the LGN inputs in the two models were very similar. However, the firing outputs from the two models are quite different. For the MFM cell, the light bar in the ON subregion generated an ON response but not an OFF response, and the dark bar in the ON subregion generated an OFF response but not an ON response, mirroring the LGN input to the cell. These are the characteristics of a simple cell. In contrast, the RM cell had both ON and OFF responses regardless of the type of bar, and thus behaved like a complex cell. In other words, for the RM cell, identities of the ON and OFF subregions determined by the LGN inputs are lost at the level of output firing.

Single-phase RM response to bars

We also examined whether the introduction of some phase specificity into the recurrent connections of the RM would allow the cells in the model to preserve their simple-cell identity. In fact, the single-phase model of Somers et al. (1995)Go can be viewed as restricting the recurrent interactions in the RM to the cells with the same receptive field phase (but different orientations) only and, as we mentioned in the INTRODUCTION, Somers et al.'s model cells do have discrete ON and OFF subregions. We have reproduced Somers et al.'s results by using our implementation of the single-phase RM described in METHODS. We found that indeed, the model cells retain their phase sensitivity (results not shown). However, the single-phase RM has a problem not noted previously: a model V1 cell's preferred orientation can change by 90° depending on whether a light bar or a dark bar is used to measure the tuning. To illustrate this problem, we considered a single-phase RM consisting of even-phase V1 cells only, each with a central ON region flanked by two OFF regions according to the alignment of the LGN inputs. There are Mexican-hat recurrent connections among the cells with the same even phase but different orientations. We considered a vertically oriented example cell from the model and generated tuning curves from the ON responses evoked by light and dark bars centered on the central ON region of the receptive field.

Figure 4 shows the results, together with those from the corresponding multiphase RM cell for comparison. Figure 4A shows the tuning curves of the feedforward inputs for the light and dark bars. They are identical for the single-phase and the multiphase RM cells. Figure 4, B and C shows the tuning curves of the output firing for the single-phase RM cell and the multiphase RM cell, respectively. When a light bar is used to stimulate the single-phase RM, the cell prefers vertical orientation, as expected (Fig. 4B, solid curve). However, when a dark bar is used instead, the peak of tuning shifts by 90° (Fig. 4B, dashed curve). This can be understood from the feedforward inputs in Fig. 4A: a light and a dark bar aligned with the central ON subregion generates a maximum and a minimum response, respectively, leading to a 90° shift between the light and dark bars' feedforward tuning peaks. The single-phase RM simply amplifies around the peak of the feedforward tuning and suppresses other orientations, thus preserving this 90° shift in the output firing. In contrast, the multiphase RM does not have this problem of peak shift (Fig. 4C) because of the strong excitation among cells with the same orientation but difference phases. For example, when the example cell receives minimal feedforward input generated by a dark bar aligned with its central ON region, the cell with the opposite receptive field phase receives maximal feedforward input and sends strong excitation to the example cell to keep it activated as well.

Because real V1 cells do not show a 90° peak shift according to the polarity of the bars (Hubel and Wiesel 1962Go, 1968Go), we conclude that the single-phase RM is not a viable model for simple cells. It should be noted, however, that Somers et al. (1995)Go designed the model to investigate primarily how recurrent cortical interactions shape feedforward inputs; they were not particularly concerned with receptive-field structures. It is thus reasonable for them to make the single-phase simplification. Indeed, every model has to use simplifying assumptions appropriate for the question under investigation that may not be appropriate for other questions.

Alternative models of orientation tuning

The above results show that the MFM and RM are well suited for explaining sharp orientation tuning in simple and complex cells, respectively. For completeness, we also simulated two variations of the MFM and the RM. They are the MFM with complex-cell inhibition (Lauritzen and Miller 2003Go) and the inhibition-dominant RM (McLaughlin et al. 2000Go; Tao et al. 2004Go; Wielaard et al. 2001Go). We constructed these models at the same level of detail as the MFM and the RM above, and found that they replicated simple cell physiology, as detailed below. When the excitation in the inhibition-dominant RM is substantially increased, it becomes similar to the standard RM and produces complex cells (Tao et al. 2004Go).

The MFM with complex inhibition

The MFM has recently been modified to include untuned complex-cell inhibition (Lauritzen and Miller 2003Go); this is in response to the finding that such inhibitory cells exist in layer 4 of the cat V1 (Hirsch et al. 2003Go), although their fraction in layer 4 and connectivity to other cells are not known. The same physiological study also found a couple of poorly tuned inhibitory simple cells in layer 4 that could provide antiphase inhibition in theory. Thus the new MFM has untuned complex inhibition in addition to antiphase inhibition. We found that the new MFM produced well-tuned simple cells just like the original MFM (Fig. 5). The only difference was that the untuned complex inhibition slightly reduced the role for antiphase inhibition in eliminating the orthogonal response; now the two mechanisms shared this responsibility. One might predict that this would mean that the cortical connections would not increase the F1/F0 ratio quite as much because the antiphase effect would be weaker. However, our simulations yielded an F1/F0 ratio of 1.61 when complex inhibition was added (similar to the 1.63 value with pure antiphase inhibition), probably because complex inhibition also serves to reduce the DC component in a cell's temporal response.


Figure 5
View larger version (10K):
[in this window]
[in a new window]
 
FIG. 5. Steady-state responses of a V1 cell from the MFM with complex inhibition to a 2-Hz drifting grating at the cell's preferred orientation (vertical) and preferred spatial frequency (0.8 cycles/deg). Temporal firing outputs of the cell with the cortical connections turned off and on are shown in A and B, respectively.

 
The inhibition-dominant RM

The inhibition-dominant RM of McLaughlin et al. (2000)Go is a version of the RM that includes detailed V1 anatomy. Interestingly, McLaughlin et al. implemented the Mexican-hat intracortical interactions not by a broader inhibitory connectivity profile but by broader tuning of inhibitory cells (see METHODS). Because the complex-cell behavior in any RM stems from strong net recurrent excitation (the peak of the Mexican hat) among cells with different phases, one expects that if the recurrent inhibition is much stronger than recurrent excitation (and thus removes the peak of the Mexican hat), the resulting inhibition-dominant RM should produce simple cells. This is indeed what McLaughlin et al. found (McLaughlin et al. 2000Go; Wielaard et al. 2001Go). We have reproduced this finding in Fig. 6.


Figure 6
View larger version (10K):
[in this window]
[in a new window]
 
FIG. 6. Steady-state responses of a V1 cell from the inhibition-dominant RM to a 2-Hz drifting grating at the cell's preferred orientation (vertical) and preferred spatial frequency (0.8 cycles/deg). Temporal firing outputs of the cell with the cortical connections turned off and on are shown in A and B, respectively.

 
The inhibition-dominant RM makes the strong prediction that the net effect of cortical connections on simple cells is inhibitory. Specifically, the cortex inhibits a simple cell's response to LGN input at every orientation (including the preferred orientation). This is why in Fig. 6, the LGN input to the cell is larger than it is in the other models. In contrast, the RM, the MFM, and the MRM presented below all predict that cortex suppresses the LGN input only away from a V1 cell's preferred orientation, but boosts the LGN input around the cell's preferred orientation. (The MFM boosts responses at the preferred orientation through phase-specific cortical excitation. Because the recurrent excitation in the inhibition-dominant RM is not phase specific, any such boosting leads to complex cell behavior as in the original RM.) Wielaard et al. (2001)Go examined the inhibition-dominant RM further and found that when cortical connections are silenced in the model, a simple cell's response at its preferred orientation increases by threefold. Cortical inactivation experiments in cat V1 (Chung and Ferster 1998Go; Ferster et al. 1996Go) found a decrease, although similar experiments have not been done in monkeys (see DISCUSSION).

The modified recurrent model (MRM)

Because the antiphase connections in the MFM were found to increase the F1/F0 ratio (Fig. 3), we asked whether antiphase inhibition could make a recurrent mechanism relevant for sharp orientation tuning in simple cells. To do this, we started with a multiphase RM with phase-independent intracortical connections. We then introduced an additional set of inhibitory cells that received feedforward input from the LGN and same-phase excitatory connections from cortical excitatory cells in the RM. This extra set of inhibitory cells supplies antiphase inhibition to all cortical cells in the RM (see METHODS). What resulted was a phase-dependent response well tuned for orientation. Both the ON and OFF response to light and dark bars (results not shown) and the modulated response to a drifting grating (Fig. 7) showed spatial phase dependency, with the latter yielding an F1/F0 ratio of 1.53. This demonstrates that phase-independent recurrent connections and antiphase inhibition together can provide a mechanism for orientation tuning in V1 simple cells. In this modified recurrent model (MRM), the phase-independent recurrent connections sharpen orientation tuning and the antiphase inhibition preserves phase dependency. The cell in Fig. 7 had an HWHH value of around 25°.


Figure 7
View larger version (10K):
[in this window]
[in a new window]
 
FIG. 7. Steady-state responses of a V1 cell from the MRM to a 2-Hz drifting grating at the cell's preferred orientation (vertical) and preferred spatial frequency (0.8 cycles/deg). Temporal firing outputs of the cell with the cortical connections turned off and on are shown in A and B, respectively.

 
By adding antiphase inhibition to the RM, we not only produced well-tuned simple cells, but also created a model whereby a cell can have a range of F1/F0 values, from simple to complex, by varying the relative strengths of antiphase and recurrent inhibition. As noted in METHODS, the MRM has three types of cortical cells: excitatory (E), inhibitory (I), and antiphase