|
|
||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1Graduate School of Engineering Science and 2Graduate School of Frontier Biosciences, Osaka University, Osaka, Japan
Submitted 16 April 2007; accepted in final form 23 July 2007
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
|
Are subunits pooled spatially to constitute the receptive fields of complex cells? What is the shape of the overall receptive fields of complex cells, and how is it related to those of the subunits? To answer these questions, we have taken advantage of a second-order interaction analysis for dynamic two-dimensional (2D) noise stimuli. Analysis for second-order interactions can reveal the shape and spatial extent of luminance signal integration, which we define as a subunit. Subunits are intimately related but not identical to the underlying individual linear filters shown in Fig. 1, B and C. In addition, comparisons of properties of these subunits with those of simple cells are also made to examine the details of the hierarchical model of complex cell receptive fields.
| METHODS |
|---|
|
|
|---|
Surgical procedure and animal maintenance
After initial preanesthetic doses of hydroxyzine (Atarax, 2.5 mg) and atropine (0.05 mg), each cat was anesthetized with isoflurane (2–3.5% in O2) for the remainder of the surgical preparation. Lidocaine was injected subcutaneously or applied topically at all points of pressure and possible sources of pain. Electrocardiogram (ECG) electrodes were secured, a rectal temperature probe was inserted, and a femoral vein was catheterized. Body temperature was maintained near 38°C with the use of a feedback-controlled heating pad for the remainder of the experiment. Subsequently, a tracheostomy was performed and a glass tracheal tube was inserted for artificial respiration. The animal was then secured in a stereotaxic apparatus with the use of ear and mouth bars and clamps on the orbital rim. Anesthesia was then switched to sodium thiopental (Ravonal, 1.0 mg · kg–1 · h–1). After the stabilization of anesthesia, the animal was paralyzed with a loading dose of gallamine triethiodide (Flaxedil, 10 mg · kg–1 · h–1) to minimize eye movements during single-unit recording, and placed under artificial ventilation with a gas mixture of 70% N2O-30% O2. End-tidal CO2 was maintained at a constant level of 3.5–4.3% throughout the experiment. For the remainder of the experiment, the infusion fluid was delivered, containing sodium thiopental (Ravonal, 1.0 mg · kg–1 · h–1), gallamine triethiodide (Flaxedil, 10 mg · kg–1 · h–1), and glucose (40 mg · kg–1 · h–1) in lactated Ringer solution. In addition to body temperature and end-tidal CO2, heart rate, ECG, and intratracheal pressure were monitored and maintained within a normal range throughout the experiment. Pupils were dilated with atropine (1%) and nictitating membranes were retracted with phenylephrine hydrochloride (Neosynesin, 5%). Contact lenses with 4-mm artificial pupils were then placed on each cornea.
A craniotomy was carried out above the central representation of the visual field in visual area 17 approximately at the Horsley–Clarke coordinate, P4–L2.5. This corresponds to <10° in retinal eccentricity. The dura was removed to permit the insertion of tungsten microelectrodes for single-unit recording. After setting the electrodes close to the cortical surface, agar was applied over the cortex for protection and melted wax was applied over the agar to create a sealed chamber for stabilization. When the electrodes were retracted, electrolytic lesions (5 µA, 5 s) were made at 700- to 1,500-µm intervals for each electrode track. Typically, recordings from an animal lasted 4 days.
At the end of an experiment, the animal was administered an overdose of pentobarbital sodium (Nembutal) and perfused through the heart with formalin (4% in buffered saline). The recorded areas were frozen, sectioned into 40- to 60-µm slices, and stained with thionin. The locations of electrode tracks were identified.
Single-unit recording
Lacquer-coated tungsten microelectrodes (1–5 M
; A-M Systems, Sequim, WA) were used to record single-unit activities. Typically, two electrodes mounted in a single protective guide tube were driven in parallel with a common microelectrode drive to increase the chance of encountering neurons. Signals from the electrodes were amplified (x10,000), band-pass filtered (300–5,000 Hz), and fed into a custom-made data acquisition computer system (Ohzawa et al. 1996
). The data acquisition system consisted of A/D converters and a spike-waveform discriminator that sorted signals from each electrode in real time. The time-stamped spike events (time resolution, 40 µs) were sent from the data acquisition system, along with their waveforms, to a separate computer that controlled the experiment. This second computer saved to files the spike data and other events, including the onset and the offset times of each trial and stimulus presentations, parameters and conditions of the spike sorter, and the entire set of experimental parameters. The stored data were analyzed on a third computer during and after the experiment.
Visual stimulation
Generations of visual stimuli were performed using custom-built software on yet another computer. At the request of the experiment control computer noted earlier, visual stimuli were generated by the fourth computer controlling a graphics card (Millenium G550; Matrox, Dorval, Quebec, Canada) and displayed on a cathode ray tube monitor (76-Hz frame rate, 1,600 x 1,024 pixels; GDM-FW900; Sony, Tokyo, Japan) through the green channel only to avoid color misconvergence across channels. In each experiment, the luminance nonlinearity of the display was measured using a photometer (Minolta CS-100; Konica Minolta, Osaka, Japan) and linearized by gamma-corrected lookup tables. The animal saw the display through a custom-built haploscope, which allows visual stimuli to be presented to the left and right eyes separately (Sanada and Ohzawa 2006
). A black separator was placed between the left and right visual fields to preclude the projection of stimuli to unintended eyes. Distance (total length of light paths) between the screen and the eyes was set to 57 cm, subtending the visual field of 23 x 30° for each eye.
Recording procedure
When a single unit was isolated, preliminary observations were made to determine its optimal orientation, spatial frequency, and both the position and the size of its receptive field. In this "search" procedure, the orientation, spatial frequency, position, and size of a patch of drifting sinusoidal grating could be adjusted with the use of a pointing device (mouse). Having completed the above-cited procedure, tuning in the orientation and spatial frequency domain was measured for the cell by presenting flashed sinusoidal grating stimuli (Nishimoto et al. 2005
) and/or drifting sinusoidal grating stimuli. The degree of response modulation was assessed by presenting sinusoidal grating with a combination of orientation and spatial frequency that elicited the largest number of spikes. The cell was then classified into simple or complex on the basis of the amplitude of the first harmonic component relative to the average firing rate (F1/F0 ratio; Li et al. 2003
; Skottun et al. 1991
).
To evaluate the receptive field structure, we presented dynamic 2D noise stimuli with 51 x 51 small dots. The noise stimuli covered an area typically two- to threefold larger than the receptive field of the neuron in both the horizontal and the vertical directions. Each dot was assigned with dark (
3 cd · m–2), bright (
90 cd · m–2), or gray luminance (
47 cd · m–2) at equal probability. The gray dots had the same luminance value as the mean luminance of the display. The dot size was determined for each cell primarily based on optimal spatial frequency to achieve both sufficient spatial resolution and signal-to-noise ratio (0.12 x 0.12 to 0.67 x 0.67° for individual dots; 5.3 dots per subregion width on average). The noise pattern was refreshed at every 26 or 13 ms, which corresponded to two video frames or one, respectively. Presentation of the dynamic noise stimuli lasted about 30 min for collecting a sufficient number of spikes for the following data analysis.
First-order map and second-order interaction map
Monocular visual stimuli (S) can be fully characterized by two dimensions of space (x, y) and one of time (t), S(x, y, t). The 2D dynamic noise stimuli used in this study consisted of three luminance values, 0, –1, 1, which corresponded to gray, dark, and bright dots, respectively. Spike-triggered stimuli were of particular interest among S(x, y, t).
For simple cells, the first-order maps were calculated to analyze the receptive field structure.
The first-order map, h1(x, y,
), which represents the linear component of neuronal responses, was obtained by the spike-triggered averaging of S(x, y, ti –
)
![]() |
For complex cells, the second-order interaction maps were computed to examine the internal structure of their receptive fields. This procedure is schematically illustrated in Fig. 2.
|
1,
2), was calculated by accumulating the product of two spike-triggered stimuli, i.e., S(x1, y1, ti –
1) and S(x2, y2, ti –
2) for the ith spike
![]() |
corresponds to a temporal offset between them. Thus the second-order interactions are described by the six-dimensional function.
This computation casts a positive vote for an interaction between the stimuli with the same contrast polarity (dark–dark and bright–bright) and a negative vote for an interaction between those with the opposite contrast polarity (dark–bright, bright–dark). When at least one member of a pair of stimuli is a gray dot, no vote is presented. This calculation is thus essentially identical to one used for sparse noise stimuli in previous studies (Gaska et al. 1994
; Livingstone and Conway 2003
).
To obtain a second-order interaction map, one stimulus location was selected as the reference and fixed at a particular spatial coordinate (x1, y1) and a particular correlation delay
1 as shown in Fig. 2B. Then, the second-order interaction map for the reference (x1, y1,
1) was calculated within the local neighborhood of this reference, as indicated by the dashed square in Fig. 2B. The spatial displacements for dx and dy ranged from –10 to 10 stimulus dots. Empirically, this extent for the second-order analysis was sufficiently wide for containing significant interactions. The second-order interaction with itself (i.e., dx = 0, dy = 0, and d
= 0) cannot be measured with ternary dense noise stimuli because overlapping stimuli cannot be represented. For further analysis, the value at zero displacement was filled in by a spline interpolation from neighboring pixels in the interaction map. Because the two-stimulus interaction profiles are strongest and occupy the largest spatial extent between simultaneously presented stimuli in general (Anzai et al. 2001
; Gaska et al. 1994
), the temporal offset d
was always set to 0 ms to evaluate the spatial structure and extent of second-order interaction maps. Accumulation of votes for all spikes produced the second-order interaction map for the reference (x1, y1,
1). By changing the values of x1, y1, and
1, second-order interaction maps were obtained for references at locations of every dot in visual stimuli for correlation delays from 0 to 197 ms in 13-ms (duration of one video frame) steps.
Second-order interaction map and subunit
The term subunit has been used extensively for referring to a functional or virtual unit that operates as a linear spatial (or spatiotemporal) filter internal to a complex cell (Livingstone and Conway 2003
; Movshon et al. 1978b
; Ohzawa and Freeman 1986
; Rust et al. 2005
). In this report, we define a subunit as a functional unit that is described by a second-order interaction profile as explained earlier. A subunit is closely related to, but is not identical to, linear filters depicted in models for complex cells (Fig. 1, B and C). As mentioned in the previous reports, a subunit contains contributions from multiple linear filters, presumably simple cells according to the hierarchical model (Hubel and Wiesel 1962
) and thus does not represent a single anatomically identifiable cell.
Data analysis
First-order receptive fields and subunits in second-order maps are usually modulated along an axis perpendicular to the optimal orientation. We obtained their spatial extents by using a partial Hilbert transform (Hahn 1992
). This method for obtaining an envelope is applicable for any multidimensional data modulated along a single axis because it does not assume a specific functional form. Alternatively, our maps could have been fitted by a model function selected a priori and its envelopes may have been extracted for subsequent analyses. A reasonable candidate for such a model function is a Gabor function, as used by most recent studies dealing with simple cell receptive fields. However, some second-order maps of complex cells may have broader flanking subregions than a central one (Szulborski and Palmer 1990
). This characteristic is not captured well by a Gabor function and thus we sought to avoid fitting second-order maps by it.
Here, the partial Hilbert transform was carried out in the 2D spatial frequency domain along an axis perpendicular to the optimal orientation. First, the 2D Fourier transform of the map was computed. Then, the spatial frequency domain was divided into two mirror-symmetrical halves about the axis perpendicular to a vector pointing to the optimal orientation and spatial frequency from the origin. As a result, each half of the spatial frequency domain had spectral components that are identical in amplitude and are different in the sign of imaginary parts. The spatial frequency components in one half were shifted by 90° in phase; for the other half, they were shifted by –90° in phase. The inverse Fourier transform of the result yielded a complex signal where the real parts retained the original signal (Fig. 2E) and formed a quadrature pair with the imaginary parts (Fig. 2F). The envelope of the map (Fig. 2G) was obtained by calculating the absolute value (amplitude) of the complex signal. Because the second-order interaction maps were obtained for all dot locations in visual stimuli, some of maps were outside the complex cell receptive field and contained no signal. For these maps, the axis for the partial Hilbert transform could not be determined. Likewise, for cells without significant orientation tuning or spatial antagonism, the axis could not be determined either. Therefore for these cases, the absolute values of interaction maps themselves were used as their envelopes.
In previous studies, the second-order interaction maps were often averaged across all reference positions within the receptive field of a complex cell to enhance signal-to-noise ratios for a single final map (Emerson et al. 1987
; Gaska et al. 1994
; Livingstone and Conway 2003
). This means that the six-dimensional interaction maps, h2(x1, x1 + dx, y1, y1 + dy,
1,
1 + d
), were reduced to just three dimensions of dx, dy, and d
. In this approach, the original maps were assumed to be spatially homogeneous. This assumption is justified for some complex cells (Emerson et al. 1987
), but not for others (Szulborski and Palmer 1990
). Therefore we examined second-order interaction maps and their envelopes for all individual reference positions separately. We assume that the strength of a second-order subunit is proportional to the collective responses of underlying linear filters contributing to that subunit. With this assumption, the overall receptive field of a complex cell may be obtained by collecting the envelopes of the second-order interaction maps. A complex cell receptive field at a correlation delay of
was obtained as follows. First, we squared the envelopes of second-order interaction maps at a delay
at all reference locations. The squaring operation was incorporated to approximate the effects of power-law static nonlinearities at the outputs of subunits (Gaska et al. 1994
). Then, the squared envelopes were summed into a larger map (Fig. 2H) at their respective positions. Finally, its square root was computed. For further analysis, an optimal correlation delay was determined at which the amplitude of the receptive field was at maximum. We found that >95% interaction maps exhibited the strongest interactions at the optimal correlation delay. This supports that the receptive field size was estimated appropriately because of constancy in timing across interaction maps.
As described earlier, the receptive field envelopes of a simple cell were obtained in 13-ms intervals. These were then spline-interpolated along the time axis in 1-ms steps. For a correlation delay where the interpolated envelope had the maximum response, the receptive field and its envelope were computed for further analysis.
We sought to evaluate the spatial shape for the second-order interaction envelopes and the receptive fields of complex cells and the receptive field envelopes of simple cells. To extract parameters for characterizing them, each of them was fitted by a 2D Gaussian function having the form
![]() |
![]() |
![]() |
x',
y', and
are free parameters. Specifically, the spatial coordinate (x0, y0) corresponds to the center position and
rotates the translated coordinate system to align x and y axes with the major and minor axes of the Gaussian function.
x' and
y' are measures of the spatial extent, B is a baseline parameter, and K is simply a scaling factor. The Gaussian fit accounted for 70% of variance in data on average. Spatial properties for the subunits and receptive fields were quantified on the basis of the parameters of the best-fitted Gaussian function. To evaluate size parameters such as length, width, and area, a bound was drawn at a criterion level that is 5% of the peak amplitude of the fitted Gaussian function. For this boundary, length and width were measured along the parallel and perpendicular axes to the optimal orientation, respectively. Figure 3 schematically shows the length and the width for the subunits and the receptive fields of complex cells. The length and the width for the receptive fields of complex cells were defined with respect to the optimal orientation of the subunits with the highest amplitude. For the receptive fields of simple cells, the length and the width were obtained for the optimal orientation. Optimal orientation, obtained by Fourier analysis, is a parameter independent of the major axis of the fitted Gaussian function.
|
| RESULTS |
|---|
|
|
|---|
One of the major purposes of this study was to examine spatial pooling of subunits that collectively constitute the receptive fields of single complex cells. To achieve this goal, spatial characteristics were compared between the subunits and the overall receptive fields for individual complex cells. If the hierarchical model of Hubel and Wiesel (1962)
is correct (Alonso and Martinez 1998
; Martinez and Alonso 2001
; but see also Toyama et al. 1981
), subunits of a complex cell should closely reflect properties of simple cells contributing inputs to it. Although simple cells in our population were recorded separately from complex cells and do not have direct connection to the complex cells, we were able to compare the two cell types as groups. Specifically, it is worth comparing corresponding parameters between the subunits of complex cells and the receptive fields of simple cells. The simple and complex cells were recorded from the same set of electrode tracks in the same animals using 2D dynamic noise stimuli, although the proportions of cell types varied from one track to another.
Second-order interaction maps
Figure 4 shows a representative example of the second-order interaction analysis for a complex cell. In Fig. 4A, interaction maps are tiled to reflect the spatial locations of their references. Although the references may be at every dot location of the stimuli, the results are displayed here for reference positions at every two stimulus dots. The interaction maps at the top left and center in Fig. 4A are partially overlapping in the actual stimulus domain, as indicated on a noise stimulus frame (Fig. 4B), and outlined by dashed and thick lines, respectively. The interaction maps vary in shape depending on reference positions (Szulborski and Palmer 1990
). The central interaction map in A is also shown magnified in Fig. 4C. To determine the spatial extent of a subunit in the map, the partial Hilbert transform (Hahn 1992
) was performed as shown in Fig. 4D and then the envelope was obtained as shown in Fig. 4E. Note that Fig. 4, C and D differ in spatial phase by 90° and thus form a quadrature pair. To illustrate the bounds of the interaction map, the dashed line is drawn in Fig. 4, C and E at a criterion level that is 5% of the peak height of the 2D Gaussian function fitted to the envelope of the interaction map. By taking the square root of the sum of the squared envelope maps for all reference positions, the receptive field of the complex cell was obtained as shown in Fig. 4F.
|
The degree of spatial pooling of subunits may be quantified by the difference in spatial extent between the subunits and the overall receptive field. When the outputs of many subunits are pooled across different spatial positions to generate an overall receptive field, its size should occupy a much larger spatial extent than that of individual subunit envelopes. However, when pooling is absent, the size of a receptive field is expected to be identical to that of subunit envelopes. Therefore the complex cell depicted in Fig. 4 exhibited hardly any spatial pooling. The absence of spatial pooling is in fact what is predicted by the minimalist energy model; therefore the energy model is an accurate representation of the cell in Fig. 4.
Figure 5 shows data from another complex cell in the same format as that for Fig. 4. For this cell, the size of the overall receptive field (solid ellipse in Fig. 5, C, E, and F) was significantly larger than that of the subunit envelope (dashed ellipse; 169% in terms of area; P < 0.05, resampling). Therefore spatial pooling of subunits did take place for this neuron.
|
|
|
Length and width
Having examined interaction maps and the extent of pooling for representative complex cells, we now analyze these properties for the population of cells we have recorded. In the population analyses, complex cell subunits were sampled at the center of the receptive fields. First, what relationships are found between the length and the width for complex cell subunits, receptive fields, and simple cell receptive fields? Figure 8 A shows a relationship between the length and the width for the subunits of complex cells. The length and the width were defined with respect to the optimal orientation (obtained by Fourier analysis) for the subunits (Fig. 3). To make a cell-by-cell comparison of the two values possible, subunit length is plotted against subunit width. Each circle in the scatterplot represents a datum from one complex cell. Note that the majority (63%) of circles lie above the identity line. The histograms show the distributions of the length (right) and the width (top) for subunits, with median values of 4.05 and 3.63°, respectively, as indicated by arrows. The subunit length was generally longer than the subunit width (P = 0.0042, Wilcoxon signed-rank test).
|
Figure 8C illustrates a relationship between the length and the width for the receptive fields of simple cells. Each circle in the scatterplot denotes a datum from a simple cell. The histograms show the distributions of the length (right) and the width (top) for the receptive fields of simple cells, respectively. The majority (65%) of symbols are above the identity line. The median length was 4.40° and the median width was 4.00°. This difference was statistically significant (P < 0.001, Wilcoxon signed-rank test). This was exactly the same trend as observed for the subunits of complex cells as shown in Fig. 8A. Are the distributions of subunit length and width similar to those of receptive fields of simple cells? The answer is yes, and there was not a significant difference in either the length or the width (P > 0.05, Mann–Whitney U test).
Number of subregions
A chief functional characteristic of the early visual cortical neurons is selectivity for orientation and spatial frequency (Hubel and Wiesel 1959
, 1962
; Movshon et al. 1978a
,b
). Among various mechanisms contributing to the determination of tuning parameters (Benevento et al. 1972
; Bredfeldt and Ringach 2002
; Ringach et al. 2002
), the number of subregions in complex cell subunits is inversely related to tuning bandwidths for orientation and spatial frequency (Gaska et al. 1994
; Movshon et al. 1978b
). The number of subregions for a subunit was calculated as follows
![]() |
|
Length-to-width ratios and aspect ratios
Figure 10 A compares length-to-width ratios between subunits and receptive fields. Note that length-to-width ratios are aspect ratios defined with respect to the preferred orientation. However, we reserve the term aspect ratios to refer only to those related to the elliptic elongation of subunit envelopes and complex cell receptive fields (see following text). There was a strong correlation between the two ratios (r = 0.81). The two distributions also had indistinguishable medians (1.12 and 1.04 for the length-to-width ratios of subunits and receptive fields, respectively, as indicated by arrows; P > 0.05, Wilcoxon signed-rank test). These results appear to indicate that shapes of subunits are reflected directly to those of the overall receptive fields of complex cells.
|
The direction of receptive field elongation, if any, can be different and may be independent of the preferred orientation of a neuron in general (Fig. 3). A similar distinction also applies for properties of subunits. Are complex cell receptive fields more elongated in a particular direction than their subunits? Is there any relationship between the direction and degree of subunit elongation and pooling? To address these questions, we quantified the degree of elongation by aspect ratios, which were defined by the length ratios of major and minor axes for subunit envelopes or complex cell receptive fields.
Figure 11 A compares aspect ratios between the subunit envelopes and the receptive fields of complex cells. The two ratios were correlated (r = 0.68). The median value of the distribution of aspect ratios for receptive fields, 1.28, was significantly smaller than that for subunits, 1.32 (P < 0.001, Wilcoxon signed-rank test). This indicates that the receptive fields of complex cells are more circular in shape than the subunits. The reduction of aspect ratios may be caused by the spatial pooling of subunits, which will be subsequently examined in Pooling of subunits.
|
Orientation of elongation
There may be a bias in the elongation of the subunit envelopes and the receptive fields of complex cells with respect to the absolute horizontal and vertical. Scatterplots in Fig. 12 show aspect ratios against the orientation of the major axis for subunit envelopes (left) and receptive fields (right). These two quantities show no systematic relationship for respective panels. The horizontal solid line is drawn at an arbitrary criterion level (aspect ratio = 1.2) to determine whether subunit envelopes and receptive fields are sufficiently elliptic for the subsequent analysis. For subunit envelopes and receptive fields with small aspect ratios (
1.2; open symbols), their major axis cannot be determined reliably. On the other hand, those with large values (>1.2; filled symbols) were judged as sufficiently elliptic, and were counted in histograms to examine distributions of the orientation of the major axis (n = 66 for subunit envelopes, n = 56 for receptive fields). These distributions were not different from uniform distributions (P > 0.05, Kolmogorov–Smirnov test). Therefore the results show no special orientation bias in the aspect ratios for both subunits and overall receptive fields of complex cells.
|
1.2; open symbols; n = 38). Simple cells that prefer horizontal orientation (0 and 180°) appear less likely to have vertically elongated receptive field envelopes (90°). To investigate the biased elongation of receptive field envelopes regardless of optimal orientation, the number of simple cells was counted in the top histogram (for cells with sufficiently elongated envelopes). Its striking U shape demonstrates a general tendency in which there were more simple cells with horizontally elongated envelopes than those elongated vertically (P < 0.05, Kolmogorov–Smirnov test for uniform distribution).
|
|
Pooling of subunits
To evaluate the degree of the spatial pooling of subunits that make up complex cells, a pooling ratio in terms of area was calculated for each neuron
![]() |
1. On the other hand, much larger values for the areal pooling ratio are expected when subunits are pooled spatially. The histogram of Fig. 15 A shows the distribution of the areal pooling ratios observed for 86 complex cells. The median of areal pooling ratios was 1.21. In length or width terms, this amounts to a 10% difference on average. These values indicate that, if anything, the spatial extents of receptive fields of complex cells were only slightly larger than those of the subunits. For each complex cell, a bootstrap test was performed to assess whether the areal pooling ratio was significantly different from one. For 32 of 86 cells (37%), the areal pooling ratio was significantly >1 (P < 0.05, resampling; black bars).
|
It is possible that our analyses might underestimate the receptive field sizes of complex cells, if weak subregions of subunits were to be missed in noisy maps. This could lead to incorrect estimation of the degree of spatial pooling. If this were the case, quality of data would have influenced the number of subregions in subunits and the areal pooling ratios. Thus a signal-to-noise ratio was calculated for each subunit. Signal can be considered as an amplitude parameter obtained by the Gaussian function fitted to the subunit envelope. Noise level was estimated as the SD of amplitudes of envelopes that were obtained by correlating spike sequence we actually recorded to stimulus noise patterns we did not present but with otherwise identical statistical properties. Examination of the signal-to-noise ratios of subunits revealed no systematic relationships with the number of subregions in subunits or the areal pooling ratios.
Figure 15, B and C compares the sizes of complex cell subunits, receptive fields, and simple cell receptive fields. By using hand-plotting mapping protocol for the minimum response fields, Hubel and Wiesel (1962)
reported that the receptive fields of complex cells were, on average, threefold larger than those of simple cells. The median of the receptive field areas of simple cells we recorded was 2.42 deg2 and that of the subunit areas and the receptive field areas of complex cells were 1.93 and 2.89 deg2, respectively. We found that the receptive fields of simple cells have spatial extents equal to those of subunits and the receptive fields of complex cells (P > 0.05, Mann–Whitney U test).
Is there any particular axis along which pooling of subunits tends to occur? As shown in Fig. 11A, aspect ratios are smaller for the receptive fields of complex cells than for the subunits. Decrease in the aspect ratios can be readily explained, if subunits are pooled more extensively along the minor axis of their envelopes. To examine the possible radial anisotropy of spatial pooling, the degree of pooling was evaluated directionally along radial axes that were at various angles with the major axis of subunit envelope, as illustrated in Fig. 16 C. Specifically, the sizes were measured along each radial axis for the subunit envelope and the receptive field. The directional pooling ratio was defined as the receptive field size divided by the subunit envelope size. Directions were then searched for which the directional pooling ratios are maximal and minimal.
|
Figure 16B depicts the average ± SE values of the directional pooling ratios as a function of the angle from the subunit major axis. The difference in the pooling ratios across different directional angles was statistically significant (P < 0.001, Friedman test).
Subunit organizations of complex cells underlying contrast-sign invariance
How do complex cells acquire response invariance to the sign or polarity of contrast of visual stimuli? The minimalist scheme, often used in modeling, for response invariance to contrast polarity is a full-wave rectification of the outputs of two linear filters that differ in spatial phase by 90° (Adelson and Bergen 1985
; Fleet et al. 1996
). In reality, biological substrates for the linear filters feeding into a complex cell are selective for the contrast polarity and their outputs are half-wave rectified. Functionally, linear filters must thus be organized as a push–pull pair (Ohzawa et al. 1990
; Pollen et al. 1989
; Troyer et al. 1998
). In theory, members of such push–pull pairs should have receptive fields that are inverted versions of each other, as assumed by various energy model schemes (Adelson and Bergen 1985
; Emerson et al. 1992
; Ohzawa et al. 1990
). However, there is no guarantee that this is actually the case for real complex cells. Therefore we will examine positive and negative halves of subunit organizations by sorting separately the second-order response maps according to the contrast polarity of reference stimuli (Emerson et al. 1987
; Livingstone and Conway 2003
).
Together with second-order interaction maps, Fig. 17 depicts bright-minus-dark maps for dark- and bright-reference stimuli separately for four complex cells. These maps in a given row were obtained at the identical location at which the strongest second-order interactions were observed for each cell. Figure 17, left shows a bright-minus-dark map when a dark stimulus was presented as the reference and Fig. 17, center shows a bright-minus-dark map when a bright stimulus was presented as the reference. The gray scale is matched between the two maps. As described in METHODS, subtraction of these bright-minus-dark maps yielded a second-order interaction map as shown in Fig. 17, right. [Multiplication by a negative (dark) reference corresponds to subtraction.] The reference location is denoted by the crosshair in each map. Different biological substrates are recruited by visual stimuli having opposite contrast polarities as the reference and, in ideal complex cells, the two bright-minus-dark maps should be an inverted version of each other. This appears to be the case for cells illustrated in Fig. 17, A and B.
|
All four complex cells presented here exhibit even-symmetric second-order interaction maps (Fig. 17, right column) as reported previously (Emerson et al. 1987
; Gaska et al. 1994
; Livingstone and Conway 2003
). However, a distinction must be made between our second-order maps and those of previous studies. For previous results, due to simple methodological reasons, the interaction maps were guaranteed to be even-symmetric because the maps were averaged for all reference positions. Because each of our maps is for a single reference position, there is no such methodological basis for the symmetry. Therefore the even symmetry in our maps represents the true nature of the two-stimulus interaction sampled near the center of the complex cell receptive fields. In fact, examinations of interaction maps near the fringes of complex cell receptive fields often revealed asymmetric interaction maps (see maps in the fifth row in Fig. 4A).
In contrast to the second-order interaction maps, however, internal organizations of subunits exhibited various response patterns to dark- and bright-reference stimuli. The bright-minus-dark maps can show different center positions and unequal strength between dark and bright references.
The minimalist energy model predicts that, when the reference is located at the center of a complex cell receptive field, the bright-minus-dark maps for dark and bright references have 1) centers at the reference location and 2) identical amplitude. These predictions are obviously violated in some complex cells as shown in Fig. 17, C and D. Thus we sought to evaluate deviations from the model predictions for a population of complex cells actually recorded. First, the envelope for a bright-minus-dark map was computed by partial Hilbert transform. Then, a Gaussian function was fitted to the envelope, and the parameters for center position and amplitude were examined. The center position was measured from the reference location and converted into spatial phase angle with respect to the optimal spatial frequency when the center was projected onto the width axis (the axis orthogonal to the optimal orientation for the subunit).
Figure 18 illustrates the distributions of center position and amplitude for the envelopes of bright-minus-dark maps for dark (left) and bright (right) references. Neurons shown in Fig. 17 are indicated by the corresponding labels. The center position expressed as a phase angle represents a spatial offset—thus it was not artificially wrapped into a range from –180 to 180°. However, the actual data points were scattered within the range. The amplitude was normalized by the SD of the amplitude values on the edges of the bright-minus-dark map envelope, which is considered to represent the noise level. Therefore the normalized amplitude may be thought of as a signal-to-noise ratio. Both of the distributions show an inverted-V shape centered about zero, indicating that any map with high amplitude tended to have its center at the reference location. This is consistent with the minimalist energy model. However, the center position could be distant from the reference location as the amplitude decreased.
|
|
| DISCUSSION |
|---|
|
|
|---|
|
A notion often held regarding complex cells is that their receptive fields tend to be large compared with simple cells, occupying an area threefold as large on average (Hubel and Wiesel 1962
). A prediction based on this idea is that a large number of simple cells must be combined hierarchically for constructing a complex cell receptive field. To our surprise, in general this initial expectation was not fulfilled. When examined quantitatively, the sizes of receptive fields of complex cells on average were only slightly larger (1.21-fold in area) than those of their subunit envelopes (Fig. 15A). The sizes of receptive fields for simple cells were also comparable to those of subunits for complex cells. Therefore the casually held notion is not correct for typical complex cells.
What could be the sources of the discrepancy between the Hubel and Wiesel's result and ours? Sampling biases between the cell types are unlikely to be the cause because our simple and complex cells were from the same set of electrode penetrations. Although individual tracks varied in their eccentricity and ratio of cell types, these factors should not influence the overall distributions for the entire population of cells. Furthermore, we have obtained about 10 pairs of simultaneously recorded simple and complex cells. The receptive field sizes of simple and complex cells within a given pair were always closely similar. Essentially the same results, obtained for the relationships between the complex cells and their subunits, and those between the simple and complex cells, also strengthen our results.
It is possible that the "minimum response field" plotting (Bishop and Henry 1972
) used in typical hand mapping of receptive fields may miss weak subregions of simple cells, thereby underestimating the receptive field size. This problem would be more acute if only a bright bar is used in mapping (which is true for nearly all early studies including those by Hubel and Wiesel), because a dark-excitatory subregion of a simple cell may not generate any OFF response if the temporal response is monophasic (Fig. 5B of DeAngelis et al. 1995
). In such cases, even a very strong dark-excitatory subregion would be missed, leading to an underestimation of receptive field area by a factor of
2. On the other hand, however, complex cells are not likely to be affected by this problem because, by definition, ON and OFF subregions are overlapping for these cells and the use of a bright bar only is not an obstacle for correctly estimating the receptive field size.
Spatial pooling of subunits
In theory, if there is a substantial degree of spatial pooling, the second-order interaction for a particular reference location can be limited to a portion of the receptive field and does not necessarily cover the entire spatial extent of the receptive field of a complex cell. However, in the cat early visual cortex, we have demonstrated that subunits obtained by the second-order interaction analysis are pooled spatially to only a small degree for most cells. In other cortical areas, such a pooling does take place. For example, Pack et al. (2006)
measured second-order interaction maps for middle temporal (MT) neurons in the spatiotemporal domain, to study the direction selectivity, and found that the spatial extent of two-stimulus interaction is limited to a small portion in the receptive field. Because neurons projecting from the primary visual cortex to MT are predominantly direction-selective complex cells (Movshon and Newsome 1996
), complex cells having the receptive fields in various positions might serve as subunits that are pooled spatially to constitute the receptive field of MT cells. Note that our results do not prove or disprove pooling of simple cells in making up spatially larger subunits. This is because summation of multiple linear receptive fields cannot be distinguished from a single large linear receptive field.
Subunit organizations of complex cells underlying contrast-sign invariance
The seminal characteristic of complex cells is that ON and OFF responses may be elicited everywhere within the receptive field (Hubel and Wiesel 1959
, 1962
). Such insensitivity to the contrast polarity is thought to be achieved by combining the outputs of multiple simple cells that are sensitive to the contrast sign. Internal structures of these complex cells have been studied with various nonlinear systems analysis techniques, most of which have assumed symmetrical structures for the contrast sign (e.g., Marmarelis and Marmarelis 1978
). However, actual complex cells do not necessarily respond identically to dark and bright stimuli (e.g., Fig. 1B of Ohzawa et al. 1990
). Therefore we have examined second-order interaction maps separately for the bright- and dark-reference stimuli. This analysis demonstrated that, contrary to the energy model assumption, the two maps could not be related to each other simply by inverting signs. The unmatched positions of bright-minus-dark maps could account for the fact that complex cell subunits have more subregions than simple cells. Together with the position displacement, the imbalanced amplitudes between these maps might contribute to produce "imperfect" complex cells, which exhibit a minor degree of response modulation (e.g., 0.2 < F1/F0 ratio < 0.8) when a drifting sinusoidal grating is presented.
Relationship to other studies on internal organizations of complex cell receptive fields
Second-order interaction maps represent how two visual stimuli interact to elicit the responses of complex cells. Thus they may be interpreted as one possible set of linear filters for explaining complex cell responses, although they do not necessarily correspond to biological entities such as the receptive fields of simple cells that feed into complex cells.
Several spike-triggered techniques are also suited for revealing the filter structure of complex cells. Recently developed spike-triggered covariance (STC) techniques discover multidimensional axes, or filters, relevant for explaining the variance structure of spike-triggered stimuli in the multidimensional space (Rust et al. 2005
; Touryan et al. 2002
). These methods rely on principal component analysis to minimize the number of filters by imposing mutually orthogonal relationships on them. Therefore a set of recovered filters can economically span the entire subspace in which neuronal responses are increased (excitatory filters) and decreased (suppressive filters). This has an advantage over second-order interaction analysis, which reveals only a single net excitatory filter for each reference position. However, filters recovered by the STC can be too large in spatial extent, in the sense that visual stimuli may not interact in the entire spatial extents of recovered filters to elicit neuronal responses and shape tuning for stimulus features (Rust et al. 2005
). Therefore even though neither the map obtained from the second-order interaction analysis nor that from the STC technique represents responses of a single biological entity, the second-order interaction maps are closer to the biological reality, in that they accurately reflect the spatial limits within which linear summation of stimuli takes place.
Other promising methods examine the net excitatory and inhibitory responses of complex cells in the spatial frequency domain. With these techniques, spike-triggered stimuli are first converted to their spectra in the spatial frequency domain by the Fourier transform, and then they are averaged in the frequency domain separately for different phases (David et al. 2004
) or without regard to phase (Nishimoto et al. 2006
; Ringach et al. 1997
). Spectral representation potentially enables suppressive responses, such as suppression at low spatial frequency and cross-orientation suppression, to be dissociated from excitatory responses (Bredfeldt and Ringach 2002
; Nishimoto et al. 2006
; Ringach et al. 2002
). These suppressive responses are masked by excitatory responses in second-order interaction analysis because they are usually overlapped in the space domain.
Comparison of spatial properties between complex cell subunits and simple cell receptive fields
As a population, the receptive fields of simple cells were comparable to complex cell subunits in width, length, width-to-length ratios, and aspect ratios. However, complex cell subunits contain more subregions than simple cell receptive fields. This is consistent with our findings that bright-minus-dark maps for dark- and bright-reference stimuli are often displaced and therefore underlying linear filters for complex cells do not necessarily form a push–pull pair. Another interesting fact is that the receptive field envelopes of simple cells tended to be elongated horizontally rather than vertically. Complex cells, on the other hand, exhibited no such bias either for the subunits or for the receptive fields. Therefore multiple simple cells could feed into a complex cell in an anisotropic manner such that the bias toward the horizontal elongation of their receptive fields is weakened.
Herein we have examined spatial relationships between the subunits and receptive fields for individual complex cells. Our strategy is a logical practical step toward understanding how the receptive fields of complex cells are built up from simple cells, as modeled by Hubel and Wiesel (1962)
. To obtain an unequivocal answer, future studies must compare subunit properties of complex cells to those of antecedent simple cells that are directly connected to them (Alonso and Martinez 1998
).
| GRANTS |
|---|
|
|
|---|
| ACKNOWLEDGMENTS |
|---|
|
|
|---|
| FOOTNOTES |
|---|
Address for reprint requests and other correspondence: I. Ohzawa, Graduate School of Frontier Biosciences and School of Engineering Science, Osaka University, 1-3 Machikaneyama, Toyonaka, Osaka 560-8531, Japan (E-mail: ohzawa{at}fbs.osaka-u.ac.jp)
| REFERENCES |
|---|
|
|
|---|
Alonso JM, Martinez LM. Functional connectivity between simple cells and complex cells in cat striate cortex. Nat Neurosci 1: 395–403, 1998.[CrossRef][Web of Science][Medline]
Anzai A, Ohzawa I, Freeman RD. Joint-encoding of motion and depth by visual cortical neurons: neural basis of the Pulfrich effect. Nat Neurosci 4: 513–518, 2001.[Web of Science][Medline]
Benevento LA, Creutzfeldt OD, Kuhnt U. Significance of intracortical inhibition in the visual cortex. Nat New Biol 238: 124–126, 1972.[CrossRef][Web of Science][Medline]
Bishop PO, Henry GH. Striate neurons: receptive field concepts. Invest Ophthalmol 11: 346–354, 1972.
Bredfeldt CE, Ringach DL. Dynamics of spatial frequency tuning in macaque V1. J Neurosci 22: 1976–1984, 2002.
David SV, Vinje WE, Gallant JL. Natural stimulus statistics alter the receptive field structure of V1 neurons. J Neurosci 24: 6991–7006, 2004.
DeAngelis GC, Ohzawa I, Freeman RD. Spatiotemporal organization of simple-cell receptive fields in the cat's striate cortex. I. General characteristics and postnatal development. J Neurophysiol 69: 1091–1117, 1993.
DeAngelis GC, Ohzawa I, Freeman RD. Receptive-field dynamics in the central visual pathways. Trends Neurosci 18: 451–458, 1995.[CrossRef][Web of Science][Medline]
Emerson RC, Bergen JR, Adelson EH. Directionally selective complex cells and the computation of motion energy in cat visual cortex. Vision Res 32: 203–218, 1992.[CrossRef][Web of Science][Medline]
Emerson RC, Citron MC, Vaughn WJ, Klein SA. Nonlinear directionally selective subunits in complex cells of cat striate cortex. J Neurophysiol 58: 33–65, 1987.
Fleet DJ, Wagner H, Heeger DJ. Neural encoding of binocular disparity: energy models, position shifts and phase shifts. Vision Res 36: 1839–1857, 1996.[CrossRef][Web of Science][Medline]
Gaska JP, Jacobson LD, Chen HW, Pollen DA. Space-time spectra of complex cell filters in the macaque monkey: a comparison of results obtained with pseudowhite noise and grating stimuli. Vis Neurosci 11: 805–821, 1994.[Web of Science][Medline]
Gilbert CD. Laminar differences in receptive field properties of cells in cat primary visual cortex. J Physiol 268: 391–421, 1977.
Hahn SL. Multidimensional complex signals with single-orthant spectra. Proc IEEE 80: 1287–1300, 1992.[CrossRef]
Hubel DH, Wiesel TN. Receptive fields of single neurones in the cat's striate cortex. J Physiol 148: 574–591, 1959.
Hubel DH, Wiesel TN. Receptive fields, binocular interaction and functional architecture in the cat's visual cortex. J Physiol 160: 106–154, 1962.
Jones JP, Palmer LA. The two-dimensional spatial structure of simple receptive fields in cat striate cortex. J Neurophysiol 58: 1187–1211, 1987.
Li B, Peterson MR, Freeman RD. Oblique effect: a neural basis in the visual cortex. J Neurophysiol 90: 204–217, 2003.
Livingstone MS, Conway BR. Substructure of direction-selective receptive fields in macaque V1. J Neurophysiol 89: 2743–2759, 2003.
Marmarelis PZ, Marmarelis VA. Analysis of Physiological Systems. New York: Plenum, 1978.
Martinez LM, Alonso JM. Construction of complex receptive fields in cat primary visual cortex. Neuron 32: 515–525, 2001.[CrossRef][Web of Science][Medline]
Movshon JA, Newsome WT. Visual response properties of striate cortical neurons projecting to area MT in macaque monkeys. J Neurosci 16: 7733–7741, 1996.
Movshon JA, Thompson ID, Tolhurst DJ. Spatial summation in the receptive fields of simple cells in the cat's striate cortex. J Physiol 283: 53–77, 1978a.
Movshon JA, Thompson ID, Tolhurst DJ. Receptive field organization of complex cells in the cat's striate cortex. J Physiol 283: 79–99, 1978b.
Nishimoto S, Arai M, Ohzawa I. Accuracy of subspace mapping of spatiotemporal frequency domain visual receptive fields. J Neurophysiol 93: 3524–3536, 2005.
Nishimoto S, Ishida T, Ohzawa I. Receptive field properties of neurons in the early visual cortex revealed by local spectral reverse correlation. J Neurosci 26: 3269–3280, 2006.
Ohzawa I, DeAngelis GC, Freeman RD. Stereoscopic depth discrimination in the visual cortex: neurons ideally suited as disparity detectors. Science 249: 1037–1041, 1990.
Ohzawa I, DeAngelis GC, Freeman RD. Encoding of binocular disparity by simple cells in the cat's visual cortex. J Neurophysiol 75: 1779–1805, 1996.
Ohzawa I, Freeman RD. The binocular organization of complex cells in the cat's visual cortex. J Neurophysiol 56: 243–259, 1986.
Pack CC, Conway BR, Born RT, Livingstone MS. Spatiotemporal structure of nonlinear subunits in macaque visual cortex. J Neurosci 26: 893–907, 2006.
Pollen DA, Gaska JP, Jacobson LD. Physiological constraints on models of visual cortical function. In: Models of Brain Function, edited by Cotterill RMJ. Cambridge, UK: Cambridge Univ. Press, 1989.
Ringach DL, Bredfeldt CE, Shapley RM, Hawken MJ. Suppression of neural responses to nonoptimal stimuli correlates with tuning selectivity in macaque V1. J Neurophysiol 87: 1018–1027, 2002.
Ringach DL, Sapiro G, Shapley R. A subspace reverse-correlation technique for the study of visual neurons. Vision Res 37: 2455–2464, 1997.[CrossRef][Web of Science][Medline]
Rust NC, Schwartz O, Movshon JA, Simoncelli EP. Spatiotemporal elements of macaque V1 receptive fields. Neuron 46: 945–956, 2005.[CrossRef][Web of Science][Medline]
Sanada TM, Ohzawa I. Encoding of three-dimensional surface slant in cat visual areas 17 and 18. J Neurophysiol 95: 2768–2786, 2006.
Skottun BC, De Valois RL, Grosof DH, Movshon JA, Albrecht DG, Bonds AB. Classifying simple and complex cells on the basis of response modulation. Vision Res 31: 1079–1086, 1991.[CrossRef][Web of Science][Medline]
Szulborski RG, Palmer LA. The two-dimensional spatial structure of nonlinear subunits in the receptive fields of complex cells. Vision Res 30: 249–254, 1990.[CrossRef][Web of Science][Medline]
Touryan J, Lau B, Dan Y. Isolation of relevant visual features from random stimuli for cortical complex cells. J Neurosci 22: 10811–10818, 2002.
Toyama K, Kimura M, Tanaka K. Organization of cat visual cortex as investigated by cross-correlation technique. J Neurophysiol 46: 202–214, 1981.
Troyer TW, Krukowski AE, Priebe NJ, Miller KD. Contrast-invariant orientation tuning in cat visual cortex: thalamocortical input tuning and correlation-based intracortical connectivity. J Neurosci 18: 5908–5927, 1998.
This article has been cited by other articles:
![]() |
R. Kimura and I. Ohzawa Time Course of Cross-Orientation Suppression in the Early Visual Cortex J Neurophysiol, March 1, 2009; 101(3): 1463 - 1479. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Tanaka and I. Ohzawa Surround Suppression of V1 Neurons Mediates Orientation-Based Representation of High-Order Visual Features J Neurophysiol, March 1, 2009; 101(3): 1444 - 1462. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Visit Other APS Journals Online |