JN Fuel your research with LabChart
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


J Neurophysiol 93: 3615-3634, 2005. First published January 19, 2005; doi:10.1152/jn.00882.2004
0022-3077/05 $8.00
This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
93/6/3615    most recent
00882.2004v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via ISI Web of Science (13)
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Recio-Spinoso, A.
Right arrow Articles by Ruggero, M. A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Recio-Spinoso, A.
Right arrow Articles by Ruggero, M. A.

Wiener-Kernel Analysis of Responses to Noise of Chinchilla Auditory-Nerve Fibers

Alberto Recio-Spinoso3, Andrei N. Temchin1, Pim van Dijk2, Yun-Hui Fan1 and Mario A. Ruggero1

1Department of Communication Sciences and Disorders, Institute for Neuroscience and Hugh Knowles Center, Northwestern University, Evanston, Illinois; 2Department of Otorhinolaryngology, University Hospital Groningen, School of Behavioral and Cognitive Neurosciences, University of Groningen, Groningen, The Netherlands; and 3Department of Physiology, University of Wisconsin, Madison, Wisconsin

Submitted 25 August 2004; accepted in final form 13 January 2005


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
Responses to broadband Gaussian white noise were recorded in auditory-nerve fibers of deeply anesthetized chinchillas and analyzed by computation of zeroth-, first-, and second-order Wiener kernels. The first-order kernels (similar to reverse correlations or "revcors") of fibers with characteristic frequency (CF) <2 kHz consisted of lightly damped transient oscillations with frequency equal to CF. Because of the decay of phase locking strength as a function of frequency, the signal-to-noise ratio of first-order kernels of fibers with CFs >2 kHz decreased with increasing CF at a rate of about –18 dB per octave. However, residual first-order kernels could be detected in fibers with CF as high as 12 kHz. Second-order kernels, 2-dimensional matrices, reveal prominent periodicity at the CF frequency, regardless of CF. Thus onset delays, frequency glides, and near-CF group delays could be estimated for auditory-nerve fibers innervating the entire length of the chinchilla cochlea.


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
Nonlinear systems cannot be described by transfer functions or their time-domain equivalents, unit-impulse responses, and thus are difficult to analyze. Therefore the prominent nonlinearities of the cochlea have typically been described as perturbations of linear-like relations, using ad hoc stimuli such as tone pairs, for example, to study 2-tone suppression and combination-tone generation. More general and systematic approaches for the study of nonlinear systems, analogous to the determination of transfer functions of linear systems, were first sketched by Volterra and Wiener (Schetzen 1989Go; Volterra 1959Go; Wiener 1958Go) and fleshed out by Lee and Schetzen (Lee and Schetzen 1965Go; Schetzen 1989Go). Wiener analysis describes systems as sums of functionals (functions of functions). (For discussion of the applicability of Wiener-kernel analysis to the auditory system, see Eggermont 1993Go; Eggermont et al. 1983Go; Johnson 1980aGo; see also Kim and Young 1994Go for a comparison of Wiener-kernel and spectrotemporal analysis.) The 1st-order Wiener functional is given by the convolution of a Gaussian white-noise input with the 1st-order Wiener kernel (h1), the 2nd-order Wiener functional is given by a convolution of the noise with the 2nd-order Wiener kernel (h2), and so on for the higher-order functionals; h1 and h2, respectively, are obtained by performing 1st- and 2nd-order cross-correlations between a white-noise stimulus and the system's response (Lee and Schetzen 1965Go; Marmarelis and Marmarelis 1978Go). For linear systems, h1 is identical to the system's impulse response (i.e., the response to an intense brief click). For a system consisting of a linear filter followed by a quadratic nonlinearity, h1 is zero and h2 provides information on the quadratic nonlinearity (Marmarelis and Marmarelis 1978Go).

Wiener-kernel analysis was introduced to auditory physiology by Egbert de Boer (de Boer 1967Go) and colleagues. They used the reverse correlation (or revcor; Eq. 8 in METHODS), the average noise-stimulus waveform preceding each spike (which is proportional to h1; Eq. 7 in METHODS), to describe the responses to noise of cat auditory-nerve fibers (ANFs) (de Boer 1969Go, 1973Go; de Boer and de Jongh 1978Go; de Boer and Jongkees 1968Go). Revcors have also been applied to the analysis of cochlear nucleus neurons and ANFs in several species (Carney and Yin 1988Go; Carney et al. 1999Go; Evans 1977Go; Joeken et al. 1997Go; Kim and Young 1994Go; Lewis et al. 2002Go; Møller 1977aGo, 1978Go; van Dijk et al. 1993Go, 1994Go, 1997Go; Wickesberg et al. 1984Go). The revcors of ANFs with low characteristic frequency (CF) have frequency tuning consistent with frequency–threshold curves for responses to tones. In the case of high-CF ANFs, revcors are small or insignificant in magnitude, reflecting the weak phase locking to high-frequency tones (Johnson 1980bGo; Palmer and Russell 1986Go). This deficiency of h1s can be made up by the computation of h2s, first used for the analysis of auditory neurons by Wickesberg et al. (1984)Go, which can yield information on temporal coding even in the absence of phase locking to near-CF stimuli (Lewis et al. 2002Go; van Dijk et al. 1993Go, 1994Go, 1997Go; Yamada and Lewis 1999Go; Yamada et al. 1997Go).

We undertook a 2nd-order Wiener-kernel analysis of ANFs in chinchilla as a complement of work in our laboratory seeking to relate the responses of ANFs to the underlying vibrations of the basilar membrane (e.g., Narayan et al. 1998Go; Ruggero and Rich 1987Go; Ruggero et al. 1996Go, 2000Go). We especially wanted to obtain timing information for near-CF responses of ANFs innervating basal regions of the chinchilla cochlea: in chinchilla, high-quality basilar membrane data are available for the cochlear base (e.g., Recio et al. 1998Go; Rhode and Recio 2000Go; Ruggero et al. 1997Go) but comparable timing data do not exist for high-CF ANFs. We describe here h1s and h2s of chinchilla ANFs as a function of stimulus level and CF. A companion paper (Temchin et al. 2005Go) provides quantitative estimates of the ability of the Wiener kernels to predict ANF responses to tones, clicks, and frozen noise and relates the Wiener kernels to basilar membrane vibrations in the chinchilla cochlea. Preliminary results of this investigation were published in abstract form (Temchin et al. 1995Go).


    METHODS
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
Animal preparation

Most of the techniques used here for animal preparation were published previously (Ruggero and Rich 1983Go, 1987Go). Adult male chinchillas were anesthetized with an initial injection of ketamine (100 mg/kg, subcutaneous) and with sodium pentobarbital (65 mg/kg, intraperitoneal), supplemented with additional doses of pentobarbital to maintain a complete absence of limb-withdrawal reflexes. Rectal temperature of the animals was maintained near 38°C with a servocontrolled electrical heating pad. Tracheotomy and tracheal intubation allowed for forced respiration, which was used only as necessitated by apnea or labored breathing. The pinna was resected and part of the bony external ear canal was chipped away to permit visualization of the umbo of the tympanic membrane and insertion of the earphone-coupling speculum. After opening the bulla widely, the tendon of the tensor tympani muscle was severed and the stapedius muscle was detached from its bony anchoring to prevent possible effects of muscle contraction evoked by high-level acoustic stimuli. A silver-ball electrode was placed on the round window to monitor cochlear health by measuring compound action potential thresholds.

The auditory nerve was approached superiorly after craniotomy and aspiration of part of the cerebellum. Capillary-glass microelectrodes (filled with 3 M NaCl or KCl solutions, impedance 20–70 M{Omega}) were positioned under visual control through an operation microscope and were advanced into the nerve by means of a remotely controlled hydraulic micropositioner.

ANF frequency–threshold tuning curves were obtained for responses to tone pips using an automated procedure (Liberman 1978Go; Ruggero and Rich 1983Go). Spontaneous rate was measured over a 10-s interval. CF and CF threshold were estimated from 3rd-order polynomial functions fitted to the tuning-curve tips.

Acoustic stimulation

Acoustic stimuli were presented using a Beyer DT-48 earphone. Electrical tone pips were produced by a custom-built digital waveform generator under computer control (Ruggero and Rich 1983Go): a sinusoidal waveform, stored as 8,192 16-bit words of read-only memory, was sampled at selectable rates and converted into analog electrical signals. The sound pressure level (SPL) and phase of the acoustic tones were controlled by attenuating the electrical signals and by adjusting their starting phases with reference to a calibration table (SPL and phase as a function of constant-level and constant-phase electrical tones) generated in situ at the beginning of the experiment using a miniature Knowles microphone with its opening near the eardrum. On average, calibration SPLs were flat within ±3.4 dB at all frequencies <15 kHz.

Gaussian white-noise electrical waveforms were usually produced with an analog device (General Radio 1381). Less often, a hardware digital generator (Tucker-Davis Technologies TDT-WG1) or, exceptionally, a software-generated 8,192-word digital table, was also used. In all cases, the noise stimulus was low-pass filtered (15-kHz corner frequency, 48 dB/octave). Acoustic-noise spectral levels are expressed throughout this and the companion paper using units of dB SPL/Hz (i.e., re 20 µPa/Hz). [The unattenuated spectral level of the (electrical) noise stimulus was –48 dBVrms/Hz. The root-mean-square (rms) voltage of the unattenuated tones exceeded the total rms voltage of the noise by 9 dB. Therefore for an attenuation of x dB, the spectral level at CF of an acoustic noise stimulus (SPL/Hz) = (SPL of unattenuated tones with frequency {approx} CF) – 57 dB – x dB.]. An equivalent rectangular bandwidth (ERB) was computed for each ANF, in accordance with its frequency tuning (see Fig. 16, inset). Thus for any given ERB and noise spectrum level, an ERB total pressure (expressed in dB SPL) could be determined. ERB SPL is more directly comparable to tone SPL than to noise spectral level (see Fig. 16).



View larger version (23K):
[in this window]
[in a new window]
 
FIG. 16. Levels of noise stimuli. Open circles in the main panel represent total noise pressure computed in ERBs re CF threshold as a function of h2 FSV BF. On average noise ERB pressure exceeded CF threshold (bracketed closed circle and dotted lines) by 8.6 ± 9.7 dB (n = 122). Inset: equivalent rectangular bandwidths (ERBs) computed from h2 FSVs as a function of h2 FSV BF.

 
The noise was almost always delivered continuously (duration: 42–1,070 s). Both the stimulus and the amplified microelectrode output were recorded on a stereo digital audiotape recorder (Sony DTC-690; 16 bits, sampling rate 44.1 or 48 kHz) for later analysis. Exceptionally (to obtain the 4 highest-level responses of Fig. 13), the noise was gated (20-ms duration, 80-ms repetition period) with an electronic switch (TDT SW2), digitized with 16-bit resolution (TDT AD1) and stored in the computer. When using the latter paradigm, neural responses were stored on-line as a series of spike times with accuracy of 1 µs (measured at their initial edge with a TDT ET1). With either paradigm, spike times were defined as the instants when their rising edges just exceeded the baseline noise level.



View larger version (28K):
[in this window]
[in a new window]
 
FIG. 13. Level dependency of Wiener kernels for a representative low-CF ANF. Fiber CF = 1.35 kHz. A: time-domain representations of the h1s obtained for stimulus levels indicated next to each trace in units of spectral level, dB SPL/Hz, or total pressure in an ERB (between parentheses). Magnitudes (B) and phases (C) of the Fourier transforms of the h1s. To permit better visualization of the slope changes as a function of stimulus level, the phase curves have been corrected for a 1.225-ms delay. (This delay is the sum of synaptic/neural processes, sound propagation time between the earphone and the eardrum, and equipment delays. Therefore the curves represent the phases of inner hair cell responses relative to pressure at the eardrum.) Th = 2 dB SPL; SR = 55.5/s; h0 = 84.2/s for noise stimulus presented at –17 dB SPL/Hz (8 dB SPL in ERB).

 
The noise produced by the TDT-WG1 digital generator contained spurious periodicities, evident in its autocorrelation function [{phi}xx({tau}) in Eq. 11]. Such periodicities sometimes contaminated the 2nd-order kernels (Figs. 14) with 2 artifactual stripes, parallel to the diagonals (not shown) and unrelated to the CF period. In every other respect, the analog and digital noises produced identical results and were therefore pooled for analysis.



View larger version (53K):
[in this window]
[in a new window]
 
FIG. 1. Wiener-kernel analysis for a representative low–characteristic frequency (CF) auditory-nerve fiber (ANF). ANF had a CF (determined from frequency–threshold curves of responses to tones) of 784 Hz. All panels represent the same train of 27,683 spikes evoked by a 180-s white-noise stimulus presented at –6 dB SPL/Hz, with equivalent rectangular bandwidth (ERB) pressure of 18 dB SPL. A: 1st-order kernel (h1, blue) and normalized 2nd-order kernel (h2) first-rank singular vector (FSV) (red). B and E: h2. E: h2 as a 3-dimensional (3-D) object. B: projection of h2 onto a 2-dimensional (2-D) plane, with the 3rd (amplitude) dimension coded by hue; the magenta and black lines, respectively, indicate the diagonal {tau}1 = {tau}2 and the row with {tau}1 = 3.27 ms. C: amplitudes of h2 at the diagonal [i.e., h2({tau}, {tau}), magenta] and a slice through the maximum of h2 (at a fixed {tau} = 3.27 ms, black). Note different timescales for row (bottom) and diagonal (top). D: Fourier magnitudes of h1 (blue) and of the h2 FSV (red). Arrow indicates the CF. F: Fourier magnitudes of h2, represented as its projection onto a 2-D plane, with the 3rd dimension coded by hue. Lines mark CF and –CF. Threshold for responses to CF tone pips (Th) = 9 dB SPL. Spontaneous rate (SR) = 125.4/s. Mean response rate [zeroth-order kernel (h0)] = 153.8/s.

 


View larger version (58K):
[in this window]
[in a new window]
 
FIG. 4. Wiener-kernel analysis for a representative high-CF ANF. ANF had a CF of 8.26 kHz. All panels represent the same train of 18,656 spikes evoked by a 330-s white-noise stimulus presented at –5 dB SPL/Hz (ERB pressure: 25 dB SPL). A: h1 (blue) and normalized h2 FSV (red). B and E: h2. E: h2 as a 3-D object. B: projection of the h2 onto a 2-D plane, with the 3rd (amplitude) dimension coded by hue; the magenta and black lines, respectively, indicate the diagonal, {tau}1 = {tau}2, and a slice at the row {tau}2 = 1.94 ms, through the kernel maximum. C: values of the h2 at the diagonal {tau}1 = {tau}2 (magenta) and at fixed {tau} = 1.94 ms (thin line). Note different timescales for row (bottom) and diagonal (top). D: Fourier magnitudes of the windowed h1 (blue; see Fig. 8B below) and of the h2 FSV (red). Arrow indicates the CF. F: Fourier magnitudes of the h2, represented as its projection onto a 2-D plane, with the 3rd dimension coded by hue. Lines mark CF and –CF. Th = 18 dB SPL; SR = 5.6/s; h0 = 56.5/s.

 
Computation and processing of the Wiener kernels

Zeroth-order kernels (h0s), h1s, and h2s were computed from the spike and the digitized noise waveforms. Let x(t) represent the Gaussian white-noise waveform and y(t) be the spike train measured in the auditory nerve. On the assumption that all of the information carried by the spike train is in the time of occurrence of each individual spike, y(t) can be expressed as

(1)
where {delta}(t) is the Dirac delta function (infinite height and area = 1), N is the total number of spikes evoked by the noise stimulus, and ti represents the spike times.

The zeroth-order Wiener kernel represents the average output of the system (Schetzen 1989Go)

(2)
where <y(t)> is the time average of y(t)

(3)
where T is the stimulus duration

(4)
that is, h0 represents the average firing rate N0 = N/T.

The 1st-order Wiener kernel h1 was obtained by cross-correlating (Schetzen 1989Go) the input x(t) to the output y(t)

(5)

(6)

(7)
where A is the power spectral density of the noise stimulus and

(8)
is the reverse-correlation function (de Boer 1967Go).

Using the previous equations, one can interpret the 1st-order Wiener kernel as the average value of the stimulus x(t) at a time {tau}1 before the occurrence of a spike, normalized to the stimulus power spectral density. For linear systems, h1(t) is identical to the impulse response. For nonlinear systems, h1 is only a component of the impulse response, which contains contributions from higher kernels. In general, h1 for a nonlinear system differs from the linear part of the system and may contain some of the system's nonlinearities (Marmarelis and Marmarelis 1978Go).

The 2nd-order Wiener kernel h2 is obtained by 2nd-order cross-correlation between x(t) and y(t) – h0 (Schetzen 1989Go)

(9)

(10)

(11)
where {phi}xx({tau}) is the autocorrelation function of the input signal x(t), and

(12)
is the 2nd-order reverse-correlation function.

All h2s presented in this paper are m x m square matrices, where m is the sample length of h1. The value of m was chosen in each case so that it substantially exceeded the duration of the h1 and h2 signals. Each element of the matrix, h2({tau}i, {tau}j) (with {tau}i != {tau}j), is proportional to the firing rate and the 2nd-order reverse correlation, which can be interpreted as the mean of the product of the value of the stimulus x(t) at 2 times, {tau}i and {tau}j, before the occurrence of a spike. Thus the h2s give a measure of the nonlinear interaction, or "cross talk," between the responses to 2 impulses (Marmarelis and Marmarelis 1978Go). In other words, h2s measure the deviation from superposition arising from the nonlinearity of the system.

Cross-correlations (according to Eqs. 8 and 12) were carried out in the time domain, using either MATLAB functions or ad hoc programs coded in the C programming language. h1s were computed with sampling period 22.68 or 20.83 µs and length (m) = 256, 512, 1,024, or 2,048 (depending on CF), chosen to fully encompass the response duration.

All h1s and h2s presented in this paper were zero-phase filtered with a low-pass function with corner frequency 1.25 octave higher than the frequency at which the magnitude of the first-rank singular vector of the 2nd-order kernel (h2 FSV; see following text) exceeded the noise floor by 6 dB (see Fig. 2F). h1s were filtered using the MATLAB function filtfilt. h2s were filtered using a MATLAB implementation of a 2-dimensional (2-D) zero-phase filter. The h2s were subjected to singular value decomposition (SVD), using the MATLAB function svd

(13)
where U, S, and V are square matrices of the same size as h2, and T is the transpose operator. The columns of U and rows of V are known as the left and right singular vectors, respectively, and are dimensionless. The nonzero diagonal of S represents the weights of the corresponding (same-rank) singular vectors. The weights have the same dimensions as the h2s. For symmetric matrices with real elements, such as the h2s in the present work, singular vectors are the same as eigenvectors. Thus h2 can be decomposed as follows

(14)
where the eigenvector ui is a column element of U (i.e., U = [u1 u2 ... uN]) and di represents the corresponding eigenvalue. The eigenvector ui associated with the largest eigenvalue (i.e., the first singular vector or h2 FSV) is used extensively in this and the companion paper (Temchin et al. 2005Go). The weighted FSVs [FSV x S(1, 1); e.g., Fig. 14B] have the same units as the h2s. In attempting to give a physical meaning to the h2 FSV it is important to note that Eqs. 12 and 14 imply that its sign (polarity) is ambiguous.



View larger version (75K):
[in this window]
[in a new window]
 
FIG. 2. Alternative views of the 2nd-order kernel of a low-CF ANF. A: 3-D rendition of the h2 of a low-CF ANF. B: same h2, displayed as a 2-D projection, with hue coding the kernel amplitude. C: h2 FSVs of low-pass–filtered (red) and unfiltered (blue) versions of the same h2. D: same h2 after it was subjected to zero-phase low-pass filtering. E: low-pass–filtered h2 (D) displayed as a 2-D projection. F: magnitude spectra of the h2 FSVs of the filtered (red) and unfiltered (blue) versions of the same h2.

 


View larger version (39K):
[in this window]
[in a new window]
 
FIG. 14. Level dependency of Wiener kernels for a representative high-CF ANF. Fiber CF = 12.1 kHz. A: time-domain representations of the h1s (black traces) and h2 FSVs (color traces) for responses to noise presented at 5 levels. h2 FSVs are displayed individually normalized to their peak amplitudes. Stimulus levels indicated in each subpanel in units of dB SPL/Hz and total pressure in ERB (the latter between parentheses). B: magnitudes of the Fourier transform of the weighted h2 FSVs (i.e., 1st column of U multiplied by first element of S in Eq. 13). C: phases of the Fourier transforms of the h2 FSVs. To permit better visualization of the slope changes as a function of stimulus level, the phase curves have been corrected for a 1.225-ms delay. (This delay is the sum of synaptic/neural processes, sound propagation time between the earphone and the eardrum, and equipment delays. Therefore the curves represent the phases of inner hair cell responses relative to pressure at the eardrum.) Th = 33 dB SPL; SR = 72.5/s; h0 = 93.5/s for noise stimulus presented at 2 dB SPL/Hz (33 dB SPL in ERB).

 
The instantaneous frequency of the h2 FSV was estimated by means of the analytic signal representation, as described in Recio et al. (1998)Go, within the time interval in which the h2 FSV envelope magnitude exceeded the noise floor by ≥12 dB. The magnitude and phase spectra of h1s and h2 FSVs (padded with zeroes to a length of 4,096 samples, from their original lengths, m) were obtained by Fourier transformation with MATLABfunctions fft. h2s (size: m x m) were Fourier transformed using MATLAB function fft2. The statistical significance of phase locking was tested by computation of the number 2nVS2, where n is the number of spikes and VS is the vector strength (Goldberg and Brown 1969Go). Phase locking was considered statistically significant when 2nVS2 > 10.6, in which case P < 0.01 (Mardia and Jupp 2000Go).


    RESULTS
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
Responses to noise were obtained from 137 ANFs, with CFs between 100 Hz and 14 kHz, recorded in 16 chinchillas. After isolation of each ANF, a frequency–threshold tuning curve was measured from its responses to tone pips and the level of gated white noise, which elicited a just-audible increase of the rate of firing above the spontaneous rate, was determined. Long-duration (42–1,070 s) white noise was then presented at that level and also at higher levels if recording time allowed. h0s, h1s, and h2s were computed off-line from 193 responses of 137 ANFs. Additionally, h0s and revcors (R1, Eq. 8) were computed from 44 responses of 40 other ANFs from 4 other chinchillas.

General features of 1st- and 2nd-order Wiener kernels of ANFs

Figure 1 illustrates the 1st- and 2nd-order Wiener kernels (h1s and h2s, respectively) for responses to noise of a representative low-CF ANF. The time-domain waveform of the h1 (blue trace, Fig. 1A) is a transient but relatively undamped oscillation, indicative of a well-tuned band-pass system. Fourier transformation of h1 (blue, Fig. 1D) reveals a close match between its best frequency (BF) and the CF of responses to tones (arrow). [We distinguish between CF, the frequency that yields the most sensitive responses at threshold levels in normal adult cochleae, and BF, the frequency of peak sensitivity. Each ANF has a unique CF but has various BFs that depend on cochlear maturity (Overstreet et al. 2003Go) and health, as well as stimulus level.]

Figure 1, B and E present h2({tau}1, {tau}2), computed from the same responses to noise represented in Fig. 1, A and D. The h2 is depicted in Fig. 1E as a 3-dimensional (3-D) object and in Fig. 1B as its color-coded projection onto the 2-D plane. Hues indicate magnitudes: red for extreme positive values, blue for extreme negative values, and green hues indicating the featureless, near-zero, background. Over a well-localized region, the h2 consists of the intersection of 2 waves moving in directions parallel and perpendicular to the diagonal, thus creating a checkerboard pattern of peaks and troughs.

"Slices" through h2 are shown in Fig. 1C. The thin black line represents the values of the kernel at a fixed {tau}1 (3.27 ms), when this kernel reaches its maximum. The magenta line represents the kernel diagonal, h2({tau}, {tau}), which is the contribution of h2 to the impulse response (Marmarelis and Marmarelis 1978Go). The oscillation of this diagonal is "stretched out" in time (by a factor of {surd}2) relative to h2 (3.27 ms, {tau}2) and its apparent periodicity is 1/{surd}2CF [instead of 1/CF, as for h1 or h2 (3.27 ms, {tau}2)]. [Both the "stretched out" duration and the 1/{surd}2CF periodicity represent geometrical artifacts of the time-domain h2, without counterpart in the frequency domain (panel F); see p. 154–155 of Marmarelis and Marmarelis 1978Go.] The nonzero diagonal indicates the presence of amplitude-dependent nonlinearities. Specifically, it indicates that the summation of 2 simultaneous identical impulses did not produce a response twice as large as the response to a single impulse. The pattern of the h2s of low-CF ANFs is consistent with that of a linear system followed by a square-law device (Marmarelis and Marmarelis 1978Go).

Figure 1F presents the magnitudes of the Fourier transform of the 2nd-order kernel. (Note that magnitudes for negative frequencies in Fig. 1D and quadrants III and IV in Fig. 1F are not shown because they are redundant.) The spectral magnitudes of h2 contain peaks around (CF, CF) in quadrant I, and around (–CF, CF) in quadrant II. The peak in quadrant I corresponds to the wave in the direction of the diagonal (magenta line in Fig. 1B), and indicates even-order distortion components phase locked to CF. The peak in quadrant II corresponds to the wave in a direction perpendicular to the diagonal and indicates responses that follow the envelopes of the even-order nonlinearities. In low-CF ANFs, the spectral peaks in quadrants I and II had similar amplitudes.

As illustrated in Fig. 2, h2s were routinely low-pass filtered to attenuate the masking effects of high-frequency noise. Additionally, Fig. 2 suggests that the features of h2s are more effectively displayed as 2-D projections with color-coded amplitudes than as 3-D objects (compare Fig. 2, A and B or D and E).

Figure 3A illustrates kernels for a mid-CF ANF. Whereas h1 (blue trace in Fig. 3A) resembles its counterparts for lower-CF ANFs (e.g., Fig. 1A), h2 (Fig. 3B) does not. In particular, the checkerboard pattern prominent in Fig. 1B is only faintly discernible (in the early part of h2) in Fig. 3B. Later parts consist of a ridge at the diagonal, flanked by parallel ridges and troughs spaced at regular intervals about equal to the CF period (measured along lines parallel to either time axis). The diagonals of h2s of low-CF and mid-CF ANFs are also different. For low-CF ANFs, the diagonal (magenta trace in Fig. 1C) consists of a prominent AC component that rides on a DC shift. For mid-CF ANFs (magenta trace, Fig. 3C), the DC shift remains but the AC component is substantially attenuated. The h2s of low- and mid-CF ANFs also differ in the frequency domain: for low-CF ANFs (Fig. 1F) the magnitude peaks in quadrants I and II are of comparable size, whereas the quadrant II peak is much larger than the quadrant I peak for mid-CF ANFs (Fig. 3F). The small size of the quadrant I peak indicates that AC responses are poorly phase locked to CF. The large peak in quadrant II indicates that the response consists principally of a transient DC shift (or envelope) synchronized to the occurrence of CF components in the stimulus.



View larger version (56K):
[in this window]
[in a new window]
 
FIG. 3. Wiener-kernel analysis for a representative mid-CF ANF. ANF had a CF of 2.94 kHz. All panels represent the same train of 28,298 spikes evoked by a 192-s white-noise stimulus presented at 15 dB SPL/Hz (ERB pressure: 44 dB SPL). A: h1 (blue) and normalized h2 FSV (red). B and E: h2. E: h2 as a 3-D object. B: projection of the h2 onto a 2-D plane, with the 3rd (amplitude) dimension coded by hue; the magenta and black lines, respectively, indicate the diagonal {tau}1 = {tau}2 and the row with {tau}1 = 2.29 ms. C: values of the h2 at the diagonal {tau}1 = {tau}2 (magenta) and a slice through the h2 maximum at fixed {tau} = 2.29 ms (black line). Note different timescales for row (bottom) and diagonal (top). D: Fourier magnitudes of h1 (blue) and of the h2 FSV (red). Arrow indicates the CF. F: Fourier magnitudes of the h2, represented as its projection onto a 2-D plane, with the 3rd dimension coded by hue. Lines mark CF and –CF. Th = 11 dB SPL; SR =57.3/s; h0 = 135.4/s.

 
Figure 4 presents the h1s and h2s of a representative high-CF ANF. The h1 (blue in Fig. 4A) is nearly insignificant and barely discernible within the baseline noise. This is consistent with the poor phase locking of ANF responses to high-frequency tones (Johnson 1980bGo; Palmer and Russell 1986Go; Woolf et al. 1981Go). However, the frequency tuning of high-CF ANFs is clearly revealed in their h2s, which consist solely of ridges and troughs at, and parallel to, the diagonal. This striped pattern contrasts with the checkerboard pattern of h2s of low-CF ANFs (e.g., Fig. 1B).

Slices through the h2 are shown in Fig. 4C. The diagonal of h2({tau}, {tau}) resembles the envelope function of h2. At a fixed {tau}1, h2 [e.g., h2(1.94 ms, {tau}2)] approximates 1/CF and its shape is that of a band-pass system. The spectrum of h2 (Fig. 4F) contains only a single peak, around (–CF, CF) in quadrant II. This contrasts with the h2s of low- and mid-CF ANFs (Figs. 1F and 3F) and indicates that near-CF spectral stimulus components are almost exclusively signaled by envelope (i.e., DC) responses.

All kernels presented in the figures of this paper (except those in Fig. 2, A and B and the blue trace in 2C) were subjected to low-pass filtering. However, we also closely studied unfiltered frequency–domain versions of h2s (similar to those of Figs. 1F, 3F, and 4F, but with greater resolution), searching for correlates of simple summation (f1 + f2) and difference (f2 – f1) tones. If present, distortion at summation and difference frequencies would appear as ridges parallel to the (CF/CF and CF/–CF) diagonals in quadrants I and II. At first glance, one might expect to find distortion at (f2 f1) in the 2nd-order kernels because such simple difference tones are often associated with square-law nonlinearity and are prominent features of ANF responses to tone pairs (Siegel et al. 1982Go). In fact, energy at distortion summation or difference frequencies was never evident. We are uncertain of whether this indicates that the poor signal-to-noise ratio of the Wiener kernels caused summation and/or difference tones to be buried in the baseline noise of the kernels or, alternatively, that a band-pass filter centered at CF is interposed between the site of origin of the even-order nonlinearities and the site of spike generation.

A simple "sandwich" model system (Fig. 5) helps to explain the results described above, in particular the relation between phase locking and the striped patterns of 2nd-order kernels in the time domain (as well as the corresponding spectral features). The system consists of a band-pass linear filter, a zero-memory nonlinearity (ZNL), and a low-pass linear filter. The band-pass (gammatone) filter is tuned to "CF." The ZNL is a half-wave rectifier. The low-pass filter has a cutoff frequency of 2.5 kHz. The input to the system is a Gaussian white noise sampled at 48 kHz. The middle column shows 40-ms segments of the input and the outputs at various stages when the CF is 500 Hz (i.e., a "low" CF). Because the CF is far below the cutoff frequency of the low-pass filter, its output ("{ddagger}" in the block diagram) is virtually identical to that of the ZNL ("{dagger}"). The right column shows 4-ms segments of the input and the outputs when the CF is 7 kHz (i.e., a "high" CF). Because the CF is far above the cutoff frequency of the low-pass filter, its output consists of a slow-varying envelope with frequency components lower than the cutoff frequency.



View larger version (39K):
[in this window]
[in a new window]
 
FIG. 5. Simple "sandwich" model system. Model system consists of a band-pass linear filter, a zero-memory nonlinearity (ZNL), and a low-pass linear filter arranged in cascade. Band-pass filters are tuned to "CFs" of 500 Hz or 7 kHz. ZNL is a half-wave rectifier. Low-pass filter has a cutoff frequency of 2.5 kHz. Middle column: 40-ms segments of the input and the outputs at various stages when the CF is 500 Hz. Because 500 Hz is far below the cutoff frequency of the low-pass filter, the output of the low-pass filter ({ddagger}) is virtually identical to that of the ZNL ({dagger}). Right column: 4-ms segments of the input and the outputs when the CF is 7 kHz. Because 7 kHz is much higher than the cutoff frequency of the low-pass filter, the latter's output lacks a 7-kHz (CF) component and consists largely of a slow-varying wave (i.e., the envelope of the input to the low-pass filter).

 
Figure 6 presents 2nd-order Wiener kernels of the model system and their corresponding 2-D Fourier transforms. The left column shows the time-domain kernels and the right column shows their Fourier transforms (actually, only sections of the 1st and the 2nd quadrants of the complete 2-D Fourier transforms). The 1st row shows the 2nd-order Wiener kernel of the system when the CF is 500 Hz. For this CF the input to the low-pass filter and the output of the low-pass filter are virtually identical and thus the 2nd-order Wiener kernels computed pre–and post–low-pass filtering are the same. Note that the Fourier transform (right-hand side) contains CF components in both spectral quadrants, corresponding to the orthogonally intersecting waves in the time domain (which yield a checkerboard pattern). The 2nd and 3rd rows show a similar analysis for the case of CF = 7 kHz. The 2nd-order Wiener kernel computed before low-pass filtering also exhibits a checkerboard pattern in the time domain and, as for the low-CF case (top row), its frequency–domain counterpart contains components in both quadrants at the CF. In contrast, the 2nd-order Wiener kernel computed after low-pass filtering has a striped pattern in the time domain and only a quadrant II component in the frequency domain.



View larger version (61K):
[in this window]
[in a new window]
 
FIG. 6. Second-order Wiener kernels of the "sandwich" model system and their corresponding 2-D Fourier transforms. Left column: time-domain kernels. Right column: their Fourier transforms. [Note that only sections of the 1st and the 2nd quadrants of the complete 2-D Fourier transforms are shown here.]. In the case of the 500-Hz ("low-CF") system, the 2nd-order Wiener kernels computed pre–and post–low-pass filtering are the same and the corresponding Fourier transforms are also identical (top row). In the case of the 7-kHz ("high-CF") system, the 2nd-order Wiener kernel computed before low-pass filtering has a checkerboard pattern and the corresponding Fourier transform shows 7-kHz components in both quadrants. For the 7-kHz system, the 2nd-order Wiener kernel computed after low-pass filtering has a striped pattern and the corresponding Fourier transform shows a 7-kHz component only in quadrant II.

 
Singular value decomposition of the 2nd-order kernels

Useful representations of matrices can be obtained by singular value decomposition (Lewis et al. 2002Go; Yamada and Lewis 1999Go; Yamada et al. 1997Go; see METHODS). For low-CF ANFs, the Fourier magnitudes of the h1s and the 1st-rank singular vectors of the h2s (h2 FSVs) were essentially identical (compare blue and red traces in Figs. 1D and Fig. 8). However, the polarity of the h2 FSVs was ambiguous because of the inherent ambiguousness of polarity of the 2nd-order kernels. Thus the h2 FSV waveforms were either identical to those of the h1s or, alternatively, had opposite polarities. Throughout this paper, h2 FSVs are plotted with polarities corresponding to those of the h1s in all figures (procedures are illustrated in Fig. 10).



View larger version (54K):
[in this window]
[in a new window]
 
FIG. 8. First- and 2nd-order Wiener kernels of low-CF ANFs. Time-domain representations of the normalized h1s (blue) and h2 FSVs (red) are shown to the left. Unit number on the left of each trace corresponds to the same ANF in Figs. 12, 15, and 17C. CF is indicated next to each tracing. Corresponding h2s, presented as color-coded projections are shown to the right. Note that timescales vary. Number of spikes (N) used for computation of kernels, stimulus duration (t), CF tone threshold, noise spectral level, and ERB pressure (between parentheses): ANF No. 1: n = 90,467 (t = 759 s), 35 dB SPL, 18 dB SPL/Hz (37 dB SPL); 2: n = 37,664 (t = 540 s), 28 dB SPL, 18 dB SPL/Hz (41 dB SPL); 3: n = 27,683 (t = 180 s), 9 dB SPL, –6 dB SPL/Hz (18 dB SPL); 4: n = 20,196 (t = 240 s), 2 dB SPL, –17 dB SPL/Hz (9 dB SPL); 5: n = 14,446 (t = 170 s), 10 dB SPL, –18 dB SPL/Hz (9 dB SPL).

 


View larger version (42K):
[in this window]
[in a new window]
 
FIG. 10. Windowing of 1st-order kernels. A: normalized raw h1 (dashed line) and envelope of h2 FSV (thin continuous line). Squares indicate the time interval within which the h2-FSV envelope was ≥6 dB higher than the h2-FSV root mean square (rms) computed for the 1st millisecond (horizontal dashed line). Circles indicate half the amplitude of the time window (thick continuous line). Duration of the flat part of the window was equal to (and varied with) the h2-FSV envelope width. Rise and fall ramps (0.5 period of a 1-kHz sinusoid) always had 0.5-ms durations. B: normalized h2 FSV (thin line) and windowed h1 (thick line). Note that the timescales are different in A and B. C: Fourier magnitudes of the raw h1 (dashed line), windowed h1 (thick line), and of the h2 FSV (thin continuous line). D: phases of raw h1 (dashed line), windowed h1 (thick line), and of the h2 FSV (thin continuous line). Phases at best frequencies (BFs) of h1 and h2 FSV, respectively, are indicated by circles. CF Th = 33 dB SPL. Noise level: 17 dB SPL/Hz (47 dB SPL in ERB).

 
In the case of mid-CF ANFs, the match between the h1s and the h2 FSVs (Fig. 3, A and D) was poorer than that for ANFs with lower CF (Fig. 1, A and D). This resulted from the deterioration of phase locking, which generated h1s with poor signal-to-noise ratio. This is evident by comparing the magnitude spectra: in the case of the low-CF ANF (Fig. 1D), the peak-to-noise ratio of the h1 exceeds 40 dB at low frequencies; in the case of the mid-CF ANF (Fig. 3D), the peak-to-noise ratio of the h1 is only 20 dB at the same frequencies. With even weaker phase locking in high-CF ANFs, the h1 was nearly indistinguishable from the noise baseline (Fig. 4A), whereas the h2 FSVs retained sharp frequency tuning. [Note that the blue trace of Fig. 4D, where a peak is detectable at BF, is the magnitude of a windowed version of h1 (see Fig. 10); the time window, derived from the h2 envelope, extended in this case from about 1.23 to 2.35 ms.]

For some matrices, SVD sometimes yields very efficient compression of information, so that, for example, a recognizable version of an image with size n2 can be reconstructed from merely a few vectors of length n. Figure 7 shows that this is also the case for the h2s of chinchilla ANFs. Figure 7A and its inset show that the weights of the h2 FSVs of chinchilla low-CF ANFs were much larger than the weights of all other vectors: the weights of the rank-2 vector amounted to <30% of the h2 FSV weight, on average (filled circles in Fig. 7A and inset) and the other vectors (ranks 3, 4, etc.) were of course smaller (inset). Thus the h2 FSVs sufficed to describe most features of the h2s of low-CF ANFs. In the case of high-CF ANFs, the h2 FSVs and the 2nd-rank vectors had the same (positive) sign and similar weights (filled squares in Fig. 7A; see also upward triangles in inset) and differed by only a 90 ° shift. Thus for high-CF ANFs, the 2 singular vectors of highest rank (1 and 2) jointly contained most of the features of the h2s. The abrupt change at 2–3 kHz in the relative weights of the 2nd-rank singular vectors (trend line in Fig. 7A) coincides with the transition in the appearance of h2s, from the checkerboard pattern (low CFs) to the stripe pattern (high CFs).



View larger version (27K):
[in this window]
[in a new window]
 
FIG. 7. Singular value decomposition of the 2nd-order kernels. Weights (diagonal elements of S in Eq. 13) of the 2nd- and 3rd-rank singular vectors are expressed as a fraction of the FSV weights. A: relative weights of the 2nd-rank singular vector plotted against BF. Inset: averaged weights of the singular vectors with ranks 2–26, expressed as a fraction of the FSV weights for low-CF (CFs <2.7 kHz, downward triangles) and high-CF ANFs (CFs >2.7 kHz, upward triangles). Relative weights indicate that the FSVs almost fully characterize the h2s of low-CF ANFs (2nd-rank singular vector weights of low-CF ANFs are only about 0.26 of FSV weight). In the case of high-CF ANFs, the FSVs and 2nd-rank singular vectors have similar weights and jointly account for most properties of the 2nd-order kernels. Dashed line, a smoothed step function, fits very well the weights of the 2nd-rank singular vector relative to the weights of the h2 FSVs (r2 = 0.95). B: relative weights of the 2nd- and 3rd-rank singular vectors plotted against the signal-to-noise ratios of the 2nd-order kernels. There is a strong negative correlation between the signal-to-noise ratios of the 2nd-order kernels (abscissa) and the weights of the 3rd-rank singular vectors for all CFs (open symbols) and the 2nd-rank singular vectors of low-CF ANFs (closed circles). Signal-to-noise ratio was computed as 2nd-order kernel peak-to-peak amplitude divided by baseline-noise amplitude. Exponential decay closely fits the variation of 3rd-rank singular vectors as a function of h2 signal-to-noise ratio (r2 = 0.81).

 
The scatter diagram of Fig. 7B reveals a negative correlation between the relative weights of the 3rd-rank singular vectors (open symbols) and the signal-to-noise ratio of the 2nd-order kernel waveforms. The negative correlation suggests that singular vectors other than those with ranks 1 and 2 (i.e., vectors of rank 3, 4, and so on) did not represent bona fide cochlear response properties but merely "noisiness" attributed to insufficient sampling time. This idea is supported by the fact that singular vectors with ranks other than 1 and 2 were generally untuned. In the case of low-CF ANFs, the h2 2nd-rank singular vectors (filled circles) also were always untuned and had small relative weights. In the case of high-CF ANFs (filled squares), however, the weights of the 2nd-rank singular vectors were prominent (i.e., similar to the FSV weights) and did not correlate with signal-to-noise ratio.

The Wiener kernels as a function of CF

Figure 8 shows the h1s and h2s of several ANFs with low CFs (109 Hz to 2.5 kHz). The left column of Fig. 8 shows that the h1s (blue trace) and the h2 FSVs (red trace) were nearly identical. The right-hand column illustrates the corresponding h2s (as color-coded projections), exhibiting the checkerboard pattern. Figure 9 shows the h1s (blue trace, left column) and the h2 FSVs (red trace, left column) for several high-CF ANFs. In the case of the ANF with CF 3.65 kHz (top), the h2 FSVs closely resembled the corresponding h1s. In the case of ANFs with higher CFs, however, the signal-to-noise ratios of the h1s diminished systematically, eventually becoming nearly, but not completely, buried in the baseline noise (Fig. 9, left column; see also blue trace in Fig. 4A). The h2s of high-CF ANFs (right column) did not exhibit checkerboard patterns but rather consisted of ridges and troughs parallel to the diagonal.



View larger version (70K):
[in this window]
[in a new window]
 
FIG. 9. First- and 2nd-order Wiener kernels of high-CF ANFs. Time-domain representations of the normalized h1s (blue) and h2 FSVs (red) on the left. CF is indicated next to each tracing. Unit numbers (left) correspond to the ANFs in Figs. 12, 15, and 17C. Corresponding h2s, presented as color-coded projections, are on the right. Number of spikes (N) used for computation of kernels, stimulus duration (t), CF threshold, noise spectral level, and ERB pressure (between parentheses): ANF No. 6: n = 13,587 (t = 82 s), 18 dB SPL, 7 dB SPL/Hz (35dB SPL); 7: n = 28,966 (t = 180 s), 15 dB SPL, –5 dB SPL/Hz (24 dB SPL); 8: n = 65,836 (t = 630 s), 15 dB SPL, –16 dB SPL/Hz (14 dB SPL); 9: n = 27,600 (t = 182 s), 2 dB SPL, –17 dB SPL/Hz (13 dB SPL); 10: n = 42,732 (t = 242 s), 33 dB SPL, 37 dB SPL/Hz (68 dB SPL).

 
For mammalian ANFs, phase locking to tones decays rapidly as a function of frequency and is often undetectable in responses to high-frequency tones (Johnson 1980bGo; Palmer and Russell 1986Go; Ruggero 1992Go; for a contrary view, see Teich et al. 1993Go). Nevertheless, the h1s of ANFs with CFs as high as 12 kHz contained small but measurable near-CF oscillations at delays matching those of the corresponding h2s (e.g., compare the blue and red traces for the ANFs with CFs of 9.3 and 12.1 kHz in Fig. 9). To obtain spectral information from such residual oscillations, each raw h1 (dashed line in Fig. 10 A) was windowed in the time domain (thick solid line in Fig. 10A) according to the normalized h2 FSV. [Note that the long duration of the time windows guaranteed that the h1 waveforms were not altered (Fig. 10A).] In ANFs with BF as high as 12.1 kHz, the windowed h1s (e.g., thick line in Fig. 10B) often exhibited clear periodicity similar to that of the h2 FSVs (thin line in Fig. 10B) and the frequencies of the peak magnitudes of their Fourier transforms closely matched the BFs of the h2 FSVs (Fig. 10C; see also Fig. 11). Except for the above-noted {pi} ambiguity, the phases of the windowed h1s also closely matched those of the h2 FSVs (Fig. 10D).



View larger version (16K):
[in this window]
[in a new window]
 
FIG. 11. Correspondence between the BFs of the 1st-order kernels (abscissa) and the FSVs of the 2nd-order kernels (ordinate). BFs of the h1s were obtained from the magnitude spectra of their windowed versions (see Fig. 10B). Only responses with reliable phase-locking (2nVS2 ≥ 10, where n is the number of spikes and VS is the vector strength) were included. Highest h1 BF is 12,375 Hz. See next figure and text below for the vector strength definition.

 
To study the dependency of phase locking on CF, we devised a measure analogous to the vector strength used in quantifying phase locking in responses to tones (Goldberg and Brown 1969Go), that is, the ratio between the amplitudes at BF of the 1st-order cross-correlations (revcors; see Eq. 8) and the (unattenuated) stimulus-noise waveforms

(15)
Vrevcor is the Fourier magnitude of the windowed revcor at BF (thick line in Fig. 10C). Vnoise is the average of the Fourier magnitudes of the noise stimulus at BF, measured in consecutive time intervals (identical to revcor times), which together spanned the full duration of the stimulus. The same time functions (e.g., thick solid line in Fig. 10A) were used to window the revcors and the noise stimuli.

VSnoise is a dimensionless quantity that varies from zero to one. It equals one when all spikes are preceded by effective waveforms with the same, fixed latency. VSnoise is plotted in Fig. 12 as a function of h1 BF. VSnoise is relatively constant, 0.68 on average, in the frequency range 100 Hz to 2 kHz, and decays precipitously at higher frequencies, at a rate of about –18 dB/octave. Because VSnoise is based on 1st-order cross-correlation, its decay with BF reflects the extent to which frequency-tuned auditory signals (such as basilar membrane vibrations) are low-pass filtered by more central cochlear processes (e.g., the generation of receptor potentials at the inner hair cells).



View larger version (20K):
[in this window]
[in a new window]
 
FIG. 12. BF dependency of the magnitudes of reverse correlations (revcors). Ordinate indicates VSnoise (Eq. 15), a dimensionless quantity between zero and one analogous to the vector strength of period histograms for responses to tones (Goldberg and Brown 1969). Abscissa indicates h1 BF. Circles: data points with 2nVSnoise2 >10; squares 2nVSnoise2 <10. Trend line equals mean VSnoise below 2 kHz and has a slope of –18 dB/oct at higher frequencies. Stimulus noise spectral levels averaged –0.9 ± 16.0 dB SPL/Hz (27.3 ± 15.7 dB SPL in ERB) in 178 responses with available frequency threshold curves (FTCs). ANFs of Figs. 8 and 9 are identified by numbers; the ANF represented in Fig. 10 is identified by crosses; the ANFs represented in Figs. 1, 3, and 4 are identified by letter F. See text for details.

 
Wiener kernels as a function of stimulus level

Figure 13 depicts h1s for responses of a representative low-CF ANF to noise stimuli presented at 7 different levels. With increasing stimulus intensity, the number of detectable oscillations in the h1s decreased (indicating deterioration of frequency selectivity) and the "center of gravity" (group delay) shifted to earlier times (Fig. 13A). The onset time of the h1s, however, remained independent of the intensity of the stimulus.

Figure 13, B and C display the amplitude- and phase-frequency spectra of the h1s of Fig. 13A. As the level of stimulation increased, response sensitivity decreased and the sharpness of frequency tuning (as reflected by Q10dB, the ratio of CF to bandwidth at 10 dB re peak value; not shown) was also reduced: e.g., Q10dB (43 dB SPL/Hz) = 1.02; Q10dB (–17 dB SPL/Hz) = 2.25. Similar changes were observed in other low-CF ANFs.

Plots of phases as a function of frequency (e.g., Fig. 13C) often showed a dependency on stimulus intensity. For frequencies below CF, phase lags increased with intensity. Near CF, phases did not change; above CF, phase lags decreased with increases of stimulus intensity. The phase of the kernel obtained using the most intense noise (43 dB SPL/Hz) typically lagged responses to lower-level stimuli at all frequencies.

Figure 14 shows h2 FSVs of a representative high-CF ANF computed from responses to noise stimuli presented at several intensities. In the time domain, the h2 FSVs shifted to earlier times systematically as a function of increasing stimulus intensity (solid color traces in Fig. 14A). In the frequency domain, these time shifts corresponded to decreasing slopes of the phase-versus-frequency curves with increasing stimulus level (Fig. 14C), concomitant with reductions in near-BF group delay. These changes were typically accompanied by decreases in BF and sharpness of frequency tuning (Fig. 14B): e.g., Q10dB (37 dB SPL/Hz) = 4.73; Q10dB (2 dB SPL/Hz) = 6.46.

Timing features of 2nd-order Wiener-kernels

The h2 FSVs were well described by a function of BF and three additional parameters

(16)
where A is related to the latency of the h2 FSV envelope peak, B is related to the width of the h2 FSV envelope, and {phi} is the response phase at BF.

The onset times of the h2 FSVs, defined as the times when the (envelope) fit curves first surpassed 5% of their peak amplitudes, are plotted in Fig. 15 A. For BF <2.7 kHz, the onset times varied roughly as a linear function of log BF. At a BF of 2.7 kHz, there was a discontinuity in the dependency of onset time on BF. For BF >2.7 kHz, the onset times also varied roughly linearly as a function of log BF but with a shallower slope than that for lower BFs. We attribute the 2.7-kHz discontinuity to the inability of the h2s of high-CF ANFs to detect the earliest responses evoked by low-level noise. The response onset corresponds to the (linear) tail components of the basilar-membrane frequency response, which is typically undetectable in basilar-membrane responses to low-level clicks (Recio et al. 1998Go) or in 1st-order Wiener kernels of basilar-membrane responses to low-level noise (Recio et al. 1997Go). Presumably, the response onsets are even further buried in the noise in the case of ANF Wiener kernels, which have very restricted (20–30 dB) dynamic range. Therefore the onset times of Fig. 15A cannot be equated with "signal-front delay" as defined elsewhere (Goldstein et al. 1971Go; Ruggero 1980Go). The discontinuity occurs at a BF around 3 kHz, coinciding with (but not necessarily causally related to) the BF at which the low-frequency flanks of tuning curves undergo a drastic slope transition in chinchilla (Temchin et al. 1997Go). [Such slope transitions have also been described for ANF tuning curves in gerbil (Ohlemiller and Echteler 1990Go; Schmiedt 1989Go) and cat (Liberman 1978Go).]



View larger version (23K):
[in this window]
[in a new window]
 
FIG. 15. Onset latencies and near-BF group delays of ANF responses to noise. A: onset latencies of h2 FSVs (measured in 193 ANFs; circles) and revcors (from 44 other ANFs; triangles). Noise levels are according to the legend of Fig. 12A. Lines represent linear/log fits computed for BFs below and above 2.7 kHz separately. B: group delays (negative of the slope) near the BF (symbols), computed from phase-vs.-frequency curves. Each symbol represents responses of a single ANF to the noise stimulus presented at the lowest available intensity. Noise level was –4.6 ± 16.0 dB SPL/Hz (23.4 ± 15.4 dB SPL in ERB) on average. Delays were computed from h2 FSVs (open circles) in 122 ANFs and from revcors in 24 other ANFs for which h2 FSVs were unavailable (triangles; unpublished data of M. A. Ruggero and N. C. Rich). [ANFs 8 and 10 of Fig. 9 are represented in Fig. 15B by (unidentified) responses to lower-level noise stimuli than those represented in Fig. 9.] Filled circles are group delays for 8 cat ANFs computed from Fig. 4, B and C of van der Heijden and Joris (2003) and corrected by 1.225 ms (to make them comparable to the chinchilla data). Fit function for the chinchilla near-BF group delays (solid thick line) has the following equation: {tau}BFGD = 1.721 + 1.863BF–0.771 (r2 = 0.95; n = 161). Both the group delays and the onset latencies include an acoustic/equipment delay of 0.225 ms. ANFs depicted in Figs. 8 and 9 are marked by numbers; the ANFs represented in Figs. 1, 3, and 4 are identified by letter F.

 
For most ANFs, the value of parameter A as a function of BF (expressed in kHz) is well described (r2 = 0.95; n = 237) by the following equation

(17)
The relationship between parameters B and BF can be roughly specified (r2 = 0.69; n = 237) by the following equation

(18)

The near-BF group delays of the h2 FSVs, computed from phase-versus-frequency curves, are plotted in Fig. 15B as a function of BF. The group delays lie along a locus well described (r2 = 0.95; n = 161) by the following equation

(19)
where BF is expressed in kHz.

Because both the detectable onset latencies and the near-BF group delays varied with stimulus level (e.g., Fig. 14), it is of interest to establish the relative levels of the stimuli that evoked the responses represented in Fig. 15. As indicated above, the lowest level of noise stimulation was chosen to just exceed thresholds. Such stimulus levels evoked a mean discharge rate (h0) exceeding the spontaneous rate by 29.3 ± 21.5 spikes/s on average (n = 123). To put that number in perspective, recall that tone thresholds correspond to levels that elicited rates of 20 spikes/s higher than the spontaneous rates. To gain further insight into the levels of the noise stimuli relative to tone thresholds, we roughly estimated the effective SPLs of the noise stimuli on the basis of their spectral levels and the bandwidths of the ANF responses. Equivalent rectangular bandwidths (ERBs) were computed from the magnitude-versus-frequency curves of the h2 FSVs (inset of Fig. 16). The main part of Fig. 16 indicates the total noise pressures in the ERBs relative to tone CF threshold as a function of h2 FSV BF. On average, total noise pressure in the ERBs exceeded CF thresholds by 8.6 ± 9.7 dB (n = 122). Such relative levels, combined with the fact that the evoked discharge rates exceeded spontaneous activity by only 29.3 spikes/s, suggest that the data represented in Fig. 15 were obtained at levels that exceeded noise thresholds by not much more than 10 dB.

Frequency glides in Wiener kernels

In one important respect, Eq. 16 does not adequately describe the h2 FSVs: their instantaneous frequencies were not constant, but rather increased or decreased monotonically immediately after response onset, depending on BF. These "frequency glides" [first described by Møller (Møller 1977bGo; Møller and Nilsson 1979Go) for the revcors of low-BF ANFs] were quantified by Hilbert transformation of the h2 FSVs. The magnitudes and phases of the Hilbert transforms give, respectively, the envelope and instantaneous frequencies of the oscillations. Figure 17, A and B, respectively, show the h2 FSVs of representative low- and high-BF ANFs (continuous traces), together with their instantaneous frequencies (dashed lines). The dotted lines indicate regression lines computed over the ranges where the frequency glides were largest (as estimated by visual inspection). Figure 17C summarizes the variation as a function of BF of the magnitudes and directions of the frequency glides of h1s and h2s. The frequency glides are expressed in dimensionless units obtained by dividing the regression slope (kHz/ms) by CF2 (Shera 2001