|
|
||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1Department of Integrative Physiology, National Institute for Physiological Sciences, Aichi; 2Research Institute of Science and Technology for Society, Japan Science and Technology Agency, Tokyo; and 3Department of Physiological Sciences, School of Life Sciences, The Graduate University for Advanced Studies, Kanagawa, Japan
Submitted 4 January 2007; accepted in final form 2 March 2007
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
There are three potential mechanisms of within-modal spatial attention. One is a gain-regulation mechanism (Hillyard and Mangun 1987
; Hillyard et al. 1999
; Kanwisher and Wojciulik 2000
; Posner and Dehaene 1994
), which is derived from animal studies indicating attentional influences on evoked electrical responses in the sensory pathway (Hermandez-Peon et al. 1956
, 1966
; Oatman and Anderson 1977
). These early studies revealed an enhancement of the response when the animal's attention was directed to a stimulus, but a reduction when attention was directed elsewhere (Hermandez-Peon et al. 1956
, 1966
; Oatman and Anderson 1977
). There is abundant evidence supporting the gain-regulation mechanism (Corbetta et al. 1991
; Hillyard and Mangun 1987
; Luck et al. 1997
; Moran and Desimone 1985
; Reynolds et al. 2000
).
A second mechanism is an attention-induced activation of a separate neuronal population that is not activated by unattended stimuli (Näätänen 1992
; Näätänen et al. 1978
). This was well investigated in EEG studies as the so-called endogenous component of event-related brain potentials (ERPs), whereas the classical gain-regulation mechanism corresponds to the enhancement of an exogenous component (Hillyard et al. 1999
). If a response is enhanced by attention together with changes in the scalp distribution and waveform of the response, the attentional enhancement is interpreted as a result of the overlapped activation of a separate neuronal population. In their EEG studies, by contrast, Hillyard et al. (1999)
regarded the attentional enhancement without changes in the spatial distribution or waveform as evidence of the enhanced activation of the same neuronal population as that for unattended stimuli, i.e., the gain-regulation mechanism.
The third mechanism is a tonic bias effect, indexed by a sustained baseline increase in activity in single-neuron recordings (Luck et al. 1997
) or PET (Rees et al. 1997). Such a sustained baseline increase will produce sustained attentional modulation like slow deflections in the EEG/MEG recordings. However, because of baseline corrections commonly used in the EEG/MEG recordings, they cannot investigate the possibility that sustained baseline changes may contribute to modulate neuronal responses to attended versus unattended stimuli. Therefore if the attentional modulation is observed as not a sustained but a phasic effect in the EEG/MEG studies, the gain-regulation mechanism better explains the modulation than does the tonic bias effect. Accordingly, the attentional enhancement of an evoked response without changes in waveform and spatial distribution predicts a gain-regulation mechanism (Hillyard et al. 1999
). However, the mechanisms underlying cross-modal links in spatial attention are still unknown. EEG/MEG measures that record a neural response with high temporal resolution are a good way to examine these possible mechanisms that can act dynamically within the order of several milliseconds. In addition, MEG, which records a neural response with a higher spatial distribution than EEG, may provide important information, although, of course, it should be considered that there is neural activity that cannot be detected by MEG.
Here, we used whole-head MEG to reveal time-varying cortical processes underlying the cross-modal links in visualtactile spatial-selective attention. To this end, we analyzed neural responses to electrocutaneous stimuli in a selective-attention task where a visual or electrocutaneous stimulus was presented in a random order on the right or left side (hand or hemifield) when vision or touch was task relevant. Unimodal attention studies in the somatosensory modality found little or no effect on the neural response in the primary somatosensory cortex (SI) to a somatosensory stimulus, but the enhancement of responses with a latency of about 80 ms in the secondary somatosensory cortex (SII) in humans (Fujiwara et al. 2000; Hamada et al. 2004; Mima et al. 1998
). Higher attentional sensitivity in SII is also reported in monkeys (Chapman and Meftah 2005
; Meftah et al. 2002
). Previous EEG studies failed to find links from vision to touch in spatial attention when touch was completely task irrelevant (Eimer and Driver 2000
), but demonstrated clear evidence of links when vision was primary and touch was of secondary relevance to the task or using a cueing task (Eimer and Driver 2000
; Eimer et al. 2001
, 2002
). MEG has the advantage of recording magnetic fields induced by tangential currents such as the activity in the upper wall of the sylvian fissure (SII), and therefore can reveal the cortical locus of cross-modal links, which are blind to EEG. Therefore we hypothesized that directing attention to visual stimuli enhances the response in the somatosensory cortex, especially in SII, to tactile stimuli that are presented near the visually attended locations, as well as directing attention to the tactile stimuli.
| METHODS |
|---|
|
|
|---|
Recordings were obtained from 11 healthy right-handed subjects (two women and nine men), ages 23 to 40 yr old. All subjects gave written informed consent before the study, which was first approved by the Ethics Committee at the National Institute for Physiological Sciences.
Stimulation
The subjects, seated in a magnetically shielded and semidarkened room, put their hands and forearms comfortably on an obliquely oriented board in front of them. An electrocutaneous or a visual stimulus was presented in a random order on the left or right side to elicit neural responses (Fig. 1). A double-pulse visual stimulus was infrequently embedded in a sequence of single-pulse visual stimuli, thus providing four types of visual stimuli differing in spatial location and number of pulses. The duration of the single-pulse stimulus was 300 ms, whereas the double-pulse stimulus was illuminated for 120 ms, turned off for 60 ms, and illuminated again for 120 ms. The visual stimuli were presented through two plastic optical fibers (POFs: diameter 1 cm, length 3 m; Multi-core Luminous POF, Asahi Kasei, Yokohama, Japan), one each on the left and right sides. A red light-emitting diode (LED, luminance 35 cd/m2, measured at the subject's nasion) was attached to one end of each fiber outside of the shielded room. The other end was located in the shielded room (Fig. 1). The fiber was covered with black resin so that subjects could see the light at only one section. The apparatus induced no magnetic artifacts and enabled one to present a visual stimulus at preferred locations, such as near the subject's hands, as in this study.
|
Task
In the present study, we used a classic selective-attention task to test attentional modulation (Eimer and Driver 2000
; Hillyard et al. 1973
; Kida et al. 2004a
). This is one of the general experimental paradigms used to test sustained attention. Subjects executed the stimulus-discrimination task under four conditions in which the direction of attention varied (left/tactile, right/tactile, left/visual, and right/visual attention conditions). In each condition, a total of eight kinds of stimuli differing in spatial location (left and right), sensory modality (vision and touch), and number of pulses (single and double) were presented in a random order at a random interstimulus interval (ISI, 8001,200 ms). Double-pulse stimuli were also presented for the irrelevant modality. The subjects were instructed to fixate on a point at a viewing distance of 40 cm and to put the second digit of each hand near the visual stimuli. In the touch-relevant/attend-right condition, subjects were instructed to attend covertly to electrocutaneous stimuli presented to the right second digit and to count silently the number of electrocutaneous double-pulse stimuli presented to the digit. Therefore in this condition, the visual single-pulse stimulus presented in the right hemifield was regarded as an external input from the task-irrelevant modality but was spatially attended (cross-modal spatial attention). By contrast, the electrocutaneous single-pulse stimulus presented to the right second digit was regarded as a spatially attended input from the relevant modality (within-modal spatial attention). The electrocutaneous and visual single-pulse stimuli on the left side were regarded as spatially unattended inputs from the task-relevant and -irrelevant modalities, respectively. In each of the other three conditions, subjects were asked to attend to the designated side and to count only the number of double-pulse stimuli presented on the designated side in the relevant modality. They reported the number after the termination of each block. Each of the four attentional conditions was run in a randomized order. Each condition contained about 600 trials (about 250 single-pulse electrocutaneous, 250 single-pulse visual, 50 double-pulse electrocutaneous, and 50 double-pulse visual, presented approximately equally on the left and right sides), divided into two blocks. Each block lasted about 5 min. The number of double-pulse targets varied between blocks (1114 per block, 2426 per condition) to ensure subjects counted the target stimuli. Two blocks were run for each condition. The experiment lasted about 5060 min. The interstimulus interval (ISI) between stimuli in each sensory modality (including single- and double-pulse stimuli) was 1,6002,400 ms, between double-pulse stimuli (including visual and tactile stimuli) was about 6 s, and between double-pulse target stimuli was about 24 s. Before each condition, subjects were given verbal instructions about the direction of attention to be directed and the type of stimuli to be counted.
Recordings and analysis
The data started to be acquired a few seconds after verbal instructions were issued by an experimenter at the beginning of each condition and then, after a few more seconds, stimulation started. The second block was also started with verbal instructions, a few minutes after the termination of the first block. The MEG was recorded with a helmet-shaped 306-channel detector array (Vectorview, Elekta Neuromag Yo, Helsinki, Finland), which consisted of 102 identical triple-sensor elements. Each sensor element consisted of two orthogonal planar gradiometers and one magnetometer coupled to a multi-SQUID (superconducting quantum interference device) and thus provided three independent measurements of the magnetic fields. We analyzed the MEG recorded from 204-channel planar-type gradiometers. They were filtered with a band-pass filter of 0.03150 Hz and digitized at a sampling rate of 1,024 Hz. The analysis period was from 100 ms before to 250 ms after the stimulus. Only neural responses to the electrocutaneous single-pulse stimulus (frequently presented standard stimulus) were analyzed, for the following reasons. 1) We aimed to examine pure selective-attention effects, not "target" effects involved in the modulation of the target-evoked response (García-Larrea et al. 1995
; Kida et al. 2004a
). 2) We could not analyze the MEG waveform in response to the target double-pulse stimuli because of the low signal-to-noise ratio resulting from the small number of values averaged. 3) The MEG waveform evoked by the target double-pulse stimuli was difficult to obtain because of the momentary electrical artifact caused by the second pulse of the double-pulse stimulus. The MEG data were filtered off-line with a low-pass filter of 50 Hz. The four head-position indicator coils attached on the subject's head were measured with respect to the three anatomical landmarks (nasion and bilateral preauricular points) using a 3D digitizer before the main experiment outside the shielded room to allow alignment of the MEG and magnetic resonance image (MRI) coordinate systems (3.0-T Siemens Allegra). Before the MEG recording after the fixation of the subject's head to the helmet-shaped sensor, a current was fed to four HPI coils placed at known sites and the resulting magnetic fields were measured with the magnetometer, to obtain the exact location of the head with respect to the sensors. The x-axis was fixed with the preauricular points, the positive direction being to the right. The positive y-axis passed through the nasion and the z-axis thus pointed upward.
Eye movements were monitored by an eye-movement monitor camera (ISCAN, Burlington, MA). Trials with eye movements >0.5° from the fixation point and with eye blinks were also excluded from the analysis. Trials with horizontal and vertical eye movements of >1° or with MEG signals >3,000 fT/cm were rejected from the averaging of MEG data.
We first calculated vector sums from the longitudinal and latitudinal derivatives of the response recorded on the planar-type gradiometers at each of the 102 sensors' location. This was obtained by squaring MEG signals for each of two planar-type gradiometers at a sensor's location, summing the squared signals together, and then calculating the root of the sum [
; here we call this the "root sum square" (RSS)] (Kida et al. 2006a
). The calculation was carried out for all 102 sensors' locations. Next, we used the obtained RSS waveforms and isocontour map of the RSS amplitude to look for a peak channel showing the greatest amplitude for each prominent response, because those waveforms had several responses with a different spatial distribution of amplitude. Then, the peak amplitude and latency of prominent responses in the RSS waveform were measured at the peak channel.
Because we did not deal with MEG responses to all kinds of double-pulse stimuli and of visual stimuli, we obtained 8 kinds of waveform in response to the tactile single-pulse stimulus in a total of four conditions, and then arranged them into a 2 x 2 x 2 array according to spatial attention (attended or unattended), relevant modality (touch or vision), and stimulus side (left hand or right hand).
To examine whether the spatial distribution of magnetic responses was different or similar between attended and unattended conditions, we calculated Pearson's correlation coefficient r for conditions of RSS amplitude at the peak latency. The higher the correlation coefficient is, the more similar the spatial distributions of the two MEG responses are (if the correlation coefficient is 1.0, the two MEG responses tested have an identical spatial distribution). The coefficient was calculated for all possible pairs of two conditions.
To identify the source of the equivalent current dipoles (ECDs), sources of measured responses to somatosensory stimulation were modeled with time-varying current dipoles (Sarvas 1987
; Hämäläinen et al. 1993
). The MEG signals were evaluated at successive time points, best describing the most dominant source of the response, by a least-squares search in a spherical volume conductor model for the head by using 1824 sensors around a sensor that had been used to measure the peak amplitude of RSS waveforms. This analysis resulted in the three-dimensional (3D) location, moment, and direction of each ECD in a spherical conductor model. The goodness-of-fit (GOF) value of an ECD was calculated to indicate in percentage terms how much the dipole accounts for the measured field variance. ECDs which account for >90% of the GOF value in the sensor subset were accepted for further analysis. Finally, all sensors were used to compute the time-varying multi-dipole model allowing the strength of the previously found ECDs to change over the entire period of the analysis while the source locations and orientations were kept fixed.
For the peak amplitude and latency of the RSS waveform, and the peak moment and 3D location of ECDs, a three-way repeated-measures ANOVA was performed with spatial attention (attended and unattended), relevant modality (touch and vision), and stimulus side (left and right) as factors. As we describe in RESULTS, five responses were analyzed in this study. Because these responses were obtained from different numbers of subjects, the three-way ANOVAs were performed separately for each of five responses. Furthermore, to examine different modulation of the responses, a four-way ANOVA with a factor of response was performed. If the sphericity assumption was violated in Mauchly's sphericity test, the GreenhouseGeisser (G-G) correction coefficient epsilon was used to correct the degree of freedom, and then F- and P-values were recalculated. When the G-G correction was applied, the epsilon and corrected results were reported. A two-tailed paired t-test was used for the post hoc analysis. Statistical significance was set at P < 0.05.
Count accuracy was assessed by calculating the absolute deviation of the subject's target count from the correct target count in each of two blocks (absolute error) and then converting the total number of absolute errors to a percentage of the total correct count (a higher error rate represents more frequent failures to count the stimulus: 0% means that the subject's count equals the correct count and 100% means no counting).
| RESULTS |
|---|
|
|
|---|
The rate of count error was very low (0.96% [SE, 0.64] in the touch-relevant/attend-right condition, 1.41% [0.73] in the touch-relevant/attend-left condition, 0.96% [0.64] in the vision-relevant/attend-right condition, and 1.41% [0.73] in the vision-relevant/attend-left condition), indicating that participants almost perfectly executed the counting task in all conditions. A two-factor ANOVA (direction of attention [left vs. right] x relevant sensory modality [vision vs. touch]) also showed that the error rates were not significantly affected by direction of attention [F(1,10) = 0.94, P = 0.36, n.s.] and relevant sensory modality [F(1,10) = 0.002, P = 0.96, n.s.] and there was no interaction [F(1,10) = 0.002, P = 0.96, n.s.].
Somatosensory-evoked magnetic responses
Figure 2 shows the procedure used in this study (waveforms of magnetic fields recorded from gradiometers, the RSS waveform, isocontour maps of magnetic fields, and isocontour maps of RSS signals; data obtained for the electrocutaneous single-pulse stimulus to the left second digit in the touch-relevant/attend-left condition from a representative subject). The absence or presence of the response was carefully determined by visual inspection using superimposed waveforms in several conditions and the isocontour map. These data include peaks around SI in the hemisphere contralateral to the stimulus side (latency of about 50 ms, M50c), two peaks around the sylvian fissure of the contralateral hemisphere (about 85 and 150 ms for M85c and M150c, respectively), and two peaks around the sylvian fissure in the ipsilateral hemisphere (about 100 and 150 ms for M100i and M150i, respectively). The M50c isocontour map had a strongly focused activation around SI contralateral to the stimulation. The M85c and M100i maps had bilateral activations over the sylvian fissures in both hemispheres. The 150-ms map also had bilateral activations around the sylvian fissures in both hemispheres. Earlier components, the so-called N20m and P35m generated in SI, were clearly found in some subjects, but not in others because of a low signal-to-noise ratio. Therefore they were not analyzed in this study. M50c, M85c, and M100i were found for all 11 subjects, M150c for 10 subjects, and M150i for nine subjects.
|
|
|
The magnetic field distribution seemed to be very similar among conditions (Fig. 4, right). To examine the similarity, we compared magnetic field distributions for each pair of spatial attention (attended and unattended) and relevant sensory modality (touch and vision) for each stimulus side (left- and right-hand stimulus) by calculating Pearson's correlation coefficient r with the peak amplitudes of RSS at all 102 sensors for each pair (six pairs were analyzed for each response for each stimulus side). The data from the subject displayed in Fig. 4, left showed a highly positive correlation in most conditions, as plotted below the isocontour map. The mean value of correlation coefficients across subjects was also calculated for each response for each stimulus side. Correlations for all pairs showed a highly positive correlation (R > 0.7, P < 0.001), indicating that the magnetic field distribution was similar among all conditions.
ECD analysis
Figure 5 shows the location of ECDs superimposed on 2D and 3D MRI scans for somatosensory responses. The ECD of M50c was located around the posterior bank of the central sulcus, corresponding to area 3b in SI. The ECDs of M85c and M100i were located in the upper wall of the sylvian fissure in both hemispheres, corresponding to SII. The ECDs of bilateral M150 were also located around the upper wall of the sylvian fissure in both hemispheres. ANOVA indicated that the 3D coordinates of the ECD location for each response were not significantly different among conditions. Figure 5 (middle) shows the time course of ECD moment revealed by the multidipole analysis. Table 1 shows the peak moment of ECD. As a result of the three-way ANOVA for each response, a significant main effect of attention was found for the peak moment of SII M85c [F(1,10) = 26.9, P < 0.001], SII M100i [F(1,10) = 45.9, P < 0.001], SII M150c [F(1,9) = 11.8, P < 0.001], and SII M150i [F(1,8) = 11.8, P < 0.001], with the peak moment increased by spatial attention. A main effect of relevant sensory modality for the peak moment of SII M100i did not reach a significant level [F(1,9) = 4.2, P = 0.073]. A stimulus sidespatial attention interaction was found for SII M150c [F(1,9) = 8.9, P < 0.05], such that spatial attention increased the peak moment of this response for the left-hand but not right-hand stimulus. The peak latency of the ECD moments did not significantly change with spatial attention or relevant modality. The interhemispheric difference of the peak latency showed the same results as the analysis of RSS. These results in the ECD analysis were consistent with the analysis of RSS with respect to spatial-attention effects.
|
|
| DISCUSSION |
|---|
|
|
|---|
Methodological considerations
In this study, we applied a classical selective-attention task to examine cross-modal attentional modulation. This task has long been used to examine selective-attention effects on the so-called EEG N1 response in a unimodal situation (Hillyard et al. 1973
; Näätänen et al. 1978
; for review also see Hillyard et al. 1999
; Näätänen 1992
). There are a number of advantages to this design. For example: 1) the difference in nonspecific arousal level between to-be-attended and to-be-unattended stimuli can be removed by randomizing the interstimulus interval (ISI) and the order of stimulus presentation; 2) the target effect or detection-related neural responses can be removed by presenting both target and nontarget (standard) stimuli and then analyzing neural responses to nontarget stimuli; and 3) the response can be recorded with a better signal-to-noise ratio because a relatively short ISI is possible and then a larger number of stimuli can be presented in one experiment. This also helps to minimize fatigue and head and body movements, which are undesirable in an MEG recording. A recent EEG study applied this kind of experimental design to the examination of cross-modal links in spatial-selective attention from vision to touch (Eimer and Driver 2000
), but failed to find such links. Using this task, we first demonstrated cross-modal links from vision to touch in spatial-selective attention, indexed by neural responses in SII at a latency of about 80 ms.
Some previous studies investigating tactilevisual attention used a vibrotactile or tactile-pressure stimulus (Eimer and Driver 2000
; Eimer et al. 2000, 2001
), whereas we used an electrical stimulus as in other studies (Desmedt and Robertson 1977
; García-Larrea et al. 1995
; Kida et al. 2004a
,b
,c, 2006b
; Mima et al. 1998
), for the following reasons. 1) Responses can be more easily and reliably recorded for an electrical stimulation than for a natural tactile stimulation, possibly because of a high signal-to-noise ratio for the former. 2) There are numerous long-accumulated findings about cortical responses to electrical stimulation in EEG and MEG studies.
The reason that single- and double-pulse stimuli were used was to better demonstrate spatial-attention effects. Spatial-attention experiments generally require the discrimination of a target stimulus embedded in a sequence of nontarget (standard) stimuli. We considered that nontarget and target stimuli must be presented at the same location to keep directing attention to that location. Thus single- and double-pulse stimuli were used, respectively, as a target stimulus and nontarget stimulus because, in this case, subjects must direct spatial attention to the first pulse of the double-pulse stimuli to detect it and inevitably direct attention to the single-pulse stimulus. Then, only MEG responses to tactile stimuli were analyzed to avoid the contamination of the target effect or detection-related activity, which is not a pure selective-attention effect. We also used the equivalent stimulus onset asynchrony (SOA) from the first to the second pulse of the double-pulse stimulus between sensory modalities. The duration of electrocutaneous stimulation is commonly 0.10.5 ms and a long duration is unusual (e.g., 100 ms). Therefore we used different durations for visual and tactile stimuli. Because the performance of the counting task was almost the same between the detection of visual and tactile target stimuli, we do not consider that the use of different durations undermines the reliability of the present findings. A similar experimental design using single-pulse and double-pulse stimuli was also used by Eimer and Driver (2000)
.
In this study, because all the participants performed all the tasks (a within-subject experiment), one may imagine subjects forgetting which modality was task relevant during one block or being otherwise influenced by the previous block of trials. The good performance indicated by lower error rates in counting in all the conditions shows that subjects did not forget which modality was task relevant. In addition, we randomized the order of conditions and therefore can avoid the contribution of order effects.
Within-modal spatial attention effect
The M85c and M100i responses were enhanced in magnitude by spatial attention when touch was task relevant, indicating a within-modal spatial-attention effect. These responses were located in the upper wall of the sylvian fissure in bilateral hemispheres, respectively, corresponding to SII. This localization is consistent with previous MEG studies (Akatsuka et al. 2006
; Hari et al. 1993
; Inui et al. 2003a
,b
; Kakigi et al. 2000
; Kida et al. 2006a
; Nakata et al. 2005
; Wasaka et al. 2005
) and intracranial recordings in humans (Frot and Mauguière 2000
). An enhancement of the response by spatial attention was observed in both the RSS amplitude and ECD moment. The within-modal attentional enhancement in SII was consistently observed in somatosensory attention studies using MEG (Fujiwara et al. 2000; Hamada et al. 2004; Hoechstetter et al. 1998; Mima et al. 1998
), although these neglected the spatial aspect of attention. For instance, Hamada et al. (2004), Hoechstetter et al. (1998), and Mima et al. (1998)
compared MEG responses to stimulation of the left median or digital nerve between active and passive attention tasks. Fujiwara et al. (2000) compared responses to median nerve stimulation between somatosensory attention (attend somatosensory) and auditory attention (ignore somatosensory) tasks. Thus we first demonstrated that within-modal spatial attention enhances neural responses at a latency of about 80 ms in human SII. The absence of attentional modulation in SI is consistent with the above-mentioned attention studies that did not manipulate the spatial aspect of attention (Fujiwara et al. 2000; Hamada et al. 2004; Hoechstetter et al. 1998; Mima et al. 1998
). Animal studies also reported a higher sensitivity of neurons in SII than in SI to attentional manipulation (Chapman and Meftah 2005
; Meftah et al. 2002
).
Some EEG studies reported that the N120 (García-Larrea et al. 1995
) or N140 response (Kida et al. 2004a
) recorded over the temporal area, which was assumed to originate from SII (García-Larrea et al. 1995
), was not modulated by spatial attention, inconsistent with the present study showing the enhancement of the SII response. MEG has the advantage of picking up synchronized neural activities in the wall of the sulcus or fissure such as SII because, theoretically, it records magnetic fields produced by currents oriented tangentially to the surface of a spherical symmetric conductor (e.g., brain surface). In contrast, EEG records summated electric fields resulting from tangentially and radially oriented currents. This may explain the difference between results obtained with MEG and EEG studies.
On the other hand, previous studies including ours reported that the N140 recorded from frontocentral electrodes was enhanced in amplitude by spatial attention (Desmedt and Robertson 1977
; García-Larrea et al. 1995
; Kida et al. 2004a
; Michie et al. 1987
), as opposed to the absence of attentional enhancement of temporal N120 or N140. Therefore some of these studies assumed that the frontocentral N140 (a negative peak of the so-called vertex response) has a function different from that of the temporal N120 or N140 (García-Larrea et al. 1995
; Kida et al. 2004a
). We also found an attentional enhancement of the responses around this latency (M150c), especially for the left-hand stimulation, located around SII in most subjects. The activity of the estimated source was also enhanced by spatial attention. However, MEG studies reported that the activity in the anterior cingulate cortex or supplementary motor area (SMA) largely contributes to the frontocentral N140 (Allison et al. 1992
; Waberski et al. 2002
) and MEG studies also found activity in the SMA (Forss et al. 1996
) and posterior parietal cortex (Forss et al. 1994
; Hoshiyama et al. 1997
) at this latency. Therefore we cannot conclude whether the enhancement of responses around the sylvian fissure at a latency of about 150 ms corresponds to that of the frontocentral N140. The present MEG analysis focused on the activity in SI and SII, although the relationship between spatial attention and the temporal dynamics of activity in other areas should be investigated in future studies.
Cross-modal spatial attention effect
A similar enhancement by spatial attention was found even when vision was task relevant, with an enhanced amplitude of the M85c and M100i responses in SII to tactile stimuli presented near where attention was directed to visual stimuli, thereby demonstrating a cross-modal link from vision to touch. The M50c response was not significantly modulated by spatial attention. Thus the cross-modal link in selective spatial attention from vision to touch is associated with somatosensory cortical processing at a latency of about 85 ms in SII. Eimer and colleagues extensively studied cross-modal links in spatial attention using EEG (Eimer and Driver 2000
; Eimer et al. 2001
, 2002
). In their early study, Eimer and Driver (2000)
used a similar selective-attention task to that used in the present study and reported that there was no evidence of cross-modal links from vision to touch when touch was completely task irrelevant (i.e., when vision was completely task relevant), whereas cross-modal links were found when vision was primary and touch was of secondary relevance to the task. These results led them to conclude that their findings were consistent with a behavioral study by Spence et al. (2000)
, who applied Posner's expectancy attention paradigm to the examination of cross-modal links (Spence et al. 1996). The behavioral study of visualtactile cross-modal links by Spence et al. (2000)
reported speeded and more accurate responses to both vibrotactile and visual target stimuli, when the target modality was not cued but the likely target side was cued before the target stimulus, compared with when the unlikely target side was cued or a neutral cue was presented. In their later studies using a cueing task different from that of the earlier one, Eimer et al. (2001
, 2002
) reported the existence of cross-modal links from vision to touch. We used MEG to find new cross-modal spatial-attention effects even when touch was completely task irrelevant in a selective-attention task similar to the early study by Eimer and Driver (2000)
. In addition, the cross-modal spatial-attention effect was observed at an earlier latency than reported in the EEG studies using a cueing task. As mentioned earlier, MEG has an advantage over EEG in being able to record the activity in SII, which explains the difference between our results and Eimer's results. The present study thus provides neurophysiological evidence of the involvement of neural activity at about 80 ms in cross-modal links from vision to touch even when touch is completely task irrelevant. In addition, we indicated the contribution of not SI but SII to the cross-modal links. The EEG studies mentioned earlier excellently demonstrated temporal loci of cross-modal links, although the spatial loci were obscured as the result of inherent weaknesses such as the volume conduction of the electric field and the problem caused by the reference electrode.
The theoretical attentional system for the organization of endogenous spatial attention can be explained in three ways. First, there may be quite separate modality-specific systems that operate independently of their respective representations of visual and tactile (and auditory) space. Second, there may be a single supramodal attentional system that allocates attention to locations in space regardless of the modality of the target being attended, modulating perception, and neural activity as a function of location across sensory modalities. Third, there is indeed a separable modality-specific attentional system, but with links such that visual spatial attention tends to result in tactile attention to the corresponding location in tactile space and vice versa (Driver and Spence 2004; Spence and Driver 1996
). Because we found cross-modal attentional modulation, the quite separate modality-specific system is not suitable to the cross-modal attentional modulation. If there is a supramodal attentional system, the size of the spatial-attention effect should be the same between tactile-evoked and visual-evoked responses (or between tactile and visual judgments). The present study did not analyze neural responses to visual stimuli and thus cannot directly approach this mechanism. Previous studies suggested behavioral or neurophysiological evidence of a separable-but-linked system (Chambers et al. 2004
; Spence and Driver 1996
) and a supramodal system (Eimer and van Velzen 2002
).
One more general important point regarding cross-modal links is whether the observed effects of attentionirrespective of whether vision or touch was relevanttruly demonstrate the existence of meaningful cross-modal links in the behavioral context. Because we recorded neural responses in either situation where touch was completely relevant or irrelevant, it is unclear whether the observed cross-modal modulation is truly essential for human behavior or is an inevitable by-product of the within-modal attention effects. However, the possible behavioral interpretations for the present study are 1) that attention in the secondary modality always and inevitably shifts to the same direction as in the primary modality (i.e., the cross-modal effect is an inevitable and nonmeaningful phenomenon); 2) that attention shifts in the same fashion as that delineated in the previous point, but it might just be that the subject cannot be bothered in some sense to shift one but not the other (i.e., this can be said to be an effective behavioral strategy); and 3) that attention in the secondary modality shifts in the same direction as in the primary modality when attention is not needed in the secondary modality, but can also shift in a different direction from the primary modality when attention is needed for the secondary modality at a different location from the primary modality (i.e., the cross-modal effect can be flexibly changed depending on the behavioral significance). Spence and Driver 2000; experiment 4) reported that when participants had a very strong spatial expectancy regarding the likely target location in just one modality (the most common primary modality), then their covert spatial attention tended to shift there not only in that modality, but also in the other (secondary) modality. Thus Spence and colleagues argued that the only way to show the obligatory nature of such links was to make it disadvantageous for attention in the secondary modality to shift in the same direction as attention in the primary modality (Lloyd et al. 2003
; Spence and Driver 1996
; Spence et al. 1996). Therefore the next step for unveiling the neural system underlying cross-modal links is to examine changes in cross-modal attentional modulation as a function of the priority of modality and the degree of directional expectancy in a cueing task (e.g., in certain kinds of dual-task situations or with the manipulation of the likely target side in just one modality). On the other hand, the ability to record neural responses to sensory stimuli in the completely task irrelevant modality is a feature specific to neuroimaging studies that cannot be achieved by behavioral studies requiring a subject's behavioral response or report, and thereby the novelty of the present study is evident.
Possible mechanisms for attentional modulation
In early EEG studies, the enhancement of a response by attention without changes in its spatial distribution and waveform had been considered the enhancement of an exogenous component of ERPs (Hillyard and Mangun 1987
; Hillyard et al. 1973
). Later, this was called a gain-regulation mechanism (Hillyard et al. 1999
). Several previous studies also suggested that attention can modulate the gain of neural responses to visual stimuli (Corbetta et al. 1990
, 1991
; Hawkins et al. 1990
; Luck et al. 1997
; Martinez et al. 1999
; Moran and Desimone 1985
; Reynolds et al. 2000
). This concept seems to be based on the idea that the same neuronal population as that synchronously activated by unattended stimuli is facilitated by a top-down attentional signal. The present study revealed an attentional enhancement of evoked magnetic responses in SII without changes in the magnetic field distribution, waveform, and ECD location of the responses. In addition, the response for attended stimuli was likely to have the same phasic waveform as that for unattended stimuli. The combination of these results favors the classical gain-regulation mechanism that the neuronal population activated by unattended stimuli was more strongly activated by attended stimuli, rather than either a tonic bias effect as indexed by a sustained baseline increase in activity during directing attention (Kastner et al. 1999
; Luck et al. 1997
) or the attention-induced activation of a separate neuronal population (Näätänen et al. 1978
). Therefore the present findings extend this gain-regulation mechanism to cross-modal spatial selective attention. Such a gain-regulation mechanism would presumably give an improved signal-to-noise ratio to inputs from attended locations, so that more information can be extracted from relevant portions of the extra- or intrapersonal space (Hawkins et al. 1990
; Hillyard et al. 1999
).
However, considering the original meaning of "gain," which represents an inputoutput relationship, the present findings are not sufficient to demonstrate a gain-regulation mechanism. According to simple mathematical expressions, gain regulation and baseline shifts are defined as multiplicative and additive attentional effects, respectively (Kanwisher and Wojciulik 2000
). In the case of multiplicative gain regulation, the magnitude of a response to a given stimulus when attended "A" should equal the product of an attentional gain multiplier "g" and the magnitude of the response to the same stimulus when unattended "U" (U x g = A). In the case of additive baseline shifts, the magnitude of the response to a given stimulus when attended "A" should be higher by a constant "K" than the magnitude of the response to the same stimulus when unattended (U) (U + K = A). According to this hypothesis, the gain-regulation mechanism produces a stronger attentional enhancement when the strength of a given stimulus is higher. On this point, the classical definition of the gain-regulation mechanism seems to be vague. To better demonstrate the gain-regulation hypothesis, it is useful to examine how MEG responses vary as a function of stimulus strength and the direction of attention.
EEG can record activity deep in the brain, whereas MEG is less sensitive to deep activity. Accordingly, the attentional modulation we observed may be regarded as a cortical phenomenon. Of course, the present study did not completely exclude the possibility that the neural systems underlying cross-modal spatial attention activate a separate neuronal population related to other processes that may be blind to MEG (e.g., area 1 or 2, or deep brain tissue) and the possibility of tonic bias effects. In fact, some previous EEG studies demonstrated the coexistence of the gain-regulation mechanism and the attention-induced activation of a separate neuronal population (Johannes et al. 1995
; Teder et al. 1993
). Somatosensory EEG studies also found both the former (Josiassen et al. 1982
) and latter effects (Garcia-Larrea et al. 1995
; Kida et al. 2004a
; Michie et al. 1987
; Valeriani et al. 2003
). PET (Rees et al. 1997) and single-neuron (Luck et al. 1997
) studies also reported both a baseline increase in activity and gain regulation. Possibly, there are multiple mechanisms underlying attentional modulation of neural and behavioral responses, although this was not directly tested in the present study.
A recent study using a cross-modal temporal order judgment task indicated that the peak latency of the visual evoked potentials (P1 and N1) was earlier when attention was directed to vision than when it was directed to touch, providing electrophysiological support for the existence of prior entry (Vibell et al. 2007
). We could not find differences in the latency of the tactile-evoked responses in terms of attention and modality relevance.
With regard to the somatosensory cortical hierarchy, SII forms a ventral stream (Krubitzer et al. 1995; Pons et al. 1992
) that projects to the premotor (Cavada and Goldman-Rakic 1989
; Rizzolatti and Luppino 2001
) and prefrontal cortices (Carmichael and Price 1995
) and may be associated with the fine discrimination of somatosensory inputs (Binkofski et al. 1999
; Romo et al. 2002
). This functional notion about SII seems to be satisfactorily consistent with the above-mentioned function of the gain-regulation mechanism. Top-down attentional signals to enhance sensory-evoked neuronal responses may come from frontal and parietal areas (Corbetta and Shulman 2002
; Fuggetta et al. 2006
; Kanwisher and Wojciulik 2000
), especially from multimodal areas activated during the performance of attention tasks (Macaluso et al. 2001, 2003
). Considering the attentional hierarchy, a similar but different possible physiological idea is a feedback mechanism from higher- to lower-order areas, which is accompanied by reducing stimulus-evoked refractoriness or inhibition in cortical ensembles, as recently suggested based on monkey studies in the visual system (Mehta et al. 2000a
,b
; Schroeder et al. 2001
). This hypothesis seems interesting and plausible, in that it may integrate findings of human and monkey studies at several levels of physiology, and it is therefore essential to experimentally examine in detail whether this hypothesis can be applied to the response modulation by within-modal or cross-modal spatial attention in humans.
In conclusion, the present study used MEG to reveal the time course of neural responses related to visualtactile cross-modal links in spatial-selective attention, focusing on somatosensory cortical activities. The high temporal and spatial resolution analysis for the attentional modulation revealed that the cross-modal link is represented by cortical processes around SII at the latency of about 80 ms. The mechanism underlying this cross-modal attentional modulation remains to be determined, but it is speculated that there may be an increase in gain and/or other mechanisms.
| GRANTS |
|---|
|
|
|---|
| ACKNOWLEDGMENTS |
|---|
|
|
|---|
| FOOTNOTES |
|---|
Address for reprint requests and other correspondence: T. Kida, Department of Integrative Physiology, National Institute for Physiological Sciences, Myodaiji, Okazaki, 444-8585, Japan (E-mail: nikita{at}nips.ac.jp)
| REFERENCES |
|---|
|
|
|---|
Allison T, McCarthy G, Wood CC. The relationship between human long-latency somatosensory evoked potentials recorded from the cortical surface and from the scalp. Electroencephalogr Clin Neurophysiol 84: 301314, 1992.[CrossRef][ISI][Medline]
Beauchamp MS, Cox RW, DeYoe EA. Graded effects of spatial and featural attention on human area MT and associated motion processing areas. J Neurophysiol 78: 516520, 1997.
Binkofski F, Buccino G, Posse S, Seitz RJ, Rizzolatti G, Freund HJ. A fronto-parietal circuit for object manipulation in man: evidence from an fMRI study. Eur J Neurosci 11: 32763286, 1999.[CrossRef][ISI][Medline]
Carmichael ST, Price JL. Sensory and premotor connections of the orbital and medial prefrontal cortex of macaque monkeys. J Comp Neurol 363: 642664, 1995.[CrossRef][ISI][Medline]
Cavada C, Goldman-Rakic PS. Posterior parietal cortex in rhesus monkey: I. Parcellation of areas based on distinctive limbic and sensory corticocortical connections. J Comp Neurol 287: 393421, 1989.[CrossRef][ISI][Medline]
Chambers CD, Stokes MG, Mattingley JB. Modality-specific control of strategic spatial attention in parietal cortex. Neuron 44: 925930, 2004.[CrossRef][ISI][Medline]
Chapman CE, Meftah EM. Independent controls of attentional influences in primary and secondary somatosensory cortex. J Neurophysiol 94: 40944107, 2005.
Corbetta M, Kincade JM, Ollinger JM, McAvoy MP, Shulman GL. Voluntary orienting is dissociated from target detection in human posterior parietal cortex. Nat Neurosci 3: 292297, 2000.[CrossRef][ISI][Medline]
Corbetta M, Miezin FM, Dobmeyer S, Shulman GL, Petersen SE. Attentional modulation of neural processing of shape, color, and velocity in humans. Science 248: 15561559, 1990.
Corbetta M, Miezin FM, Dobmeyer S, Shulman GL, Petersen SE. Selective and divided attention during visual discriminations of shape, color, and speed: function