JN AJP citation statistics
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


J Neurophysiol 94: 2251-2254, 2005. First published May 31, 2005; doi:10.1152/jn.00329.2005
0022-3077/05 $8.00
This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
94/3/2251    most recent
00329.2005v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Web of Science (10)
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Fecteau, S.
Right arrow Articles by Belin, P.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Fecteau, S.
Right arrow Articles by Belin, P.

REPORT

Sensitivity to Voice in Human Prefrontal Cortex

Shirley Fecteau1,2, Jorge L. Armony3, Yves Joanette1,2 and Pascal Belin1,2

1Département de Psychologie, Université de Montréal, Montreal; 2Institut Universitaire de Gériatrie de Montréal, Montreal; and 3Douglas Hospital Research Centre, McGill University, Montreal, Canada

Submitted 30 March 2005; accepted in final form 28 May 2005


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
We report two functional MRI (fMRI) experiments showing sensitivity to human voice in a region of human left inferior prefrontal cortex, pars orbitalis. The voice-enhanced response was observed for speech as well as nonlinguistic vocalizations and was stronger for emotional than neutral vocalizations. This region could constitute a human prefrontal auditory domain similar to the one recently identified in the macaque brain.


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
In the macaque brain, extracellular recording experiments have shown the existence of a bilateral auditory domain in ventrolateral prefrontal cortex. Neurons in this region show robust responses to complex sounds and respond more strongly to species-specific vocalizations and human vocalizations compared with nonvocal auditory stimuli (Romanski and Goldman-Rakic 2002Go; Romanski et al. 2005Go). These neurons associated with auditory activity were located in area 47/12 that is, according to the anatomical boundaries identified by Petrides and Pandya (2002)Go, outside of the human homologue of Broca's area.

In the human brain, various parts of the left prefrontal cortex have been shown to be activated during several tasks such as auditory working memory, semantic, and phonological processing. One important prefrontal region involved in speech processing is Broca's area (Bookheimer 2002Go; Friederici 2002Go; Gelfand and Bookheimer 2003Go), traditionally defined as pars opercularis and pars triangularis of the left inferior prefrontal gyrus [Brodmann areas (BAs) 44 and 45]. However, it is still unclear whether Broca's area constitutes an auditory field homologous to the one identified in the macaque brain (Romanski and Goldman-Rakic 2002Go; Romanski et al. 2005Go). We measured auditory-related activity in prefrontal cortex of normal human volunteers during listening to several categories of natural sounds. Our objective was to study prefrontal region's involvement in voice processing.


    METHODS
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
Fifteen healthy adults (6 women; 22.6 ± 2.7 yr) took part in two fMRI experiments within the same scanning session. The ethical committee from Université de Montréal approved the study, and all volunteers provided written informed consent before participating.

In the first fMRI experiment, participants were scanned while passively listening to several categories of natural sounds that included human vocalizations and nonhuman sounds from the environment (see Fig. 1A for the average frequency distribution and the spectrogram of a representative sample for each category). Instructions were to carefully listen to the stimuli. Human vocalizations consisted of 48 speech stimuli (e.g., phonemes, sentences of 1st and foreign languages) and nonlinguistic vocalizations (e.g., coughs, clearing throat). Nonhuman sounds category was composed of 48 nonhuman animal vocalizations (24 cat vocalizations and 24 mixed-animal vocalizations: alligator, antelope, bird, camel, chimpanzee, chinchilla, cow, dog, donkey, eagle, elephant, frog, hippopotamus, horse, kangaroo, koala, moose, pig, raccoon, rooster, sea lion, sheep, turtle, and whale) and 48 nonvocal sounds (e.g., musical instruments, environmental sounds such as toilet flushing, car horn). Null-events were included as a baseline (n = 48). Mean duration of human vocal stimuli was 1.2 ± 0.2 (SD) s and nonhuman stimuli was 1.45 ± 0.3 s.



View larger version (68K):
[in this window]
[in a new window]
 
FIG. 1. Stimulus categories from (A) experiment 1 and (B) experiment 2. Left: average frequency distribution. Right: spectrogram of a representative sample in each category. Color indicates amplitude.

 
In the second fMRI experiment, participants were scanned while performing a gender identification task by pressing the right or the left mouse button on emotional and neutral nonlinguistic vocalizations. Stimuli were matched for emotional valence (described elsewhere, see Fecteau et al. 2005Go). Stimuli were 96 emotional nonlinguistic vocalizations (24 stimuli for each category: laugh, sexual vocalization, cry, and fearful scream) and emotionally neutral vocalizations (e.g., cough, clearing throat). Figure 1B shows the average frequency distribution and the spectrogram of a representative sample for each stimulus category. Null-events were also included as a baseline (n = 48). Mean durations were as follows: laughs, 1.6 ± 0.4 s; sexual vocalizations, 1.6 ± 0.4 s; cries, 1.7 ± 0.4 s; fearful screams, 1.7 ± 0.4 s; neutral vocalizations, 1.5 ± 0.4 s. Each human vocalization was produced by a different adult speaker (24 male and 24 female speakers). An independent group of individuals (n = 60) had previously judged all stimuli in terms of emotional valence (whether the speaker was expressing a positive, negative, or a neutral emotion) using visual analog scales (1–100) (Fecteau et al. 2005Go). The order of stimulus presentation was counterbalanced across participants. Participants in the fMRI experiment also rated the vocalizations on valence after their scanning session. As found with the independent group of participants who previously rated the stimuli, significant differences in valence ratings between stimuli of positive valence, negative valence, and neutral stimuli were observed (pairwise comparisons between laughs and neutral vocalizations, sexual and neutral vocalizations, cries and neutral vocalizations, fearful screams and neutral vocalizations; P < 0.0001).

In both fMRI experiments, stimuli were presented in a pseudo-randomized order with an average stimulus onset asynchrony of 4.3 s. They were presented binaurally through pneumatic headphones, acting as a low-pass filter with a cut-off frequency around 2 kHz, sealed by foam ear inserts, and further shielded by plastic ear defenders that provided an attenuation of fMRI scanning noise of about 30 dB. Stimuli were delivered using MCF (DigiVox, Montreal, Canada), with a sound-pressure level of 85–90 dB.

We used a 1.5-T MRI system (Magnetom Vision, Siemens Electric, Erlangen, Germany). Functional scans were acquired with a single-shot echoplanar gradient-echo pulse sequence (TR = 2.6, TE = 40 ms, flip angle = 90°, field of view = 215 mm, matrix = 128 x 128). The 28 axial slices (resolution 3.75 x 3.75 mm in plane, 5 mm between planes) in each volume were aligned with the AC-PC line, covering the whole brain. A total of 320 volumes were acquired in each session. In addition, T1-weighted anatomical images were obtained for each participant, (1 mm x 1 mm x 1 mm resolution). Scanner noise was continuous throughout the experiment providing a constant auditory background. Image processing and statistical analysis were performed using SPM99 (Wellcome Department of Cognitive Neurology). Data analysis was performed in a two-stage mixed-effects analysis (equivalent to a random effects analysis) in which BOLD responses for each participant were first modeled using a synthetic hemodynamic function in the context of the fixed-effects general linear model. Subject-specific linear contrasts on the parameter estimates were entered into a second-level analysis to perform between-subjects analyses, resulting in a t-statistic for each voxel. These t-statistics (transformed to Z-statistics) constitute a statistical parametric map (SPM). SPMs were thresholded at P = 0.001. All statistical comparisons involved comparisons between all experimental conditions and null-events and between some of the experimental conditions: experiment 1, human voice (speech and nonlinguistic vocalizations) versus nonhuman sounds (cat vocalizations, mixed-animal vocalizations, nonvocal sounds); experiment 2, emotional nonlinguistic vocalizations (laughs, sexual vocalizations, cries, fearful screams) versus neutral nonlinguistic vocalizations.


    RESULTS
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
In the first experiment, only one region of prefrontal cortex, located in the left inferior frontal gyrus, pars orbitalis (BA 47, Talairach coordinates –42, 26, –10; 304 mm3; t = 5.28; z-score = 3.85; P < 0.0001 uncorrected), showed greater activity for sounds of human voice compared with nonhuman sounds (cat vocalizations, mixed-animal vocalizations, nonvocal sounds; Fig. 2, A–C). This region responded to speech sounds but also, to a similar extent, to human vocal sounds without linguistic content such as coughs and laughs (Fig. 2C). Interestingly, the homologous region in the right hemisphere, which showed a smaller activation, responded only to nonlinguistic vocalizations (Fig. 2D).



View larger version (76K):
[in this window]
[in a new window]
 
FIG. 2. Brain responses when human voice stimuli were contrasted with nonhuman sounds (P < 0.001) in (A) coronal and (B) sagittal views. Effect size (arbitrary units; averaged between 0 and 14 s; ±SE) associated with stimulus categories compared with baseline (null-events) in left Brodmann area (BA) 47 (–42, 26, –10) and its right hemisphere homologue are presented. Effect size associated with human voice and nonhuman sounds in (C) left and (D) right BA 47 (experiment 1). Effect size associated with nonlinguistic vocalizations in (E) left and (F) right BA 47 (left: t = 2.32; z-score = 2.61; P = 0.01; right: t = 1.45; z-score = 1.54; P = 0.07 uncorrected; experiment 2). Effect size of each stimulus category of (G) experiment 1 and (H) experiment 2 for each participant in left BA 47. LH, left hemisphere.

 
In the second experiment, significant enhanced activity was observed in the same left prefrontal region when the emotional conditions were compared with the neutral one (distance between peaks of the 2 experiments <3 mm; –44, 26, –14; 96 mm3; t = 2.80; z-score = 3.33; P = 0.002 uncorrected; Fig. 2E). The sexual vocalizations were the ones that yielded the strongest activity when contrasted with the neutral ones.


    DISCUSSION
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
These results show that a restricted part of BA 47 in the left human prefrontal cortex is particularly involved in processing human voice. This preferential response was observed whether vocal sounds contained linguistic information or not (experiment 1), but was sensitive to their emotional content (experiment 2). Previous studies focusing on the temporal lobes have outlined such voice-selective responses along the superior temporal sulci (STS), but have not studied the frontal lobe (Belin et al. 2000Go). Using prosody, a similar finding has recently been reported in the temporal lobe, showing greater activation for emotional compared with neutral prosody within a voice-selective area along the STS (Grandjean et al. 2005Go). A converging result has also been obtained by Wildgruber et al. (2005)Go, showing enhanced activity in bilateral BA 47 associated with emotional prosody compared with rest.

Importantly, the voice-sensitive prefrontal region found in this study is distinct from Broca's area. When the speech stimuli were contrasted with the nonlinguistic vocalizations, greater responses in Broca's area (BA 44; –50, 18, 26; 184 mm3; t = 5.03; z-score = 3.74; P < 0.0001 uncorrected) were observed. Activations within Broca's area are associated with naming, phonological, syntactic, semantic processes (Bookheimer 2002Go; Friederici 2002Go), verbal working memory (Prabhakaran et al. 2000Go), and action understanding and imitation (Nishitani et al. 2005Go). In contrast, BA 47 has been linked to semantic processes (Dapretto and Bookheimer 1999Go; Petersen et al. 1990Go) and voiced speech production (Schulz et al. 2005Go). A recent report by Rama and Courtney (2005)Go showed enhanced activity in BA 45/47 associated with voice delays processing, whereas face delays processing elicited greater activity in BA 44/45, suggesting an across-modality dissociation (voice vs. face) in working memory within the ventral prefrontal cortex. Recent studies also suggested a role of BA 47 in visual spatial selectivity in the human brain (Rizzuto et al. 2005Go), as well as the involvement of the ventral prefrontal cortex in auditory spatial selectivity and monkey-call selectivity in the macaque brain (Cohen et al. 2004Go). Thus more work is needed to fully characterize the functional role of BA 47 (Fiez 1997Go; Price 1998Go).

An important issue is the possible effect of participants' experience with the sounds categories: studies in the visual domain have shown the importance of expertise with object categories in the activation of the face-selective fusiform cortex (see Tarr and Gauthier 2000Go for review). Therefore we cannot rule out the possibility that left BA 47 responses to human voice that we observed may have been modulated by stimulus familiarity or expertise. Also, it will be important to better characterize the role of acoustic structure on the voice-related activation. Indeed, Romanski et al. (2005)Go observed similar neuronal responses to functionally different, but acoustically similar vocalizations. The low-pass filtering by the pneumatic auditory stimulation system, attenuating frequencies above 2 kHz and thus particularly influencing higher-pitched animal vocalizations, could have influenced the present pattern of results.

Our findings suggest that the region in left BA 47 outlined in this study is involved in the processing of human voices. It may therefore be part of a human counterpart of the auditory domain in the prefrontal monkey brain extending over areas 12 and 45 described by Romanski and colleagues (Romanski and Goldman-Rakic 2002Go; Romanski et al. 2005Go). Indeed, area 12 in the monkey brain is similar in topography and cytoarchitecture to part of the human BA 47 (Petrides and Pandya 2002Go) and projections from auditory superior temporal region in the human and the monkey brain target BA 47/12 and BA 45 (Petrides and Pandya 1988Go). In summary, this study showed that a region in the left BA 47 is involved in human voice processing, suggesting the existence of an auditory domain in the left human prefrontal cortex distinct from Broca's area.


    GRANTS
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
This study was supported by National Sciences and Engineering Research Council, Université de Montréal, Fonds Québecois de Recherche sur la Nature et les Technologies to P. Belin, and Canadian Institutes of Health Research to J. L. Armony and P. Belin.


    ACKNOWLEDGMENTS
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
We thank anonymous reviewers for constructive comments on a previous version.


    FOOTNOTES
 
The costs of publication of this article were defrayed in part by the payment of page charges. The article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

Address for reprint requests and other correspondence: S. Fecteau, Dept. de Psychologie, Univ. de Montréal, C.P. 6128, Succ. Centre-ville, Montreal, Quebec H3C 3J7, Canada (E-mail: shirley.fecteau{at}umontreal.ca)


    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 GRANTS
 ACKNOWLEDGMENTS
 REFERENCES
 
Belin P, Zatorre R, Lafaille P, Ahad P, and Pike B. Voice-selective areas in human auditory cortex. Nature 403: 309–312, 2000.[CrossRef][Medline]

Bookheimer S. Functional MRI of language: new approaches to understanding the cortical organization of semantic processing. Annu Rev Neurosci 25: 151–188, 2002.[CrossRef][Web of Science][Medline]

Cohen YE, Russ BE, Gifford GW III, Kiringoda R, and MacLean KA. Selectivity for the spatial and nonspatial attributes of auditory stimuli in the ventrolateral prefrontal cortex. J Neurosci 24: 11307–11316, 2004.[Abstract/Free Full Text]

Dapretto M and Bookheimer SY. Form and content: dissociating syntax and semantics in sentence comprehension. Neuron 24: 427–432, 1999.[CrossRef][Web of Science][Medline]

Fecteau S, Armony JL, Joanette Y, and Belin P. Judgment of emotional nonlinguistic vocalizations: age-related differences. Appl Neuropsychol 12: 40–48, 2005.[CrossRef][Web of Science][Medline]

Fiez JA. Phonology, semantics, and the role of the left inferior prefrontal cortex. Hum Brain Mapp 5: 79–83, 1997.[CrossRef][Web of Science][Medline]

Friederici AD. Towards a neural basis of auditory sentence processing. Trends Cogn Sci 6: 78–84, 2002.[CrossRef][Web of Science][Medline]

Gelfand JR and Bookheimer SY. Dissociating neural mechanisms of temporal sequencing and processing phonemes. Neuron 38: 831–842, 2003.[CrossRef][Web of Science][Medline]

Grandjean D, Sander D, Pourtois G, Schwartz S, Seghier ML, Scherer KL, and Vuilleuimer P. The voices of wrath: brain responses to angry prosody in meaningless speech. Nat Neurosci 8: 145–146, 2005.[CrossRef][Web of Science][Medline]

Nishitani N, Schurmann M, Amunts K, and Hari R. Broca's region: from action to language. Physiology 20: 60–95, 2005.[Abstract/Free Full Text]

Petersen SE, Fox PT, Snyder AZ, and Raichle ME. Activation of extrastriate and frontal cortical areas by words and word-like stimuli. Science 249: 1041–1044, 1990.[Abstract/Free Full Text]

Petrides M and Pandya DN. Association fiber pathways to the frontal cortex from the superior temporal region in the rhesus monkey. J Comp Neurol 273: 52–66, 1988.[CrossRef][Web of Science][Medline]

Petrides M and Pandya DN. Comparative cytoarchitectonic analysis of the human and the macaque ventrolateral prefrontal cortex and corticocortical connection patterns in the monkey. Eur J Neurosci 16: 291–310, 2002.[CrossRef][Web of Science][Medline]

Prabhakaran V, Narayanan K, Zhao Z, and Gabrieli JDE. Integration of diverse information in working memory within the frontal lobe. Nat Neurosci 3: 85–90, 2000.[CrossRef][Web of Science][Medline]

Price CJ. The functional anatomy of word comprehension and production. Trends Cogn Sci 2: 281–288, 1998.

Rama P and Courtney SM. Functional topography of working memory for face or voice identity. Neuroimage 24: 224–234, 2005.[CrossRef][Web of Science][Medline]

Rizzuto DS, Mamelak AN, Sutherling WW, Fineman I, and Andersen RA. Spatial selectivity in human ventrolateral prefrontal cortex. Nat Neurosci 8: 415–417, 2005.[Web of Science][Medline]

Romanski LM, Averbeck BB, and Diltz M. Neural representation of vocalizations in the primate ventrolateral prefrontal cortex. J Neurophysiol 93: 734–747, 2005.[Abstract/Free Full Text]

Romanski LM and Goldman-Rakic PS. An auditory domain in primate prefrontal cortex. Nat Neurosci 5: 15–16, 2002.[CrossRef][Web of Science][Medline]

Schulz GM, Varga M, Jeffires K, Ludlow CL, and Braun AR. Functional neuroanatomy of human vocalization: an H215O PET study. Cereb Cortex 2005.

Tarr MJ and Gauthier I. FFA: a flexible fusiform area for subordinate-level visual processing automatized by expertise. Nat Neurosci 3: 764–769, 2000.[CrossRef][Web of Science][Medline]

Wildgruber D, Riecker A, Hertrich I, Erb M, Grodd W, Ethofer T, and Ackermann H. Identification of emotional intonation evaluated by fMRI. Neuroimage 24: 1233–1241, 2005.[CrossRef][Web of Science][Medline]




This article has been cited by other articles:


Home page
Cereb CortexHome page
C. F. Altmann, H. Nakata, Y. Noguchi, K. Inui, M. Hoshiyama, Y. Kaneoke, and R. Kakigi
Temporal Dynamics of Adaptation to Natural Sounds in the Human Auditory Cortex
Cereb Cortex, June 1, 2008; 18(6): 1350 - 1360.
[Abstract] [Full Text] [PDF]


Home page
Proc R Soc BHome page
P. Belin, S. Fecteau, I. Charest, N. Nicastro, M. D Hauser, and J. L Armony
Human cerebral response to animal affective vocalizations
Proc R Soc B, March 7, 2008; 275(1634): 473 - 481.
[Abstract] [Full Text] [PDF]


Home page
Cereb CortexHome page
C. F. Altmann, O. Doehrmann, and J. Kaiser
Selectivity for Animal Vocalizations in the Human Auditory Cortex
Cereb Cortex, November 1, 2007; 17(11): 2601 - 2608.
[Abstract] [Full Text] [PDF]


Home page
Cereb CortexHome page
L. M. Romanski
Representation and Integration of Auditory and Visual Stimuli in the Primate Ventral Lateral Prefrontal Cortex
Cereb Cortex, September 1, 2007; 17(suppl_1): i61 - i69.
[Abstract] [Full Text] [PDF]


Home page
Phil Trans R Soc BHome page
P. Belin
Voice processing in human and non-human primates
Phil Trans R Soc B, December 29, 2006; 361(1476): 2091 - 2107.
[Abstract] [Full Text] [PDF]


Home page
J. Neurosci.Home page
T. Sugihara, M. D. Diltz, B. B. Averbeck, and L. M. Romanski
Integration of Auditory and Visual Communication Information in the Primate Ventrolateral Prefrontal Cortex
J. Neurosci., October 25, 2006; 26(43): 11138 - 11147.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
94/3/2251    most recent
00329.2005v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Web of Science (10)
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Fecteau, S.
Right arrow Articles by Belin, P.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Fecteau, S.
Right arrow Articles by Belin, P.


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
Visit Other APS Journals Online
Copyright © 2005 by the The American Physiological Society.