Temporal encoding of the voice onset time phonetic parameter by field potentials recorded directly from human auditory cortex

Citation
M. Steinschneider et al., Temporal encoding of the voice onset time phonetic parameter by field potentials recorded directly from human auditory cortex, J NEUROPHYS, 82(5), 1999, pp. 2346-2357
Citations number
83
Categorie Soggetti
Neurosciences & Behavoir
Journal title
JOURNAL OF NEUROPHYSIOLOGY
ISSN journal
00223077 → ACNP
Volume
82
Issue
5
Year of publication
1999
Pages
2346 - 2357
Database
ISI
SICI code
0022-3077(199911)82:5<2346:TEOTVO>2.0.ZU;2-1
Abstract
Temporal encoding of the voice onset time phonetic parameter by field poten tials recorded directly from human auditory cortex. J. Neurophysiol. 82: 23 46-2357, 1999. Voice onset time NOT) is an important parameter of speech th at denotes the time interval between consonant onset and the onset of low-f requency periodicity generated by rhythmic vocal cord vibration. Voiced sto p consonants (/b/, /g/, and /d/) in syllable initial position are character ized by short VOTs, whereas unvoiced stop consonants (/p/, /k/, and t/) con tain prolonged VOTs. As the VOT is increased in incremental steps, percepti on rapidly changes from a voiced stop consonant to an unvoiced consonant at an interval of 20-40 ms. This abrupt change in consonant identification is an example of categorical speech perception and is a central feature of ph onetic discrimination. This study tested the hypothesis that VOT is represe nted within auditory cortex by transient responses time-locked to consonant and voicing onset. Auditory evoked potentials (AEPs) elicited by stop cons onant-vowel (CV) syllables were recorded directly from Heschl's gyrus, the planum temporale, and the superior temporal gyrus in three patients undergo ing evaluation for surgical remediation of medically intractable epilepsy. Voiced CV syllables elicited a triphasic sequence of field potentials withi n Heschl's gyrus. AEPs evoked by unvoiced CV syllables contained additional response components time-locked to voicing onset. Syllables with a VOT of 40, 60, or 80 ms evoked components time-locked to consonant release and voi cing onset. In contrast, the syllable with a VOT of 20 ms evoked a markedly diminished response to voicing onset and elicited an AEP very similar in m orphology to that evoked by the syllable with a 0-ms VOT. Similar response features were observed in the AEPs evoked by click trains. In this case, th ere was a marked decrease in amplitude of the transient response to the sec ond click in trains with interpulse intervals of 20-25 ms. Speech-evoked AE Ps recorded from the posterior superior temporal gyrus lateral to Heschl's gyrus displayed comparable response features, whereas field potentials reco rded from three locations in the planum temporale did not contain component s time-locked to voicing onset. This study demonstrates that VOT at least p artially is represented in primary and specific secondary auditory cortical fields by synchronized activity time-locked to consonant release and voici ng onset. Furthermore, AEPs exhibit features that may facilitate categorica l perception of stop consonants, and these response patterns appear to be b ased on temporal processing limitations within auditory cortex. Demonstrati ons of similar speech-evoked response patterns in animals support a role fo r these experimental models in clarifying selected features of speech encod ing.