SPEECH RECOGNITION WITH PRIMARILY TEMPORAL CUES

Citation
Rv. Shannon et al., SPEECH RECOGNITION WITH PRIMARILY TEMPORAL CUES, Science, 270(5234), 1995, pp. 303-304
Citations number
26
Categorie Soggetti
Multidisciplinary Sciences
Journal title
ISSN journal
00368075
Volume
270
Issue
5234
Year of publication
1995
Pages
303 - 304
Database
ISI
SICI code
0036-8075(1995)270:5234<303:SRWPTC>2.0.ZU;2-H
Abstract
Nearly perfect speech recognition was observed under conditions of gre atly reduced spectral information. Temporal envelopes of speech were e xtracted from broad frequency bands and were used to modulate noises o f the same bandwidths. This manipulation preserved temporal envelope c ues in each band but restricted the listener to severely degraded info rmation on the distribution of spectral energy. The identification of consonants, vowels, and words in simple sentences improved markedly as the number of bands increased; high speech recognition performance wa s obtained with only three bands of modulated noise. Thus, the present ation of a dynamic temporal pattern in only a few broad spectral regio ns is sufficient for the recognition of speech.