ITA
ENG

SPEECH RECOGNITION WITH PRIMARILY TEMPORAL CUES

Authors

SHANNON RV ZENG FG KAMATH V WYGONSKI J EKELID M

Citation

Rv. Shannon et al., SPEECH RECOGNITION WITH PRIMARILY TEMPORAL CUES, Science, 270(5234), 1995, pp. 303-304

Citations number

Categorie Soggetti

Multidisciplinary Sciences

Journal title

Science → ACNP

ISSN journal

00368075

Volume

270

Issue

5234

Year of publication

1995

Pages

303 - 304

Database

ISI

SICI code

0036-8075(1995)270:5234<303:SRWPTC>2.0.ZU;2-H

Abstract

Nearly perfect speech recognition was observed under conditions of gre atly reduced spectral information. Temporal envelopes of speech were e xtracted from broad frequency bands and were used to modulate noises o f the same bandwidths. This manipulation preserved temporal envelope c ues in each band but restricted the listener to severely degraded info rmation on the distribution of spectral energy. The identification of consonants, vowels, and words in simple sentences improved markedly as the number of bands increased; high speech recognition performance wa s obtained with only three bands of modulated noise. Thus, the present ation of a dynamic temporal pattern in only a few broad spectral regio ns is sufficient for the recognition of speech.