AUTOMATIC PHONE SEGMENTATION AND LABELING OF CONTINUOUS SPEECH

Authors
Citation
Cg. Jeong et H. Jeong, AUTOMATIC PHONE SEGMENTATION AND LABELING OF CONTINUOUS SPEECH, Speech communication, 20(3-4), 1996, pp. 291-311
Citations number
28
Categorie Soggetti
Communication,"Language & Linguistics
Journal title
ISSN journal
01676393
Volume
20
Issue
3-4
Year of publication
1996
Pages
291 - 311
Database
ISI
SICI code
0167-6393(1996)20:3-4<291:APSALO>2.0.ZU;2-N
Abstract
To obtain an accurate phone sequence from a continuous speech signal, we suggest a novel approach consisting of tightly coupled bottom-up an d top-down processing. The bottom-up path consists of segmentation, re cognition and labeling. Also the top-down path consists of labeling, s peech generation and segmentation. In this manner, the four processes form a closed feedback loop achieving an optimal interpretation effici ently for a given noisy observation of speech signal and a priori know ledge. The major goal of this paper is to identify the system model us ing both the stochastic estimation theory and the mean field theory. E xperimental results are obtained in terms of the TIMIT database. It is shown that introducing the top-down path to the traditional bottom-up path can improve the recognition rate by 19.7%, and reduce the error (substitution, deletion and insertion) rate by 16.1%. As a result, the overall system can transform the incoming continuous signal into one of the 61 phone classes at the rate of 73.7%.