AUTOMATIC SEGMENTATION AND LABELING OF MULTILINGUAL SPEECH DATA

Citation
A. Vorstermans et al., AUTOMATIC SEGMENTATION AND LABELING OF MULTILINGUAL SPEECH DATA, Speech communication, 19(4), 1996, pp. 271-293
Citations number
35
Categorie Soggetti
Communication,"Language & Linguistics
Journal title
ISSN journal
01676393
Volume
19
Issue
4
Year of publication
1996
Pages
271 - 293
Database
ISI
SICI code
0167-6393(1996)19:4<271:ASALOM>2.0.ZU;2-2
Abstract
A new system for the automatic segmentation and labelling of speech is presented. The system is capable of labelling speech originating from different languages without requiring extensive linguistic knowledge or large (manually segmented and labeled) training databases of that l anguage. The system comprises small neural networks for the segmentati on and the broad phonetic classification of the speech. These networks were originally trained on one task (Flemish continuous speech), and are automatically adapted to a new task. Due to the limited size of th e neural networks, the segmentation and labelling strategy requires bu t a limited amount of computations, and the adaptation to a new task c an be accomplished very quickly. The system was first evaluated on fiv e isolated word corpora designed for the development of Dutch, French, American English, Spanish and Korean text-to-speech systems. The resu lts show that the accuracy of the obtained automatic segmentation and labelling is comparable to that of human experts. In order to provide segmentation and labelling results which can be compared to data repor ted in the literature, additional tests were run on TIMIT and on the E nglish, Danish and Italian portions of the EUROMO continuous speech ut terances. The performance of our system appears to compare favourably to that of other systems.