ITA
ENG

LEARNING TO SEGMENT SPEECH USING MULTIPLE CUES - A CONNECTIONIST MODEL

Authors

CHRISTIANSEN MH ALLEN J SEIDENBERG MS

Citation

Mh. Christiansen et al., LEARNING TO SEGMENT SPEECH USING MULTIPLE CUES - A CONNECTIONIST MODEL, Language and cognitive processes, 13(2-3), 1998, pp. 221-268

Citations number

Categorie Soggetti

Language & Linguistics","Psychology, Experimental

Journal title

Language and cognitive processes → ACNP

ISSN journal

01690965

Volume

Issue

2-3

Year of publication

1998

Pages

221 - 268

Database

ISI

SICI code

0169-0965(1998)13:2-3<221:LTSSUM>2.0.ZU;2-A

Abstract

Considerable research in language acquisition has addressed the extent to which basic aspects of linguistic structure might be identified on the basis of probabilistic cues in caregiver speech to children. This type of learning mechanism presents classic learnability issues: ther e are aspects of language for which the input is thought to provide no evidence, and the evidence that does exist tends to be unreliable. We address these issues in the context of the specific problem of learni ng to identify lexical units in speech. A simple recurrent network was trained on a phoneme prediction task. The model was explicitly provid ed with information about phonemes, relative lexical stress, and bound aries between utterances. Individually these sources of information pr ovide relatively unreliable cues to word boundaries and no direct evid ence about actual word boundaries. After training on a large corpus of child-directed speech, the model was able to use these cues to reliab ly identify word boundaries. The model shows that aspects of linguisti c structure that are not overtly marked in the input can be derived by efficiently combining multiple probabilistic cues. Connectionist netw orks provide a plausible mechanism for acquiring, representing, and co mbining such probabilistic information.