Mh. Christiansen et al., LEARNING TO SEGMENT SPEECH USING MULTIPLE CUES - A CONNECTIONIST MODEL, Language and cognitive processes, 13(2-3), 1998, pp. 221-268
Considerable research in language acquisition has addressed the extent
to which basic aspects of linguistic structure might be identified on
the basis of probabilistic cues in caregiver speech to children. This
type of learning mechanism presents classic learnability issues: ther
e are aspects of language for which the input is thought to provide no
evidence, and the evidence that does exist tends to be unreliable. We
address these issues in the context of the specific problem of learni
ng to identify lexical units in speech. A simple recurrent network was
trained on a phoneme prediction task. The model was explicitly provid
ed with information about phonemes, relative lexical stress, and bound
aries between utterances. Individually these sources of information pr
ovide relatively unreliable cues to word boundaries and no direct evid
ence about actual word boundaries. After training on a large corpus of
child-directed speech, the model was able to use these cues to reliab
ly identify word boundaries. The model shows that aspects of linguisti
c structure that are not overtly marked in the input can be derived by
efficiently combining multiple probabilistic cues. Connectionist netw
orks provide a plausible mechanism for acquiring, representing, and co
mbining such probabilistic information.