A FREQUENCY WARPING APPROACH TO SPEAKER NORMALIZATION

Authors
Citation
L. Lee et R. Rose, A FREQUENCY WARPING APPROACH TO SPEAKER NORMALIZATION, IEEE transactions on speech and audio processing, 6(1), 1998, pp. 49-60
Citations number
14
Categorie Soggetti
Engineering, Eletrical & Electronic",Acoustics
ISSN journal
10636676
Volume
6
Issue
1
Year of publication
1998
Pages
49 - 60
Database
ISI
SICI code
1063-6676(1998)6:1<49:AFWATS>2.0.ZU;2-V
Abstract
In an effort to reduce the degradation in speech recognition performan ce caused by variation in vocal tract shape among speakers, a frequenc y warping approach to speaker normalization is investigated, A set of low complexity, maximum likelihood based frequency warping procedures have been applied to speaker normalization for a telephone based conne cted digit recognition task. This paper presents an efficient means fo r estimating a linear frequency warping factor and a simple mechanism for implementing frequency warping by modifying the filterbank in mel- frequency cepstrum feature analysis, An experimental study comparing t hese techniques to other well-known techniques for reducing variabilit y is described, The results have shown that frequency warping is consi stently able to reduce word error rate by 20% even for very short utte rances.