A pseudo glottal excitation model for the linear prediction vocoder with speech signals coded at 1.6 kbps

Citation
Ht. Hu et al., A pseudo glottal excitation model for the linear prediction vocoder with speech signals coded at 1.6 kbps, IEICE T INF, E83D(8), 2000, pp. 1654-1661
Citations number
28
Categorie Soggetti
Information Tecnology & Communication Systems
Journal title
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS
ISSN journal
09168532 → ACNP
Volume
E83D
Issue
8
Year of publication
2000
Pages
1654 - 1661
Database
ISI
SICI code
0916-8532(200008)E83D:8<1654:APGEMF>2.0.ZU;2-R
Abstract
This paper presents a pseudo glottal excitation model for the type of linea r prediction vocoders with speech being coded at 1.6 kbps. While unvoiced s peech and silence intervals are processed with a stochastic codebook of 512 entries, a glottal codebook with 32 entries for voiced excitation is used to describe the glottal phase characteristics. Steps of formulating the pse udo glottal excitation for one pitch period consist of 1) applying a polyno mial model to simulate the low-frequency constituent of the residual, 2) in serting a magnitude-adjustable pulse sequence to characterize the main exci tation, and 3) introducing turbulent noise in series with the resulting exc itation. Procedures are described for codebook construction in addition to analysis and synthesis of the pseudo glottal excitation. Results in a mean opinion score (MOS) test show that the quality produced by the proposed cod er is almost as good as that by 4.8 kbps CELP coder for male utterances, bu t the quality for female utterances is yet somewhat inferior.