A CORRECTING METHOD FOR PITCH EXTRACTION USING NEURAL NETWORKS

Citation
A. Ogihara et K. Fukunaga, A CORRECTING METHOD FOR PITCH EXTRACTION USING NEURAL NETWORKS, IEICE transactions on fundamentals of electronics, communications and computer science, E77A(6), 1994, pp. 1015-1022
Citations number
NO
Categorie Soggetti
Engineering, Eletrical & Electronic","Computer Science Hardware & Architecture","Computer Science Information Systems
ISSN journal
09168508
Volume
E77A
Issue
6
Year of publication
1994
Pages
1015 - 1022
Database
ISI
SICI code
0916-8508(1994)E77A:6<1015:ACMFPE>2.0.ZU;2-7
Abstract
Pitch frequency is a basic characteristic of human voice, and pitch ex traction is one of the most important studies for speech recognition. This paper describes a simple but effective technique to obtain correc t pitch frequency from candidates (pitch candidates) extracted by the short-range autocorrelation function. The correction is performed by a neural network in consideration of the time continuation that is real ized by referring to pitch candidates at previous frames. Since the ne ural network is trained by the back-propagation algorithm with trainin g data, it adapts to any speaker and obtains good correction without s ensitive adjustment and tuning. The pitch extraction was performed for 3 male and 3 female announcers, and the proposed method improves the percentage of correct pitch from 58.65% to 89.19%.