ITA
ENG

SPEECH ENHANCEMENT USING SPECTRAL SUBTRACTION WITH WAVELET TRANSFORM

Authors

NISHIMURA R ASANO F SUZUKI Y SONE T

Citation

R. Nishimura et al., SPEECH ENHANCEMENT USING SPECTRAL SUBTRACTION WITH WAVELET TRANSFORM, Electronics and communications in Japan. Part 3, Fundamental electronic science, 81(1), 1998, pp. 24-31

Citations number

Categorie Soggetti

Engineering, Eletrical & Electronic

Journal title

Electronics and communications in Japan. Part 3, Fundamental electronic science → ACNP

ISSN journal

10420967

Volume

Issue

Year of publication

1998

Pages

24 - 31

Database

ISI

SICI code

1042-0967(1998)81:1<24:SEUSSW>2.0.ZU;2-K

Abstract

For speech enhancement based on spectral estimation/analysis, an analy tic technique by which speech signals can he easily distinguished from noise is desired. The wavelet transform (WT) is an analysis tool for which various types of basis functions can be used. By selecting a pro per fundamental wavelet. speech energy can be effectively localized in the space transformed by the WT. In this article, we apply the WT to the spectral subtraction technique, originally defined as using the sh ort-time Fourier transform (STFT), and evaluate the effectiveness of i ts outcome. Considering the structure of the human voice, we use Gabor and Daubechies wavelets as well as a decaying sinusoid as the fundame ntal wavelet. The results of computer simulations show that the S/N ra tio was improved by the proposed method employing the decaying sinusoi d as compared with conventional spectral subtraction. In articulation tests with Japanese nonsense monosyllables, however, no significant di fference could be observed. (C) 1998 Scripta Technica.