ITA
ENG

A comparison of speaker identification results using features based on cepstrum and Fourier-Bessel expansion

Authors

Gopalan, K Anderson, TR Cupples, EJ

Citation

K. Gopalan et al., A comparison of speaker identification results using features based on cepstrum and Fourier-Bessel expansion, IEEE SPEECH, 7(3), 1999, pp. 289-294

Citations number

Categorie Soggetti

Eletrical & Eletronics Engineeing

Journal title

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING

ISSN journal

10636676 → ACNP

Volume

Issue

Year of publication

1999

Pages

289 - 294

Database

ISI

SICI code

1063-6676(199905)7:3<289:ACOSIR>2.0.ZU;2-D

Abstract

A compact representation of speech is possible using Bessel functions becau se of the similarity between voiced speech and the Bessel functions, Both v oiced speech and the Bessel functions exhibit quasiperiodicity and decaying amplitude with time. This paper presents the results of speaker identifica tion experiments using features obtained from 1) the Fourier-Bessel expansi on and 2) the cepstral representation of speech frames. Identification scor es of 65% and 76% were achieved using features based on J(1)(t) expansion o f air-to-ground speech transmission databases of 143 and 1054 test utteranc es, respectively. The corresponding scores for the two databases using ceps tral coefficients. of a comparable size were 80% and 88%, A comparison of t he two sets of features indicates that J(1)(t) can be used to model the hea ring perception much like the mel cepstral coefficients.