ITA
ENG

Relevance of time-frequency features for phonetic and speaker-channel classification

Authors

Yang, HH Van Vuuren, S Sharma, S Hermansky, H

Citation

Hh. Yang et al., Relevance of time-frequency features for phonetic and speaker-channel classification, SPEECH COMM, 31(1), 2000, pp. 35-50

Citations number

Categorie Soggetti

Computer Science & Engineering

Journal title

SPEECH COMMUNICATION

ISSN journal

01676393 → ACNP

Volume

Issue

Year of publication

2000

Pages

35 - 50

Database

ISI

SICI code

0167-6393(200005)31:1<35:ROTFFP>2.0.ZU;2-3

Abstract

The mutual information concept is used to study the distribution of speech information in frequency and in time. The main focus is on the information that is relevant for phonetic classification. A large database of hand-labe led fluent speech is used to (a) compute the mutual information (MI) betwee n a phonetic classification variable and one spectral feature variable in t he time-frequency plane, and (b) compute the joint mutual information (JMI) between the phonetic classification variable and two feature variables in the time-frequency plane. The MI and the JMI of the feature variables are u sed as relevance measures to select inputs for phonetic classifiers. Multi- layer perceptron (MLP) classifiers with one or two inputs are trained to re cognize phonemes to examine the effectiveness of the input selection method based on the MI and the JMI, To analyze the non-linguistic sources of vari ability, we use speaker-channel labels to represent different speakers and different telephone channels and estimate the MI between the speaker-channe l variable and one or two feature variables. (C) 2000 Elsevier Science B.V. All rights reserved.