ITA
ENG

ASR AND TTS TELECOMMUNICATIONS APPLICATIONS IN JAPAN

Authors

KITAI M HAKODA K SAGAYAMA S YAMADA T TSUKADA H TAKAHASHI S NODA Y TAKAHASHI J YOSHIDA Y ARAI K IMOTO T HIROKAWA T

Citation

M. Kitai et al., ASR AND TTS TELECOMMUNICATIONS APPLICATIONS IN JAPAN, Speech communication, 23(1-2), 1997, pp. 17-30

Citations number

Journal title

Speech communication → ACNP

ISSN journal

01676393

Volume

Issue

1-2

Year of publication

1997

Pages

17 - 30

Database

ISI

SICI code

0167-6393(1997)23:1-2<17:AATTAI>2.0.ZU;2-5

Abstract

This paper first describes recent trends of ASR and TTS telecommunicat ions applications in Japan. ASR applications focus on public services such as operator automation, operator assistance, voice-activated info rmation retrieval, and voice dialing. Major TTS applications include i nformation service by voice and e-mail reading. The usage of ASR and T TS functions is expected to dramatically increase in the near future w ith the penetration of handy and mobile telephone terminals; hot topic s are text broadcasting and digital communication. Secondly this paper describes NTT's experimental interactive system featuring (1) highly accurate speaker independent and large vocabulary speech recognition b ased on context-dependent accurate acoustic phoneme HMM models trained with speech data from more than 10,000 speakers collected over teleph one network, (2) high quality text-to-speech synthesis that generates speech by concatenating triphone-context-dependent waveform segments, (3) software-based configuration that requires no special hardware exc ept a PC equipped with a sound board and a voice modem, and (4) easy a nd rapid prototyping which enables the developer to build a system by writing some types of service scenarios. (C) 1997 Elsevier Science B.V .