R. Zhang et al., THE INTEGRATION OF SPEECH AND CAMERA CONTROL IN MESSAGE TRANSFER TV CONFERENCING, International journal of human-computer interaction, 10(4), 1998, pp. 327-341
In this study, we examined participants' integration of speech and cam
era control during message-transfer television conferencing. We observ
ed participants' free use of speech and camera-control modality when t
hey transmitted 3 types of objects-formed text (FT), mathematical expr
ession (ME), and electronic circuit (EC)-each ranging in size from 1 t
o 7 elements. Modality use was classified into main-modality use and a
uxiliary-modality use. Five integration models were proposed based on
main-auxiliary combinations. Results indicated that (a) speech was use
d more frequently for transmitting ME objects, and camera control was
used more often for FT and EC objects; (b) speech was gradually replac
ed by camera control as size increased; (c) when camera control was ch
osen as main modality, speech was usually also used, as auxiliary moda
lity; and (d) compared to beginners, experienced participants used mor
e speech. A postsession questionnaire was conducted to collect partici
pants' assessment of the correctness, speed, and ease of use of speech
and camera control. Camera control had a higher score in total, but s
peech was thought faster for transmitting FT and ME objects.