This paper presents some developments in query expansion and document repre
sentation of our spoken document retrieval system and shows how various ret
rieval techniques affect performance for different sets of transcriptions d
erived from a common speech source. Modifications of the document represent
ation are used, which combine several techniques for query expansion, knowl
edge-based on one hand and statistics-based on the other. Taken together, t
hese techniques can improve Average Precision by over 19% relative to a sys
tem similar to that which we presented at TREC-7. These new experiments hav
e also confirmed that the degradation of Average Precision due to a word er
ror rate (WER) of 25% is quite small (3.7% relative) and can be reduced to
almost zero (0.2% relative). The overall improvement of the retrieval syste
m can also be observed for seven different sets of transcriptions from diff
erent recognition engines with a WER ranging from 24.8% to 61.5%. We hope t
o repeat these experiments when larger document collections become availabl
e, in order to evaluate the scalability of these techniques. (C) 2000 Elsev
ier Science B.V. All rights reserved.