APPLICATION OF PROBABILISTIC METHODS TO CHINESE TEXT RETRIEVAL

Citation
Xj. Huang et Se. Robertson, APPLICATION OF PROBABILISTIC METHODS TO CHINESE TEXT RETRIEVAL, Journal of Documentation, 53(1), 1997, pp. 74-79
Citations number
6
Categorie Soggetti
Information Science & Library Science","Information Science & Library Science","Computer Science Information Systems
Journal title
ISSN journal
00220418
Volume
53
Issue
1
Year of publication
1997
Pages
74 - 79
Database
ISI
SICI code
0022-0418(1997)53:1<74:AOPMTC>2.0.ZU;2-P
Abstract
The use of text retrieval methods based on the probabilistic model wit h Chinese language material is discussed. Since Chinese text has no na tural word boundaries, we must either apply a dictionary-based word se gmentation method to the text, or index and search in terms of single Chinese characters. In either case, it becomes important to have a goo d way of dealing with phrases or contiguous strings of characters; the probabilistic model does not at present have such a facility. Some ad hoc modifications of the probabilistic weighting function and matchin g method are proposed for this purpose.