Probability-based Chinese text processing and retrieval

Citation
Xj. Huang et al., Probability-based Chinese text processing and retrieval, COMPUT INTE, 16(4), 2000, pp. 552-569
Citations number
15
Categorie Soggetti
AI Robotics and Automatic Control
Journal title
COMPUTATIONAL INTELLIGENCE
ISSN journal
08247935 → ACNP
Volume
16
Issue
4
Year of publication
2000
Pages
552 - 569
Database
ISI
SICI code
0824-7935(200011)16:4<552:PCTPAR>2.0.ZU;2-J
Abstract
We discuss the use of probability-based natural language processing for Chi nese text retrieval. We focus on comparing different text extraction method s and probabilistic weighting methods. Several document processing methods and probabilistic weighting functions are presented. A number of experiment s have been conducted on large standard text collections. We present the ex perimental results that compare a word-based text processing method with a character-based method. The experimental results also compare a number of t erm-weighting functions including both single-unit weighting and compound-u nit weighting functions.