ITA
ENG

Probability-based Chinese text processing and retrieval

Authors

Huang, XJ Robertson, S Cercone, N An, AJ

Citation

Xj. Huang et al., Probability-based Chinese text processing and retrieval, COMPUT INTE, 16(4), 2000, pp. 552-569

Citations number

Categorie Soggetti

AI Robotics and Automatic Control

Journal title

COMPUTATIONAL INTELLIGENCE

ISSN journal

08247935 → ACNP

Volume

Issue

Year of publication

2000

Pages

552 - 569

Database

ISI

SICI code

0824-7935(200011)16:4<552:PCTPAR>2.0.ZU;2-J

Abstract

We discuss the use of probability-based natural language processing for Chi nese text retrieval. We focus on comparing different text extraction method s and probabilistic weighting methods. Several document processing methods and probabilistic weighting functions are presented. A number of experiment s have been conducted on large standard text collections. We present the ex perimental results that compare a word-based text processing method with a character-based method. The experimental results also compare a number of t erm-weighting functions including both single-unit weighting and compound-u nit weighting functions.