LEARNING SYNTACTIC RULES AND TAGS WITH GENETIC ALGORITHMS FOR INFORMATION-RETRIEVAL AND FILTERING - AN EMPIRICAL-BASIS FOR GRAMMATICAL RULES

Authors
Citation
Rm. Losee, LEARNING SYNTACTIC RULES AND TAGS WITH GENETIC ALGORITHMS FOR INFORMATION-RETRIEVAL AND FILTERING - AN EMPIRICAL-BASIS FOR GRAMMATICAL RULES, Information processing & management, 32(2), 1996, pp. 185-197
Citations number
31
Categorie Soggetti
Information Science & Library Science","Information Science & Library Science","Computer Science Information Systems
ISSN journal
03064573
Volume
32
Issue
2
Year of publication
1996
Pages
185 - 197
Database
ISI
SICI code
0306-4573(1996)32:2<185:LSRATW>2.0.ZU;2-J
Abstract
The grammars of natural languages may be learned by using genetic algo rithms that reproduce and mutate grammatical rules and part-of-speech tags, improving the quality of later generations of grammatical compon ents. Syntactic rules are randomly generated and then evolve; those ru les resulting in improved parsing and occasionally improved retrieval and filtering performance are allowed to further propagate. The LUST s ystem learns the characteristics of the language or sublanguage used i n document abstracts by learning from the document rankings obtained f rom the parsed abstracts. Unlike the application of traditional lingui stic rules to retrieval and filtering applications, LUST develops gram matical structures and tags without the prior imposition of some commo n grammatical assumptions (e.g. part-of-speech assumptions), producing grammars that are empirically based and are optimized for this partic ular application.