Jw. Grzymalabusse et S. Than, DATA-COMPRESSION IN MACHINE LEARNING APPLIED TO NATURAL-LANGUAGE, Behavior research methods, instruments, & computers, 25(2), 1993, pp. 318-321
In this paper, we investigate the possibility of applying machine lear
ning methods to data derived from the area of natural language and sho
w how rules, induced by machine learning, are changed after the origin
al data are compressed by grouping together entries, attributes, and a
ttribute values. Also shown is how excessive compression of input data
may affect the accuracy of induced rules.