PARASITE - MINING STRUCTURAL INFORMATION ON THE WEB

Authors
Citation
E. Spertus, PARASITE - MINING STRUCTURAL INFORMATION ON THE WEB, Computer networks and ISDN systems, 29(8-13), 1997, pp. 1205-1215
Citations number
23
Categorie Soggetti
Computer Sciences","System Science",Telecommunications,"Engineering, Eletrical & Electronic","Computer Science Information Systems
ISSN journal
01697552
Volume
29
Issue
8-13
Year of publication
1997
Pages
1205 - 1215
Database
ISI
SICI code
0169-7552(1997)29:8-13<1205:P-MSIO>2.0.ZU;2-1
Abstract
Web information retrieval tools typically make use of only the text on pages, ignoring valuable information implicitly contained in links. A t the ether extreme, viewing the Web as a traditional hypertext system would also be mistake, because heterogeneity, cross-domain links, and the dynamic nature of the Web mean that many assumptions of typical h ypertext systems do not apply. The novelty of the Web leads to new pro blems in information access, and it is necessary to make use of the ne w kinds of information available, such as multiple independent categor ization, naming, and indexing of pages. This paper discusses the varie ties of link information (not just hyperlinks) on the Web, how the Web differs from conventional hypertext, and how the links can be exploit ed to build useful applications. Specific applications presented as pa rt of the ParaSite system find individuals' homepages, new locations o f moved pages, and unindexed information. (C) 1997 Published by Elsevi er Science B.V.