ITA
ENG

PARASITE - MINING STRUCTURAL INFORMATION ON THE WEB

Authors

SPERTUS E

Citation

E. Spertus, PARASITE - MINING STRUCTURAL INFORMATION ON THE WEB, Computer networks and ISDN systems, 29(8-13), 1997, pp. 1205-1215

Citations number

Categorie Soggetti

Computer Sciences","System Science",Telecommunications,"Engineering, Eletrical & Electronic","Computer Science Information Systems

Journal title

Computer networks and ISDN systems → ACNP

ISSN journal

01697552

Volume

Issue

8-13

Year of publication

1997

Pages

1205 - 1215

Database

ISI

SICI code

0169-7552(1997)29:8-13<1205:P-MSIO>2.0.ZU;2-1

Abstract

Web information retrieval tools typically make use of only the text on pages, ignoring valuable information implicitly contained in links. A t the ether extreme, viewing the Web as a traditional hypertext system would also be mistake, because heterogeneity, cross-domain links, and the dynamic nature of the Web mean that many assumptions of typical h ypertext systems do not apply. The novelty of the Web leads to new pro blems in information access, and it is necessary to make use of the ne w kinds of information available, such as multiple independent categor ization, naming, and indexing of pages. This paper discusses the varie ties of link information (not just hyperlinks) on the Web, how the Web differs from conventional hypertext, and how the links can be exploit ed to build useful applications. Specific applications presented as pa rt of the ParaSite system find individuals' homepages, new locations o f moved pages, and unindexed information. (C) 1997 Published by Elsevi er Science B.V.