Gv. Gkoutos et al., A robot-based resource discovery tool for adding chemical meta-informationand value to web-based documents, NEW J CHEM, 25(4), 2001, pp. 635-638
We report a set of tools to be used in conjunction with a robot-based Inter
net indexing engine which can be used to convert non-conforming HTML collec
tions to well-formed and valid XHTML documents. The tools, inter alia, can
correct invalid syntax which can occur in embedded RasMol scripts and extra
ct chemical mete-information from normally inaccessible document components
, including transcluded chemical files. The index that can be built from th
e transformed documents can be used to improve the quality of searches carr
ied out in a chemical context.