Gv. Gkoutos et al., JChemTidy: A tool for converting chemical Web document collections to an XHTML representation, J CHEM INF, 41(2), 2001, pp. 253-258
Citations number
13
Categorie Soggetti
Chemistry
Journal title
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES
A robot-based procedure is described for traversing a collection of hyperli
nked documents written in HTML and converting these to the XML-compliant an
d well-formed XHTML representation. Transcluded chemical content invoked us
ing <embed> or <applet> HTML calls are converted to the XHTML recommended <
object> form. Additional attributes such as title or derived chemical attri
butes such as a SMILES descriptor are added to improve the indexing of the
resulting document collection. Conformance tests for the popular Web browse
rs are reported.