JChemTidy: A tool for converting chemical Web document collections to an XHTML representation

Citation
Gv. Gkoutos et al., JChemTidy: A tool for converting chemical Web document collections to an XHTML representation, J CHEM INF, 41(2), 2001, pp. 253-258
Citations number
13
Categorie Soggetti
Chemistry
Journal title
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES
ISSN journal
00952338 → ACNP
Volume
41
Issue
2
Year of publication
2001
Pages
253 - 258
Database
ISI
SICI code
0095-2338(200103/04)41:2<253:JATFCC>2.0.ZU;2-T
Abstract
A robot-based procedure is described for traversing a collection of hyperli nked documents written in HTML and converting these to the XML-compliant an d well-formed XHTML representation. Transcluded chemical content invoked us ing <embed> or <applet> HTML calls are converted to the XHTML recommended < object> form. Additional attributes such as title or derived chemical attri butes such as a SMILES descriptor are added to improve the indexing of the resulting document collection. Conformance tests for the popular Web browse rs are reported.