High-performance XML storage/retrieval system

Citation
Y. Yamane et al., High-performance XML storage/retrieval system, FUJITSU SCI, 36(2), 2000, pp. 185-192
Citations number
8
Categorie Soggetti
Eletrical & Eletronics Engineeing
Journal title
FUJITSU SCIENTIFIC & TECHNICAL JOURNAL
ISSN journal
00162523 → ACNP
Volume
36
Issue
2
Year of publication
2000
Pages
185 - 192
Database
ISI
SICI code
0016-2523(2000)36:2<185:HXSS>2.0.ZU;2-8
Abstract
This paper describes a system that integrates full-text searching and datab ase technologies for storing XML (eXtensible Markup Language) documents and retrieving information from them while providing a uniform interface. Our main goal with this system is to achieve high-performance, because there wi ll be a large amount of XML documents in the near future if XML becomes a s tandard for structured documents and data exchange. We have therefore devel oped techniques for achieving highperformance storage and retrieval of XML documents. For full-text searches, we improved the Structure Index + Text I ndex model, which references both indexes alternately at retrieval. In our improved method, a hieriarchical structure query is converted into a flat s tructure query by referencing just the structure index, then the optimized query can be quickly processed using only the text index. For storage, we d eveloped an offset space, which is an address space in secondary memory tha t can compactly store any structure, for example, a tree. We use the offset space to solve the problem that occurs in other methods which store the an alyzed result of XML documents as multiple relations in an nos. In our meth od, the analyzed result can be stored in a single page in the best case. Th is makes it superior to other methods which store the analysis results in m ultiple relations so that storage of N relations needs at least N pages. As a result, generally, our method greatly reduces I/O costs.