Preservation of digital data with self-validating, self-instantiating knowledge-based archives

Citation
B. Ludascher et al., Preservation of digital data with self-validating, self-instantiating knowledge-based archives, SIG RECORD, 30(3), 2001, pp. 54-63
Citations number
18
Categorie Soggetti
Computer Science & Engineering
Journal title
SIGMOD RECORD
ISSN journal
01635808 → ACNP
Volume
30
Issue
3
Year of publication
2001
Pages
54 - 63
Database
ISI
SICI code
0163-5808(200109)30:3<54:PODDWS>2.0.ZU;2-9
Abstract
Digital archives are dedicated to the long-term preservation of electronic information and have the mandate to enable sustained access despite rapid t echnology changes. Persistent archives are confronted with heterogeneous da ta formats, helper applications, and platforms being used over the lifetime of the archive. This is not unlike the interoperability challenges, for wh ich mediators are devised. To prevent technological obsolescence over time and across platforms, a migration approach for persistent at-chives is prop osed based on an XML infrastructure. We extend current archival approaches that build upon standardized data for mats and simple metadata mechanisms for collection management, by involving high-level conceptual models and knowledge representations as an integral part of the archive and the ingestion/migration processes. Infrastructure i ndependence is maximized by archiving generic, executable specifications of (i) archival constraints (i.e., "model validators"), and (ii) archival tra nsformations that are part of the ingestion process. The proposed architect ure facilitates construction of self-validating and self-instantiating know ledge-based archives. We illustrate our overall approach and report on firs t experiences using a sample collection from a collaboration with the Natio nal Archives and Records Administration (NARA).