Revealing hidden interval graph structure in STS-content data

Citation
E. Harley et al., Revealing hidden interval graph structure in STS-content data, BIOINFORMAT, 15(4), 1999, pp. 278-285
Citations number
30
Categorie Soggetti
Multidisciplinary
Journal title
BIOINFORMATICS
ISSN journal
13674803 → ACNP
Volume
15
Issue
4
Year of publication
1999
Pages
278 - 285
Database
ISI
SICI code
1367-4803(199904)15:4<278:RHIGSI>2.0.ZU;2-U
Abstract
Motivation: STS-content data for genomic mapping contain numerous errors an d anomalies resulting in cross-links among distant regions of the genome. I dentification of contigs within the data is an important and difficult prob lem. Results: This paper introduces a graph algorithm which creates a simpl ified view of STS content data. The shape of the resulting structure graph provides a quality check - coherent data produce a straight line, while ano malous data produce branches and loops. In the latter case, it is sometimes possible to disentangle the various paths into subsets of the data coverin g contiguous regions of the genome, i.e. contigs. These straight subgraphs can then be analyzed in standard ways to construct a physical map. A theore tical basis for the method is presented along with examples of its applicat ion to current STS data from human genome centers.