Detecting the impact of sequencing errors on SAGE data

Citation
J. Colinge et G. Feger, Detecting the impact of sequencing errors on SAGE data, BIOINFORMAT, 17(9), 2001, pp. 840-842
Citations number
9
Categorie Soggetti
Multidisciplinary
Journal title
BIOINFORMATICS
ISSN journal
13674803 → ACNP
Volume
17
Issue
9
Year of publication
2001
Pages
840 - 842
Database
ISI
SICI code
1367-4803(200109)17:9<840:DTIOSE>2.0.ZU;2-7
Abstract
SAGE data are obtained by sequencing short DNA tags. Due to the mistakes in DNA sequencing, SAGE data contain errors. We propose a new approach to ide ntify tags whose abundance is biased by sequencing errors. This approach is based on a concept of neighbourhood: abundant tags can contaminate tags wh ose sequence is very close. The application of our approach reveals that mo derately abundant tags can be generated by sequencing errors uniquely. It a lso allows for detecting correct rare tags.