ITA
ENG

Optimal structure for automatic processing of DNA sequences

Authors

Davies, SW Eizenman, M Pasupathy, S

Citation

Sw. Davies et al., Optimal structure for automatic processing of DNA sequences, IEEE BIOMED, 46(9), 1999, pp. 1044-1056

Citations number

Categorie Soggetti

Multidisciplinary,"Instrumentation & Measurement

Journal title

IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING

ISSN journal

00189294 → ACNP

Volume

Issue

Year of publication

1999

Pages

1044 - 1056

Database

ISI

SICI code

0018-9294(199909)46:9<1044:OSFAPO>2.0.ZU;2-B

Abstract

The faithful recovery of the base sequence in automatic DeoxyriboNucleic Ac id (DNA) sequencing fundamentally depends on the underlying statistics of t he DNA electrophoresis time series, Current DNA sequencing algorithms are h euristic in, nature and modest in their use of statistical information, In this paper, a Formal statistical model of the DNA time series is presented and then used to construct the optimal maximum-likelihood (MZ) processor. The DNA-ML algorithm that is derived in this paper features Kalman predicti on of peak locations, peak parameter estimation, whitened waveform comparis on and multiple hypothesis processing using the M-algorithm, Properties of the algorithm are examined using both simulated and real data. Model parame ters of critical importance and their impact on different types of error me chanisms, such as insertions and deletions, are pointed out, The statistica l model of the DNA time-series and the structure of the DNA-ML algorithm pr ovides a basis for future investigation and refinement of DNA sequencing te chniques.