Insertion and deletion (indel) sequencing errors in DNA coding regions
disrupt DNA-to-protein translation frames, and hence make most frame-
sensitive coding recognition approaches fail, This paper extends the a
uthors' previous work on indel detection and ''correction'' algorithms
, and presents a more effective algorithm for localizing indels that a
ppear in DNA coding regions and ''correcting'' the located indels by i
nserting or deleting DNA bases. The algorithm localizes indels by disc
overing changes of the preferred translation frames within presumed co
ding regions, and then ''corrects'' them to restore a consistent trans
lation frame within each coding region, An iterative strategy is explo
ited to repeatedly localize and ''correct'' indels until no more indel
s can be found, Test results have shown that this improved algorithm c
an detect and ''correct'' more indels while not worsening the rate of
introduction of false indels when compared to the authors' previous wo
rk.