A number of current technologies allow for the determination of interatomic
distance information in structures such as proteins and RNA. Thus, the rec
onstruction of a three-dimensional set of points using information about it
s interpoint distances has become a task of basic importance in determining
molecular structure. The distance measurements one obtains from techniques
such as NMR are typically sparse and error-prone, greatly complicating the
reconstruction task. Many of these errors result in distance measurements
that can be safely assumed to lie within certain fixed tolerances. But a nu
mber of sources of systematic error in these experiments lead to inaccuraci
es in the data that are very hard to quantify; in effect, one must treat ce
rtain entries of the measured distance matrix as being arbitrarily "corrupt
ed."
The existence of arbitrary errors leads to an interesting sort of error-cor
rection problem-how many corrupted entries in a distance matrix can be effi
ciently corrected to produce a consistent three-dimensional structure? For
the case of an n x n matrix in which every entry is specified, we provide a
randomized algorithm running in time O(n log n) that enumerates all struct
ures consistent with at most (1/2 - epsilon)n errors per row, with high pro
bability. In the case of randomly located errors, we can correct errors of
the same density in a sparse matrix-one in which only a beta fraction of th
e entries in each row are given, for any constant beta > 0.