C. Landes et Jl. Risler, FAST DATA-BANK SEARCHING WITH A REDUCED AMINO-ACID ALPHABET, Computer applications in the biosciences, 10(4), 1994, pp. 453-454
Fast sequence databanks search algorithms generally make use of hash t
ables and look for exactly matching words. An increased sensitivity-at
the expense of a decreased selectivity-can be attained in the case of
proteins by using a reduced amino acid alphabet. We propose here an a
lphabet reduced to 10 symbols, that we used in modified versions of th
e FASTP and SCAN programs. An application to the aminoacyl-tRNA synthe
tases shows that this technique may be useful in detecting distant rel
ationships between proteins.