Byte structure variable length coding (BS-VLC): A new specific algorithm applied in the compression of trajectories generated by molecular dynamics

Citation
A. Melo et al., Byte structure variable length coding (BS-VLC): A new specific algorithm applied in the compression of trajectories generated by molecular dynamics, J CHEM INF, 40(3), 2000, pp. 559-566
Citations number
11
Categorie Soggetti
Chemistry
Journal title
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES
ISSN journal
00952338 → ACNP
Volume
40
Issue
3
Year of publication
2000
Pages
559 - 566
Database
ISI
SICI code
0095-2338(200005/06)40:3<559:BSVLC(>2.0.ZU;2-N
Abstract
Molecular dynamics is a well-known technique very much used in the study of biomolecular systems. The trajectory files produced by molecular dynamics simulations are extensive, and the classical lossless algorithms give poor efficiencies in their compression. In this work, a new specific algorithm, named byte structure variable length coding (BS-VLC), is introduced. Trajec tory files, obtained by molecular dynamics applied to trypsin and a trypsin :pancreatic trypsin inhibitor complex, were compressed using four classical lossless algorithms (Huffman, adaptive Huffman, LZW, and LZ77) as well as the BS-VLC algorithm. The results obtained show that BS-VLC nearly triplica tes the compression efficiency of the best classical lossless algorithm, pr eserving a near lossless behavior. Compression efficiencies close to 50% ca n be obtained with a high degree of precision, and the maximum efficiency p ossible (75%), within this algorithm, can be performed with good precision.