A strategy for analysis of (molecular) equilibrium simulations: Configuration space density estimation, clustering, and visualization

Citation
Fa. Hamprecht et al., A strategy for analysis of (molecular) equilibrium simulations: Configuration space density estimation, clustering, and visualization, J CHEM PHYS, 114(5), 2001, pp. 2079-2089
Citations number
57
Categorie Soggetti
Physical Chemistry/Chemical Physics
Journal title
JOURNAL OF CHEMICAL PHYSICS
ISSN journal
00219606 → ACNP
Volume
114
Issue
5
Year of publication
2001
Pages
2079 - 2089
Database
ISI
SICI code
0021-9606(20010201)114:5<2079:ASFAO(>2.0.ZU;2-Z
Abstract
We propose an approach for summarizing the output of long simulations of co mplex systems, affording a rapid overview and interpretation. First, multid imensional scaling techniques are used in conjunction with dimension reduct ion methods to obtain a low-dimensional representation of the configuration space explored by the system. A nonparametric estimate of the density of s tates in this subspace is then obtained using kernel methods. The free ener gy surface is calculated from that density, and the configurations produced in the simulation are then clustered according to the topography of that s urface, such that all configurations belonging to one local free energy min imum form one class. This topographical cluster analysis is performed using basin spanning trees which we introduce as subgraphs of Delaunay triangula tions. Free energy surfaces obtained in dimensions lower than four can be v isualized directly using iso-contours and -surfaces. Basin spanning trees a lso afford a glimpse of higher-dimensional topographies. The procedure is i llustrated using molecular dynamics simulations on the reversible folding o f peptide analoga. Finally, we emphasize the intimate relation of density e stimation techniques to modern enhanced sampling algorithms. (C) 2001 Ameri can Institute of Physics.