In this study, we aim at developing efficient computational tools to accelerate the sampling and interpretation of molecular dynamics simulations. To address the absence of structural information, we developed a machine learning based algorithm (FingerprintContacts) to quickly predict multiple protein structures by combining agglomerative clustering and co-evolutionary information. We have demonstrated the capabilities of FingerprintContacts on eight proteins with varying conformational motions. To enhance the sampling efficiency, we proposed that evolutionary couplings can be used as reaction coordinates to efficiently guide the sampling of complex conformational free energy landscapes. To interpret the resulting high-dimensional simulation data, we developed a genetic algorithm based method to automatically select features for dimensionality reduction. The integration of the developed algorithms and all-atom molecular dynamics simulations has allowed us to characterize long timescale conformational transitions and the complete substrate translocation cycle of two nitrogen transporters. This work would establish efficient computational frameworks for understanding long timescale biophysical processes.
See more of this Group/Topical: Computational Molecular Science and Engineering Forum