A scalable parallel framework for analyzing terascale molecular dynamics simulation trajectories

  • Authors:
  • Tiankai Tu;Charles A. Rendleman;David W. Borhani;Ron O. Dror;Justin Gullingsrud;Morten Ø. Jensen;John L. Klepeis;Paul Maragakis;Patrick Miller;Kate A. Stafford;David E. Shaw

  • Affiliations:
  • D. E. Shaw Research, New York, NY;D. E. Shaw Research, New York, NY;D. E. Shaw Research, New York, NY;D. E. Shaw Research, New York, NY;D. E. Shaw Research, New York, NY;D. E. Shaw Research, New York, NY;D. E. Shaw Research, New York, NY;D. E. Shaw Research, New York, NY;D. E. Shaw Research, New York, NY;D. E. Shaw Research, New York, NY;D. E. Shaw Research, New York, NY

  • Venue:
  • Proceedings of the 2008 ACM/IEEE conference on Supercomputing
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

As parallel algorithms and architectures drive the longest molecular dynamics (MD) simulations towards the millisecond scale, traditional sequential post-simulation data analysis methods are becoming increasingly untenable. Inspired by the programming interface of Google's MapReduce, we have built a new parallel analysis framework called HiMach, which allows users to write trajectory analysis programs sequentially, and carries out the parallel execution of the programs automatically. We introduce (1) a new MD trajectory data analysis model that is amenable to parallel processing, (2) a new interface for defining trajectories to be analyzed, (3) a novel method to make use of an existing sequential analysis tool called VMD, and (4) an extension to the original MapReduce model to support multiple rounds of analysis. Performance evaluations on up to 512 cores demonstrate the efficiency and scalability of the HiMach framework on a Linux cluster.