A novel force matrix transformation with optimal load-balance for 3-body potential based parallel molecular dynamics using atom-decomposition in a heterogeneous cluster environment

Authors:
J. V. Sumanth;David Swanson;Hong Jiang
Affiliations:
Department of Computer Science and Engineering, University of Nebraska-Lincoln, Lincoln, NE;Department of Computer Science and Engineering, University of Nebraska-Lincoln, Lincoln, NE;Department of Computer Science and Engineering, University of Nebraska-Lincoln, Lincoln, NE
Venue:
HiPC'07 Proceedings of the 14th international conference on High performance computing
Year:
2007

Citing 8
Cited 0

Computer simulation of liquids

Computer simulation of liquids
Fast parallel algorithms for short-range molecular dynamics

Journal of Computational Physics
Parallel many-body simulations without all-to-all communication

Journal of Parallel and Distributed Computing
A high-performance, portable implementation of the MPI message passing interface standard

Parallel Computing
A Cyclic Force Decomposition Algorithm for Parallelising Three-Body Interactions in Molecular Dynamics Simulations

IMSCCS '06 Proceedings of the First International Multi-Symposiums on Computer and Computational Sciences - Volume 1 (IMSCCS'06) - Volume 01
Adaptive Load Balancing for Long-Range MD Simulations in A Distributed Environment

ICPP '06 Proceedings of the 2006 International Conference on Parallel Processing
Performance and cost effectiveness of a cluster of workstations and MD-GRAPE 2 for MD simulations

ISPDC'03 Proceedings of the Second international conference on Parallel and distributed computing
Scheduling many-body short range MD simulations on a cluster of workstations and custom VLSI hardware

HiPC'04 Proceedings of the 11th international conference on High Performance Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Evaluating the Force Matrix constitutes the most computationally intensive part of a Molecular Dynamics (MD) simulation. In three-body MD simulations, the total energy of the system is determined by the energy of every unique triple in the system and the force matrix is three-dimensional. The execution time of a three-body MD algorithm is thus proportional to the cube of the number of atoms in the system. Fortunately, there exist symmetries in the Force Matrix that can be exploited to improve the running time of the algorithm. While this optimization is straight forward to implement in the case of sequential code, it has proven to be nontrivial for parallel code even in a homogeneous environment. In this paper, we present a force matrix transformation that is capable of exploiting the symmetries in the force matrix in both a homogeneous and a heterogeneous environment while balancing the load among all the participating processors. The proposed transformation distributes the number of interactions to be computed uniformly among all the slices of the force matrix along any of the axes. The transformed matrix can be scheduled using any well known heterogeneous slice-level scheduling technique. We also derive theoretical bounds for efficiency and load balance for prior work in the literature. We then prove some interesting and useful properties of our transformation and evaluate its advantages and disadvantages. A loop reordering optimization for the symmetric transformation is described. The performance of an MPI implementation of the transformation is studied in terms of the Step Time Variation Ratio (STVR) in a homogeneous and heterogeneous environment.