Demonstrating the Scalability of a Molecular Dynamics Application on a Petaflops Computer

  • Authors:
  • George S. Almasi;Călin Caşcaval;José G. Castaños;Monty Denneau;Wilm Donath;Maria Eleftheriou;Mark Giampapa;Howard Ho;Derek Lieber;José E. Moreira;Dennis Newns;Marc Snir;Henry S. Warren, Jr.

  • Affiliations:
  • -;-;-;IBM Thomas J. Watson Research Center, Yorktown Heights, New York 10598-0218. denneau@us.ibm.com;IBM Thomas J. Watson Research Center, Yorktown Heights, New York 10598-0218;IBM Thomas J. Watson Research Center, Yorktown Heights, New York 10598-0218;IBM Thomas J. Watson Research Center, Yorktown Heights, New York 10598-0218;IBM Thomas J. Watson Research Center, Yorktown Heights, New York 10598-0218;IBM Thomas J. Watson Research Center, Yorktown Heights, New York 10598-0218;IBM Thomas J. Watson Research Center, Yorktown Heights, New York 10598-0218. jmoreira@us.ibm.com;IBM Thomas J. Watson Research Center, Yorktown Heights, New York 10598-0218;IBM Thomas J. Watson Research Center, Yorktown Heights, New York 10598-0218. snir@cs.uiuc.edu;IBM Thomas J. Watson Research Center, Yorktown Heights, New York 10598-0218

  • Venue:
  • International Journal of Parallel Programming
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

The IBM Blue Gene/C parallel computer aims to demonstrate the feasibility of a cellular architecture computer with millions of concurrent threads of execution. One of the major challenges in this project is showing that applications can successfully scale to this massive amount of parallelism. In this paper we demonstrate that the simulation of protein folding using classical molecular dynamics falls in this category. Starting from the sequential version of a well known molecular dynamics code, we developed a new parallel implementation that exploited the multiple levels of parallelism present in the Blue Gene/C cellular architecture. We performed both analytical and simulation studies of the behavior of this application when executed on a very large number of threads. As a result, we demonstrate that this class of applications can execute efficiently on a large cellular machine.