Computer simulations for organic light-emitting diodes
IBM Journal of Research and Development
An Evaluation of Marenostrum Performance
International Journal of High Performance Computing Applications
A Microbenchmark Suite for Mixed-Mode OpenMP/MPI
IWOMP '09 Proceedings of the 5th International Workshop on OpenMP: Evolving OpenMP in an Age of Extreme Parallelism
International Journal of High Performance Computing Applications
Parallel discrete element simulation of poly-dispersed granular material
Advances in Engineering Software
Early experience with scientific applications on the blue gene/l supercomputer
Euro-Par'05 Proceedings of the 11th international Euro-Par conference on Parallel Processing
Hybrid parallelization of a large-scale heart model
Facing the Multicore-Challenge II
Journal of Parallel and Distributed Computing
Hi-index | 0.00 |
We show teraflop performance of the fully featured ab initio molecular dynamics code CPMD on an IBM pSeries 690 cluster. A mixed distributed-memory, coarse-grained parallel approach using the MPI library and shared-memory, fine-grained parallelism using OpenMP directives is used to optimally map the algorithms on the available hardware. The top performance achieved is ~20% of the peak performance and an estimated parallel efficiency of ~45% on 1024 processors for a system of 1000 atoms. The main limiting factor of parallel efficiency was found to be the latency of the interconnect.