Practical performance portability in the Parallel Ocean Program (POP): Research Articles

  • Authors:
  • P. W. Jones;P. H. Worley;Y. Yoshida;J. B. White, III;J. Levesque

  • Affiliations:
  • Theoretical Division, Los Alamos National Laboratory, T-3, MS B216, Los Alamos, NM 87545-1663, U.S.A.;Computer Science and Mathematics Division, Oak Ridge National Laboratory, P.O. Box 2008, Oak Ridge, TN 37831-6367, U.S.A.;Central Research Institute of Electric Power Industry, 1646 Abiko Abiko-shi Chiba, 270-1194, Japan;Computer Science and Mathematics Division, Oak Ridge National Laboratory, P.O. Box 2008, Oak Ridge, TN 37831-6367, U.S.A.;Cray Inc., 411 First Avenue South, Suite 600, Seattle, WA 98104-2860, U.S.A.

  • Venue:
  • Concurrency and Computation: Practice & Experience - The High Performance Architectural Challenge: Mass Market versus Proprietary Components?
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

The design of the Parallel Ocean Program (POP) is described with an emphasis on portability. Performance of POP is presented on a wide variety of computational architectures, including vector architectures and commodity clusters. Analysis of POP performance across machines is used to characterize performance and identify improvements while maintaining portability. A new design of the POP model, including a cache blocking and land point elimination scheme, is described with some preliminary performance results. Published in 2005 by John Wiley & Sons, Ltd.This article is a U.S. Government work and is in the public domain in the U.S.A.