Unstructured mesh partition improvement for implicit finite element at extreme scale

  • Authors:
  • Min Zhou;Onkar Sahni;Ting Xie;Mark S. Shephard;Kenneth E. Jansen

  • Affiliations:
  • Scientific Computational Research Center, Rensselaer Polytechnic Institute, Troy, USA 12180;Scientific Computational Research Center, Rensselaer Polytechnic Institute, Troy, USA 12180;Scientific Computational Research Center, Rensselaer Polytechnic Institute, Troy, USA 12180;Scientific Computational Research Center, Rensselaer Polytechnic Institute, Troy, USA 12180;Aerospace Engineering Sciences, University of Colorado at Boulder, Boulder, USA 80309

  • Venue:
  • The Journal of Supercomputing
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Parallel simulations at extreme scale require that the mesh is distributed across a large number of processors with equal work load and minimum inter-part communications. A number of algorithms have been developed to meet these goals and graph/hypergraph-based methods are by far the most powerful ones. However, the global implementation of current approaches can fail on very large core counts and the vertex imbalance is not optimal where individual cores are lightly loaded. Those issues are resolved by combination of global and local partitioning and an iterative improvement algorithm, LIIPBMod, developed in the previous study (Zhou et al. in SIAM J. Sci. Comput. 32:3201---3227, 2010). In the current work, this combined partition strategy is applied to the simulations at extreme scale with up to O(1010) elements and up to O(300K) cores. Strong scaling studies on IBM BlueGene/P and Cray XT5 systems demonstrate the effectiveness of this combined partition algorithm.