Fast PGAS connected components algorithms

  • Authors:
  • Guojing Cong;Gheorghe Almasi;Vijay Saraswat

  • Affiliations:
  • IBM Research, Yorktown Heights, NY;IBM Research, Yorktown Heights, NY;IBM Research, Hawthorne, NY

  • Venue:
  • Proceedings of the Third Conference on Partitioned Global Address Space Programing Models
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Irregular graph algorithms for distributed-memory systems are hard to implement and optimize. Recent developments in PGAS languages make the implementation of irregular algorithms easier. In this paper we present our study of PRAM-based parallel connected components algorithm implemented in UPC for distributed-memory systems, and discuss optimization techniques for such settings. Our optimized version achieved more than 100 times speedup over the straight-forward implementation. Remarkable speedups are also achieved over the best SMP implementation for the same input. As the memory access patterns of these algorithms are representative of those of many other PRAM algorithms, we expect our techniques applicable to optimizing a wide range of PRAM graph algorithms on distributed-memory machines.