To 4,000 compute nodes and beyond: network-aware vertex placement in large-scale graph processing systems

Authors:
Karim Awara;Hani Jamjoom;Panos Kanlis
Affiliations:
King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia;IBM T. J. Watson Research Center, Yorktown Heights, NY, USA;King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia
Venue:
Proceedings of the ACM SIGCOMM 2013 conference on SIGCOMM
Year:
2013

Citing 4
Cited 0

Pregel: a system for large-scale graph processing

Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
Generic topology mapping strategies for large-scale parallel architectures

Proceedings of the international conference on Supercomputing
PowerGraph: distributed graph-parallel computation on natural graphs

OSDI'12 Proceedings of the 10th USENIX conference on Operating Systems Design and Implementation
Mizan: a system for dynamic load balancing in large-scale graph processing

Proceedings of the 8th ACM European Conference on Computer Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

The explosive growth of "big data" is giving rise to a new breed of large scale graph systems, such as Pregel. This poster describes our ongoing work in characterizing and minimizing the communication cost of Bulk Synchronous Parallel (BSP) graph mining systems, like Pregel, when scaling to 4,096 compute nodes. Existing implementations generally assume a fixed communication cost. This is sufficient in small deployments as the BSP programming model (i.e., overlapping computation and communication) masks small variations in the underlying network. In large scale deployments, such variations can dominate the overall runtime characteristics. In this poster, we first quantify the impact of network communication on the total compute time of a Pregel system. We then propose an efficient vertex placement strategy that subsamples highly connected vertices and applies the Reverse Cuthill-McKee (RCM) algorithm to efficiently partition the input graph and place partitions closer to each other based on their expected communication patterns. We finally describe a vertex replication strategy to further reduce communication overhead.