Evaluating the performance and scalability of mapreduce applications on X10

  • Authors:
  • Chao Zhang;Chenning Xie;Zhiwei Xiao;Haibo Chen

  • Affiliations:
  • Parallel Processing Institute, Fudan University;Parallel Processing Institute, Fudan University;Parallel Processing Institute, Fudan University;Parallel Processing Institute, Fudan University

  • Venue:
  • APPT'11 Proceedings of the 9th international conference on Advanced parallel processing technologies
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

MapReduce has been shown to be a simple and efficient way to harness the massive resources of clusters. Recently, researchers propose using partitioned global address space (PGAS) based language and runtime to ease the programming of large-scale clusters. In this paper, we present an empirical study on the effectiveness of running MapReduce applications on a typical PGAS language runtime called X10. By tuning the performance of two applications on X10 platforms, we successfully eliminate several performance bottlenecks related to I/O processing. We also identify several remaining problems and propose several approaches to remedying them. Our final performance evaluation on a small-scale multicore cluster shows that the MapReduce applications written with X10 notably outperform those in Hadoop in most cases. Detailed analysis reveals that the major performance advantages come from a simplified task management and data storage scheme.