Resource-aware adaptive scheduling for mapreduce clusters

  • Authors:
  • Jordà Polo;Claris Castillo;David Carrera;Yolanda Becerra;Ian Whalley;Malgorzata Steinder;Jordi Torres;Eduard Ayguadé

  • Affiliations:
  • Barcelona Supercomputing Center (BSC) and Technical University of Catalonia (UPC), Spain;IBM T.J. Watson Research Center;Barcelona Supercomputing Center (BSC) and Technical University of Catalonia (UPC), Spain;Barcelona Supercomputing Center (BSC) and Technical University of Catalonia (UPC), Spain;IBM T.J. Watson Research Center;IBM T.J. Watson Research Center;Barcelona Supercomputing Center (BSC) and Technical University of Catalonia (UPC), Spain;Barcelona Supercomputing Center (BSC) and Technical University of Catalonia (UPC), Spain

  • Venue:
  • Middleware'11 Proceedings of the 12th ACM/IFIP/USENIX international conference on Middleware
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present a resource-aware scheduling technique for MapReduce multi-job workloads that aims at improving resource utilization across machines while observing completion time goals. Existing MapReduce schedulers define a static number of slots to represent the capacity of a cluster, creating a fixed number of execution slots per machine. This abstraction works for homogeneous workloads, but fails to capture the different resource requirements of individual jobs in multi-user environments. Our technique leverages job profiling information to dynamically adjust the number of slots on each machine, as well as workload placement across them, to maximize the resource utilization of the cluster. In addition, our technique is guided by user-provided completion time goals for each job. Source code of our prototype is available at [1].