Resource-aware adaptive scheduling for MapReduce clusters

  • Authors:
  • Jordà Polo;Claris Castillo;David Carrera;Yolanda Becerra;Ian Whalley;Malgorzata Steinder;Jordi Torres;Eduard Ayguadé

  • Affiliations:
  • Technical University of Catalonia (UPC);IBM T. J. Watson Research Center;Technical University of Catalonia (UPC);Technical University of Catalonia (UPC);IBM T. J. Watson Research Center;IBM T. J. Watson Research Center;Technical University of Catalonia (UPC);Technical University of Catalonia (UPC)

  • Venue:
  • Proceedings of the 12th International Middleware Conference
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present a resource-aware scheduling technique for MapReduce multi-job workloads that aims at improving resource utilization across machines while observing completion time goals. Existing MapReduce schedulers define a static number of slots to represent the capacity of a cluster, creating a fixed number of execution slots per machine. This abstraction works for homogeneous workloads, but fails to capture the different resource requirements of individual jobs in multi-user environments. Our technique leverages job profiling information to dynamically adjust the number of slots on each machine, as well as workload placement across them, to maximize the resource utilization of the cluster. In addition, our technique is guided by user-provided completion time goals for each job. Source code of our prototype is available at [1].