FLEX: a slot allocation scheduling optimizer for MapReduce workloads

  • Authors:
  • Joel Wolf;Deepak Rajan;Kirsten Hildrum;Rohit Khandekar;Vibhore Kumar;Sujay Parekh;Kun-Lung Wu;Andrey balmin

  • Affiliations:
  • IBM Watson Research Center, Hawthorne, NY;IBM Watson Research Center, Hawthorne, NY;IBM Watson Research Center, Hawthorne, NY;IBM Watson Research Center, Hawthorne, NY;IBM Watson Research Center, Hawthorne, NY;IBM Watson Research Center, Hawthorne, NY;IBM Watson Research Center, Hawthorne, NY;IBM Almaden Research Center, San Jose CA

  • Venue:
  • Proceedings of the ACM/IFIP/USENIX 11th International Conference on Middleware
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Originally, MapReduce implementations such as Hadoop employed First In First Out (fifo) scheduling, but such simple schemes cause job starvation. The Hadoop Fair Scheduler (hfs) is a slot-based MapReduce scheme designed to ensure a degree of fairness among the jobs, by guaranteeing each job at least some minimum number of allocated slots. Our prime contribution in this paper is a different, flexible scheduling allocation scheme, known as flex. Our goal is to optimize any of a variety of standard scheduling theory metrics (response time, stretch, makespan and Service Level Agreements (slas), among others) while ensuring the same minimum job slot guarantees as in hfs, and maximum job slot guarantees as well. The flex allocation scheduler can be regarded as an add-on module that works synergistically with hfs. We describe the mathematical basis for flex, and compare it with fifo and hfs in a variety of experiments.