Mitigating the negative impact of preemption on heterogeneous MapReduce workloads

  • Authors:
  • Lu Cheng;Qi Zhang;Raouf Boutaba

  • Affiliations:
  • University of Waterloo, Ontario Canada;University of Waterloo, Ontario Canada;University of Waterloo, Ontario Canada, and Division of IT Convergence Engineering, POSTECH, Pohang, KB, Korea

  • Venue:
  • Proceedings of the 7th International Conference on Network and Services Management
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Modern production clusters are often shared by multiple types of jobs with different priorities in order to improve resource utilization. Preemption is a common technique employed by MapReduce schedulers to avoid delaying production jobs while allowing the cluster to be shared by other non-production jobs. In addition, it also prevents a large job from occupying too many resources and starving others. Recent literature shows that jobs in production MapReduce clusters have a mixture of lengths and sizes spanning many orders of magnitude. In this type of environments, the current preemption policy used by MapReduce schedulers can significantly delay the completion time of long running tasks, resulting in waste of resources. This paper firstly discusses the heterogeneous nature of MapReduce jobs and their arrival rates in several production clusters. Secondly, we characterize the situations where the current preemption policy causes significant preemption penalty. We then propose a simple mechanism that works in conjunction with existing job schedulers to address this problem. Finally, we evaluate our solution under various types of workloads in Amazon EC2. Experiments show our method can improve system normalized performance by 15% during busy periods by effectively avoiding unnecessary preemption while preserving fairness.