Natjam: design and evaluation of eviction policies for supporting priorities and deadlines in mapreduce clusters

  • Authors:
  • Brian Cho;Muntasir Rahman;Tej Chajed;Indranil Gupta;Cristina Abad;Nathan Roberts;Philbert Lin

  • Affiliations:
  • University of Illinois (Urbana-Champaign) and Samsung;University of Illinois (Urbana-Champaign);University of Illinois (Urbana-Champaign);University of Illinois (Urbana-Champaign);University of Illinois (Urbana-Champaign) and Yahoo! Inc.;Yahoo! Inc.;University of Illinois (Urbana-Champaign)

  • Venue:
  • Proceedings of the 4th annual Symposium on Cloud Computing
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents Natjam, a system that supports arbitrary job priorities, hard real-time scheduling, and efficient preemption for Mapreduce clusters that are resource-constrained. Our contributions include: i) exploration and evaluation of smart eviction policies for jobs and for tasks, based on resource usage, task runtime, and job deadlines; and ii) a work-conserving task preemption mechanism for Mapreduce. We incorporated Natjam into the Hadoop YARN scheduler framework (in Hadoop 0.23). We present experiments from deployments on a test cluster, Emulab and a Yahoo! Inc. commercial cluster, using both synthetic workloads as well as Hadoop cluster traces from Yahoo!. Our results reveal that Natjam incurs overheads as low as 7%, and is preferable to existing approaches.