The Yahoo!: cloud datastore load balancer

  • Authors:
  • Markus Klems;Adam Silberstein;Jianjun Chen;Masood Mortazavi;Sahaya Andrews Albert;P.P.S. Narayan;Adwait Tumbde;Brian Cooper

  • Affiliations:
  • Karlsruhe Institute of Technology, Karlsruhe, Germany;LinkedIn, Mountain View, CA, USA;Google, Moutain View, CA, USA;Yahoo! Inc., Sunnyvale, CA, USA;Yahoo! Inc., Sunnyvale, CA, USA;Yahoo! Inc., Sunnyvale, CA, USA;Instart Logic;Google

  • Venue:
  • Proceedings of the fourth international workshop on Cloud data management
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Sherpa is a large-scale distributed and globally replicated multi-tenant cloud data storage system. Sherpa scales by horizontally partitioning data into tablets and distributing these tablets across multiple servers. While Sherpa scales for increasing workload sizes by adding servers, it is vulnerable to load imbalance among tablets that cause hotspots to develop on just a few servers. In this paper we describe Yak, the Sherpa load balancer. Yak detects hotspots and then automatically balances load by migrating tablets from the overloaded servers, and also by splitting data into new tablets. We describe Yak's design principles, algorithms and architecture. We then evaluate Yak on workloads based on Sherpa production scenarios.