Automated diagnosis without predictability is a recipe for failure

  • Authors:
  • Raja R. Sambasivan;Gregory R. Ganger

  • Affiliations:
  • Carnegie Mellon University;Carnegie Mellon University

  • Venue:
  • HotCloud'12 Proceedings of the 4th USENIX conference on Hot Topics in Cloud Ccomputing
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Automated management is critical to the success of cloud computing, given its scale and complexity. But, most systems do not satisfy one of the key properties required for automation: predictability, which in turn relies upon low variance. Most automation tools are not effective when variance is consistently high. Using automated performance diagnosis as a concrete example, this position paper argues that for automation to become a reality, system builders must treat variance as an important metric and make conscious decisions about where to reduce it. To help with this task, we describe a framework for reasoning about sources of variance in distributed systems and describe an example tool for helping identify them.