Performance debugging in data centers: doing more with less

  • Authors:
  • Emmanuel Cecchet;Maitreya Natu;Vaishali Sadaphal;Prashant Shenoy;Harrick Vin

  • Affiliations:
  • Computer Science Department, University of Massachusetts at Amherst;Tata Research Development and Design Centre, Tata Consultancy Services, Pune, India;Tata Research Development and Design Centre, Tata Consultancy Services, Pune, India;Computer Science Department, University of Massachusetts at Amherst;Tata Research Development and Design Centre, Tata Consultancy Services, Pune, India

  • Venue:
  • COMSNETS'09 Proceedings of the First international conference on COMmunication Systems And NETworks
  • Year:
  • 2009

Quantified Score

Hi-index 0.01

Visualization

Abstract

With the increasing scale and complexity of data centers, detecting and localizing performance faults in real-time has become both a pressing need and a challenge. While several approaches for performance debugging in data centers have been proposed, these techniques do not assume any constraints on the availability of operational data needed to detect and localize faults. We argue that collecting such operational data often requires significant instrumentation or intrusiveness, which is difficult to realize in production data centers. Such constraints complicate the deployment of existing techniques or limit their effectiveness in practice. In this paper, we argue that for performance debugging to become practical and effective in real-world systems, one needs to develop techniques that are "more effective" with "less instrumentation and intrusiveness". We raise several issues and challenges in realizing this vision and present some initial ideas on addressing these challenges.