Performance Issue Diagnosis for Online Service Systems

  • Authors:
  • Qiang Fu;Jian-Guang Lou;Qing-Wei Lin;Rui Ding;Dongmei Zhang;Zihao Ye;Tao Xie

  • Affiliations:
  • -;-;-;-;-;-;-

  • Venue:
  • SRDS '12 Proceedings of the 2012 IEEE 31st Symposium on Reliable Distributed Systems
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Monitoring and diagnosing performance issues of an online service system are critical to assure satisfactory performance of the system. Given a detected performance issue and collected system metrics for an online service system, engineers usually need to make great efforts to conduct diagnosis by first identifying performance issue beacons, which are metrics that pinpoint to the root causes. In order to reduce the manual efforts, in this paper, we propose a new approach to effectively detecting performance issue beacons to help with performance issue diagnosis. Our approach includes techniques for mining system metric data to address limitations when applying previous classification-based approaches. Our evaluations on both a controlled environment and a real production environment show that our approach can more effectively identify performance issue beacons from system metric data than previous approaches.