Lightweight problem determination in DBMSs using data stream analysis techniques

  • Authors:
  • Jing Huang;Patrick Martin;Wendy Powley;Paul Bird;Dmitri Abrashkevich

  • Affiliations:
  • Queen's University, Kingston, ON;Queen's University, Kingston, ON;Queen's University, Kingston, ON;IBM, Toronto, ON;IBM, Toronto, ON

  • Venue:
  • Proceedings of the 2010 Conference of the Center for Advanced Studies on Collaborative Research
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Problem determination in a database management system can be a difficult task given the complexity of the system and the large amount of data that must be collected and analyzed. Monitoring the system for this data incurs overhead and has a detrimental effect on application performance. As an alternative to the standard practice of storing the performance data and performing offline analysis, we examine an approach where monitoring data is produced as a continuous data stream and data stream mining techniques are applied. We implement this approach as a prototype system called Tempo on IBM DB2®. Tempo implements Top-K analysis, which is a common task performed by database administrators for problem determination. Top-K analysis typically identifies the set of most frequently occurring events, or the highest consumers of system resources. Our experimental evaluation indicates that Tempo is time and space efficient, incurs low overhead, and produces accurate results.