A comparative study of pairwise regression techniques for problem determination

  • Authors:
  • Mohammad A. Munawar;Paul A. S. Ward

  • Affiliations:
  • University of Waterloo, Ontario, Canada;University of Waterloo, Ontario, Canada

  • Venue:
  • CASCON '07 Proceedings of the 2007 conference of the center for advanced studies on Collaborative research
  • Year:
  • 2007

Quantified Score

Hi-index 0.01

Visualization

Abstract

Many runtime metrics can be collected from modern software systems. Stable statistical relationships exist among these metrics. Deviation from these stable relationships indicates potential problems, allowing diagnosis of failures. There exist many modeling techniques to represent these relationships. However, which one to use is a question that has yet to be studied. In this paper we compare the use of simple linear regression (SLR) to some of its more complex variants, including autoregressive regression and locally weighted regression. We consider the component coverage, model robustness, accuracy of diagnosis, and computation cost. Our study finds that while more flexible models can improve diagnosis accuracy, they achieve it at the cost of reduced robust-ness. In particular, we found the autoregressive regression model with exogenous input (ARX) to provide the most accurate diagnosis; however, it is the least robust of the techniques considered and the second most expensive. This study also finds that smoothing and other data transformations can noticeably improve results of SLR, thus providing an efficient alternative to ARX.