Efficiently extracting operational profiles from execution logs using suffix arrays

  • Authors:
  • Meiyappan Nagappan;Kesheng Wu;Mladen A. Vouk

  • Affiliations:
  • North Carolina State University, Dept. of Computer Science, Raleigh, NC;Lawrence Berkeley National Laboratory, Computational Research Division, Berkeley, CA;North Carolina State University, Dept. of Computer Science, Raleigh, NC

  • Venue:
  • ISSRE'09 Proceedings of the 20th IEEE international conference on software reliability engineering
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

An important software reliability engineering tool is operational profiles. In this paper we propose a cost effective automated approach for creating second generation operational profiles using execution logs of a software product. Our algorithm parses the execution logs into sequences of events and produces an ordered list of all possible subsequences by constructing a suffix array of the events. The difficulty in using execution logs is that the amount of data that needs to be analyzed is often extremely large (more than a million records per day in many applications). Our approach is very efficient. We show that our approach requires O(N) in space and time to discover all possible patterns in N events. We discuss a practical implementation of the algorithm in the context of the logs from a large cloud computing system.