Event log mining tool for large scale HPC systems

  • Authors:
  • Ana Gainaru;Franck Cappello;Stefan Trausan-Matu;Bill Kramer

  • Affiliations:
  • University of Illinois at Urbana-Champaign, IL and University Politehnica of Bucharest, Romania;University of Illinois at Urbana-Champaign, IL and INRIA, France;University Politehnica of Bucharest, Romania;University of Illinois at Urbana-Champaign, IL

  • Venue:
  • Euro-Par'11 Proceedings of the 17th international conference on Parallel processing - Volume Part I
  • Year:
  • 2011

Quantified Score

Hi-index 0.01

Visualization

Abstract

Event log files are the most common source of information for the characterization of events in large scale systems. However the large size of these files makes the task of manual analysing log messages to be difficult and error prone. This is the reason why recent research has been focusing on creating algorithms for automatically analysing these log files. In this paper we present a novel methodology for extracting templates that describe event formats from large datasets presenting an intuitive and user-friendly output to system administrators. Our algorithm is able to keep up with the rapidly changing environments by adapting the clusters to the incoming stream of events. For testing our tool, we have chosen 5 log files that have different formats and that challenge different aspects in the clustering task. The experiments show that our tool outperforms all other algorithms in all tested scenarios achieving an average precision and recall of 0.9, increasing the correct number of groups by a factor of 1.5 and decreasing the number of false positives and negatives by an average factor of 4.