System State Discovery Via Information Content Clustering of System Logs

  • Authors:
  • Adetokunbo Makanju;A. Nur Zincir-Heywood;Evangelos E. Milios

  • Affiliations:
  • -;-;-

  • Venue:
  • ARES '11 Proceedings of the 2011 Sixth International Conference on Availability, Reliability and Security
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Self-awareness is an important attribute for any system to have before it is capable of self-management. A system needs to have a continuous stream of real-time data to analyze to allow it be aware of its internal state. To this end, previous approaches have utilized system performance metrics and system log data to characterize system internal state. In using system logs to characterize system internal state, the computation of strongly correlated message types is necessary. In this work, we show that strongly correlated message types can be easily discovered without much computation. Our work explores a natural behaviour of system logs where system log data partitioned using source and time information contain correlated message types. We demonstrate how the groups of partitions, which contain correlated message types, can be found by clustering the partitions based on their entropy-based information content. We evaluate our method using cluster cohesion, cluster separation and cluster conceptual purity as metrics. The results show that our proposed method not only produces well-formed clusters but also clusters that can be mapped to different alert states with a high degree of confidence.