Mining invariants from console logs for system problem detection

  • Authors:
  • Jian-Guang Lou;Qiang Fu;Shengqi Yang;Ye Xu;Jiang Li

  • Affiliations:
  • Microsoft Research Asia, Beijing, P. R. China;Microsoft Research Asia, Beijing, P. R. China;Dept. of Computer Science, Beijing Univ. of Posts and Telecom;Dept. of Computer Science, Nanjing University, P.R. China;Microsoft Research Asia, Beijing, P. R. China

  • Venue:
  • USENIXATC'10 Proceedings of the 2010 USENIX conference on USENIX annual technical conference
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Detecting execution anomalies is very important to the maintenance and monitoring of large-scale distributed systems. People often use console logs that are produced by distributed systems for troubleshooting and problem diagnosis. However, manually inspecting console logs for the detection of anomalies is unfeasible due to the increasing scale and complexity of distributed systems. Therefore, there is great demand for automatic anomaly detection techniques based on log analysis. In this paper, we propose an unstructured log analysis technique for anomaly detection, with a novel algorithm to automatically discover program invariants in logs. At first, a log parser is used to convert the unstructured logs to structured logs. Then, the structured log messages are further grouped to log message groups according to the relationship among log parameters. After that, the program invariants are automatically mined from the log message groups. The mined invariants can reveal the inherent linear characteristics of program work flows. With these learned invariants, our technique can automatically detect anomalies in logs. Experiments on Hadoop show that the technique can effectively detect execution anomalies. Compared with the state of art, our approach can not only detect numerous real problems with high accuracy but also provide intuitive insight into the problems.