Execution Anomaly Detection in Distributed Systems through Unstructured Log Analysis

Authors:
Qiang Fu;Jian-Guang Lou;Yi Wang;Jiang Li
Affiliations:
-;-;-;-
Venue:
ICDM '09 Proceedings of the 2009 Ninth IEEE International Conference on Data Mining
Year:
2009

Citing 0
Cited 13

Mining dependency in distributed systems through unstructured logs analysis

ACM SIGOPS Operating Systems Review
Mining program workflow from interleaved traces

Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Mining invariants from console logs for system problem detection

USENIXATC'10 Proceedings of the 2010 USENIX conference on USENIX annual technical conference
Symptom-based problem determination using log data abstraction

Proceedings of the 2010 Conference of the Center for Advanced Studies on Collaborative Research
A graphical representation for identifier structure in logs

SLAML'10 Proceedings of the 2010 workshop on Managing systems via log analysis and machine learning techniques
Experience mining Google's production console logs

SLAML'10 Proceedings of the 2010 workshop on Managing systems via log analysis and machine learning techniques
Creating operational profiles of software systems by transforming their log files to directed cyclic graphs

Proceedings of the 6th International Workshop on Traceability in Emerging Forms of Software Engineering
Event log mining tool for large scale HPC systems

Euro-Par'11 Proceedings of the 17th international conference on Parallel processing - Volume Part I
Software analytics as a learning case in practice: approaches and experiences

Proceedings of the International Workshop on Machine Learning Technologies in Software Engineering
Structured comparative analysis of systems logs to diagnose performance problems

NSDI'12 Proceedings of the 9th USENIX conference on Networked Systems Design and Implementation
Spatio-temporal decomposition, clustering and identification for alert detection in system logs

Proceedings of the 27th Annual ACM Symposium on Applied Computing
Contextual analysis of program logs for understanding system behaviors

Proceedings of the 10th Working Conference on Mining Software Repositories
CAPRI: a tool for mining complex line patterns in large log data

Proceedings of the 2nd International Workshop on Big Data, Streams and Heterogeneous Source Mining: Algorithms, Systems, Programming Models and Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

Detection of execution anomalies is very important for the maintenance, development, and performance refinement of large scale distributed systems. Execution anomalies include both work flow errors and low performance problems. People often use system logs produced by distributed systems for troubleshooting and problem diagnosis. However, manually inspecting system logs to detect anomalies is unfeasible due to the increasing scale and complexity of distributed systems. Therefore, there is a great demand for automatic anomalies detection techniques based on log analysis. In this paper, we propose an unstructured log analysis technique for anomalies detection. In the technique, we propose a novel algorithm to convert free form text messages in log files to log keys without heavily relying on application specific knowledge. The log keys correspond to the log-print statements in the source code which can provide cues of system execution behavior. After converting log messages to log keys, we learn a Finite State Automaton (FSA) from training log sequences to present the normal work flow for each system component. At the same time, a performance measurement model is learned to characterize the normal execution performance based on the log mes-sages’ timing information. With these learned models, we can automatically detect anomalies in newly input log files. Experiments on Hadoop and SILK show that the technique can effectively detect running anomalies.