Distributional clustering of words for text classification
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Diagnosing Rediscovered Software Problems Using Symptoms
IEEE Transactions on Software Engineering
Finding failures by cluster analysis of execution profiles
ICSE '01 Proceedings of the 23rd International Conference on Software Engineering
Software Engineer's Reference Book
Software Engineer's Reference Book
Software Dependability in the Tandem GUARDIAN System
IEEE Transactions on Software Engineering
Pinpoint: Problem Determination in Large, Dynamic Internet Services
DSN '02 Proceedings of the 2002 International Conference on Dependable Systems and Networks
Automated known problem diagnosis with event traces
Proceedings of the 1st ACM SIGOPS/EuroSys European Conference on Computer Systems 2006
PDA: a tool for automated problem determination
LISA'07 Proceedings of the 21st conference on Large Installation System Administration Conference
Automated Identification of Failure Causes in System Logs
ISSRE '08 Proceedings of the 2008 19th International Symposium on Software Reliability Engineering
Automatic software fault diagnosis by exploiting application signatures
LISA'08 Proceedings of the 22nd conference on Large installation system administration conference
DRACA: decision support for root cause analysis and change impact analysis for CMDBs
CASCON '09 Proceedings of the 2009 Conference of the Center for Advanced Studies on Collaborative Research
Diagnosis of recurrent faults using log files
CASCON '09 Proceedings of the 2009 Conference of the Center for Advanced Studies on Collaborative Research
Finding similar failures using callstack similarity
SysML'08 Proceedings of the Third conference on Tackling computer systems problems with machine learning techniques
Cost drivers of software corrective maintenance: An empirical study in two companies
ICSM '10 Proceedings of the 2010 IEEE International Conference on Software Maintenance
Symptom-based problem determination using log data abstraction
Proceedings of the 2010 Conference of the Center for Advanced Studies on Collaborative Research
Capturing Expert Knowledge for Automated Configuration Fault Diagnosis
ICPC '11 Proceedings of the 2011 IEEE 19th International Conference on Program Comprehension
Analysis of a service degradation model with preventive rejuvenation
ISAS'06 Proceedings of the Third international conference on Service Availability
Hi-index | 0.00 |
With increasing software size and complexity, corrective software maintenance has become a challenging process. When a failure is reported, it takes time and expertise for human operators to collect the right information and pinpoint the root cause. Typically, the operators are overloaded with information generated from many system components, and need assistance. In practice, however, failures are often recurrent. If they can be identified accurately, the appropriate fix may already be known from prior collected experience about the system. Our approach to diagnose failures is to look at differences in the state of the filesystem and how files are accessed under normal and abnormal situations. In this research, we monitor the behavior of the system through its file-related calls on an instrumented filesystem. When a failure occurs, these calls are abstracted and classified to identify the likely cause. A diagnostic tool is implemented based on this approach. Through an experiment involving one J2EE Web application, we present the effectiveness of our approach in terms of precision and recall.