Improved error reporting for software that uses black-box components

Authors:
Jungwoo Ha;Christopher J. Rossbach;Jason V. Davis;Indrajit Roy;Hany E. Ramadan;Donald E. Porter;David L. Chen;Emmett Witchel
Affiliations:
The University of Texas at Austin, Austin, TX;The University of Texas at Austin, Austin, TX;The University of Texas at Austin, Austin, TX;The University of Texas at Austin, Austin, TX;The University of Texas at Austin, Austin, TX;The University of Texas at Austin, Austin, TX;The University of Texas at Austin, Austin, TX;The University of Texas at Austin, Austin, TX
Venue:
Proceedings of the 2007 ACM SIGPLAN conference on Programming language design and implementation
Year:
2007

Citing 27
Cited 27

C4.5: programs for machine learning

C4.5: programs for machine learning
Efficient path profiling

Proceedings of the 29th annual ACM/IEEE international symposium on Microarchitecture
Experiments of the effectiveness of dataflow- and controlflow-based test adequacy criteria

ICSE '94 Proceedings of the 16th international conference on Software engineering
Exploiting hardware performance counters with flow and context sensitive profiling

Proceedings of the ACM SIGPLAN 1997 conference on Programming language design and implementation
The use of program profiling for software maintenance with applications to the year 2000 problem

ESEC '97/FSE-5 Proceedings of the 6th European SOFTWARE ENGINEERING conference held jointly with the 5th ACM SIGSOFT international symposium on Foundations of software engineering
Fine-grained dynamic instrumentation of commodity operating system kernels

OSDI '99 Proceedings of the third symposium on Operating systems design and implementation
Data mining: practical machine learning tools and techniques with Java implementations

Data mining: practical machine learning tools and techniques with Java implementations
Dynamo: a transparent dynamic optimization system

PLDI '00 Proceedings of the ACM SIGPLAN 2000 conference on Programming language design and implementation
Tracking down software bugs using automatic anomaly detection

Proceedings of the 24th International Conference on Software Engineering
Automatically characterizing large scale program behavior

Proceedings of the 10th international conference on Architectural support for programming languages and operating systems
An infrastructure for adaptive dynamic optimization

Proceedings of the international symposium on Code generation and optimization: feedback-directed and runtime optimization
Automated support for classifying software failure reports

Proceedings of the 25th International Conference on Software Engineering
Bug isolation via remote program sampling

PLDI '03 Proceedings of the ACM SIGPLAN 2003 conference on Programming language design and implementation
Performance debugging for distributed systems of black boxes

SOSP '03 Proceedings of the nineteenth ACM symposium on Operating systems principles
Finding Latent Code Errors via Machine Learning over Program Executions

Proceedings of the 26th International Conference on Software Engineering
Active learning for automatic classification of software behavior

ISSTA '04 Proceedings of the 2004 ACM SIGSOFT international symposium on Software testing and analysis
Field studies of computer system administrators: analysis of system management tools and practices

CSCW '04 Proceedings of the 2004 ACM conference on Computer supported cooperative work
Combining High Level Symptom Descriptions and Low Level State Information for Configuration Fault Diagnosis

LISA '04 Proceedings of the 18th USENIX conference on System administration
Scalable statistical bug isolation

Proceedings of the 2005 ACM SIGPLAN conference on Programming language design and implementation
TraceBack: first fault diagnosis by reconstruction of distributed control flow

Proceedings of the 2005 ACM SIGPLAN conference on Programming language design and implementation
Quickly Finding Known Software Problems via Automated Symptom Matching

ICAC '05 Proceedings of the Second International Conference on Automatic Computing
Accurate, efficient, and adaptive calling context profiling

Proceedings of the 2006 ACM SIGPLAN conference on Programming language design and implementation
HeapMD: identifying heap-based bugs using anomaly detection

Proceedings of the 12th international conference on Architectural support for programming languages and operating systems
Automated known problem diagnosis with event traces

Proceedings of the 1st ACM SIGOPS/EuroSys European Conference on Computer Systems 2006
Automatic misconfiguration troubleshooting with peerpressure

OSDI'04 Proceedings of the 6th conference on Symposium on Opearting Systems Design & Implementation - Volume 6
Automated response using system-call delays

SSYM'00 Proceedings of the 9th conference on USENIX Security Symposium - Volume 9
Cost-Sensitive decision tree learning for forensic classification

ECML'06 Proceedings of the 17th European conference on Machine Learning

Information-theoretic metric learning

Proceedings of the 24th international conference on Machine learning
AutoBash: improving configuration management with operating system causality analysis

Proceedings of twenty-first ACM SIGOPS symposium on Operating systems principles
Probabilistic calling context

Proceedings of the 22nd annual ACM SIGPLAN conference on Object-oriented programming systems and applications
Privacy-preserving remote diagnostics

Proceedings of the 14th ACM conference on Computer and communications security
Structured metric learning for high dimensional problems

Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Using causality to diagnose configuration bugs

ATC'08 USENIX 2008 Annual Technical Conference on Annual Technical Conference
Automatic software fault diagnosis by exploiting application signatures

LISA'08 Proceedings of the 22nd conference on Large installation system administration conference
Panalyst: privacy-aware remote error analysis on commodity software

SS'08 Proceedings of the 17th conference on Security symposium
Dynamic shape analysis via degree metrics

Proceedings of the 2009 international symposium on Memory management
A user-extensible and adaptable parser architecture

Knowledge-Based Systems
A concurrent dynamic analysis framework for multicore hardware

Proceedings of the 24th ACM SIGPLAN conference on Object oriented programming systems languages and applications
Towards versatile performance models for complex, popular applications

ACM SIGMETRICS Performance Evaluation Review
Opportunities for concurrent dynamic analysis with explicit inter-core communication

Proceedings of the 9th ACM SIGPLAN-SIGSOFT workshop on Program analysis for software tools and engineering
Practical performance models for complex, popular applications

Proceedings of the ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Secure evaluation of private linear branching programs with medical applications

ESORICS'09 Proceedings of the 14th European conference on Research in computer security
Visualizing and exploring profiles with calling context ring charts

Software—Practice & Experience
Automatically generating predicates and solutions for configuration troubleshooting

USENIX'09 Proceedings of the 2009 conference on USENIX Annual technical conference
Automating configuration troubleshooting with dynamic information flow analysis

OSDI'10 Proceedings of the 9th USENIX conference on Operating systems design and implementation
Improving software diagnosability via log enhancement

Proceedings of the sixteenth international conference on Architectural support for programming languages and operating systems
Context-based online configuration-error detection

USENIXATC'11 Proceedings of the 2011 USENIX conference on USENIX annual technical conference
An empirical study on configuration errors in commercial and open source systems

SOSP '11 Proceedings of the Twenty-Third ACM Symposium on Operating Systems Principles
Deja vu: fingerprinting network problems

Proceedings of the Seventh COnference on emerging Networking EXperiments and Technologies
Improving Software Diagnosability via Log Enhancement

ACM Transactions on Computer Systems (TOCS) - Special Issue APLOS 2011
Metric and kernel learning using a linear transformation

The Journal of Machine Learning Research
Dynamically validating static memory leak warnings

Proceedings of the 2013 International Symposium on Software Testing and Analysis
DeltaPath: Precise and Scalable Calling Context Encoding

Proceedings of Annual IEEE/ACM International Symposium on Code Generation and Optimization
Dynamic and Adaptive Calling Context Encoding

Proceedings of Annual IEEE/ACM International Symposium on Code Generation and Optimization

Quantified Score

Hi-index	0.00

Visualization

Abstract

An error occurs when software cannot complete a requested action as a result of some problem with its input, configuration, or environment. A high-quality error report allows a user to understand and correct the problem. Unfortunately, the quality of error reports has been decreasing as software becomes more complex and layered. End-users take the cryptic error messages given to them by programsand struggle to fix their problems using search engines and support websites. Developers cannot improve their error messages when they receive an ambiguous or otherwise insufficient error indicator from a black-box software component. We introduce Clarify, a system that improves error reporting by classifying application behavior. Clarify uses minimally invasive monitoring to generate a behavior profile, which is a summary of the program's execution history. A machine learning classifier uses the behavior profile to classify the application's behavior, thereby enabling a more precise error report than the output of the application itself. We evaluate a prototype Clarify system on ambiguous error messages generated by large, modern applications like gcc, La-TeX, and the Linux kernel. For a performance cost of less than 1% on user applications and 4.7% on the Linux kernel, the proto type correctly disambiguates at least 85% of application behaviors that result in ambiguous error reports. This accuracy does not degrade significantly with more behaviors: a Clarify classifier for 81 La-TeX error messages is at most 2.5% less accurate than a classifier for 27 LaTeX error messages. Finally, we show that without any human effort to build a classifier, Clarify can provide nearest-neighbor software support, where users who experience a problem are told about 5 other users who might have had the same problem. On average 2.3 of the 5 users that Clarify identifies have experienced the same problem.