Learning universal probabilistic models for fault localization

Authors:
Min Feng;Rajiv Gupta
Affiliations:
University of California at Riverside, Riverside, CA, USA;University of California at Riverside, Riverside, CA, USA
Venue:
Proceedings of the 9th ACM SIGPLAN-SIGSOFT workshop on Program analysis for software tools and engineering
Year:
2010

Citing 22
Cited 3

Dynamic program slicing

Information Processing Letters
Dynamic program slicing

PLDI '90 Proceedings of the ACM SIGPLAN 1990 conference on Programming language design and implementation
Isolating cause-effect chains from computer programs

Proceedings of the 10th ACM SIGSOFT symposium on Foundations of software engineering
Whole Execution Traces

Proceedings of the 37th annual IEEE/ACM International Symposium on Microarchitecture
Locating causes of program failures

Proceedings of the 27th international conference on Software engineering
Scalable statistical bug isolation

Proceedings of the 2005 ACM SIGPLAN conference on Programming language design and implementation
SOBER: statistical model-based bug localization

Proceedings of the 10th European software engineering conference held jointly with 13th ACM SIGSOFT international symposium on Foundations of software engineering
Error explanation and fault localization with distance metrics

Error explanation and fault localization with distance metrics
Rx: treating bugs as allergies---a safe method to survive software failures

Proceedings of the twentieth ACM symposium on Operating systems principles
Locating faulty code using failure-inducing chops

Proceedings of the 20th IEEE/ACM international Conference on Automated software engineering
Empirical evaluation of the tarantula automatic fault-localization technique

Proceedings of the 20th IEEE/ACM international Conference on Automated software engineering
Automated path generation for software fault localization

Proceedings of the 20th IEEE/ACM international Conference on Automated software engineering
DieHard: probabilistic memory safety for unsafe languages

Proceedings of the 2006 ACM SIGPLAN conference on Programming language design and implementation
Pruning dynamic slices with confidence

Proceedings of the 2006 ACM SIGPLAN conference on Programming language design and implementation
Locating faults through automated predicate switching

Proceedings of the 28th international conference on Software engineering
Exterminator: automatically correcting memory errors with high probability

Proceedings of the 2007 ACM SIGPLAN conference on Programming language design and implementation
Context-aware statistical debugging: from bug predictors to faulty control flow paths

Proceedings of the twenty-second IEEE/ACM international conference on Automated software engineering
Archipelago: trading address space for reliability and security

Proceedings of the 13th international conference on Architectural support for programming languages and operating systems
Fault localization using value replacement

ISSTA '08 Proceedings of the 2008 international symposium on Software testing and analysis
The probabilistic program dependence graph and its application to fault diagnosis

ISSTA '08 Proceedings of the 2008 international symposium on Software testing and analysis
HOLMES: Effective statistical debugging via efficient path profiling

ICSE '09 Proceedings of the 31st International Conference on Software Engineering
An importance sampling algorithm based on evidence pre-propagation

UAI'03 Proceedings of the Nineteenth conference on Uncertainty in Artificial Intelligence

Bayesian reasoning for software testing

Proceedings of the FSE/SDP workshop on Future of software engineering research
A general noise-reduction framework for fault localization of Java programs

Information and Software Technology
A test-suite reduction approach to improving fault-localization effectiveness

Computer Languages, Systems and Structures

Quantified Score

Hi-index	0.00

Visualization

Abstract

Recently there has been significant interest in employing probabilistic techniques for fault localization. Using dynamic dependence information for multiple passing runs, learning techniques are used to construct a probabilistic graph model for a given program. Then, given a failing run, the probabilistic model is used to rank the executed statements according to the likelihood of them being faulty. In this paper we present a novel probabilistic approach in which universal probabilistic models are learned to characterize the behaviors of various instruction types used by all programs. The universal probabilistic model for an instruction type is in form of a probability distribution that represents how errors in the input (operand) values are propagated as errors in the output (result) of a given instruction type. Once these models have been constructed, they can be used in the analysis of any program as follows. Given a set of runs for any program, including at least one passing and one failing run, a Bayesian network called the Error Flow Graph (EFG) is then constructed from the dynamic dependence graphs of the program runs and the universal probabilistic models. Standard inference algorithms are employed to compute the probability of each executed statement being faulty. We also present optimizations to reduce the runtime cost of inference using the EFG. Our experiments demonstrate that our approach is highly effective in fault localization even when very few passing runs are available. It also performs well in the presence of multiple faults.