Software fault localization using feature selection

Authors:
Shounak Roychowdhury;Sarfraz Khurshid
Affiliations:
University of Texas at Austin, Austin, Texas;University of Texas at Austin, Austin, Texas
Venue:
Proceedings of the International Workshop on Machine Learning Technologies in Software Engineering
Year:
2011

Citing 22
Cited 0

A practical approach to feature selection

ML92 Proceedings of the ninth international workshop on Machine learning
Estimating attributes: analysis and extensions of RELIEF

ECML-94 Proceedings of the European conference on machine learning on Machine Learning
Experiments of the effectiveness of dataflow- and controlflow-based test adequacy criteria

ICSE '94 Proceedings of the 16th international conference on Software engineering
Selection of relevant features and examples in machine learning

Artificial Intelligence - Special issue on relevance
Algorithmic Program DeBugging

Algorithmic Program DeBugging
Isolating cause-effect chains from computer programs

Proceedings of the 10th ACM SIGSOFT symposium on Foundations of software engineering
An adaptation of Relief for attribute estimation in regression

ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Theoretical and Empirical Analysis of ReliefF and RReliefF

Machine Learning
An introduction to variable and feature selection

The Journal of Machine Learning Research
Pattern Classification (2nd Edition)

Pattern Classification (2nd Edition)
Feature Selection via Supervised Model Construction

ICDM '04 Proceedings of the Fourth IEEE International Conference on Data Mining
Locating causes of program failures

Proceedings of the 27th international conference on Software engineering
Scalable statistical bug isolation

Proceedings of the 2005 ACM SIGPLAN conference on Programming language design and implementation
Feature Selection Based on Mutual Information: Criteria of Max-Dependency, Max-Relevance, and Min-Redundancy

IEEE Transactions on Pattern Analysis and Machine Intelligence
Empirical evaluation of the tarantula automatic fault-localization technique

Proceedings of the 20th IEEE/ACM international Conference on Automated software engineering
The Dissimilarity Representation for Pattern Recognition: Foundations And Applications (Machine Perception and Artificial Intelligence)

The Dissimilarity Representation for Pattern Recognition: Foundations And Applications (Machine Perception and Artificial Intelligence)
Statistical Debugging: A Hypothesis Testing-Based Approach

IEEE Transactions on Software Engineering
Context-aware statistical debugging: from bug predictors to faulty control flow paths

Proceedings of the twenty-second IEEE/ACM international conference on Automated software engineering
A practical evaluation of spectrum-based fault localization

Journal of Systems and Software
A family of code coverage-based heuristics for effective fault localization

Journal of Systems and Software
Comprehensive evaluation of association measures for fault localization

ICSM '10 Proceedings of the 2010 IEEE International Conference on Software Maintenance
A novel framework for locating software faults using latent divergences

ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part III

Quantified Score

Hi-index	0.00

Visualization

Abstract

Manually locating and fixing faults can be tedious and hard. Recent years have seen much progress in automated techniques for fault localization. A particularly promising approach is to analyze passing and failing runs to compute how likely each statement is to be faulty. Techniques based on this approach have so far largely focused on either using statistical analysis or similarity based algorithms, which have a natural application in evaluating such runs. We present a novel approach to fault localization using feature selection techniques from machine learning. Our insight is that each additional failing or passing run can provide significantly diverse amount of information, which can help localize faults in code -- the statements with maximum feature diversity information can point to most suspicious lines of code. Experimental results show that our approach outperforms state-of-the-art approaches for localizing faults in most subject programs of the Siemens suite, which have previously been used to evaluate several fault localization techniques.