Iterative identification of fault-prone binaries using in-process metrics

Authors:
Lucas Layman;Gunnar Kudrjavets;Nachiappan Nagappan
Affiliations:
North Carolina State University, Raleigh, NC, USA;Microsoft Corporation, Redmond, WA, USA;Microsoft Corporation, Redmond, WA, USA
Venue:
Proceedings of the Second ACM-IEEE international symposium on Empirical software engineering and measurement
Year:
2008

Citing 14
Cited 4

Compilers: principles, techniques, and tools

Compilers: principles, techniques, and tools
A Validation of Object-Oriented Design Metrics as Quality Indicators

IEEE Transactions on Software Engineering
Investigating quality factors in object-oriented designs: an industrial case study

Proceedings of the 21st international conference on Software engineering
Exploring the relationship between design measures and software quality in object-oriented systems

Journal of Systems and Software
Predicting Fault Incidence Using Software Change History

IEEE Transactions on Software Engineering
The Confounding Effect of Class Size on the Validity of Object-Oriented Metrics

IEEE Transactions on Software Engineering
A Metrics Suite for Object Oriented Design

IEEE Transactions on Software Engineering
Empirical Analysis of CK Metrics for Object-Oriented Design Complexity: Implications for Software Defects

IEEE Transactions on Software Engineering
An Empirical Study on Object-Oriented Metrics

METRICS '99 Proceedings of the 6th International Symposium on Software Metrics
Some issues in multi-phase software reliability modeling

CASCON '93 Proceedings of the 1993 conference of the Centre for Advanced Studies on Collaborative research: software engineering - Volume 1
Where the bugs are

ISSTA '04 Proceedings of the 2004 ACM SIGSOFT international symposium on Software testing and analysis
Predictors of customer perceived software quality

Proceedings of the 27th international conference on Software engineering
Use of relative code churn measures to predict system defect density

Proceedings of the 27th international conference on Software engineering
Using Historical In-Process and Product Metrics for Early Estimation of Software Failures

ISSRE '06 Proceedings of the 17th International Symposium on Software Reliability Engineering

Fault-prone module detection using large-scale text features based on spam filtering

Empirical Software Engineering
Exploring the relationship of a file's history and its fault-proneness: An empirical method and its application to open source programs

Information and Software Technology
Keeping up with your friends: function Foo, library Bar.DLL, and work item 24

Proceedings of the 1st Workshop on Web 2.0 for Software Engineering
An integrated approach to detect fault-prone modules using complexity and text feature metrics

AST/UCMA/ISA/ACN'10 Proceedings of the 2010 international conference on Advances in computer science and information technology

Quantified Score

Hi-index	0.00

Visualization

Abstract

Code churn, the amount of code change taking place within a software unit over time, has been correlated with fault-proneness in software systems. We investigate the use of code churn and static metrics collected at regular time intervals during the development cycle to predict faults in an iterative, in-process manner. We collected 159 churn and structure metrics from six, four-month snapshots of a 1 million LOC Microsoft product. The number of software faults fixed during each period is recorded per binary module. Using stepwise logistic regression, we create a prediction model to identify fault-prone binaries using three parameters: code churn (the number of new and changed blocks); class Fan In and class Fan Out (normalized by lines of code). The iteratively-built model is 80.0% accurate at predicting fault-prone and non-fault-prone binaries. These fault-prediction models have the advantage of allowing the engineers to observe how their fault-prediction profile evolves over time.