Where the bugs are

Authors:
Thomas J. Ostrand;Elaine J. Weyuker;Robert M. Bell
Affiliations:
AT&T Labs - Research, Florham Park, NJ;AT&T Labs - Research, Florham Park, NJ;AT&T Labs - Research, Florham Park, NJ
Venue:
ISSTA '04 Proceedings of the 2004 ACM SIGSOFT international symposium on Software testing and analysis
Year:
2004

Citing 8
Cited 51

Software errors and complexity: an empirical investigation0

Communications of the ACM
The Detection of Fault-Prone Programs

IEEE Transactions on Software Engineering
Predicting Fault Incidence Using Software Change History

IEEE Transactions on Software Engineering
The distribution of faults in a large industrial software system

ISSTA '02 Proceedings of the 2002 ACM SIGSOFT international symposium on Software testing and analysis
Early Quality Prediction: A Case Study in Telecommunications

IEEE Software
Reexamining the Fault Density-Component Size Connection

IEEE Software
Quantitative Analysis of Faults and Failures in a Complex Software System

IEEE Transactions on Software Engineering
An Empirical Analysis of Fault Persistence Through Software Releases

ISESE '03 Proceedings of the 2003 International Symposium on Empirical Software Engineering

The (Im)maturity level of software testing

ACM SIGSOFT Software Engineering Notes
Correlation exploitation in error ranking

Proceedings of the 12th ACM SIGSOFT twelfth international symposium on Foundations of software engineering
Use of relative code churn measures to predict system defect density

Proceedings of the 27th international conference on Software engineering
Automatic Mining of Source Code Repositories to Improve Bug Finding Techniques

IEEE Transactions on Software Engineering
HATARI: raising risk awareness

Proceedings of the 10th European software engineering conference held jointly with 13th ACM SIGSOFT international symposium on Foundations of software engineering
Improving evolvability through refactoring

MSR '05 Proceedings of the 2005 international workshop on Mining software repositories
Experiences and results from initiating field defect prediction and product test prioritization efforts at ABB Inc.

Proceedings of the 28th international conference on Software engineering
Revisiting the problem of using problem reports for quality assessment

Proceedings of the 2006 international workshop on Software quality
Extracting refactoring trends from open-source software and a possible solution to the 'related refactoring' conundrum

Proceedings of the 2006 ACM symposium on Applied computing
Looking for bugs in all the right places

Proceedings of the 2006 international symposium on Software testing and analysis
MINIX 3: a highly reliable, self-repairing operating system

ACM SIGOPS Operating Systems Review
Memories of bug fixes

Proceedings of the 14th ACM SIGSOFT international symposium on Foundations of software engineering
Using Developer Information as a Factor for Fault Prediction

PROMISE '07 Proceedings of the Third International Workshop on Predictor Models in Software Engineering
Automating algorithms for the identification of fault-prone files

Proceedings of the 2007 international symposium on Software testing and analysis
A critique of the GNU hurd multi-server operating system

ACM SIGOPS Operating Systems Review
Which warnings should I fix first?

Proceedings of the the 6th joint meeting of the European software engineering conference and the ACM SIGSOFT symposium on The foundations of software engineering
Software engineering research: from cradle to grave

Proceedings of the the 6th joint meeting of the European software engineering conference and the ACM SIGSOFT symposium on The foundations of software engineering
An empirical investigation of software reuse benefits in a large telecom product

ACM Transactions on Software Engineering and Methodology (TOSEM)
Predicting accurate and actionable static analysis warnings: an experimental approach

Proceedings of the 30th international conference on Software engineering
The influence of organizational structure on software quality: an empirical case study

Proceedings of the 30th international conference on Software engineering
Exploring the relationship of history characteristics and defect count: an empirical study

DEFECTS '08 Proceedings of the 2008 workshop on Defects in large software systems
Iterative identification of fault-prone binaries using in-process metrics

Proceedings of the Second ACM-IEEE international symposium on Empirical software engineering and measurement
Failure-prone components are also attack-prone components

Companion to the 23rd ACM SIGPLAN conference on Object-oriented programming systems languages and applications
Prioritizing software security fortification throughcode-level metrics

Proceedings of the 4th ACM workshop on Quality of protection
Data mining source code for locating software bugs: A case study in telecommunication industry

Expert Systems with Applications: An International Journal
Toward Non-security Failures as a Predictor of Security Faults and Failures

ESSoS '09 Proceedings of the 1st International Symposium on Engineering Secure Software and Systems
Cross-project defect prediction: a large scale experiment on data vs. domain vs. process

Proceedings of the the 7th joint meeting of the European software engineering conference and the ACM SIGSOFT symposium on The foundations of software engineering
Cooperative bug isolation: winning thesis of the 2005 ACM doctoral dissertation competition

Cooperative bug isolation: winning thesis of the 2005 ACM doctoral dissertation competition
Approximating deployment metrics to predict field defects and plan corrective maintenance activities

ISSRE'09 Proceedings of the 20th IEEE international conference on software reliability engineering
Modeling consumer-perceived web application fault severities for testing

Proceedings of the 19th international symposium on Software testing and analysis
Defect prediction from static code features: current results, limitations, new approaches

Automated Software Engineering
Better, faster, and cheaper: what is better software?

Proceedings of the 6th International Conference on Predictive Models in Software Engineering
A different view of fault prediction

COMPSAC-W'05 Proceedings of the 29th annual international conference on Computer software and applications conference
Predicting software bugs using ARIMA model

Proceedings of the 48th Annual Southeast Regional Conference
Requirements attributes to predict requirements related defects

Proceedings of the 2010 Conference of the Center for Advanced Studies on Collaborative Research
A systematic literature review of actionable alert identification techniques for automated static code analysis

Information and Software Technology
Defect prediction using social network analysis on issue repositories

Proceedings of the 2011 International Conference on Software and Systems Process
Don't touch my code!: examining the effects of ownership on software quality

Proceedings of the 19th ACM SIGSOFT symposium and the 13th European conference on Foundations of software engineering
High-impact defects: a study of breakage and surprise defects

Proceedings of the 19th ACM SIGSOFT symposium and the 13th European conference on Foundations of software engineering
Are popular classes more defect prone?

FASE'10 Proceedings of the 13th international conference on Fundamental Approaches to Software Engineering
Controversy Corner: Improving test efficiency through system test prioritization

Journal of Systems and Software
Software defect analysis of a multi-release telecommunications system

PROFES'05 Proceedings of the 6th international conference on Product Focused Software Process Improvement
Regularities in learning defect predictors

PROFES'10 Proceedings of the 11th international conference on Product-Focused Software Process Improvement
On the improvement of a fault classification scheme with implications for white-box testing

Proceedings of the 27th Annual ACM Symposium on Applied Computing
Evaluating defect prediction approaches: a benchmark and an extensive comparison

Empirical Software Engineering
A learning-to-rank algorithm for constructing defect prediction models

IDEAL'12 Proceedings of the 13th international conference on Intelligent Data Engineering and Automated Learning
Characterizing the roles of classes and their fault-proneness through change metrics

Proceedings of the ACM-IEEE international symposium on Empirical software engineering and measurement
Open Source Software Systems: Understanding Bug Prediction and Software Developer Roles

International Journal of Open Source Software and Processes
Influence of confirmation biases of developers on software quality: an empirical study

Software Quality Control
Beyond data mining; towards "idea engineering"

Proceedings of the 9th International Conference on Predictive Models in Software Engineering
DConfusion: a technique to allow cross study performance evaluation of fault prediction studies

Automated Software Engineering

Quantified Score

Hi-index	0.00

Visualization

Abstract

The ability to predict which files in a large software system are most likely to contain the largest numbers of faults in the next release can be a very valuable asset. To accomplish this, a negative binomial regression model using information from previous releases has been developed and used to predict the numbers of faults for a large industrial inventory system. The files of each release were sorted in descending order based on the predicted number of faults and then the first 20% of the files were selected. This was done for each of fifteen consecutive releases, representing more than four years of field usage. The predictions were extremely accurate, correctly selecting files that contained between 71% and 92% of the faults, with the overall average being 83%. In addition, the same model was used on data for the same system's releases, but with all fault data prior to integration testing removed. The prediction was again very accurate, ranging from 71% to 93%, with the average being 84%. Predictions were made for a second system, and again the first 20% of files accounted for 83% of the identified faults. Finally, a highly simplified predictor was considered which correctly predicted 73% and 74% of the faults for the two systems.