Empirical Analysis of Software Fault Content and Fault Proneness Using Bayesian Methods

Authors:
Ganesh J. Pai;Joanne Bechta Dugan
Affiliations:
IEEE;IEEE
Venue:
IEEE Transactions on Software Engineering
Year:
2007

Citing 21
Cited 17

Predictive Modeling Techniques of Software Quality from Software Measures

IEEE Transactions on Software Engineering - Special issue on software measurement principles, techniques, and environments
A Validation of Object-Oriented Design Metrics as Quality Indicators

IEEE Transactions on Software Engineering
Predicting Fault-Prone Software Modules in Telephone Switches

IEEE Transactions on Software Engineering
A tutorial on learning with Bayesian networks

Proceedings of the NATO Advanced Study Institute on Learning in graphical models
A Critique of Software Defect Prediction Models

IEEE Transactions on Software Engineering
Exploring the relationship between design measures and software quality in object-oriented systems

Journal of Systems and Software
Predicting Fault Incidence Using Software Change History

IEEE Transactions on Software Engineering
A Discipline for Software Engineering

A Discipline for Software Engineering
Introduction to Bayesian Networks

Introduction to Bayesian Networks
Software Measurement: Uncertainty and Causal Modeling

IEEE Software
A Metrics Suite for Object Oriented Design

IEEE Transactions on Software Engineering
Assessing the applicability of fault-proneness models across object-oriented software projects

IEEE Transactions on Software Engineering
Empirical Analysis of CK Metrics for Object-Oriented Design Complexity: Implications for Software Defects

IEEE Transactions on Software Engineering
Software system defect content prediction from development process and product characteristics

Software system defect content prediction from development process and product characteristics
End-to-End Defect Modeling

IEEE Software
Use of relative code churn measures to predict system defect density

Proceedings of the 27th international conference on Software engineering
Empirical Validation of Object-Oriented Metrics on Open Source Software for Fault Prediction

IEEE Transactions on Software Engineering
Improving fault prediction using Bayesian networks for the development of embedded software applications: Research Articles

Software Testing, Verification & Reliability - UKTest 2005: The Third U.K. Workshop on Software Testing Research
Using Historical In-Process and Product Metrics for Early Estimation of Software Failures

ISSRE '06 Proceedings of the 17th International Symposium on Software Reliability Engineering
Predicting software defects in varying development lifecycles using Bayesian nets

Information and Software Technology
Empirical Analysis of Object-Oriented Design Metrics for Predicting High and Low Severity Faults

IEEE Transactions on Software Engineering

Application of support vector machine to predict fault prone classes

ACM SIGSOFT Software Engineering Notes
Investigating the effect of dataset size, metrics sets, and feature selection techniques on software fault prediction problem

Information Sciences: an International Journal
A systematic and comprehensive investigation of methods to build and evaluate fault prediction models

Journal of Systems and Software
Empirical validation of object-oriented metrics for predicting fault proneness models

Software Quality Control
On the ability of complexity metrics to predict fault-prone classes in object-oriented systems

Journal of Systems and Software
Cost-sensitive boosting neural networks for software defect prediction

Expert Systems with Applications: An International Journal
A symbolic fault-prediction model based on multiobjective particle swarm optimization

Journal of Systems and Software
Review: Software fault prediction: A literature review and current trends

Expert Systems with Applications: An International Journal
Software fault prediction for object oriented systems: a literature review

ACM SIGSOFT Software Engineering Notes
A framework for defect prediction in specific software project contexts

CEE-SET'08 Proceedings of the Third IFIP TC 2 Central and East European conference on Software engineering techniques
Defect proneness estimation and feedback approach for software design quality improvement

Information and Software Technology
Evaluating three approaches to extracting fault data from software change repositories

PROFES'10 Proceedings of the 11th international conference on Product-Focused Software Process Improvement
Comparing the performance of fault prediction models which report multiple performance measures: recomputing the confusion matrix

Proceedings of the 8th International Conference on Predictive Models in Software Engineering
An in-depth study of the potentially confounding effect of class size in fault prediction

ACM Transactions on Software Engineering and Methodology (TOSEM)
Software defect prediction using Bayesian networks

Empirical Software Engineering
Applications of fuzzy integrals for predicting software fault-prone

Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology
DConfusion: a technique to allow cross study performance evaluation of fault prediction studies

Automated Software Engineering

Quantified Score

Hi-index	0.01

Visualization

Abstract

We present a methodology for Bayesian analysis of software quality. We cast our research in the broader context of constructing a causal framework that can include process, product and other diverse sources of information regarding fault introduction during the software development process. In this paper, we discuss the aspect of relating internal product metrics to external quality metrics. Specifically, we build a Bayesian network (BN) model to relate object-oriented software metrics to software fault content and fault proneness. Assuming that the relationship can be described as a generalized linear model, we derive parametric functional forms for the target node conditional distributions, in the BN. These functional forms are shown to be able to represent linear, Poisson and binomial logistic regression. The models are empirically evaluated using a public domain data set from a software subsystem. The results show that our approach produces statistically significant estimations, and that our overall modelling method performs no worse than existing techniques.