Fault Prediction Modeling for Software Quality Estimation: Comparing Commonly Used Techniques

Authors:
Taghi M. Khoshgoftaar;Naeem Seliya
Affiliations:
Florida Atlantic University, Boca Raton, Florida taghi@cse.fau.edu;Florida Atlantic University, Boca Raton, Florida nseliya@cse.fau.edu
Venue:
Empirical Software Engineering
Year:
2003

Citing 20
Cited 29

Predictive Modeling Techniques of Software Quality from Software Measures

IEEE Transactions on Software Engineering - Special issue on software measurement principles, techniques, and environments
Developing Interpretable Models with Optimized set Reduction for Identifying High-Risk Software Components

IEEE Transactions on Software Engineering - Special issue on software reliability
Case-based reasoning

Case-based reasoning
A neural network approach for early detection of program modules having high risk in the maintenance phase

Selected papers of the sixth annual Oregon workshop on Software metrics
Neural fuzzy systems: a neuro-fuzzy synergism to intelligent systems

Neural fuzzy systems: a neuro-fuzzy synergism to intelligent systems
Software metrics (2nd ed.): a rigorous and practical approach

Software metrics (2nd ed.): a rigorous and practical approach
A comparison of software effort estimation techniques: using function points with neural networks, case-based reasoning and regression models

Journal of Systems and Software
A replicated assessment and comparison of common software cost modeling techniques

Proceedings of the 22nd international conference on Software engineering
Accuracy of software quality models over multiple releases

Annals of Software Engineering
Software Metrics Data Analysis—Exploring the RelativePerformance of Some Commonly Used Modeling Techniques

Empirical Software Engineering
Emerald: Software Metrics and Models on the Desktop

IEEE Software
Application of a Usage Profile in Software Quality Models

CSMR '99 Proceedings of the Third European Conference on Software Maintenance and Reengineering
Controlling Overfitting in Software Quality Models: Experiments with Regression Trees and Classification

METRICS '01 Proceedings of the 7th International Symposium on Software Metrics
Tree-Based Software Quality Estimation Models For Fault Prediction

METRICS '02 Proceedings of the 8th International Symposium on Software Metrics
Experience from Replicating Empirical Studies on Prediction Models

METRICS '02 Proceedings of the 8th International Symposium on Software Metrics
Software Metrics Model For Integrating Quality Control And Prediction

ISSRE '97 Proceedings of the Eighth International Symposium on Software Reliability Engineering
Building Software Quality Classification Trees: Approach, Experimentation, Evaluation

ISSRE '97 Proceedings of the Eighth International Symposium on Software Reliability Engineering
Improving Tree-Based Models of Software Quality with Principal Components Analysis

ISSRE '00 Proceedings of the 11th International Symposium on Software Reliability Engineering
Modeling software quality: the Software Measurement Analysis and Reliability Toolkit

ICTAI '00 Proceedings of the 12th IEEE International Conference on Tools with Artificial Intelligence
Fuzzy logic techniques for software reliability engineering

Fuzzy logic techniques for software reliability engineering

Static analysis tools as early indicators of pre-release defect density

Proceedings of the 27th international conference on Software engineering
Early estimation of software quality using in-process testing metrics: a controlled case study

3-WoSQ Proceedings of the third workshop on Software quality
Predicting object-oriented software maintainability using multivariate adaptive regression splines

Journal of Systems and Software
Statistical models vs. expert estimation for fault prediction in modified code - an industrial case study

Journal of Systems and Software
Data Mining Static Code Attributes to Learn Defect Predictors

IEEE Transactions on Software Engineering
Introduction to the special issue on: "Software Quality Improvements and Estimations with Intelligence-based Methods"

Software Quality Control
Improving fault detection in modified code: a study from the telecommunication industry

Journal of Computer Science and Technology
A comprehensive empirical evaluation of missing value imputation in noisy software measurement data

Journal of Systems and Software
Anomaly-based fault detection in pervasive computing system

Proceedings of the 5th international conference on Pervasive services
Quantitative analysis of faults and failures with multiple releases of softpm

Proceedings of the Second ACM-IEEE international symposium on Empirical software engineering and measurement
Analysis of Naive Bayes' assumptions on software fault data: An empirical study

Data & Knowledge Engineering
Software faults prediction based on grey system theory

ACM SIGSOFT Software Engineering Notes
Misclassification cost-sensitive fault prediction models

PROMISE '09 Proceedings of the 5th International Conference on Predictor Models in Software Engineering
Evolutionary sampling and software quality modeling of high-assurance systems

IEEE Transactions on Systems, Man, and Cybernetics, Part A: Systems and Humans
Adaptive Detection of Design Flaws

Electronic Notes in Theoretical Computer Science (ENTCS)
Exhaustive and heuristic search approaches for learning a software defect prediction model

Engineering Applications of Artificial Intelligence
Defect prediction from static code features: current results, limitations, new approaches

Automated Software Engineering
On the value of learning from defect dense components for software defect prediction

Proceedings of the 6th International Conference on Predictive Models in Software Engineering
Review: Software fault prediction: A literature review and current trends

Expert Systems with Applications: An International Journal
On the dynamic evidential reasoning algorithm for fault prediction

Expert Systems with Applications: An International Journal
Software defect detection with rocus

Journal of Computer Science and Technology
Assessing the maintainability of software product line feature models using structural metrics

Software Quality Control
An industrial case study of classifier ensembles for locating software defects

Software Quality Control
Fault Detection, Diagnosis and Prediction in Electrical Valves Using Self-Organizing Maps

Journal of Electronic Testing: Theory and Applications
Regularities in learning defect predictors

PROFES'10 Proceedings of the 11th international conference on Product-Focused Software Process Improvement
Software measurement data reduction using ensemble techniques

Neurocomputing
Data flow analysis for anomaly detection and identification toward resiliency in extreme scale systems

The Journal of Supercomputing
Prediction of testability using the design metrics for object-oriented software

International Journal of Computer Applications in Technology
SPL Conqueror: Toward optimization of non-functional properties in software product lines

Software Quality Control

Quantified Score

Hi-index	0.00

Visualization

Abstract

High-assurance and complex mission-critical software systems are heavily dependent on reliability of their underlying software applications. An early software fault prediction is a proven technique in achieving high software reliability. Prediction models based on software metrics can predict number of faults in software modules. Timely predictions of such models can be used to direct cost-effective quality enhancement efforts to modules that are likely to have a high number of faults. We evaluate the predictive performance of six commonly used fault prediction techniques: CART-LS (least squares), CART-LAD (least absolute deviation), S-PLUS, multiple linear regression, artificial neural networks, and case-based reasoning. The case study consists of software metrics collected over four releases of a very large telecommunications system. Performance metrics, average absolute and average relative errors, are utilized to gauge the accuracy of different prediction models. Models were built using both, original software metrics (RAW) and their principle components (PCA). Two-way ANOVA randomized-complete block design models with two blocking variables are designed with average absolute and average relative errors as response variables. System release and the model type (RAW or PCA) form the blocking variables and the prediction technique is treated as a factor. Using multiple-pairwise comparisons, the performance order of prediction models is determined. We observe that for both average absolute and average relative errors, the CART-LAD model performs the best while the S-PLUS model is ranked sixth.