Review: A systematic review of software fault prediction studies

Authors:
Cagatay Catal;Banu Diri
Affiliations:
The Scientific and Technological Research Council of Turkey, Marmara Research Center, Information Technologies Institute, Kocaeli, Turkey;Yildiz Technical University, Department of Computer Engineering, Istanbul, Turkey
Venue:
Expert Systems with Applications: An International Journal
Year:
2009

Citing 27
Cited 34

Object-oriented software metrics: a practical guide

Object-oriented software metrics: a practical guide
Comparing case-based reasoning classifiers for predicting high risk software components

Journal of Systems and Software
A Hierarchical Model for Object-Oriented Design Quality Assessment

IEEE Transactions on Software Engineering
Software Engineering Metrics for COTS-Based Systems

Computer
A Metrics Suite for Object Oriented Design

IEEE Transactions on Software Engineering
Metrics-Guided Quality Management for Component-Based Software Systems

COMPSAC '01 Proceedings of the 25th International Computer Software and Applications Conference on Invigorating Software Development
Machine Learning Method for Software Quality Model Building

ISMIS '99 Proceedings of the 11th International Symposium on Foundations of Intelligent Systems
An Application of Fuzzy Clustering to Software Quality Prediction

ASSET '00 Proceedings of the 3rd IEEE Symposium on Application-Specific Systems and Software Engineering Technology (ASSET'00)
Evaluating the Impact of Object-Oriented Design on Software Quality

METRICS '96 Proceedings of the 3rd International Symposium on Software Metrics: From Measurement to Empirical Results
Investigation of Logistic Regression as a Discriminant of Software Quality

METRICS '01 Proceedings of the 7th International Symposium on Software Metrics
Software Quality Classification Modeling Using The SPRINT Decision Tree Algorithm

ICTAI '02 Proceedings of the 14th IEEE International Conference on Tools with Artificial Intelligence
An Application of Zero-Inflated Poisson Regression for Software Fault Prediction

ISSRE '01 Proceedings of the 12th International Symposium on Software Reliability Engineering
Application of Neural Networks for Software Quality Prediction Using Object-Oriented Metrics

ICSM '03 Proceedings of the International Conference on Software Maintenance
Component-based measurement: few useful guidelines

ACM SIGSOFT Software Engineering Notes
Few important considerations for deriving interface complexity metric for component-based systems

ACM SIGSOFT Software Engineering Notes
Predicting the Location and Number of Faults in Large Software Systems

IEEE Transactions on Software Engineering
An investigation of the effect of module size on defect prediction using static measures

PROMISE '05 Proceedings of the 2005 workshop on Predictor models in software engineering
Data Mining Static Code Attributes to Learn Defect Predictors

IEEE Transactions on Software Engineering
A Systematic Review of Software Development Cost Estimation Studies

IEEE Transactions on Software Engineering
Empirical Analysis of Object-Oriented Design Metrics for Predicting High and Low Severity Faults

IEEE Transactions on Software Engineering
Empirical Validation of Three Software Metrics Suites to Predict Fault-Proneness of Object-Oriented Classes Developed Using Highly Iterative or Agile Software Development Processes

IEEE Transactions on Software Engineering
Software Defect Prediction Using Regression via Classification

AICCSA '06 Proceedings of the IEEE International Conference on Computer Systems and Applications
Software defect prediction using artificial immune recognition system

SE'07 Proceedings of the 25th conference on IASTED International Multi-Conference: Software Engineering
Fault Prediction using Early Lifecycle Data

ISSRE '07 Proceedings of the The 18th IEEE International Symposium on Software Reliability
A Fault Prediction Model with Limited Fault Data to Improve Test Process

PROFES '08 Proceedings of the 9th international conference on Product-Focused Software Process Improvement
A survey of component based system quality assurance and assessment

Information and Software Technology
Software fault prediction with object-oriented metrics based artificial immune recognition system

PROFES'07 Proceedings of the 8th international conference on Product-Focused Software Process Improvement

A tree-based approach to preserve the privacy of software engineering data and predictive models

PROMISE '09 Proceedings of the 5th International Conference on Predictor Models in Software Engineering
A systematic review of domain analysis solutions for product lines

Journal of Systems and Software
What's up with software metrics? - A preliminary mapping study

Journal of Systems and Software
Predicting defects with program dependencies

ESEM '09 Proceedings of the 2009 3rd International Symposium on Empirical Software Engineering and Measurement
Transparent combination of expert and measurement data for defect prediction: an industrial case study

Proceedings of the 32nd ACM/IEEE International Conference on Software Engineering - Volume 2
Predicting vulnerable software components with dependency graphs

Proceedings of the 6th International Workshop on Security Measurements and Metrics
Which is the right source for vulnerability studies?: an empirical analysis on Mozilla Firefox

Proceedings of the 6th International Workshop on Security Measurements and Metrics
An integrated approach to detect fault-prone modules using complexity and text feature metrics

AST/UCMA/ISA/ACN'10 Proceedings of the 2010 international conference on Advances in computer science and information technology
Software metrics reduction for fault-proneness prediction of software modules

NPC'10 Proceedings of the 2010 IFIP international conference on Network and parallel computing
Thresholds based outlier detection approach for mining class outliers: An empirical case study on software measurement datasets

Expert Systems with Applications: An International Journal
Using mapping studies as the basis for further research - A participant-observer case study

Information and Software Technology
Review: On the application of genetic programming for software engineering predictive modeling: A systematic review

Expert Systems with Applications: An International Journal
Empirical evaluation of reliability improvement in an evolving software product line

Proceedings of the 8th Working Conference on Mining Software Repositories
Six years of systematic literature reviews in software engineering: An updated tertiary study

Information and Software Technology
Are change metrics good predictors for an evolving software product line?

Proceedings of the 7th International Conference on Predictive Models in Software Engineering
Customization support for CBR-based defect prediction

Proceedings of the 7th International Conference on Predictive Models in Software Engineering
BugCache for inspections: hit or miss?

Proceedings of the 19th ACM SIGSOFT symposium and the 13th European conference on Foundations of software engineering
An investigation on the feasibility of cross-project defect prediction

Automated Software Engineering
Applying heuristic approaches for predicting defect-prone software components

EUROCAST'11 Proceedings of the 13th international conference on Computer Aided Systems Theory - Volume Part I
State coverage: software validation metrics beyond code coverage

SOFSEM'12 Proceedings of the 38th international conference on Current Trends in Theory and Practice of Computer Science
The value of mapping studies: a participantobserver case study

EASE'10 Proceedings of the 14th international conference on Evaluation and Assessment in Software Engineering
Reducing test effort: A systematic mapping study on existing approaches

Information and Software Technology
Can faulty modules be predicted by warning messages of static code analyzer?

Advances in Software Engineering - Special issue on Software Quality Assurance Methodologies and Techniques
The scientific basis for prediction research

Proceedings of the 8th International Conference on Predictive Models in Software Engineering
A mapping study to investigate component-based software system metrics

Journal of Systems and Software
Reuse: reducing test effort

ACM SIGSOFT Software Engineering Notes
Predicting method crashes with bytecode operations

Proceedings of the 6th India Software Engineering Conference
Data quality in empirical software engineering: a targeted review

Proceedings of the 17th International Conference on Evaluation and Assessment in Software Engineering
A learning-based method for combining testing techniques

Proceedings of the 2013 International Conference on Software Engineering
A study of subgroup discovery approaches for defect prediction

Information and Software Technology
Test case prioritization: a systematic mapping study

Software Quality Control
An empirical study of vulnerability rewards programs

SEC'13 Proceedings of the 22nd USENIX conference on Security
A survey of computational intelligence approaches for software reliability prediction

ACM SIGSOFT Software Engineering Notes
Prediction of faults-slip-through in large software projects: an empirical evaluation

Software Quality Control

Quantified Score

Hi-index	12.05

Visualization

Abstract

This paper provides a systematic review of previous software fault prediction studies with a specific focus on metrics, methods, and datasets. The review uses 74 software fault prediction papers in 11 journals and several conference proceedings. According to the review results, the usage percentage of public datasets increased significantly and the usage percentage of machine learning algorithms increased slightly since 2005. In addition, method-level metrics are still the most dominant metrics in fault prediction research area and machine learning algorithms are still the most popular methods for fault prediction. Researchers working on software fault prediction area should continue to use public datasets and machine learning algorithms to build better fault predictors. The usage percentage of class-level is beyond acceptable levels and they should be used much more than they are now in order to predict the faults earlier in design phase of software life cycle.