Software Quality Classification Modeling Using The SPRINT Decision Tree Algorithm

Authors:
Taghi M. Khoshgoftaar;Naeem Seliya
Affiliations:
-;-
Venue:
ICTAI '02 Proceedings of the 14th IEEE International Conference on Tools with Artificial Intelligence
Year:
2002

Citing 0
Cited 11

Comparative Assessment of Software Quality Classification Techniques: An Empirical Case Study

Empirical Software Engineering
Enhancing software quality estimation using ensemble-classifier based noise filtering

Intelligent Data Analysis
Training on errors experiment to detect fault-prone software modules by spam filter

Proceedings of the the 6th joint meeting of the European software engineering conference and the ACM SIGSOFT symposium on The foundations of software engineering
A Fault Prediction Model with Limited Fault Data to Improve Test Process

PROFES '08 Proceedings of the 9th international conference on Product-Focused Software Process Improvement
Investigating the effect of dataset size, metrics sets, and feature selection techniques on software fault prediction problem

Information Sciences: an International Journal
Review: A systematic review of software fault prediction studies

Expert Systems with Applications: An International Journal
Prediction of Fault-Prone Software Modules Using a Generic Text Discriminator

IEICE - Transactions on Information and Systems
Practical development of an Eclipse-based software fault prediction tool using Naive Bayes algorithm

Expert Systems with Applications: An International Journal
Review: Software fault prediction: A literature review and current trends

Expert Systems with Applications: An International Journal
Application of K-Medoids with Kd-Tree for Software Fault Prediction

ACM SIGSOFT Software Engineering Notes
Feature selection and clustering in software quality prediction

EASE'07 Proceedings of the 11th international conference on Evaluation and Assessment in Software Engineering

Quantified Score

Hi-index	0.00

Visualization

Abstract

Predicting the quality of system modules prior to software testing and operations can benefit the software development team. Such a timely reliability estimation can be used to direct cost-effective quality improvement efforts to the high-risk modules. Tree-based softwarequality classification models based on software metrics are used to predict whether a software module is fauIt-prone or not fault-prone. They are white box quality estimation models with good accuracy, and are simpIe and easy to interpret.This paper presents an in-depth study of calibrating classification trees for software quality estimation using the SPRINT decision tree algorithm. Many classification algorithms have memory limitations including the requirement that data sets be memory resident. SPRINT removes all of these limitations and provides a fast and scalable analysis. It is an extension of a commonly used decision tree algorithm, CART, and provides a unique tree-pruning technique based on the Minimum Description Length (MDL) principle. Combining the MDL pruning technique and the modified classification algorithm, SPRINT yields classification trees with useful prediction accuracy. The case study used comprises of software metrics and fault data collected over four releases from a very large telecommunications system. It is observed that classification trees built by SPRINT are more balanced and demonstrate better stability incomparison to those built by CART.