Software Quality Classification Modeling Using The SPRINT Decision Tree Algorithm

  • Authors:
  • Taghi M. Khoshgoftaar;Naeem Seliya

  • Affiliations:
  • -;-

  • Venue:
  • ICTAI '02 Proceedings of the 14th IEEE International Conference on Tools with Artificial Intelligence
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

Predicting the quality of system modules prior to software testing and operations can benefit the software development team. Such a timely reliability estimation can be used to direct cost-effective quality improvement efforts to the high-risk modules. Tree-based softwarequality classification models based on software metrics are used to predict whether a software module is fauIt-prone or not fault-prone. They are white box quality estimation models with good accuracy, and are simpIe and easy to interpret.This paper presents an in-depth study of calibrating classification trees for software quality estimation using the SPRINT decision tree algorithm. Many classification algorithms have memory limitations including the requirement that data sets be memory resident. SPRINT removes all of these limitations and provides a fast and scalable analysis. It is an extension of a commonly used decision tree algorithm, CART, and provides a unique tree-pruning technique based on the Minimum Description Length (MDL) principle. Combining the MDL pruning technique and the modified classification algorithm, SPRINT yields classification trees with useful prediction accuracy. The case study used comprises of software metrics and fault data collected over four releases from a very large telecommunications system. It is observed that classification trees built by SPRINT are more balanced and demonstrate better stability incomparison to those built by CART.