Preparing Measurements of Legacy Software for Predicting Operational Faults

Authors:
Taghi M. Khoshgoftaar;Edward B. Allen;Xiaojing Yuan;Wendell D. Jones;John P. Hudepohl
Affiliations:
-;-;-;-;-
Venue:
ICSM '99 Proceedings of the IEEE International Conference on Software Maintenance
Year:
1999

Citing 0
Cited 5

Controlling Overfitting in Classification-Tree Models ofSoftware Quality

Empirical Software Engineering
Uncertain Classification of Fault-Prone Software Modules

Empirical Software Engineering
Balancing Misclassification Rates in Classification-TreeModels of Software Quality

Empirical Software Engineering
Classification Tree Models of Software Quality Over Multiple Releases

ISSRE '99 Proceedings of the 10th International Symposium on Software Reliability Engineering
Improving Tree-Based Models of Software Quality with Principal Components Analysis

ISSRE '00 Proceedings of the 11th International Symposium on Software Reliability Engineering

Quantified Score

Hi-index	0.01

Visualization

Abstract

Software quality modeling can be used by a software maintenance project to identify a limited set of software modules that probably need improvement. A model's goal is to recommend a set of modules to receive special treatment. The purpose of this paper is to report our experiences modeling software quality with classification trees, including necessary preprocessing of data.We conducted a case study on two releases of a very large legacy telecommunications system. A module was considered fault-prone if any faults were discovered by customers, and not fault-prone otherwise. Software product, process, and execution metrics were the basis for predictors.The treedisc algorithm for building classification trees was investigated, because it emphasizes statistical significance. Numeric data, such as software metrics, are not suitable for treedisc . Consequently, we transformed measurements into discrete ordinal predictors by grouping. This case study investigated the sensitivity of modeling results to various groupings. We found that robustness, accuracy, and parsimony of the mod- els were influenced by the maximum number of groups. Models based on two sets of candidate predictors had similar sensitivity.