The Confounding Effect of Class Size on the Validity of Object-Oriented Metrics

Authors:
Kalhed El Emam;Saïda Benlarbi;Nishith Goel;Shesh N. Rai
Affiliations:
Canada Institute for Information Technology, Ottawa, Ont., Canada;Cistel Technology, Nepean, Ont., Canada;Cistel Technology, Nepean, Ont., Canada;St. Jude Children's Research Hospital, Memphis, TN
Venue:
IEEE Transactions on Software Engineering
Year:
2001

Citing 58
Cited 82

Software complexity measurement

Communications of the ACM
Alternative estimators in logistic regression when the data are collinear

Journal of Statistical Computation and Simulation
Robust regression and outlier detection

Robust regression and outlier detection
Object-oriented systems analysis: modeling the world in data

Object-oriented systems analysis: modeling the world in data
Using software metrics to allocate testing resources

Journal of Management Information Systems
The impact of collinearity involving the intercept term on the numerical accuracy of regression

Computer Science in Economics and Management
A philosophy for software measurement

Journal of Systems and Software - An Oregon workshop on software metrics
Software metrics: theory, tools and validation

Software Engineering Journal - Special issue: on software reliability and metrics
A guide to using the collinearity diagnostics

Computer Science in Economics and Management
Towards a metrics suite for object oriented design

OOPSLA '91 Conference proceedings on Object-oriented programming systems, languages, and applications
The Detection of Fault-Prone Programs

IEEE Transactions on Software Engineering
The role of program structure in software maintenance

International Journal of Man-Machine Studies
Support for Maintaining Object-Oriented Programs

IEEE Transactions on Software Engineering - Special issue on software maintenance
Object-oriented metrics that predict maintainability

Journal of Systems and Software - Special issue on object-oriented software
Developing Interpretable Models with Optimized set Reduction for Identifying High-Risk Software Components

IEEE Transactions on Software Engineering - Special issue on software reliability
Object-oriented software metrics: a practical guide

Object-oriented software metrics: a practical guide
Conceptual entropy and its effect on class hierarchies

Computer
Comments on 'A Metrics Suite for Object Oriented Design'

IEEE Transactions on Software Engineering
Using design patterns to develop reusable object-oriented communication software

Communications of the ACM - Special issue on object-oriented experiences and future trends
A Validation of Object-Oriented Design Metrics as Quality Indicators

IEEE Transactions on Software Engineering
Predicting Fault-Prone Software Modules in Telephone Switches

IEEE Transactions on Software Engineering
Characterizing and modeling the cost of rework in a library of reusable software components

ICSE '97 Proceedings of the 19th international conference on Software engineering
An investigation into coupling measures for C++

ICSE '97 Proceedings of the 19th international conference on Software engineering
Modeling and managing risk early in software development

ICSE '93 Proceedings of the 15th international conference on Software Engineering
Evaluating predictive quality models derived from software measures: lessons learned

Journal of Systems and Software
Managerial Use of Metrics for Object-Oriented Software: An Exploratory Analysis

IEEE Transactions on Software Engineering
Effort estimation and prediction of object-oriented systems

Journal of Systems and Software
A Unified Framework for Coupling Measurement in Object-Oriented Systems

IEEE Transactions on Software Engineering
Validation of the coupling dependency metric as a predictor of run-time failures and maintenance measures

Proceedings of the 20th international conference on Software engineering
An investigation on the use of machine learned models for estimating correction costs

Proceedings of the 20th international conference on Software engineering
Polymorphism measures for early risk prediction

Proceedings of the 21st international conference on Software engineering
Which software modules have faults which will be discovered by customers?

Journal of Software Maintenance: Research and Practice
A Critique of Software Defect Prediction Models

IEEE Transactions on Software Engineering
Software metrics: success, failures and new directions

Journal of Systems and Software - Special issue on invited articles on top systems and software engineering scholars
Exploring the relationship between design measures and software quality in object-oriented systems

Journal of Systems and Software
Software Engineering: A Practitioner's Approach

Software Engineering: A Practitioner's Approach
A Unified Framework for Cohesion Measurement in Object-OrientedSystems

Empirical Software Engineering
Empirical Studies of Object-Oriented Artifacts, Methods,and Processes: State of the Art and Future Directions

Empirical Software Engineering
Prediction of Run-Time Failures Using Static Product Quality Metrics

Software Quality Control
Using Process History to Predict Software Quality

Computer
Maintaining Object-Oriented Software

IEEE Software
Emerald: Software Metrics and Models on the Desktop

IEEE Software
Does OO Sync with How We Think?

IEEE Software
Correction to 'A Practical Approach to Programming with Assertions'

IEEE Transactions on Software Engineering
A Metrics Suite for Object Oriented Design

IEEE Transactions on Software Engineering
Experience With the Accuracy of Software Maintenance Task Effort Prediction Models

IEEE Transactions on Software Engineering
Towards a Framework for Software Measurement Validation

IEEE Transactions on Software Engineering
An Empirical Investigation of an Object-Oriented Software System

IEEE Transactions on Software Engineering
Quantitative Analysis of Faults and Failures in a Complex Software System

IEEE Transactions on Software Engineering
Experience Using Design Patterns to Evolve Communication Software Across Diverse OS Platforms

ECOOP '95 Proceedings of the 9th European Conference on Object-Oriented Programming
Evaluating the Impact of Object-Oriented Design on Software Quality

METRICS '96 Proceedings of the 3rd International Symposium on Software Metrics: From Measurement to Empirical Results
Coupling Metrics for Object-Oriented Design

METRICS '98 Proceedings of the 5th International Symposium on Software Metrics
An Empirical Study on Object-Oriented Metrics

METRICS '99 Proceedings of the 6th International Symposium on Software Metrics
Reliability of a commercial telecommunications system

ISSRE '96 Proceedings of the The Seventh International Symposium on Software Reliability Engineering
Integrating metrics and models for software risk assessmen

ISSRE '96 Proceedings of the The Seventh International Symposium on Software Reliability Engineering
Predicting Fault-Prone Classes with Design Measures in Object-Oriented Systems

ISSRE '98 Proceedings of the The Ninth International Symposium on Software Reliability Engineering
Classification Tree Models of Software Quality Over Multiple Releases

ISSRE '99 Proceedings of the 10th International Symposium on Software Reliability Engineering
ARMOR: Analyzer for Reducing Module Operational Risk

FTCS '95 Proceedings of the Twenty-Fifth International Symposium on Fault-Tolerant Computing

Object-oriented metrics: A review of theory and practice

Advances in software engineering
Preliminary guidelines for empirical research in software engineering

IEEE Transactions on Software Engineering
Empirical Analysis of CK Metrics for Object-Oriented Design Complexity: Implications for Software Defects

IEEE Transactions on Software Engineering
Thresholds for Object-Oriented Measures

ISSRE '00 Proceedings of the 11th International Symposium on Software Reliability Engineering
An empirical comparison and characterization of high defect and high complexity modules

Journal of Systems and Software
Comments on "The Confounding Effect of Class Size on the Validity of Object-Oriented Metrics"

IEEE Transactions on Software Engineering
Software metrics, information and entropy

Practicing software engineering in the 21st century
Dynamic Coupling Measurement for Object-Oriented Software

IEEE Transactions on Software Engineering
Improving Cohesion Metrics for Classes by Considering Dependent Instance Variables

IEEE Transactions on Software Engineering
Modeling Design/Coding Factors That Drive Maintainability of Software Systems

Software Quality Control
An investigation of the effect of module size on defect prediction using static measures

PROMISE '05 Proceedings of the 2005 workshop on Predictor models in software engineering
Early estimation of defect density using an in-process Haskell metrics model

A-MOST '05 Proceedings of the 1st international workshop on Advances in model-based testing
Early estimation of software quality using in-process testing metrics: a controlled case study

3-WoSQ Proceedings of the third workshop on Software quality
Comparing High-Change Modules and Modules with the Highest Measurement Values in Two Large-Scale Open-Source Products

IEEE Transactions on Software Engineering
Research ethics and computer science: an unconsummated marriage

SIGDOC '06 Proceedings of the 24th annual ACM international conference on Design of communication
An empirical study into class testability

Journal of Systems and Software - Special issue: Selected papers from the 4th source code analysis and manipulation (SCAM 2004) workshop
Identifying and characterizing change-prone classes in two large-scale open-source products

Journal of Systems and Software
Teaching students to build well formed object-oriented methods through refactoring

Proceedings of the 38th SIGCSE technical symposium on Computer science education
Measuring size, complexity, and coupling of hypergraph abstractions of software: An information-theory approach

Software Quality Control
An empirical study of the bad smells and class error probability in the post-release object-oriented system evolution

Journal of Systems and Software
Empirical Analysis of Object-Oriented Design Metrics for Predicting High and Low Severity Faults

IEEE Transactions on Software Engineering
Empirical Validation of Three Software Metrics Suites to Predict Fault-Proneness of Object-Oriented Classes Developed Using Highly Iterative or Agile Software Development Processes

IEEE Transactions on Software Engineering
Quality of manual data collection in Java software: an empirical investigation

Empirical Software Engineering
Modeling the Effect of Size on Defect Proneness for Open-Source Software

PROMISE '07 Proceedings of the Third International Workshop on Predictor Models in Software Engineering
Ranking reusability of software components using coupling metrics

Journal of Systems and Software
Defining and validating metrics for assessing the understandability of entity-relationship diagrams

Data & Knowledge Engineering
The application of product measures in directing software maintenance activity

Journal of Software Maintenance and Evolution: Research and Practice
An empirical validation of object-oriented class complexity metrics and their ability to predict error-prone classes in highly iterative, or agile, software: a case study

Journal of Software Maintenance and Evolution: Research and Practice
Towards a suite of quality metrics for KADS-domain knowledge

Expert Systems with Applications: An International Journal
Iterative identification of fault-prone binaries using in-process metrics

Proceedings of the Second ACM-IEEE international symposium on Empirical software engineering and measurement
Theory of relative defect proneness

Empirical Software Engineering
Predicting Software Fault Proneness Model Using Neural Network

PROFES '08 Proceedings of the 9th international conference on Product-Focused Software Process Improvement
Application of support vector machine to predict fault prone classes

ACM SIGSOFT Software Engineering Notes
Revising cohesion measures by considering the impact of write interactions between class members

Information and Software Technology
Empirical Validation of Measures for UML Class Diagrams: A Meta-Analysis Study

Models in Software Engineering
Fault detection and prediction in an open-source software project

PROMISE '09 Proceedings of the 5th International Conference on Predictor Models in Software Engineering
Comparative analysis of regression and machine learning methods for predicting fault proneness models

International Journal of Computer Applications in Technology
A systematic and comprehensive investigation of methods to build and evaluate fault prediction models

Journal of Systems and Software
Empirical validation of object-oriented metrics for predicting fault proneness models

Software Quality Control
On the ability of complexity metrics to predict fault-prone classes in object-oriented systems

Journal of Systems and Software
The impact of structural complexity on the understandability of UML statechart diagrams

Information Sciences: an International Journal
Finding software metrics threshold values using ROC curves

Journal of Software Maintenance and Evolution: Research and Practice
Relationships between selected software measures and latent bug-density: guidelines for improving quality

ICCSA'03 Proceedings of the 2003 international conference on Computational science and its applications: PartI
Problems adopting metrics from other disciplines

Proceedings of the 2010 ICSE Workshop on Emerging Trends in Software Metrics
Testing the theory of relative defect proneness for closed-source software

Empirical Software Engineering
An empirical study of overriding in open source Java

ACSC '10 Proceedings of the Thirty-Third Australasian Conferenc on Computer Science - Volume 102
A replicated and refined empirical study of the use of friends in C++ software

Journal of Systems and Software
Replication of defect prediction studies: problems, pitfalls and recommendations

Proceedings of the 6th International Conference on Predictive Models in Software Engineering
Determination of Maintainability Index for Object Oriented Systems

ACM SIGSOFT Software Engineering Notes
Design evolution metrics for defect prediction in object oriented systems

Empirical Software Engineering
An empirical study on the influence of pattern roles on change-proneness

Empirical Software Engineering
Using a class abstraction technique to predict faults in OO classes: a case study through six releases of the Eclipse JDT

Proceedings of the 2011 ACM Symposium on Applied Computing
By no means: a study on aggregating software metrics

Proceedings of the 2nd International Workshop on Emerging Trends in Software Metrics
A simpler model of software readability

Proceedings of the 8th Working Conference on Mining Software Repositories
Studying software evolution using artefacts' shared information content

Science of Computer Programming
Nothing else matters: what predictive model should I use?

Proceedings of the 7th International Conference on Predictive Models in Software Engineering
Assessment of maintainability metrics for object-oriented software system

ACM SIGSOFT Software Engineering Notes
Effectiveness of encapsulation and object-oriented metrics to refactor code and identify error prone classes using bad smells

ACM SIGSOFT Software Engineering Notes
Don't touch my code!: examining the effects of ownership on software quality

Proceedings of the 19th ACM SIGSOFT symposium and the 13th European conference on Foundations of software engineering
Assessing architectural evolution: a case study

Empirical Software Engineering
Empirical analysis of object-oriented design metrics for predicting high, medium and low severity faults using mallows Cp

ACM SIGSOFT Software Engineering Notes
Defect proneness estimation and feedback approach for software design quality improvement

Information and Software Technology
An analysis of dependency of coupling on software defects

ACM SIGSOFT Software Engineering Notes
Effectiveness of refactoring metrics model to identify smelly and error prone classes in open source software

ACM SIGSOFT Software Engineering Notes
An exploratory study of the impact of antipatterns on class change- and fault-proneness

Empirical Software Engineering
The ability of object-oriented metrics to predict change-proneness: a meta-analysis

Empirical Software Engineering
Construction and analysis of vector space models for use in aspect mining

Proceedings of the 50th Annual Southeast Regional Conference
Characterizing the roles of classes and their fault-proneness through change metrics

Proceedings of the ACM-IEEE international symposium on Empirical software engineering and measurement
Method-level bug prediction

Proceedings of the ACM-IEEE international symposium on Empirical software engineering and measurement
Studying volatility predictors in open source software

Proceedings of the ACM-IEEE international symposium on Empirical software engineering and measurement
Empirical study of Software Quality estimation

Proceedings of the Second International Conference on Computational Science, Engineering and Information Technology
Recalling the "imprecision" of cross-project defect prediction

Proceedings of the ACM SIGSOFT 20th International Symposium on the Foundations of Software Engineering
Software fault prediction with object-oriented metrics based artificial immune recognition system

PROFES'07 Proceedings of the 8th international conference on Product-Focused Software Process Improvement
How, and why, process metrics are better

Proceedings of the 2013 International Conference on Software Engineering
Empirical model for predicting high, medium and low severity faults using object oriented metrics in Mozilla Firefox

International Journal of Computer Applications in Technology
Investigating of high and low impact faults in object-oriented projects

ACM SIGSOFT Software Engineering Notes
An in-depth study of the potentially confounding effect of class size in fault prediction

ACM Transactions on Software Engineering and Methodology (TOSEM)
Mining the impact of evolution categories on object-oriented metrics

Software Quality Control
Source code size estimation approaches for object-oriented systems from UML class diagrams: A comparative study

Information and Software Technology
System performance analyses through object-oriented fault and coupling prisms

Proceedings of the 5th ACM/SPEC international conference on Performance engineering
A comparative study of models for predicting fault proneness in object-oriented systems

International Journal of Computer Applications in Technology
Applications of fuzzy integrals for predicting software fault-prone

Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology

Quantified Score

Hi-index	0.01

Visualization

Abstract

Much effort has been devoted to the development and empirical validation of object-oriented metrics. The empirical validations performed thus far would suggest that a core set of validated metrics is close to being identified. However, none of these studies allow for the potentially confounding effect of class size. In this paper, we demonstrate a strong size confounding effect and question the results of previous object-oriented metrics validation studies. We first investigated whether there is a confounding effect of class size in validation studies of object-oriented metrics and show that, based on previous work, there is reason to believe that such an effect exists. We then describe a detailed empirical methodology for identifying those effects. Finally, we perform a study on a large C++ telecommunications framework to examine if size is really a confounder. This study considered the Chidamber and Kemerer metrics and a subset of the Lorenz and Kidd metrics. The dependent variable was the incidence of a fault attributable to a field failure (fault-proneness of a class). Our findings indicate that, before controlling for size, the results are very similar to previous studies: The metrics that are expected to be validated are indeed associated with fault-proneness. After controlling for size, none of the metrics we studied were associated with fault-proneness anymore. This demonstrates a strong size confounding effect and casts doubt on the results of previous object-oriented metrics validation studies. It is recommended that previous validation studies be reexamined to determine whether their conclusions would still hold after controlling for size and that future validation studies should always control for size.