Identifying and characterizing change-prone classes in two large-scale open-source products

Authors:
A. Güneş Koru;Hongfang Liu
Affiliations:
Department of Information Systems, University of Maryland, Baltimore County, UMBC-EASEL, Empirical and Applied Software Engineering Laboratory, 1000 Hilltop Circle, Baltimore, MD 21250, USA;Georgetown University Medical Center, Department of Biostatistics, Bioinformatics, and Biomathematics, 4000 Reservoir Road, NW Suite 120, Washington, DC 20007, USA
Venue:
Journal of Systems and Software
Year:
2007

Citing 37
Cited 11

The Use of Software Complexity Metrics in Software Maintenance

IEEE Transactions on Software Engineering
Robust regression and outlier detection

Robust regression and outlier detection
Evaluating Software Complexity Measures

IEEE Transactions on Software Engineering
Measuring software design quality

Measuring software design quality
Evaluating techniques for generating metric-based classification trees

Journal of Systems and Software - An Oregon workshop on software metrics
An evaluation of some design metrics

Software Engineering Journal - Special issue: on software reliability and metrics
Object-oriented metrics that predict maintainability

Journal of Systems and Software - Special issue on object-oriented software
Developing Interpretable Models with Optimized set Reduction for Identifying High-Risk Software Components

IEEE Transactions on Software Engineering - Special issue on software reliability
Derivation and validation of software metrics

Derivation and validation of software metrics
Property-Based Software Engineering Measurement

IEEE Transactions on Software Engineering
A Unified Framework for Coupling Measurement in Object-Oriented Systems

IEEE Transactions on Software Engineering
How well do experienced software developers predict software change?

Journal of Systems and Software
Validation of the coupling dependency metric as a predictor of run-time failures and maintenance measures

Proceedings of the 20th international conference on Software engineering
A comparison of measurement and defect characteristics of new and legacy software systems

Journal of Systems and Software
Are large C++ classes change-prone? An empirical investigation

Software—Practice & Experience
Data mining: practical machine learning tools and techniques with Java implementations

Data mining: practical machine learning tools and techniques with Java implementations
Exploring the relationship between design measures and software quality in object-oriented systems

Journal of Systems and Software
The cathedral and the bazaar: musings on Linux and open source by an accidental revolutionary

The cathedral and the bazaar: musings on Linux and open source by an accidental revolutionary
The prediction of faulty classes using object-oriented design metrics

Journal of Systems and Software
The Confounding Effect of Class Size on the Validity of Object-Oriented Metrics

IEEE Transactions on Software Engineering
Experience with identifying and characterizing problem-prone modules telecommunication software systems

Journal of Systems and Software
A Unified Framework for Cohesion Measurement in Object-OrientedSystems

Empirical Software Engineering
Empirically Guided Software Development Using Metric-Based Classification Trees

IEEE Software
Software Risk Management: Principles and Practices

IEEE Software
Early Quality Prediction: A Case Study in Telecommunications

IEEE Software
A Metrics Suite for Object Oriented Design

IEEE Transactions on Software Engineering
Complexity Measure Evaluation and Selection

IEEE Transactions on Software Engineering
Columbus - Reverse Engineering Tool and Schema for C++

ICSM '02 Proceedings of the International Conference on Software Maintenance (ICSM'02)
An empirical comparison and characterization of high defect and high complexity modules

Journal of Systems and Software
Quality assurance under the open source development model

Journal of Systems and Software
Defect Handling in Medium and Large Open Source Projects

IEEE Software
An investigation of the effect of module size on defect prediction using static measures

PROMISE '05 Proceedings of the 2005 workshop on Predictor models in software engineering
Comparing High-Change Modules and Modules with the Highest Measurement Values in Two Large-Scale Open-Source Products

IEEE Transactions on Software Engineering
Software Structure Metrics Based on Information Flow

IEEE Transactions on Software Engineering
An Empirical Study of a Syntactic Complexity Family

IEEE Transactions on Software Engineering
Software quality metrics based on interconnectivity

Journal of Systems and Software
Application of neural networks to software quality modeling of a very large telecommunications system

IEEE Transactions on Neural Networks

Fault detection and prediction in an open-source software project

PROMISE '09 Proceedings of the 5th International Conference on Predictor Models in Software Engineering
Measuring behavioral dependency for improving change-proneness prediction in UML-based design models

Journal of Systems and Software
Better, faster, and cheaper: what is better software?

Proceedings of the 6th International Conference on Predictive Models in Software Engineering
Review: Software fault prediction: A literature review and current trends

Expert Systems with Applications: An International Journal
An empirical study on the influence of pattern roles on change-proneness

Empirical Software Engineering
The ability of object-oriented metrics to predict change-proneness: a meta-analysis

Empirical Software Engineering
Regularities in learning defect predictors

PROFES'10 Proceedings of the 11th international conference on Product-Focused Software Process Improvement
Construct specific coupling measurement for C++ software

Computer Languages, Systems and Structures
Studying volatility predictors in open source software

Proceedings of the ACM-IEEE international symposium on Empirical software engineering and measurement
Recalling the "imprecision" of cross-project defect prediction

Proceedings of the ACM SIGSOFT 20th International Symposium on the Foundations of Software Engineering
How, and why, process metrics are better

Proceedings of the 2013 International Conference on Software Engineering

Quantified Score

Hi-index	0.00

Visualization

Abstract

Developing and maintaining open-source software has become an important source of profit for many companies. Change-prone classes in open-source products increase project costs by requiring developers to spend effort and time. Identifying and characterizing change-prone classes can enable developers to focus timely preventive actions, for example, peer-reviews and inspections, on the classes with similar characteristics in the future releases or products. In this study, we collected a set of static metrics and change data at class level from two open-source projects, KOffice and Mozilla. Using these data, we first tested and validated Pareto's Law which implies that a great majority (around 80%) of change is rooted in a small proportion (around 20%) of classes. Then, we identified and characterized the change-prone classes in the two products by producing tree-based models. In addition, using tree-based models, we suggested a prioritization strategy to use project resources for focused preventive actions in an efficient manner. Our empirical results showed that this strategy was effective for prioritization purposes. This study should provide useful guidance to practitioners involved in development and maintenance of large-scale open-source products.