Network Versus Code Metrics to Predict Defects: A Replication Study

Authors:
Rahul Premraj;Kim Herzig
Affiliations:
-;-
Venue:
ESEM '11 Proceedings of the 2011 International Symposium on Empirical Software Engineering and Measurement
Year:
2011

Citing 0
Cited 3

Graph-based analysis and prediction for software evolution

Proceedings of the 34th International Conference on Software Engineering
Transfer defect learning

Proceedings of the 2013 International Conference on Software Engineering
It's not a bug, it's a feature: how misclassification impacts bug prediction

Proceedings of the 2013 International Conference on Software Engineering

Quantified Score

Hi-index	0.00

Visualization

Abstract

Several defect prediction models have been proposed to identify which entities in a software system are likely to have defects before its release. This paper presents a replication of one such study conducted by Zimmermann and Nagappan on Windows Server 2003 where the authors leveraged dependency relationships between software entities captured using social network metrics to predict whether they are likely to have defects. They found that network metrics perform significantly better than source code metrics at predicting defects. In order to corroborate the generality of their findings, we replicate their study on three open source Java projects, viz., JRuby, ArgoUML, and Eclipse. Our results are in agreement with the original study by Zimmermann and Nagappan when using a similar experimental setup as them (random sampling). However, when we evaluated the metrics using setups more suited for industrial use--forward-release and cross-project prediction--we found network metrics to offer no vantage over code metrics. Moreover, code metrics may be preferable to network metrics considering the data is easier to collect and we used only 8 code metrics compared to approximately 58 network metrics.