Comparison of similarity metrics for refactoring detection

Authors:
Benjamin Biegel;Quinten David Soetens;Willi Hornig;Stephan Diehl;Serge Demeyer
Affiliations:
University of Trier, Germany, Germany;University of Antwerp, Belgium, Belgium;University of Trier, Germany, Germany;University of Trier, Germany, Germany;University of Antwerp, Belgium, Belgium
Venue:
Proceedings of the 8th Working Conference on Mining Software Repositories
Year:
2011

Citing 17
Cited 3

Finding refactorings via change metrics

OOPSLA '00 Proceedings of the 15th ACM SIGPLAN conference on Object-oriented programming, systems, languages, and applications
CCFinder: a multilinguistic token-based code clone detection system for large scale source code

IEEE Transactions on Software Engineering
On the Resemblance and Containment of Documents

SEQUENCES '97 Proceedings of the Compression and Complexity of Sequences 1997
Reconstruction of Successful Software Evolution Using Clone Detection

IWPSE '03 Proceedings of the 6th International Workshop on Principles of Software Evolution
CatchUp!: capturing and replaying refactorings to support API evolution

Proceedings of the 27th international conference on Software engineering
Error detection by refactoring reconstruction

MSR '05 Proceedings of the 2005 international workshop on Mining software repositories
Digging the Development Dust for Refactorings

ICPC '06 Proceedings of the 14th IEEE International Conference on Program Comprehension
Are refactorings less error-prone than other changes?

Proceedings of the 2006 international workshop on Mining software repositories
Identifying Refactorings from Source-Code Changes

ASE '06 Proceedings of the 21st IEEE/ACM International Conference on Automated Software Engineering
Refactoring Detection based on UMLDiff Change-Facts Queries

WCRE '06 Proceedings of the 13th Working Conference on Reverse Engineering
Guidelines for conducting and reporting case study research in software engineering

Empirical Software Engineering
Comparison and evaluation of code clone detection techniques and tools: A qualitative approach

Science of Computer Programming
Using differences among replications of software engineering experiments to gain knowledge

ESEM '09 Proceedings of the 2009 3rd International Symposium on Empirical Software Engineering and Measurement
Template-based reconstruction of complex refactorings

ICSM '10 Proceedings of the 2010 IEEE International Conference on Software Maintenance
Highly Configurable and Extensible Code Clone Detection

WCRE '10 Proceedings of the 2010 17th Working Conference on Reverse Engineering
Studying the Effect of Refactorings: A Complexity Metrics Perspective

QUATIC '10 Proceedings of the 2010 Seventh International Conference on the Quality of Information and Communications Technology
Automated detection of refactorings in evolving components

ECOOP'06 Proceedings of the 20th European conference on Object-Oriented Programming

Issues arising from refactoring studies: an experience report

ACM SIGSOFT Software Engineering Notes
A multidimensional empirical study on refactoring activity

CASCON '13 Proceedings of the 2013 Conference of the Center for Advanced Studies on Collaborative Research
A method to evaluate differences between student UML class diagrams

Journal of Computing Sciences in Colleges

Quantified Score

Hi-index	0.00

Visualization

Abstract

Identifying refactorings in software archives has been an active research topic in the last decade, mainly because it is a prerequisite for various software evolution analyses (e.g., error detection, capturing intent of change, capturing and replaying changes, and relating refactorings and software metrics). Many of these techniques rely on similarity measures to identify structurally equivalent code, however, up until now the effect of this similarity measure on the performance of the refactoring identification algorithm is largely unexplored. In this paper we replicate a well-known experiment from Weißgerber and Diehl, plugging in three different similarity measures (text-based, AST-based, token-based). We look at the overlap of the results obtained by the different metrics, and we compare the results using recall and the computation time. We conclude that the different result sets have a large overlap and that the three metrics perform with a comparable quality.