Evaluating Clone Detection Techniques from a Refactoring Perspective

  • Authors:
  • Filip Van Rysselberghe;Serge Demeyer

  • Affiliations:
  • University Of Antwerp, Belgium;University Of Antwerp, Belgium

  • Venue:
  • Proceedings of the 19th IEEE international conference on Automated software engineering
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

In the last decade, several researchers have investigated techniques to automatically detect duplicated code in programs exceeding hundreds of thousands lines of code. All of these techniques have known merits and deficiencies, but as of today, little is known on how these techniques fit into the refactoring process of object-oriented systems. This paper compares three representative detection techniques (simple line matching, parameterized matching, and metric fingerprints) by means of five small to medium sized cases and analyses the differences between the reported matches. Based on this comparison, we conclude that (1) simple line matching is best suited for a partial, yet advanced restructuring with little effort; (2) metric fingerprints work best for refactoring a system with minimal effort; (3) parameterized matching demands more effort yet allows a more profound, less obvious restructuring of the code.