Detecting Move Operations in Versioning Information

  • Authors:
  • Filip Van Rysselberghe;Matthias Rieger;Serge Demeyer

  • Affiliations:
  • University Of Antwerp;University Of Antwerp;University Of Antwerp

  • Venue:
  • CSMR '06 Proceedings of the Conference on Software Maintenance and Reengineering
  • Year:
  • 2006

Quantified Score

Hi-index 0.01

Visualization

Abstract

Recently, there is an increasing research interest in mining versioning information, i.e. the analysis of the transactions made on version systems to understand how and when a software system evolves. One particular area of interest is the identification of move operations as these are key indicators for refactorings. Unfortunately, there exists no evaluation which identifies the quality (expressed in precision and recall) of the most commonly used detection technique and its underlying principle of name identity. To overcome this problem, the paper compares the precision and recall values of the name-based technique with two alternative techniques, one based on line matching and one based on identifier matching, by means of two case studies. From the results of these studies we conclude that the name-based technique is very precise, yet misses a significant number of move operations (low recall value). To improve the trade-off it is worthwhile to consider the line-based technique since it detects more matches with a slightly worse precision, or to use the number of overlapping identifiers when combined with an additional filter.