Clone Detection in Source Code by Frequent Itemset Techniques

  • Authors:
  • Vera Wahler;Dietmar Seipel;Jurgen Wolff v. Gudenberg;Gregor Fischer

  • Affiliations:
  • University of Würzburg, Germany;University of Würzburg, Germany;University of Würzburg, Germany;University of Würzburg, Germany

  • Venue:
  • SCAM '04 Proceedings of the Source Code Analysis and Manipulation, Fourth IEEE International Workshop
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we describe a new approach for the detection of clones in source code, which is inspired by the concept of frequent itemsets from data mining. The source code is represented as an abstract syntax tree in XML. Currently, such XML representations exist for instance for Java, C++, or PROLOG. Our approach is very flexible; it can be configured easily to work with multiple programming languages.