Clone Detection Using Abstract Syntax Suffix Trees

Authors:
Rainer Koschke;Raimar Falke;Pierre Frenzel
Affiliations:
University of Bremen, Germany;University of Bremen, Germany;University of Bremen, Germany
Venue:
WCRE '06 Proceedings of the 13th Working Conference on Reverse Engineering
Year:
2006

Citing 0
Cited 28

Efficient token based clone detection with flexible tokenization

Proceedings of the the 6th joint meeting of the European software engineering conference and the ACM SIGSOFT symposium on The foundations of software engineering
Structural analysis and visualization of C++ code evolution using syntax trees

Ninth international workshop on Principles of software evolution: in conjunction with the 6th ESEC/FSE joint meeting
Efficient token based clone detection with flexible tokenization

The 6th Joint Meeting on European software engineering conference and the ACM SIGSOFT symposium on the foundations of software engineering: companion papers
Comparison and Evaluation of Clone Detection Tools

IEEE Transactions on Software Engineering
Clone detection in automotive model-based development

Proceedings of the 30th international conference on Software engineering
Towards a mutation-based automatic framework for evaluating code clone detection tools

Proceedings of the 2008 C3S2E conference
Empirical evaluation of clone detection using syntax suffix trees

Empirical Software Engineering
"Cloning considered harmful" considered harmful: patterns of cloning in software

Empirical Software Engineering
Clone detection and removal for Erlang/OTP within a refactoring environment

Proceedings of the 2009 ACM SIGPLAN workshop on Partial evaluation and program manipulation
An evaluation of code similarity identification for the grow-and-prune model

Journal of Software Maintenance and Evolution: Research and Practice - Special Issue on the 12th Conference on Software Maintenance and Reengineering (CSMR 2008)
Comparison and evaluation of code clone detection techniques and tools: A qualitative approach

Science of Computer Programming
Accurate and Efficient Structural Characteristic Feature Extraction for Clone Detection

FASE '09 Proceedings of the 12th International Conference on Fundamental Approaches to Software Engineering: Held as Part of the Joint European Conferences on Theory and Practice of Software, ETAPS 2009
Do code clones matter?

ICSE '09 Proceedings of the 31st International Conference on Software Engineering
CloneDetective - A workbench for clone detection research

ICSE '09 Proceedings of the 31st International Conference on Software Engineering
COMPASS: A Community-driven Parallelization Advisor for Sequential Software

IWMSE '09 Proceedings of the 2009 ICSE Workshop on Multicore Software Engineering
Clone detection via structural abstraction

Software Quality Control
Tree-pattern-based duplicate code detection

Proceedings of the ACM first international workshop on Data-intensive software management and mining
Tracking the evolution of code clones

SOFSEM'11 Proceedings of the 37th international conference on Current trends in theory and practice of computer science
Scalable clone detection using description logic

Proceedings of the 5th International Workshop on Software Clones
Similar code detection and elimination for erlang programs

PADL'10 Proceedings of the 12th international conference on Practical Aspects of Declarative Languages
An empirical study on inconsistent changes to code clones at the release level

Science of Computer Programming
CBCD: cloned buggy code detector

Proceedings of the 34th International Conference on Software Engineering
Code flows: visualizing structural evolution of source code

EuroVis'08 Proceedings of the 10th Joint Eurographics / IEEE - VGTC conference on Visualization
Identification of generalization refactoring opportunities

Automated Software Engineering
Resource requirement prediction using clone detection technique

Future Generation Computer Systems
Language independent framework for static code analysis

Proceedings of the 6th Balkan Conference in Informatics
Tuning research tools for scalability and performance: The NiCad experience

Science of Computer Programming
Anti-unification for Unranked Terms and Hedges

Journal of Automated Reasoning

Quantified Score

Hi-index	0.00

Visualization

Abstract

Reusing software through copying and pasting is a continuous plague in software development despite the fact that it creates serious maintenance problems. Various techniques have been proposed to find duplicated redundant code (also known as software clones). A recent study has compared these techniques and shown that token-based clone detection based on suffix trees is extremely fast but yields clone candidates that are often no syntactic units. Current techniques based on abstract syntax trees--on the other hand--find syntactic clones but are considerably less efficient. This paper describes how we can make use of suffix trees to find clones in abstract syntax trees. This new approach is able to find syntactic clones in linear time and space. The paper reports the results of several large case studies in which we empirically compare the new technique to other techniques using the Bellon benchmark for clone detectors.