Algorithms on strings, trees, and sequences: computer science and computational biology
Algorithms on strings, trees, and sequences: computer science and computational biology
Using design abstractions to visualize, quantify, and restructure software
Journal of Systems and Software - Special issue on software engineering and knowledge engineering
Substring Matching for Clone Detection and Change Tracking
ICSM '94 Proceedings of the International Conference on Software Maintenance
Experiment on the Automatic Detection of Function Clones in a Software System Using Metrics
ICSM '96 Proceedings of the 1996 International Conference on Software Maintenance
Measuring Clone Based Reengineering Opportunities
METRICS '99 Proceedings of the 6th International Symposium on Software Metrics
Software Quality Analysis by Code Clones in Industrial Legacy Software
METRICS '02 Proceedings of the 8th International Symposium on Software Metrics
On finding duplication and near-duplication in large software systems
WCRE '95 Proceedings of the Second Working Conference on Reverse Engineering
Partial Redesign of Java Software Systems Based on Clone Analysis
WCRE '99 Proceedings of the Sixth Working Conference on Reverse Engineering
Assessing the Benefits of Incorporating Function Clone Detection in a Development Process
ICSM '97 Proceedings of the International Conference on Software Maintenance
Clone Detection Using Abstract Syntax Trees
ICSM '98 Proceedings of the International Conference on Software Maintenance
A Language Independent Approach for Detecting Duplicated Code
ICSM '99 Proceedings of the IEEE International Conference on Software Maintenance
Identifying redundancy in source code using fingerprints
CASCON '93 Proceedings of the 1993 conference of the Centre for Advanced Studies on Collaborative research: software engineering - Volume 1
On Software Maintenance Process Improvement Based on Code Clone Analysis
PROFES '02 Proceedings of the 4th International Conference on Product Focused Software Process Improvement
Component rank: relative significance rank for software component search
Proceedings of the 25th International Conference on Software Engineering
Evaluating Clone Detection Techniques from a Refactoring Perspective
Proceedings of the 19th IEEE international conference on Automated software engineering
Practical language-independent detection of near-miss clones
CASCON '04 Proceedings of the 2004 conference of the Centre for Advanced Studies on Collaborative research
Beyond templates: a study of clones in the STL and some general implications
Proceedings of the 27th international conference on Software engineering
An investigation of cloning in web applications
WWW '05 Special interest tracks and posters of the 14th international conference on World Wide Web
Hybridizing evolutionary algorithms and clustering algorithms to find source-code clones
GECCO '05 Proceedings of the 7th annual conference on Genetic and evolutionary computation
Detecting higher-level similarity patterns in programs
Proceedings of the 10th European software engineering conference held jointly with 13th ACM SIGSOFT international symposium on Foundations of software engineering
An empirical study of code clone genealogies
Proceedings of the 10th European software engineering conference held jointly with 13th ACM SIGSOFT international symposium on Foundations of software engineering
Mining evolution data of a product family
MSR '05 Proceedings of the 2005 international workshop on Mining software repositories
Using a clone genealogy extractor for understanding and supporting evolution of code clones
MSR '05 Proceedings of the 2005 international workshop on Mining software repositories
ARIES: refactoring support tool for code clone
3-WoSQ Proceedings of the third workshop on Software quality
SDD: high performance code clone detection system for large scale source code
OOPSLA '05 Companion to the 20th annual ACM SIGPLAN conference on Object-oriented programming, systems, languages, and applications
On the Use of Clone Detection for Identifying Crosscutting Concern Code
IEEE Transactions on Software Engineering
Timna: a framework for automatically combining aspect mining analyses
Proceedings of the 20th IEEE/ACM international Conference on Automated software engineering
On feature traceability in object oriented programs
TEFSE '05 Proceedings of the 3rd international workshop on Traceability in emerging forms of software engineering
CP-Miner: Finding Copy-Paste and Related Bugs in Large-Scale Software Code
IEEE Transactions on Software Engineering
Supporting the analysis of clones in software systems: Research Articles
Journal of Software Maintenance and Evolution: Research and Practice - IEEE International Conference on Software Maintenance (ICSM2005)
Maintaining mental models: a study of developer work habits
Proceedings of the 28th international conference on Software engineering
Research journey towards industrial application of reuse technique
Proceedings of the 28th international conference on Software engineering
Effects of software industry structure on a research framework for empirical software engineering
Proceedings of the 28th international conference on Software engineering
Program element matching for multi-version program analyses
Proceedings of the 2006 international workshop on Mining software repositories
Constructing universal version history
Proceedings of the 2006 international workshop on Mining software repositories
GPLAG: detection of software plagiarism by program dependence graph analysis
Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Visualization of clone detection results
eclipse '06 Proceedings of the 2006 OOPSLA workshop on eclipse technology eXchange
On approximate matching of programs for protecting libre software
CASCON '06 Proceedings of the 2006 conference of the Center for Advanced Studies on Collaborative research
Supervised categorization of JavaScriptTM using program analysis features
Information Processing and Management: an International Journal - Special issue: AIRS2005: Information retrieval research in Asia
DECKARD: Scalable and Accurate Tree-Based Detection of Code Clones
ICSE '07 Proceedings of the 29th international conference on Software Engineering
ICSE '07 Proceedings of the 29th international conference on Software Engineering
Using Server Pages to Unify Clones in Web Applications: A Trade-Off Analysis
ICSE '07 Proceedings of the 29th international conference on Software Engineering
Tracking Code Clones in Evolving Software
ICSE '07 Proceedings of the 29th international conference on Software Engineering
Automatic Inference of Structural Changes for Matching across Program Versions
ICSE '07 Proceedings of the 29th international conference on Software Engineering
SoftGUESS: Visualization and Exploration of Code Clones in Context
ICSE '07 Proceedings of the 29th international conference on Software Engineering
CP-Miner: a tool for finding copy-paste and related bugs in operating system code
OSDI'04 Proceedings of the 6th conference on Symposium on Opearting Systems Design & Implementation - Volume 6
New Frontiers of Reverse Engineering
FOSE '07 2007 Future of Software Engineering
Identifying Changed Source Code Lines from Version Repositories
MSR '07 Proceedings of the Fourth International Workshop on Mining Software Repositories
Evaluating the Harmfulness of Cloning: A Change Based Experiment
MSR '07 Proceedings of the Fourth International Workshop on Mining Software Repositories
Analysis of the Linux Kernel Evolution Using Code Clone Coverage
MSR '07 Proceedings of the Fourth International Workshop on Mining Software Repositories
Large-Scale Code Reuse in Open Source Software
FLOSS '07 Proceedings of the First International Workshop on Emerging Trends in FLOSS Research and Development
Method and implementation for investigating code clones in a software system
Information and Software Technology
Mining specifications of malicious behavior
Proceedings of the the 6th joint meeting of the European software engineering conference and the ACM SIGSOFT symposium on The foundations of software engineering
Context-based detection of clone-related bugs
Proceedings of the the 6th joint meeting of the European software engineering conference and the ACM SIGSOFT symposium on The foundations of software engineering
Determining detailed structural correspondence for generalization tasks
Proceedings of the the 6th joint meeting of the European software engineering conference and the ACM SIGSOFT symposium on The foundations of software engineering
Training on errors experiment to detect fault-prone software modules by spam filter
Proceedings of the the 6th joint meeting of the European software engineering conference and the ACM SIGSOFT symposium on The foundations of software engineering
Efficient token based clone detection with flexible tokenization
Proceedings of the the 6th joint meeting of the European software engineering conference and the ACM SIGSOFT symposium on The foundations of software engineering
Efficient token based clone detection with flexible tokenization
The 6th Joint Meeting on European software engineering conference and the ACM SIGSOFT symposium on the foundations of software engineering: companion papers
Comparison and Evaluation of Clone Detection Tools
IEEE Transactions on Software Engineering
Finding Clones with Dup: Analysis of an Experiment
IEEE Transactions on Software Engineering
Assisting potentially-repetitive small-scale changes via semi-automated heuristic search
Proceedings of the twenty-second IEEE/ACM international conference on Automated software engineering
Visualizing clone detection results
Proceedings of the twenty-second IEEE/ACM international conference on Automated software engineering
Ariadne: an eclipse-based system for tracking the originality of source code
IBM Systems Journal
Source code enhancement using reduction of duplicated code
SE'07 Proceedings of the 25th conference on IASTED International Multi-Conference: Software Engineering
Mining specifications of malicious behavior
ISEC '08 Proceedings of the 1st India software engineering conference
Improving modularity by refactoring code clones: a feasibility study on Linux
ACM SIGSOFT Software Engineering Notes
Aspect mining from a modelling perspective
International Journal of Computer Applications in Technology
A test driven approach for aspectualizing legacy software using mock systems
Information and Software Technology
Proceedings of the 30th international conference on Software engineering
Scalable detection of semantic clones
Proceedings of the 30th international conference on Software engineering
Clone detection in automotive model-based development
Proceedings of the 30th international conference on Software engineering
Clonetracker: tool support for code clone management
Proceedings of the 30th international conference on Software engineering
Evaluation of source code copy detection methods on freebsd
Proceedings of the 2008 international working conference on Mining software repositories
A criterion for filtering code clone related bugs
DEFECTS '08 Proceedings of the 2008 workshop on Defects in large software systems
Retrieving similar code fragments based on identifier similarity for defect detection
DEFECTS '08 Proceedings of the 2008 workshop on Defects in large software systems
An automated approach for abstracting execution logs to execution events
Journal of Software Maintenance and Evolution: Research and Practice - Special Issue on Program Comprehension through Dynamic Analysis (PCODA)
Aspect mining in the presence of the C preprocessor
Proceedings of the 2008 AOSD workshop on Linking aspect technology and evolution
Mining application-specific coding patterns for software maintenance
Proceedings of the 2008 AOSD workshop on Linking aspect technology and evolution
Locating dependence structures using search-based slicing
Information and Software Technology
Semi-automating small-scale source code reuse via structural correspondence
Proceedings of the 16th ACM SIGSOFT International Symposium on Foundations of software engineering
Software Reuse beyond Components with XVCL (Tutorial)
Generative and Transformational Techniques in Software Engineering II
Journal of Software Maintenance and Evolution: Research and Practice
Empirical evaluation of clone detection using syntax suffix trees
Empirical Software Engineering
Empirical Software Engineering
"Cloning considered harmful" considered harmful: patterns of cloning in software
Empirical Software Engineering
Enhancing Software Product Line Maintenance with Source Code Mining
WASA '08 Proceedings of the Third International Conference on Wireless Algorithms, Systems, and Applications
Clone detection and removal for Erlang/OTP within a refactoring environment
Proceedings of the 2009 ACM SIGPLAN workshop on Partial evaluation and program manipulation
An information retrieval process to aid in the analysis of code clones
Empirical Software Engineering
An evaluation of code similarity identification for the grow-and-prune model
Journal of Software Maintenance and Evolution: Research and Practice - Special Issue on the 12th Conference on Software Maintenance and Reengineering (CSMR 2008)
Towards generic representation of web applications: solutions and trade-offs
Software—Practice & Experience
Remote software protection by orthogonal client replacement
Proceedings of the 2009 ACM symposium on Applied Computing
Comparison and evaluation of code clone detection techniques and tools: A qualitative approach
Science of Computer Programming
A Model Engineering Approach to Tool Interoperability
Software Language Engineering
Accurate and Efficient Structural Characteristic Feature Extraction for Clone Detection
FASE '09 Proceedings of the 12th International Conference on Fundamental Approaches to Software Engineering: Held as Part of the Joint European Conferences on Theory and Practice of Software, ETAPS 2009
Complete and accurate clone detection in graph-based models
ICSE '09 Proceedings of the 31st International Conference on Software Engineering
CloneDetective - A workbench for clone detection research
ICSE '09 Proceedings of the 31st International Conference on Software Engineering
Unraveling the complexity of network management
NSDI'09 Proceedings of the 6th USENIX symposium on Networked systems design and implementation
An Effect of Comment Statements on Source Code Corrective Maintenance
Proceedings of the 2006 conference on Knowledge-Based Software Engineering: Proceedings of the Seventh Joint Conference on Knowledge-Based Software Engineering
Quick parser development using modified compilers and generated syntax rules
CIS'09 Proceedings of the international conference on Computational and information science 2009
Change impact graphs: Determining the impact of prior codechanges
Information and Software Technology
Automatic mining of functionally equivalent code fragments via random testing
Proceedings of the eighteenth international symposium on Software testing and analysis
Detecting code clones in binary executables
Proceedings of the eighteenth international symposium on Software testing and analysis
Facilitating software refactoring with appropriate resolution order of bad smells
Proceedings of the the 7th joint meeting of the European software engineering conference and the ACM SIGSOFT symposium on The foundations of software engineering
Dependence clusters in source code
ACM Transactions on Programming Languages and Systems (TOPLAS)
Clone detection via structural abstraction
Software Quality Control
Towards a refactoring guideline using code clone classification
Proceedings of the 2nd Workshop on Refactoring Tools
Cleman: Comprehensive Clone Group Evolution Management
ASE '08 Proceedings of the 2008 23rd IEEE/ACM International Conference on Automated Software Engineering
Tree-pattern-based duplicate code detection
Proceedings of the ACM first international workshop on Data-intensive software management and mining
Behavior based software theft detection
Proceedings of the 16th ACM conference on Computer and communications security
Clone detection and elimination for Haskell
Proceedings of the 2010 ACM SIGPLAN workshop on Partial evaluation and program manipulation
An empirical study on the maintenance of source code clones
Empirical Software Engineering
Clone region descriptors: Representing and tracking duplication in source code
ACM Transactions on Software Engineering and Methodology (TOSEM)
Description of a new feature meta-model
EUC'07 Proceedings of the 2007 conference on Emerging direction in embedded and ubiquitous computing
Code clone detection using parsing actions
ISCIT'09 Proceedings of the 9th international conference on Communications and information technologies
Achieving accurate clone detection results
Proceedings of the 4th International Workshop on Software Clones
Distinguishing copies from originals in software clones
Proceedings of the 4th International Workshop on Software Clones
Finding similar defects using synonymous identifier retrieval
Proceedings of the 4th International Workshop on Software Clones
Proceedings of the 4th International Workshop on Software Clones
Towards a multi-scale approach for source code approximate match report
Proceedings of the 4th International Workshop on Software Clones
Malware detection based on dependency graph using hybrid genetic algorithm
Proceedings of the 12th annual conference on Genetic and evolutionary computation
JCCD: a flexible and extensible API for implementing custom code clone detectors
Proceedings of the IEEE/ACM international conference on Automated software engineering
An experience report on scaling tools for mining software repositories using MapReduce
Proceedings of the IEEE/ACM international conference on Automated software engineering
Matching dependence-related queries in the system dependence graph
Proceedings of the IEEE/ACM international conference on Automated software engineering
Proceedings of the Joint ERCIM Workshop on Software Evolution (EVOL) and International Workshop on Principles of Software Evolution (IWPSE)
Code clones in feature-oriented software product lines
GPCE '10 Proceedings of the ninth international conference on Generative programming and component engineering
Scalable and systematic detection of buggy inconsistencies in source code
Proceedings of the ACM international conference on Object oriented programming systems languages and applications
A study of the uniqueness of source code
Proceedings of the eighteenth ACM SIGSOFT international symposium on Foundations of software engineering
Proceedings of the eighteenth ACM SIGSOFT international symposium on Foundations of software engineering
Managing technical debt in software-reliant systems
Proceedings of the FSE/SDP workshop on Future of software engineering research
On challenges in evaluating malware clustering
RAID'10 Proceedings of the 13th international conference on Recent advances in intrusion detection
Achievements and challenges in software reverse engineering
Communications of the ACM
Hard-to-answer questions about code
Evaluation and Usability of Programming Languages and Tools
Tracking the evolution of code clones
SOFSEM'11 Proceedings of the 37th international conference on Current trends in theory and practice of computer science
Excerpts from the TXL cookbook
GTTSE'09 Proceedings of the 3rd international summer school conference on Generative and transformational techniques in software engineering III
Measuring the effects of aspect-oriented refactoring on component relationships: two case studies
Proceedings of the tenth international conference on Aspect-oriented software development
An extended assessment of type-3 clones as detected by state-of-the-art tools
Software Quality Control
Extracting code clones for refactoring using combinations of clone metrics
Proceedings of the 5th International Workshop on Software Clones
Automated type-3 clone oracle using Levenshtein metric
Proceedings of the 5th International Workshop on Software Clones
Scalable clone detection using description logic
Proceedings of the 5th International Workshop on Software Clones
Representing clones in a localized manner
Proceedings of the 5th International Workshop on Software Clones
Proceedings of the 5th International Workshop on Software Clones
A spontaneous code recommendation tool based on associative search
Proceedings of the 3rd International Workshop on Search-Driven Development: Users, Infrastructure, Tools, and Evaluation
Comparison of similarity metrics for refactoring detection
Proceedings of the 8th Working Conference on Mining Software Repositories
Social interactions around cross-system bug fixings: the case of FreeBSD and OpenBSD
Proceedings of the 8th Working Conference on Mining Software Repositories
Software bertillonage: finding the provenance of an entity
Proceedings of the 8th Working Conference on Mining Software Repositories
MeCC: memory comparison-based clone detector
Proceedings of the 33rd International Conference on Software Engineering
Frequency and risks of changes to clones
Proceedings of the 33rd International Conference on Software Engineering
Value-based program characterization and its application to software plagiarism detection
Proceedings of the 33rd International Conference on Software Engineering
Proceedings of the 33rd International Conference on Software Engineering
Incremental clone detection and elimination for erlang programs
FASE'11/ETAPS'11 Proceedings of the 14th international conference on Fundamental approaches to software engineering: part of the joint European conferences on theory and practice of software
An empirical study of long-lived code clones
FASE'11/ETAPS'11 Proceedings of the 14th international conference on Fundamental approaches to software engineering: part of the joint European conferences on theory and practice of software
Studying software evolution using artefacts' shared information content
Science of Computer Programming
Function clone detection in web applications: a semiautomated approach
Journal of Web Engineering
Identifying cloned navigational patterns in web applications
Journal of Web Engineering
Demystifying configuration challenges and trade-offs in network-based ISP services
Proceedings of the ACM SIGCOMM 2011 conference
Sydit: creating and applying a program transformation from an example
Proceedings of the 19th ACM SIGSOFT symposium and the 13th European conference on Foundations of software engineering
Approximate graph clustering for program characterization
ACM Transactions on Architecture and Code Optimization (TACO) - HIPEAC Papers
Designing useful tools for developers
Proceedings of the 3rd ACM SIGPLAN workshop on Evaluation and usability of programming languages and tools
Supervised categorization of JavaScript™ using program analysis features
AIRS'05 Proceedings of the Second Asia conference on Asia Information Retrieval Technology
Similar code detection and elimination for erlang programs
PADL'10 Proceedings of the 12th international conference on Practical Aspects of Declarative Languages
A method for analyzing code homology in genealogy of evolving software
FASE'10 Proceedings of the 13th international conference on Fundamental Approaches to Software Engineering
AuDeNTES: Automatic Detection of teNtative plagiarism according to a rEference Solution
ACM Transactions on Computing Education (TOCE)
An investigation of cloning in web applications
ICWE'05 Proceedings of the 5th international conference on Web Engineering
PROFES'05 Proceedings of the 6th international conference on Product Focused Software Process Improvement
Measuring similarity of large software systems based on source code correspondence
PROFES'05 Proceedings of the 6th international conference on Product Focused Software Process Improvement
An empirical study on inconsistent changes to code clones at the release level
Science of Computer Programming
A shared fragments analysis system for large collections of web pages
DAS'06 Proceedings of the 7th international conference on Document Analysis Systems
Relation of code clones and change couplings
FASE'06 Proceedings of the 9th international conference on Fundamental Approaches to Software Engineering
CloneDifferentiator: Analyzing clones by differentiation
ASE '11 Proceedings of the 2011 26th IEEE/ACM International Conference on Automated Software Engineering
Harmfulness of code duplication: a structured review of the evidence
EASE'09 Proceedings of the 13th international conference on Evaluation and Assessment in Software Engineering
Empirical Software Engineering
What kind of and how clones are refactored?: a case study of three OSS projects
Proceedings of the Fifth Workshop on Refactoring Tools
Journal of Systems and Software
An empirical study on the impact of duplicate code
Advances in Software Engineering - Special issue on Software Quality Assurance Methodologies and Techniques
A first step towards algorithm plagiarism detection
Proceedings of the 2012 International Symposium on Software Testing and Analysis
CBCD: cloned buggy code detector
Proceedings of the 34th International Conference on Software Engineering
Proceedings of the 34th International Conference on Software Engineering
Active refinement of clone anomaly reports
Proceedings of the 34th International Conference on Software Engineering
On how often code is cloned across repositories
Proceedings of the 34th International Conference on Software Engineering
Cloning in DSLs: experiments with OCL
SLE'11 Proceedings of the 4th international conference on Software Language Engineering
Can I clone this piece of code here?
Proceedings of the 27th IEEE/ACM International Conference on Automated Software Engineering
Boreas: an accurate and scalable token-based approach to code clone detection
Proceedings of the 27th IEEE/ACM International Conference on Automated Software Engineering
Proceedings of the 27th IEEE/ACM International Conference on Automated Software Engineering
History-sensitive heuristics for recovery of features in code of evolving program families
Proceedings of the 16th International Software Product Line Conference - Volume 1
Increasing clone maintenance support by unifying clone detection and refactoring activities
Information and Software Technology
Towards a catalog of variability evolution patterns: the Linux kernel case
FOSD '12 Proceedings of the 4th International Workshop on Feature-Oriented Software Development
Typestate-based semantic code search over partial programs
Proceedings of the ACM international conference on Object oriented programming systems languages and applications
REPERTOIRE: a cross-system porting analysis tool for forked software projects
Proceedings of the ACM SIGSOFT 20th International Symposium on the Foundations of Software Engineering
A case study of cross-system porting in forked projects
Proceedings of the ACM SIGSOFT 20th International Symposium on the Foundations of Software Engineering
Generalized vulnerability extrapolation using abstract syntax trees
Proceedings of the 28th Annual Computer Security Applications Conference
XIAO: tuning code clones at hands of engineers in practice
Proceedings of the 28th Annual Computer Security Applications Conference
Identification of generalization refactoring opportunities
Automated Software Engineering
Resource requirement prediction using clone detection technique
Future Generation Computer Systems
RAMC: runtime abstract memory context based plagiarism detection in binary code
Proceedings of the 7th International Conference on Ubiquitous Information Management and Communication
Detecting source code similarity using code abstraction
Proceedings of the 7th International Conference on Ubiquitous Information Management and Communication
Using roles to model crosscutting concerns
Proceedings of the 12th annual international conference on Aspect-oriented software development
An efficient similarity comparison based on core API calls
Proceedings of the 28th Annual ACM Symposium on Applied Computing
Data clone detection and visualization in spreadsheets
Proceedings of the 2013 International Conference on Software Engineering
LASE: locating and applying systematic edits by learning from examples
Proceedings of the 2013 International Conference on Software Engineering
Automatic recovery from runtime failures
Proceedings of the 2013 International Conference on Software Engineering
LASE: an example-based program transformation tool for locating and applying systematic edits
Proceedings of the 2013 International Conference on Software Engineering
An empirical study of the fault-proneness of clone mutation and clone migration
Proceedings of the 10th Working Conference on Mining Software Repositories
Rendezvous: a search engine for binary code
Proceedings of the 10th Working Conference on Mining Software Repositories
Searching for better configurations: a rigorous approach to clone evaluation
Proceedings of the 2013 9th Joint Meeting on Foundations of Software Engineering
Extraction of product evolution tree from source code of product variants
Proceedings of the 17th International Software Product Line Conference
Enhancement of CRD-based clone tracking
Proceedings of the 2013 International Workshop on Principles of Software Evolution
Viewing functions as token sequences to highlight similarities in source code
Science of Computer Programming
An application for plagiarized source code detection based on a parse tree kernel
Engineering Applications of Artificial Intelligence
ECOOP'13 Proceedings of the 27th European conference on Object-Oriented Programming
Simseer and bugwise: web services for binary-level software similarity and defect detection
AusPDC '13 Proceedings of the Eleventh Australasian Symposium on Parallel and Distributed Computing - Volume 140
ADAM: External dependency-driven architecture discovery and analysis of quality attributes
ACM Transactions on Software Engineering and Methodology (TOSEM)
Tuning research tools for scalability and performance: The NiCad experience
Science of Computer Programming
Beyond plagiarism: An active learning method to analyze causes behind code-similarity
Computers & Education
Systematic audit of third-party android phones
Proceedings of the 4th ACM conference on Data and application security and privacy
What you like in design use to correct bad-smells
Software Quality Control
Pattern mining of cloned codes in software systems
Information Sciences: an International Journal
Empirical Software Engineering
Comparison and evaluation of source code mining tools and techniques: A qualitative approach
Intelligent Data Analysis
Hi-index | 0.02 |
A code clone is a code portion in source files that is identical or similar to another. Since code clones are believed to reduce the maintainability of software, several code clone detection techniques and tools have been proposed. This paper proposes a new clone detection technique, which consists of the transformation of input source text and a token-by-token comparison. For its implementation with several useful optimization techniques, we have developed a tool, named CCFinder, which extracts code clones in C, C++, Java, COBOL, and other source files. As well, metrics for the code clones have been developed: In order to evaluate the usefulness of CCFinder and metrics, we conducted several case studies where we applied the new tool to the source code of JDK, FreeBSD, NetBSD, Linux, and many other systems. As a result, CCFinder has effectively found clones and the metrics have been able to effectively identify the characteristics of the systems. In addition, we have compared the proposed technique with other clone detection techniques.