CCFinder: a multilinguistic token-based code clone detection system for large scale source code
IEEE Transactions on Software Engineering
Using Slicing to Identify Duplication in Source Code
SAS '01 Proceedings of the 8th International Symposium on Static Analysis
On finding duplication and near-duplication in large software systems
WCRE '95 Proceedings of the Second Working Conference on Reverse Engineering
Clone Detection Using Abstract Syntax Trees
ICSM '98 Proceedings of the International Conference on Software Maintenance
Winnowing: local algorithms for document fingerprinting
Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Control flow based obfuscation
Proceedings of the 5th ACM workshop on Digital rights management
Deobfuscation: Reverse Engineering Obfuscated Code
WCRE '05 Proceedings of the 12th Working Conference on Reverse Engineering
CP-Miner: Finding Copy-Paste and Related Bugs in Large-Scale Software Code
IEEE Transactions on Software Engineering
GPLAG: detection of software plagiarism by program dependence graph analysis
Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
DECKARD: Scalable and Accurate Tree-Based Detection of Code Clones
ICSE '07 Proceedings of the 29th international conference on Software Engineering
Tracking Code Clones in Evolving Software
ICSE '07 Proceedings of the 29th international conference on Software Engineering
Scalable detection of semantic clones
Proceedings of the 30th international conference on Software engineering
ICSE '09 Proceedings of the 31st International Conference on Software Engineering
Shared information and program plagiarism detection
IEEE Transactions on Information Theory
Hi-index | 0.00 |
Analyzing software similarity has emerged as a key ingredient for various applications such as software maintenance, bug finding, malware clustering and copyright protection. In this paper, we propose a novel software similarity analysis tool for plagiarism detection. The tool, which we refer to it as RAMC (Runtime Abstract Memory Context based tool), has the following three characteristics. First, it is based on runtime semantic information, which makes it feasible to investigate similarity in binary codes, without source codes. Second, among runtime semantic information, it focuses on AMC (Abstract Memory Context) that can represent the intrinsic features of the analyzed code. Finally, it introduces two advanced techniques, namely region filtering and sequence alignment for AMC comparison. Real implementation based experiments have shown that RAMC can identify similarity appropriately between the original and plagiarized binaries.