YAP3: improved detection of similarities in computer program and other texts
SIGCSE '96 Proceedings of the twenty-seventh SIGCSE technical symposium on Computer science education
Data mining: concepts and techniques
Data mining: concepts and techniques
An information-theoretic perspective of tf—idf measures
Information Processing and Management: an International Journal
K-gram based software birthmarks
Proceedings of the 2005 ACM symposium on Applied computing
Java Birthmarks —Detecting the Software Theft—
IEICE - Transactions on Information and Systems
A Software Birthmark Based on Dynamic Opcode n-gram
ICSC '07 Proceedings of the International Conference on Semantic Computing
Proceedings of the twenty-second IEEE/ACM international conference on Automated software engineering
A static API birthmark for Windows binary executables
Journal of Systems and Software
Behavior based software theft detection
Proceedings of the 16th ACM conference on Computer and communications security
Software plagiarism detection: a graph-based approach
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Hi-index | 0.00 |
In this paper, we propose a system for detecting software plagiarism using a birthmark. The birthmark is representative features of a program, which can be used to identify the program. We use a set of frequency of APIs used in a program as its birthmark. The proposed system consists of three components. First, it extracts the frequency of APIs employed in a program. Next, it generates the program birthmark using a set of frequency of APIs and weights to APIs to extract unique features of the program. Finally, it decides the plagiarism based on the cosine similarity between the birthmarks. Through extensive experiments, it was found that the proposed system can provide 97.2% of precision and 95.7% of recall in plagiarism detection.