PDE4Java: Plagiarism Detection Engine for Java source code: a clustering approach
International Journal of Business Intelligence and Data Mining
Clustering dense graphs: A web site graph paradigm
Information Processing and Management: an International Journal
Plagiarism detection for Java: a tool comparison
Computer Science Education Research Conference
Instructor-centric source code plagiarism detection and plagiarism corpus
Proceedings of the 17th ACM annual conference on Innovation and technology in computer science education
Hi-index | 0.00 |
Efficient detection of plagiarism in programming assignments of students is of a great importance to the educational procedure. This paper presents a clustering oriented approach for facing the problem of source code plagiarism. The implemented software, called PDetect, accepts as input a set of program sources and extracts subsets (the clusters of plagiarism) such that each program within a particular subset has been derived from the same original. PDetect proposes the use of an appropriate measure for evaluating plagiarism detection performance and supports the idea of combining different plagiarism detection schemes. Furthermore, a cluster analysis is performed in order to provide information beneficial to the plagiarism detection process. PDetect is designed such that it may be easily adapted over any keyword-based programming language and it is quite beneficial when compared with earlier (state-of-the-art) plagiarism detection approaches.