Methods for identifying versioned and plagiarized documents
Journal of the American Society for Information Science and Technology
Strategies for retrieving plagiarized documents
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Efficient partial-duplicate detection based on sequence matching
Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
An evaluation framework for plagiarism detection
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Hi-index | 0.00 |
In this paper, we describe a view of our research method on the Plagiarism Detection for Indonesian texts that we are working on. This method should address the problems of handling the equivalence class of Indonesian tokens, selecting the targeted source documents, and minimizing the gap of similarity measurement between the selection and the comparison modules. For these reasons, we propose a novel document representation in the candidate document retrieval module and the hybrid of segmentation and similarity of hashing technique in the comparison module.