Copy detection mechanisms for digital documents
SIGMOD '95 Proceedings of the 1995 ACM SIGMOD international conference on Management of data
Methods for identifying versioned and plagiarized documents
Journal of the American Society for Information Science and Technology
Authorship verification as a one-class classification problem
ICML '04 Proceedings of the twenty-first international conference on Machine learning
Addressing plagiarism and IPR violation
Information Services and Use - APE 2007
Reducing the Plagiarism Detection Search Space on the Basis of the Kullback-Leibler Distance
CICLing '09 Proceedings of the 10th International Conference on Computational Linguistics and Intelligent Text Processing
Multilayer SOM with tree-structured data for efficient document retrieval and plagiarism detection
IEEE Transactions on Neural Networks
Finding inner copy communities using social network analysis
KES'10 Proceedings of the 14th international conference on Knowledge-based and intelligent information and engineering systems: Part II
Improving understandability of semantic search explanations
International Journal of Knowledge Engineering and Data Mining
An evolutionary neural network approach to intrinsic plagiarism detection
AICS'09 Proceedings of the 20th Irish conference on Artificial intelligence and cognitive science
Constructing understandable explanations for semantic search results
EKAW'10 Proceedings of the 17th international conference on Knowledge engineering and management by the masses
Language Resources and Evaluation
Cross-language plagiarism detection
Language Resources and Evaluation
High performance technique for database applications using a hybrid GPU/CPU platform
Proceedings of the 21st edition of the great lakes symposium on Great lakes symposium on VLSI
Unsupervised decomposition of a document into authorial components
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Proceedings of the 11th ACM symposium on Document engineering
Progress in information retrieval
ECIR'06 Proceedings of the 28th European conference on Advances in Information Retrieval
Finding and exploring memes in social media
Proceedings of the 23rd ACM conference on Hypertext and social media
Expert Systems with Applications: An International Journal
Research on intrinsic plagiarism detection resolution: a supervised learning approach
CLSW'12 Proceedings of the 13th Chinese conference on Chinese Lexical Semantics
Hi-index | 0.00 |
Current research in the field of automatic plagiarism detection for text documents focuses on algorithms that compare plagiarized documents against potential original documents. Though these approaches perform well in identifying copied or even modified passages, they assume a closed world: a reference collection must be given against which a plagiarized document can be compared. This raises the question whether plagiarized passages within a document can be detected automatically if no reference is given, e. g. if the plagiarized passages stem from a book that is not available in digital form. We call this problem class intrinsic plagiarism detection. The paper is devoted to this problem class; it shows that it is possible to identify potentially plagiarized passages by analyzing a single document with respect to variations in writing style. Our contributions are fourfold: (i) a taxonomy of plagiarism delicts along with detection methods, (ii) new features for the quantification of style aspects, (iii) a publicly available plagiarism corpus for benchmark comparisons, and (iv) promising results in non-trivial plagiarism detection settings: in our experiments we achieved recall values of 85% with a precision of 75% and better.