Outlier-based approaches for intrinsic and external plagiarism detection

  • Authors:
  • Gabriel Oberreuter;Gaston L'Huillier;Sebastián A. Ríos;Juan D. Velásquez

  • Affiliations:
  • Department of Industrial Engineering, University of Chile;Department of Industrial Engineering, University of Chile;Department of Industrial Engineering, University of Chile;Department of Industrial Engineering, University of Chile

  • Venue:
  • KES'11 Proceedings of the 15th international conference on Knowledge-based and intelligent information and engineering systems - Volume Part II
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Plagiarism detection, one of the main problems that educational institutions have been dealing with since the massification of Internet, can be considered as a classification problem using both self-based information and text processing algorithms whose computational complexity is intractable without using space search reduction algorithms. First, self-based information algorithms treat plagiarism detection as an outlier detection problem for which the classifier must decide plagiarism using only the text in a given document. Then, external plagiarism detection uses text matching algorithms where it is fundamental to reduce the matching space with text search space reduction techniques, which can be represented as another outlier detection problem. The main contribution of this work is the inclusion of text outlier detection methodologies to enhance both intrinsic and external plagiarism detection. Results shows that our approach is highly competitive with respect to the leading research teams in plagiarism detection.