Comparative evaluation of text- and citation-based plagiarism detection approaches using guttenplag

  • Authors:
  • Bela Gipp;Norman Meuschke;Joeran Beel

  • Affiliations:
  • UC Berkeley, California, USA / OVGU Magdeburg, Germany, Berkeley, CA, USA;OVGU Magdeburg, Germany, Magdeburg, Germany;OVGU Magdeburg, Germany / UC Berkeley, California, USA, Magdeburg, Germany

  • Venue:
  • Proceedings of the 11th annual international ACM/IEEE joint conference on Digital libraries
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Various approaches for plagiarism detection exist. All are based on more or less sophisticated text analysis methods such as string matching, fingerprinting or style comparison. In this paper a new approach called Citation-based Plagiarism Detection is evaluated using a doctoral thesis, in which a volunteer crowd-sourcing project called GuttenPlag identified substantial amounts of plagiarism through careful manual inspection. This new approach is able to identify similar and plagiarized documents based on the citations used in the text. It is shown that citation-based plagiarism detection performs significantly better than text-based procedures in identifying strong paraphrasing, translation and some idea plagiarism. Detection rates can be improved by combining citation-based with text-based plagiarism detection.