Citation based plagiarism detection: a new approach to identify plagiarized work language independently

  • Authors:
  • Bela Gipp;Jöran Beel

  • Affiliations:
  • University of California, Berkeley & OvGU, Berkeley, USA;University of California, Berkeley & OvGU, Berkeley, USA

  • Venue:
  • Proceedings of the 21st ACM conference on Hypertext and hypermedia
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper describes a new approach towards detecting plagiarism and scientific documents that have been read but not cited. In contrast to existing approaches, which analyze documents' words but ignore their citations, this approach is based on citation analysis and allows duplicate and plagiarism detection even if a document has been paraphrased or translated, since the relative position of citations remains similar. Although this approach allows in many cases the detection of plagiarized work that could not be detected automatically with the traditional approaches, it should be considered as an extension rather than a substitute. Whereas the known text analysis methods can detect copied or, to a certain degree, modified passages, the proposed approach requires longer passages with at least two citations in order to create a digital fingerprint.