The plagiarism detection by compression method

  • Authors:
  • Daniela Chudá;Martin Uhlík

  • Affiliations:
  • Slovak University of Technology in Bratislava, Slovak Republic;Slovak University of Technology in Bratislava, Slovak Republic

  • Venue:
  • Proceedings of the 12th International Conference on Computer Systems and Technologies
  • Year:
  • 2011
  • Why inverse document frequency?

    NAACL '01 Proceedings of the second meeting of the North American Chapter of the Association for Computational Linguistics on Language technologies

Quantified Score

Hi-index 0.00

Visualization

Abstract

Tools to detect plagiarized text written in Slovak is not supported. For detection of plagiarized texts will also focus on support for Slovak texts. The aim of this paper is to introduce our own method for detect plagiarism. It explains the principle of dictionary method for data compression known as the Lempel-Ziv, which idea of creating the dictionary is used as the basis for our method proposal to detect plagiarism in texts. Self-designed dictionary method was implemented in our version of the tool. In conclusion, achievements are presented, which were compared with results of analyzed tools on a sample of the Slovak texts.