Why inverse document frequency?
NAACL '01 Proceedings of the second meeting of the North American Chapter of the Association for Computational Linguistics on Language technologies
Hi-index | 0.00 |
Tools to detect plagiarized text written in Slovak is not supported. For detection of plagiarized texts will also focus on support for Slovak texts. The aim of this paper is to introduce our own method for detect plagiarism. It explains the principle of dictionary method for data compression known as the Lempel-Ziv, which idea of creating the dictionary is used as the basis for our method proposal to detect plagiarism in texts. Self-designed dictionary method was implemented in our version of the tool. In conclusion, achievements are presented, which were compared with results of analyzed tools on a sample of the Slovak texts.