A source code linearization technique for detecting plagiarized programs

  • Authors:
  • Jeong-Hoon Ji;Gyun Woo;Hwan-Gue Cho

  • Affiliations:
  • Pusan National University;Pusan National University;Pusan National University

  • Venue:
  • Proceedings of the 12th annual SIGCSE conference on Innovation and technology in computer science education
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

It is very important to detect plagiarized programs in the field of computer science education. Therefore, many tools and algorithms have been developed for this purpose. Generally, these tools are operated in two phases. In phase 1, a program plagiarism detecting tool generates an intermediate representation from a given program set. The intermediate representation should reflect the structural characterization of the program. Most tools use the parse tree or token sequence by intermediate representation. In phase 2, the program looks for plagiarized material and evaluates the similarity of two programs. It is helpful to announce the plagiarized metarials between two programs to the instructor. In this paper, we present the static tracing method in order to improve program plagiarism detection accuracy. The static tracing method statically executes a program at the syntax-level and then extracts predefined keywords according to the order of the executed functions. The result of experiment proves this method can detect plagiarism more effectively than the previously released plagiarism detecting method.