An anti-plagiarism editor for software development courses
ACE '05 Proceedings of the 7th Australasian conference on Computing education - Volume 42
A case study in efficient microcontroller education
ACM SIGBED Review - Special issue: The first workshop on embedded system education (WESE)
Information distance from a question to an answer
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Proceedings of the 2007 ACM symposium on Document engineering
Detecting outsourced student programming assignments
Journal of Computing Sciences in Colleges
Distance measures for biological sequences: Some recent approaches
International Journal of Approximate Reasoning
Design of a P2P infrastructure to support plagiarism detection mechanisms
CSTST '08 Proceedings of the 5th international conference on Soft computing as transdisciplinary science and technology
Information shared by many objects
Proceedings of the 17th ACM conference on Information and knowledge management
Towards a Normal Form for Mercury Programs
Logic-Based Program Synthesis and Transformation
Efficient overlap and content reuse detection in blogs and online news articles
Proceedings of the 18th international conference on World wide web
Reading beside the lines: Using indentation to rank revisions by complexity
Science of Computer Programming
New information distance measure and its application in question answering system
Journal of Computer Science and Technology
Software reuse and plagiarism: a code of practice
ITiCSE '09 Proceedings of the 14th annual ACM SIGCSE conference on Innovation and technology in computer science education
Forensic Authorship Attribution Using Compression Distances to Prototypes
IWCF '09 Proceedings of the 3rd International Workshop on Computational Forensics
Finding Similarities in Source Code Through Factorization
Electronic Notes in Theoretical Computer Science (ENTCS)
Structural Entropic Difference: A Bounded Distance Metric for Unordered Trees
SISAP '09 Proceedings of the 2009 Second International Workshop on Similarity Search and Applications
Towards the validation of plagiarism detection tools by means of grammar evolution
IEEE Transactions on Evolutionary Computation
Program plagiarism detection using parse tree Kernels
PRICAI'06 Proceedings of the 9th Pacific Rim international conference on Artificial intelligence
Measuring the non-compositionality of multiword expressions
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Code analyzer for an online course management system
Journal of Systems and Software
A Fast Quartet tree heuristic for hierarchical clustering
Pattern Recognition
A bounded distance metric for comparing tree structure
Information Systems
Plagiarism detection among source codes using adaptive local alignment of keywords
Proceedings of the 5th International Conference on Ubiquitous Information Management and Communication
An extended assessment of type-3 clones as detected by state-of-the-art tools
Software Quality Control
Reuse in the wild: an empirical and ethnographic study of organizational content reuse
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
A new multiword expression metric and its applications
Journal of Computer Science and Technology - Special issue on natural language processing
Studying software evolution using artefacts' shared information content
Science of Computer Programming
Plagiarism detection for Java: a tool comparison
Computer Science Education Research Conference
Information distance and its extensions
DS'11 Proceedings of the 14th international conference on Discovery science
A fast compression-based similarity measure with applications to content-based image retrieval
Journal of Visual Communication and Image Representation
Similarity of objects and the meaning of words
TAMC'06 Proceedings of the Third international conference on Theory and Applications of Models of Computation
Information distance and its applications
CIAA'06 Proceedings of the 11th international conference on Implementation and Application of Automata
Proceedings of the 50th Annual Southeast Regional Conference
Criticality of spatiotemporal dynamics in contact mediated pattern formation
IPCAT'12 Proceedings of the 9th international conference on Information Processing in Cells and Tissues
Impugning Randomness, Convincingly
Studia Logica
Classifying stem cell differentiation images by information distance
ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part I
Experiments with filtered detection of similar academic papers
AIMSA'12 Proceedings of the 15th international conference on Artificial Intelligence: methodology, systems, and applications
RAMC: runtime abstract memory context based plagiarism detection in binary code
Proceedings of the 7th International Conference on Ubiquitous Information Management and Communication
Information distance between what I said and what it heard
Communications of the ACM
An application for plagiarized source code detection based on a parse tree kernel
Engineering Applications of Artificial Intelligence
Similarity in languages and programs
Theoretical Computer Science
Automated Assessment of Programming Assignments
Proceedings of the 3rd Computer Science Education Research Conference on Computer Science Education Research
Pattern mining of cloned codes in software systems
Information Sciences: an International Journal
Hi-index | 754.85 |
A fundamental question in information theory and in computer science is how to measure similarity or the amount of shared information between two sequences. We have proposed a metric, based on Kolmogorov complexity, to answer this question and have proven it to be universal. We apply this metric in measuring the amount of shared information between two computer programs, to enable plagiarism detection. We have designed and implemented a practical system SID (Software Integrity Diagnosis system) that approximates this metric by a heuristic compression algorithm. Experimental results demonstrate that SID has clear advantages over other plagiarism detection systems. SID system server is online at http://software.bioinformatics.uwaterloo.ca/SID/.