Creating, destroying, and restoring value in wikipedia
Proceedings of the 2007 international ACM conference on Supporting group work
Using dynamic markov compression to detect vandalism in the wikipedia
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
The work of sustaining order in wikipedia: the banning of a vandal
Proceedings of the 2010 ACM conference on Computer supported cooperative work
Automatic vandalism detection in Wikipedia
ECIR'08 Proceedings of the IR research, 30th European conference on Advances in information retrieval
Proceedings of the 20th international conference companion on World wide web
Proceedings of the 20th international conference on World wide web
Wikipedia vandalism detection: combining natural language, metadata, and reputation features
CICLing'11 Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part II
Language of vandalism: improving Wikipedia vandalism detection via stylometric analysis
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
Proceedings of the 7th International Symposium on Wikis and Open Collaboration
Autonomous link spam detection in purely collaborative environments
Proceedings of the 7th International Symposium on Wikis and Open Collaboration
Automatic vandalism detection in wikipedia with active associative classification
TPDL'12 Proceedings of the Second international conference on Theory and Practice of Digital Libraries
Automatic vandalism detection in wikipedia with active associative classification
TPDL'12 Proceedings of the Second international conference on Theory and Practice of Digital Libraries
WHAD: Wikipedia historical attributes data
Language Resources and Evaluation
Hi-index | 0.00 |
We report on the construction of the PAN Wikipedia vandalism corpus, PAN-WVC-10, using Amazon's Mechanical Turk. The corpus compiles 32452 edits on 28468 Wikipedia articles, among which 2391 vandalism edits have been identified. 753 human annotators cast a total of 193022 votes on the edits, so that each edit was reviewed by at least 3 annotators, whereas the achieved level of agreement was analyzed in order to label an edit as "regular" or "vandalism." The corpus is available free of charge.