Machine Learning
Machine Learning
Mopping up: modeling wikipedia promotion decisions
Proceedings of the 2008 ACM conference on Computer supported cooperative work
Is Wikipedia growing a longer tail?
Proceedings of the ACM 2009 international conference on Supporting group work
The singularity is not near: slowing growth of Wikipedia
Proceedings of the 5th International Symposium on Wikis and Open Collaboration
The WEKA data mining software: an update
ACM SIGKDD Explorations Newsletter
Natural Language Processing with Python
Natural Language Processing with Python
Detecting Wikipedia vandalism via spatio-temporal analysis of revision metadata?
Proceedings of the Third European Workshop on System Security
Automatic vandalism detection in Wikipedia
ECIR'08 Proceedings of the IR research, 30th European conference on Advances in information retrieval
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
The effects of group composition on decision quality in a social production community
Proceedings of the 16th ACM international conference on Supporting group work
Beyond Notability. Collective Deliberation on Content Inclusion in Wikipedia
SASOW '10 Proceedings of the 2010 Fourth IEEE International Conference on Self-Adaptive and Self-Organizing Systems Workshop
Wikipedia vandalism detection: combining natural language, metadata, and reputation features
CICLing'11 Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part II
What Wikipedia deletes: characterizing dangerous collaborative content
Proceedings of the 7th International Symposium on Wikis and Open Collaboration
Don't bite the newbies: how reverts affect the quantity and quality of Wikipedia work
Proceedings of the 7th International Symposium on Wikis and Open Collaboration
Participation in Wikipedia's article deletion processes
Proceedings of the 7th International Symposium on Wikis and Open Collaboration
Predicting quality flaws in user-generated content: the case of wikipedia
SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
"Writing up rather than writing down": becoming Wikipedia literate
Proceedings of the Eighth Annual International Symposium on Wikis and Open Collaboration
Deletion discussions in Wikipedia: decision factors and outcomes
Proceedings of the Eighth Annual International Symposium on Wikis and Open Collaboration
Hi-index | 0.00 |
Wikipedia's low barriers to participation have the unintended effect of attracting a large number of articles whose topics do not meet Wikipedia's inclusion standards. Many are quickly deleted, often causing their creators to stop contributing to the site. We collect and make available several datasets of deleted articles, heretofore inaccessible, and use them to create a model that can predict with high precision whether or not an article will be deleted. We report precision of 98.6% and recall of 97.5% in the best case and high precision with lower, but still useful, recall, in the most difficult case. We propose to deploy a system utilizing this model on Wikipedia as a set of decision-support tools to help article creators evaluate and improve their articles before posting, and new article patrollers make more informed decisions about which articles to delete and which to improve.