C4.5: programs for machine learning
C4.5: programs for machine learning
Assessing agreement on classification tasks: the kappa statistic
Computational Linguistics
Fast training of support vector machines using sequential minimal optimization
Advances in kernel methods
Machine Learning
A Comparative Study on Feature Selection in Text Categorization
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Natural Language Engineering
Talk Before You Type: Coordination in Wikipedia
HICSS '07 Proceedings of the 40th Annual Hawaii International Conference on System Sciences
Information quality work organization in wikipedia
Journal of the American Society for Information Science and Technology
Inter-coder agreement for computational linguistics
Computational Linguistics
The WEKA data mining software: an update
ACM SIGKDD Explorations Newsletter
Towards identifying unresolved discussions in student online forums
IUNLPBEA '10 Proceedings of the NAACL HLT 2010 Fifth Workshop on Innovative Use of NLP for Building Educational Applications
Tagging and linking web forum posts
CoNLL '10 Proceedings of the Fourteenth Conference on Computational Natural Language Learning
Understanding and improving Wikipedia article discussion spaces
Proceedings of the 2011 ACM Symposium on Applied Computing
Wikipedia revision toolkit: efficiently accessing Wikipedia's edit history
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: Systems Demonstrations
Annotating social acts: authority claims and alignment moves in Wikipedia talk pages
LSM '11 Proceedings of the Workshop on Languages in Social Media
Information quality assessment of community generated content: A user study of Wikipedia
Journal of Information Science
Deletion discussions in Wikipedia: decision factors and outcomes
Proceedings of the Eighth Annual International Symposium on Wikis and Open Collaboration
Hi-index | 0.00 |
In this paper, we propose an annotation schema for the discourse analysis of Wikipedia Talk pages aimed at the coordination efforts for article improvement. We apply the annotation schema to a corpus of 100 Talk pages from the Simple English Wikipedia and make the resulting dataset freely available for download. Furthermore, we perform automatic dialog act classification on Wikipedia discussions and achieve an average F1-score of 0.82 with our classification pipeline.