Some advances in transformation-based part of speech tagging
AAAI '94 Proceedings of the twelfth national conference on Artificial intelligence (vol. 1)
ICTAI '02 Proceedings of the 14th IEEE International Conference on Tools with Artificial Intelligence
Towards a syntactic account of punctuation
COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 2
Introduction to the CoNLL-2000 shared task: chunking
ConLL '00 Proceedings of the 2nd workshop on Learning language in logic and the 4th conference on Computational natural language learning - Volume 7
Introduction to the CoNLL-2001 shared task: clause identification
ConLL '01 Proceedings of the 2001 workshop on Computational Natural Language Learning - Volume 7
Design and development of a system for the detection of agreement errors in basque
CICLing'05 Proceedings of the 6th international conference on Computational Linguistics and Intelligent Text Processing
Natural language generation for sponsored-search advertisements
Proceedings of the 9th ACM conference on Electronic commerce
Correcting comma errors in learner essays, and restoring commas in newswire text
NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Hi-index | 0.00 |
In this paper, we describe the research using machine learning techniques to build a comma checker to be integrated in a grammar checker for Basque. After several experiments, and trained with a little corpus of 100,000 words, the system guesses correctly not placing commas with a precision of 96% and a recall of 98%. It also gets a precision of 70% and a recall of 49% in the task of placing commas. Finally, we have shown that these results can be improved using a bigger and a more homogeneous corpus to train, that is, a bigger corpus written by one unique author.