TnT: a statistical part-of-speech tagger
ANLC '00 Proceedings of the sixth conference on Applied natural language processing
A study of the influence of pos tagging on WSD
TSD'06 Proceedings of the 9th international conference on Text, Speech and Dialogue
Hi-index | 0.00 |
Modern part-of-speech (POS) tagging tools can provide high quality markup for grammatically correct documents, but ungrammatical sentences can be challenging for them. In the present paper we study the problem of POS-tagging for the texts that contain grammatical errors, and show how POS-taggers can be improved for the use in this context. Specifically, we propose to include ungrammatical POS-tagged sentences into the text corpus used to train a tagger (presumably, a tagger is based on a certain variation of machine learning).