A maximum entropy approach to natural language processing
Computational Linguistics
A maximum entropy approach to named entity recognition
A maximum entropy approach to named entity recognition
ACM Transactions on Asian Language Information Processing (TALIP)
Syntax-based semi-supervised named entity tagging
ACLdemo '05 Proceedings of the ACL 2005 on Interactive poster and demonstration sessions
Hi-index | 0.00 |
Scarcity of annotated data is a challenge in building high performance named entity recognition (NER) systems in resource poor languages. We use a semi-supervised approach which uses a small annotated corpus and a large raw corpus for the Hindi NER task using maximum entropy classifier. A novel statistical annotation confidence measure is proposed for the purpose. The confidence measure is used in selective sampling based semi-supervised NER. Also a prior modulation of maximum entropy classifier is used where the annotation confidence values are used as `prior weight'. The superiority of the proposed technique over baseline classifier is demonstrated extensively through experiments.