Japanese named entity recognition based on a simple rule generator and decision tree learning
ACL '01 Proceedings of the 39th Annual Meeting on Association for Computational Linguistics
A bootstrapping approach to named entity classification using successive learners
ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
A bootstrapping method for learning semantic lexicons using extraction pattern contexts
EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
Use of support vector machines in extended named entity recognition
COLING-02 proceedings of the 6th conference on Natural language learning - Volume 20
Syntax-based semi-supervised named entity tagging
ACLdemo '05 Proceedings of the ACL 2005 on Interactive poster and demonstration sessions
VN-KIM IE: automatic extraction of Vietnamese named-entities on the web
New Generation Computing
A simple semi-supervised algorithm for named entity recognition
SemiSupLearn '09 Proceedings of the NAACL HLT 2009 Workshop on Semi-Supervised Learning for Natural Language Processing
Introduction to Semi-Supervised Learning
Introduction to Semi-Supervised Learning
Named entity recognition for Vietnamese
ACIIDS'10 Proceedings of the Second international conference on Intelligent information and database systems: Part II
PAKDD'11 Proceedings of the 15th Pacific-Asia conference on Advances in knowledge discovery and data mining - Volume Part I
A bootstrapping approach for training a NER with conditional random fields
EPIA'11 Proceedings of the 15th Portugese conference on Progress in artificial intelligence
VNLP: an open source framework for Vietnamese natural language processing
Proceedings of the Fourth Symposium on Information and Communication Technology
Hi-index | 0.00 |
Requiring a large hand-annotated corpus in supervised learning of contemporary Vietnamese Named Entity Recognition researches is challenging. We therefore propose a hybrid approach of pattern extraction and semi-supervised learning. Applied rule-based method helps generating patterns automatically. Part-of-speech tagger, lexical diversity and chunking are explored to define rules in pattern extractions which are used for identifying potential named entities. Semi-supervised learning trains a small amount of seed named entities to categorize named entities in extracted patterns. In experiments, our approach shows good increasing the system accuracy with others in Vietnamese.