A cooccurrence-based thesaurus and two applications to information retrieval
Information Processing and Management: an International Journal
Learning dictionaries for information extraction by multi-level bootstrapping
AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
A WordNet-based approach to Named Entities recognition
SEMANET '02 Proceedings of the 2002 workshop on Building and using semantic networks - Volume 11
Nested Named Entity Recognition in Historical Archive Text
ICSC '07 Proceedings of the International Conference on Semantic Computing
Evaluating and combining biomedical named entity recognition systems
BioNLP '07 Proceedings of the Workshop on BioNLP 2007: Biological, Translational, and Clinical Language Processing
A context pattern induction method for named entity extraction
CoNLL-X '06 Proceedings of the Tenth Conference on Computational Natural Language Learning
AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
WikiRelate! computing semantic relatedness using wikipedia
AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
Comparison between tagged corpora for the named entity task
CompareCorpora '00 Proceedings of the Workshop on Comparing Corpora
Bootstrapping named entity recognition with automatically generated gazetteer lists
EACL '06 Proceedings of the Eleventh Conference of the European Chapter of the Association for Computational Linguistics: Student Research Workshop
Self-annotation for fine-grained geospatial relation extraction
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Authoring technical documents for effective retrieval
EKAW'10 Proceedings of the 17th international conference on Knowledge engineering and management by the masses
EAGER: extending automatically gazetteers for entity recognition
Proceedings of the 3rd Workshop on the People's Web Meets NLP: Collaboratively Constructed Semantic Resources and their Applications to NLP
Topic-Oriented words as features for named entity recognition
CICLing'13 Proceedings of the 14th international conference on Computational Linguistics and Intelligent Text Processing - Volume Part I
Hi-index | 0.00 |
Gazetteers or entity dictionaries are important knowledge resources for solving a wide range of NLP problems, such as entity extraction. We introduce a novel method to automatically generate gazetteers from seed lists using an external knowledge resource, the Wikipedia. Unlike previous methods, our method exploits the rich content and various structural elements of Wikipedia, and does not rely on language- or domain-specific knowledge. Furthermore, applying the extended gazetteers to an entity extraction task in a scientific domain, we empirically observed a significant improvement in system accuracy when compared with those using seed gazetteers.