Learning dictionaries for information extraction by multi-level bootstrapping
AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
Extracting Patterns and Relations from the World Wide Web
WebDB '98 Selected papers from the International Workshop on The World Wide Web and Databases
Named Entity recognition without gazetteers
EACL '99 Proceedings of the ninth conference on European chapter of the Association for Computational Linguistics
Using the web to overcome data sparseness
EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
Disambiguating toponyms in news
HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Extracting geographic features from the Internet to automatically build detailed regional gazetteers
International Journal of Geographical Information Science
Unsupervised named-entity extraction from the Web: An experimental study
Artificial Intelligence
An agenda for the next generation gazetteer: geographic information contribution and retrieval
Proceedings of the 17th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems
Bottom-Up Gazetteers: Learning from the Implicit Semantics of Geotags
GeoS '09 Proceedings of the 3rd International Conference on GeoSpatial Semantics
Pattern-based extraction of addresses from web page content
APWeb'08 Proceedings of the 10th Asia-Pacific web conference on Progress in WWW research and development
Multi-source toponym data integration and mediation for a meta-gazetteer service
GIScience'10 Proceedings of the 6th international conference on Geographic information science
Toward traffic-driven location-based web search
Proceedings of the 20th ACM international conference on Information and knowledge management
Heuristic methods for reducing errors of geographic named entities learned by bootstrapping
IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
A bootstrapping approach for geographic named entity annotation
AIRS'04 Proceedings of the 2004 international conference on Asian Information Retrieval Technology
Expert Systems with Applications: An International Journal
Decision making aid in mobile environment by behavioral characteristic
Proceedings of the 13th International Conference on Electronic Commerce
Hi-index | 0.00 |
In this paper we present an approach to the acquisition of geographical gazetteers. Instead of creating these resources manually, we propose to extract gazetteers from the World Wide Web, using Data Mining techniques.The bootstrapping approach, investigated in our study, allows us to create new gazetteers using only a small seed dataset (1260 words). In addition to gazetteers, the system produces classifiers. They can be used online to determine a class (CITY, ISLAND, RIVER, MOUNTAIN, REGION, COUNTRY) of any geographical name. Our classifiers perform with the average accuracy of 86.5%.