A method of geographical name extraction from Japanese text for thematic geographical search

Authors:
Yasusi Kanada
Affiliations:
Central Research Laboratory, Hitachi Ltd., Higashi-Koigakubo 1-280, Kokubunji, Tokyo 185, Japan
Venue:
Proceedings of the eighth international conference on Information and knowledge management
Year:
1999

Citing 1
Cited 7

Axis-specified search: a fine-grained full-text search method for gathering and structuring excerpts

Proceedings of the third ACM conference on Digital libraries

Disambiguating Geographic Names in a Historical Digital Library

ECDL '01 Proceedings of the 5th European Conference on Research and Advanced Technology for Digital Libraries
Bootstrapping toponym classifiers

HLT-NAACL-GEOREF '03 Proceedings of the HLT-NAACL 2003 workshop on Analysis of geographic references - Volume 1
Toward tighter integration of web search with a geographic information system

Proceedings of the 15th international conference on World Wide Web
BEIRA: An Area-based User Interface for Map Services

World Wide Web
Extracting Geographic Context from the Web: GeoReferencing in MyMoSe

ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
BEIRA: a geo-semantic clustering method for area summary

WISE'07 Proceedings of the 8th international conference on Web information systems engineering
A metadata geoparsing system for place name recognition and resolution in metadata records

Proceedings of the 11th annual international ACM/IEEE joint conference on Digital libraries

Quantified Score

Hi-index	0.00

Visualization

Abstract

A text retrieval method called the thematic geographical search method has been developed and applied to a Japanese encyclopedia called the World Encyclopædia. In this method, the user specifies a search theme using free words, then obtains a sorted list of excerpts and hyperlinks to encyclopedia sentences that contain geographical names. Using this list, the user can also open maps that indicate the locations of the names. To generate an index of names for this searching, a method of extracting geographical names has been developed. In this method, geographical names are extracted, matched to names in a geographical name database, and identified. Geographical names, however, often have several types of ambiguities. Ambiguities are resolved by using non-local context analysis, which uses a stack and several other techniques. As a result, the precision of extracted names is more than 96% on average. This method depends on features of the Japanese language, but the strategy and most of the techniques can be applied to texts in English or other languages.