Simple word strings as compound keywords: an indexing and ranking method for Japanese texts

Authors:
Yasushi Ogawa;Ayako Bessho;Masako Hirose
Affiliations:
RICOH Co., Ltd., Yokohama, Japan;RICOH Co., Ltd., Yokohama, Japan;RICOH Co., Ltd., Yokohama, Japan
Venue:
SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
Year:
1993

Citing 8
Cited 4

Automatic phrase indexing for document retrieval

SIGIR '87 Proceedings of the 10th annual international ACM SIGIR conference on Research and development in information retrieval
The constituent object parser: syntactic structure matching for information retrieval

SIGIR '89 Proceedings of the 12th annual international ACM SIGIR conference on Research and development in information retrieval
On the application of syntactic methodologies in automatic text analysis

SIGIR '89 Proceedings of the 12th annual international ACM SIGIR conference on Research and development in information retrieval
A fuzzy document retrieval system using the keyword connection matrix and a learning method

Fuzzy Sets and Systems - Special issue on applications of fuzzy systems theory, Iizuka '88
ECLAIR: an extensible class library for information retrieval

The Computer Journal - Special issue on information retrieval
Introduction to Modern Information Retrieval

Introduction to Modern Information Retrieval
A New Indexing and Text Ranking Method for Japanese Text Databases Using Simple-Word Compounds as Keywords

Proceedings of the 3rd International Conference on Database Systems for Advanced Applications (DASFAA)
An interactive Japanese parser for machine translation

COLING '90 Proceedings of the 13th conference on Computational linguistics - Volume 2

A new character-based indexing method using frequency data for Japanese documents

SIGIR '95 Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval
On Chinese text retrieval

SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
Overlapping statistical word indexing: a new indexing method for Japanese text

Proceedings of the 20th annual international ACM SIGIR conference on Research and development in information retrieval
A probabilistic approach to compound noun indexing in Korean texts

COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 1

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper describes a new indexing method for Japanese text databases using the simple keyword string, in which a compound word is treated as a string of simple words, which are the smallest units in Japanese grammar which still maintain their meanings. This method allows retrieved texts to be ranked, according to the similarity of their meaning to the query, without using a control vocabulary or thesaurus. This paper also introduces the keyword feature, which describes the syntactic and semantic characteristics of a word, and results in more precise keyword extraction and text retrieval as well as simple dictionary maintenance.