Simple word strings as compound keywords: an indexing and ranking method for Japanese texts

  • Authors:
  • Yasushi Ogawa;Ayako Bessho;Masako Hirose

  • Affiliations:
  • RICOH Co., Ltd., Yokohama, Japan;RICOH Co., Ltd., Yokohama, Japan;RICOH Co., Ltd., Yokohama, Japan

  • Venue:
  • SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
  • Year:
  • 1993

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper describes a new indexing method for Japanese text databases using the simple keyword string, in which a compound word is treated as a string of simple words, which are the smallest units in Japanese grammar which still maintain their meanings. This method allows retrieved texts to be ranked, according to the similarity of their meaning to the query, without using a control vocabulary or thesaurus. This paper also introduces the keyword feature, which describes the syntactic and semantic characteristics of a word, and results in more precise keyword extraction and text retrieval as well as simple dictionary maintenance.