Retrieving collocations by co-occurrences and word order constraints

  • Authors:
  • Sayori Shimohata;Toshiyuki Sugio;Junji Nagata

  • Affiliations:
  • Oki Electric Industry Co., Ltd., Shiromi, Chuo-ku, Osaka, Japan;Oki Electric Industry Co., Ltd., Shiromi, Chuo-ku, Osaka, Japan;Oki Electric Industry Co., Ltd., Shiromi, Chuo-ku, Osaka, Japan

  • Venue:
  • ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
  • Year:
  • 1997

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we describe a method for automatically retrieving collocations from large text corpora. This method retrieve collocations in the following stages: 1) extracting strings of characters as units of collocations 2) extracting recurrent combinations of strings in accordance with their word order in a corpus as collocations. Through the method, various range of collocations, especially domain specific collocations, are retrieved. The method is practical because it uses plain texts without any information dependent on a language such as lexical knowledge and parts of speech.