A comprehensive dictionary of multiword expressions

  • Authors:
  • Kosho Shudo;Akira Kurahone;Toshifumi Tanabe

  • Affiliations:
  • Fukuoka University, Nanakuma, Jonan-ku, Fukuoka, Japan;TechTran Ltd., Ikebukuro, Naka-ku, Yokohama, Japan;Fukuoka University, Nanakuma, Jonan-ku, Fukuoka, Japan

  • Venue:
  • HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
  • Year:
  • 2011
  • Combining resources for MWE-token classification

    SemEval '12 Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation

Quantified Score

Hi-index 0.00

Visualization

Abstract

It has been widely recognized that one of the most difficult and intriguing problems in natural language processing (NLP) is how to cope with idiosyncratic multiword expressions. This paper presents an overview of the comprehensive dictionary (JDMWE) of Japanese multiword expressions. The JDMWE is characterized by a large notational, syntactic, and semantic diversity of contained expressions as well as a detailed description of their syntactic functions, structures, and flexibilities. The dictionary contains about 104,000 expressions, potentially 750,000 expressions. This paper shows that the JDMWE's validity can be supported by comparing the dictionary with a large-scale Japanese N-gram frequency dataset, namely the LDC2009T08, generated by Google Inc. (Kudo et al. 2009).