Multiword Expressions: A Pain in the Neck for NLP
CICLing '02 Proceedings of the Third International Conference on Computational Linguistics and Intelligent Text Processing
A Statistical Corpus-Based Term Extractor
AI '01 Proceedings of the 14th Biennial Conference of the Canadian Society on Computational Studies of Intelligence: Advances in Artificial Intelligence
Large scale collocation data and their application to Japanese word processor technology
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Morphological aspect of Japanese language processing
COLING '80 Proceedings of the 8th conference on Computational linguistics
Lexicon-grammar: the representation of compound words
COLING '86 Proceedings of the 11th coference on Computational linguistics
A disambiguation method for Japanese compound verbs
MWE '03 Proceedings of the ACL 2003 workshop on Multiword expressions: analysis, acquisition and treatment - Volume 18
MWEs as non-propositional content indicators
MWE '04 Proceedings of the Workshop on Multiword Expressions: Integrating Processing
Frozen sentences of Portuguese: formal descriptions for NLP
MWE '04 Proceedings of the Workshop on Multiword Expressions: Integrating Processing
MWE '04 Proceedings of the Workshop on Multiword Expressions: Integrating Processing
A measure of syntactic flexibility for automatically identifying multiword expressions in corpora
MWE '07 Proceedings of the Workshop on a Broader Perspective on Multiword Expressions
Combining resources for MWE-token classification
SemEval '12 Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation
Hi-index | 0.00 |
It has been widely recognized that one of the most difficult and intriguing problems in natural language processing (NLP) is how to cope with idiosyncratic multiword expressions. This paper presents an overview of the comprehensive dictionary (JDMWE) of Japanese multiword expressions. The JDMWE is characterized by a large notational, syntactic, and semantic diversity of contained expressions as well as a detailed description of their syntactic functions, structures, and flexibilities. The dictionary contains about 104,000 expressions, potentially 750,000 expressions. This paper shows that the JDMWE's validity can be supported by comparing the dictionary with a large-scale Japanese N-gram frequency dataset, namely the LDC2009T08, generated by Google Inc. (Kudo et al. 2009).