Multiword Expressions: A Pain in the Neck for NLP
CICLing '02 Proceedings of the Third International Conference on Computational Linguistics and Intelligent Text Processing
Retrieving collocations from text: Xtract
Computational Linguistics - Special issue on using large corpora: I
Using small random samples for the manual evaluation of statistical association measures
Computer Speech and Language
Web-based and combined language models: a case study on noun compound identification
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
A hybrid approach for multiword expression identification
PROPOR'10 Proceedings of the 9th international conference on Computational Processing of the Portuguese Language
Identifying and analyzing Brazilian Portuguese complex predicates
MWE '11 Proceedings of the Workshop on Multiword Expressions: from Parsing and Generation to the Real World
Identification and treatment of multiword expressions applied to information retrieval
MWE '11 Proceedings of the Workshop on Multiword Expressions: from Parsing and Generation to the Real World
Detecting noun compounds and light verb constructions: a contrastive study
MWE '11 Proceedings of the Workshop on Multiword Expressions: from Parsing and Generation to the Real World
jMWE: a Java toolkit for detecting multi-word expressions
MWE '11 Proceedings of the Workshop on Multiword Expressions: from Parsing and Generation to the Real World
Fast and flexible MWE candidate generation with the mwetoolkit
MWE '11 Proceedings of the Workshop on Multiword Expressions: from Parsing and Generation to the Real World
A broad evaluation of techniques for automatic acquisition of multiword expressions
ACL '12 Proceedings of ACL 2012 Student Research Workshop
A generic framework for multiword expressions treatment: from acquisition to applications
ACL '12 Proceedings of ACL 2012 Student Research Workshop
Learning to detect english and hungarian light verb constructions
ACM Transactions on Speech and Language Processing (TSLP) - Special issue on multiword expressions: From theory to practice and use, part 1
Hi-index | 0.00 |
The mwetoolkit is a tool for automatic extraction of Multiword Expressions (MWEs) from monolingual corpora. It both generates and validates MWE candidates. The generation is based on surface forms, while for the validation, a series of criteria for removing noise are provided, such as some (language independent) association measures. In this paper, we present the use of the mwetoolkit in a standard configuration, for extracting MWEs from a corpus of general-purpose English. The functionalities of the toolkit are discussed in terms of a set of selected examples, comparing it with related work on MWE extraction.