Mining the Web for Synonyms: PMI-IR versus LSA on TOEFL
EMCL '01 Proceedings of the 12th European Conference on Machine Learning
Multiword Expressions: A Pain in the Neck for NLP
CICLing '02 Proceedings of the Third International Conference on Computational Linguistics and Intelligent Text Processing
Using the web to obtain frequencies for unseen bigrams
Computational Linguistics - Special issue on web as corpus
Corpus-based method for automatic identification of support verbs for nominalizations
EACL '95 Proceedings of the seventh conference on European chapter of the Association for Computational Linguistics
Automatic identification of non-compositional phrases
ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
Detecting a continuum of compositionality in phrasal verbs
MWE '03 Proceedings of the ACL 2003 workshop on Multiword expressions: analysis, acquisition and treatment - Volume 18
Statistical measures of the semi-productivity of light verb constructions
MWE '04 Proceedings of the Workshop on Multiword Expressions: Integrating Processing
Hi-index | 0.00 |
We develop statistical measures for assessing the acceptability of a frequent class of multiword expressions. We also use the measures to estimate the degree of productivity of the expressions over semantically related nouns. We show that a linguistically-inspired measure outperforms a standard measure of collocation in its match with human judgments. The measure uses simple extraction techniques over non-marked-up web data.