Estimating lexical priors for low-frequency morphologically ambiguous forms
Computational Linguistics
Mining the Web for Synonyms: PMI-IR versus LSA on TOEFL
EMCL '01 Proceedings of the 12th European Conference on Machine Learning
Monolingual Document Retrieval for European Languages
Information Retrieval
Unsupervised learning of the morphology of a natural language
Computational Linguistics
Applied morphological processing of English
Natural Language Engineering
Automatic retrieval and clustering of similar words
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 2
Linguistic structure as composition and perturbation
ACL '96 Proceedings of the 34th annual meeting on Association for Computational Linguistics
A freely available wide coverage morphological analyzer for English
COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 3
Memory-based morphological analysis
ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
Automatic identification of non-compositional phrases
ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
Knowledge-free induction of inflectional morphologies
NAACL '01 Proceedings of the second meeting of the North American Chapter of the Association for Computational Linguistics on Language technologies
Minimally supervised morphological analysis by multimodal alignment
ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
Knowledge-free induction of morphology using latent semantic analysis
ConLL '00 Proceedings of the 2nd workshop on Learning language in logic and the 4th conference on Computational natural language learning - Volume 7
MPL '02 Proceedings of the ACL-02 workshop on Morphological and phonological learning - Volume 6
A statistical approach to the semantics of verb-particles
MWE '03 Proceedings of the ACL 2003 workshop on Multiword expressions: analysis, acquisition and treatment - Volume 18
An empirical model of multiword expression decomposability
MWE '03 Proceedings of the ACL 2003 workshop on Multiword expressions: analysis, acquisition and treatment - Volume 18
Generating query substitutions
Proceedings of the 15th international conference on World Wide Web
Unsupervised models for morpheme segmentation and morphology learning
ACM Transactions on Speech and Language Processing (TSLP)
Speech and Language Processing (2nd Edition)
Speech and Language Processing (2nd Edition)
HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Introduction to Information Retrieval
Introduction to Information Retrieval
LIBLINEAR: A Library for Large Linear Classification
The Journal of Machine Learning Research
Unsupervised type and token identification of idiomatic expressions
Computational Linguistics
Latent-variable modeling of string transductions with finite-state methods
EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Unsupervised morphological segmentation with log-linear models
NAACL '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Multilingual noise-robust supervised morphological analysis using the WordFrame model
SIGMorPhon '04 Proceedings of the 7th Meeting of the ACL Special Interest Group in Computational Phonology: Current Themes in Computational Phonology and Morphology
A global model for joint lemmatization and part-of-speech prediction
ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1
Search engine statistics beyond the n-gram: application to noun compound bracketing
CONLL '05 Proceedings of the Ninth Conference on Computational Natural Language Learning
Shared task system description: frustratingly hard compositionality prediction
DiSCo '11 Proceedings of the Workshop on Distributional Semantics and Compositionality
Modeling covert event retrieval in logical metonymy: probabilistic and distributional accounts
CMCL '12 Proceedings of the 3rd Workshop on Cognitive Modeling and Computational Linguistics
Hi-index | 0.00 |
In many applications, replacing a complex word form by its stem can reduce sparsity, revealing connections in the data that would not otherwise be apparent. In this paper, we focus on prefix verbs: verbs formed by adding a prefix to an existing verb stem. A prefix verb is considered compositional if it can be decomposed into a semantically equivalent expression involving its stem. We develop a classifier to predict compositionality via a range of lexical and distributional features, including novel features derived from web-scale N-gram data. Results on a new annotated corpus show that prefix verb compositionality can be predicted with high accuracy. Our system also performs well when trained and tested on conventional morphological segmentations of prefix verbs.