Generalized probabilistic LR parsing of natural language (Corpora) with unification-based grammars
Computational Linguistics - Special issue on using large corpora: I
PrepLex: a lexicon of French prepositions for parsing
SigSem '07 Proceedings of the Fourth ACL-SIGSEM Workshop on Prepositions
IJCNLP'04 Proceedings of the First international joint conference on Natural Language Processing
Classifying French verbs using French and English lexical resources
ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
Hi-index | 0.00 |
TreeLex is a subcategorization lexicon of French, automatically extracted from a syntactically annotated corpus. The lexicon comprises 2006 verbs (25076 occurrences). The goal of the project is to obtain a list of subcategorization frames of contemporary French verbs and to estimate the number of different verb frames available in French in general. A few more frames are discovered when the corpus size changes, but the average number of frames per verb remains relatively stable (about 1.91-2.09 frames per verb).