Growing TreeLex

  • Authors:
  • Anna Kupść;Anne Abeillé

  • Affiliations:
  • Université de Bordeaux, ERSSàB, SIGNES and IPIPAN and Université Michel de Montaigne, UFRL, Pessac Cedex, France;Université Paris 7, LLF, CNRS, UMR, Paris Cedex 05, France

  • Venue:
  • CICLing'08 Proceedings of the 9th international conference on Computational linguistics and intelligent text processing
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

TreeLex is a subcategorization lexicon of French, automatically extracted from a syntactically annotated corpus. The lexicon comprises 2006 verbs (25076 occurrences). The goal of the project is to obtain a list of subcategorization frames of contemporary French verbs and to estimate the number of different verb frames available in French in general. A few more frames are discovered when the corpus size changes, but the average number of frames per verb remains relatively stable (about 1.91-2.09 frames per verb).