The development of the Index Thomisticus Treebank valency lexicon

  • Authors:
  • Barbara McGillivray;Marco Passarotti

  • Affiliations:
  • University of Pisa, Italy;Catholic University of the Sacred Heart, Milan, Italy

  • Venue:
  • LaTeCH-SHELT&R '09 Proceedings of the EACL 2009 Workshop on Language Technology and Resources for Cultural Heritage, Social Sciences, Humanities, and Education
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present a valency lexicon for Latin verbs extracted from the Index Thomisticus Treebank, a syntactically annotated corpus of Medieval Latin texts by Thomas Aquinas. In our corpus-based approach, the lexicon reflects the empirical evidence of the source data. Verbal arguments are induced directly from annotated data. The lexicon contains 432 Latin verbs with 270 valency frames. The lexicon is useful for NLP applications and is able to support annotation.