Cross-lingual bootstrapping of semantic lexicons: the case of FrameNet

  • Authors:
  • Sebastian Padó;Mirella Lapata

  • Affiliations:
  • Computational Linguistics, Saarland University, Saarbrücken, Germany;School of Informatics, University of Edinburgh, Edinburgh, UK

  • Venue:
  • AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 3
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper considers the problem of unsupervised semantic lexicon acquisition. We introduce a fully automatic approach which exploits parallel corpora, relies on shallow text properties, and is relatively inexpensive. Given the English FrameNet lexicon, our method exploits word alignments to generate frame candidate lists for new languages, which are subsequently pruned automatically using a small set of linguistically motivated filters. Evaluation shows that our approach can produce high-precision multilingual FrameNet lexicons without recourse to bilingual dictionaries or deep syntactic and semantic analysis.