Finding short definitions of terms on web pages

  • Authors:
  • Gerasimos Lampouras;Ion Androutsopoulos

  • Affiliations:
  • Athens University of Economics and Business, Greece;Athens University of Economics and Business, Greece and Research Centre "Athena", Athens, Greece

  • Venue:
  • EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
  • Year:
  • 2009

Quantified Score

Hi-index 0.01

Visualization

Abstract

We present a system that finds short definitions of terms on Web pages. It employs a Maximum Entropy classifier, but it is trained on automatically generated examples; hence, it is in effect unsupervised. We use rouge-w to generate training examples from encyclopedias and Web snippets, a method that outperforms an alternative centroid-based one. After training, our system can be used to find definitions of terms that are not covered by encyclopedias. The system outperforms a comparable publicly available system, as well as a previously published form of our system.