"More like these": growing entity classes from seeds

  • Authors:
  • Luis Sarmento;Valentin Jijkuon;Maarten de Rijke;Eugenio Oliveira

  • Affiliations:
  • Universidade do Porto, Porto, Portugal;University of Amsterdam, Amsterdam, Netherlands;University of Amsterdam, Amsterdam, Netherlands;Universidade do Porto, Porto, Portugal

  • Venue:
  • Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present a corpus-based approach to the class expansion task. For a given set of seed entities we use co-occurrence statistics taken from a text collection to define a membership function that is used to rank candidate entities for inclusion in the set. We describe an evaluation framework that uses data from Wikipedia. The performance of our class extension method improves as the size of the text collection increases.