L3S at INEX 2007: Query Expansion for Entity Ranking Using a Highly Accurate Ontology

  • Authors:
  • Gianluca Demartini;Claudiu S. Firan;Tereza Iofciu

  • Affiliations:
  • L3S Research Center, Leibniz Universität Hannover, Hannover, Germany D-30167;L3S Research Center, Leibniz Universität Hannover, Hannover, Germany D-30167;L3S Research Center, Leibniz Universität Hannover, Hannover, Germany D-30167

  • Venue:
  • Focused Access to XML Documents
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Entity ranking on Web scale datasets is still an open challenge. Several resources, as for example Wikipedia-based ontologies, can be used to improve the quality of the entity ranking produced by a system. In this paper we focus on the Wikipedia corpus and propose algorithms for finding entities based on query relaxation using category information. The main contribution is a methodology for expanding the user query by exploiting the semantic structure of the dataset. Our approach focuses on constructing queries using not only keywords from the topic, but also information about relevant categories. This is done leveraging on a highly accurate ontology which is matched to the character strings of the topic. The evaluation is performed using the INEX 2007 Wikipedia collection and entity ranking topics. The results show that our approach performs effectively, especially for early precision metrics.