Retrieving attributes using web tables

  • Authors:
  • Arlind Kopliku;Karen Pinel-Sauvagnat;Mohand Boughanem

  • Affiliations:
  • IRIT, University of Toulouse, Toulouse, France;IRIT, University of Toulouse, Toulouse, France;IRIT, University of Toulouse, Toulouse, France

  • Venue:
  • Proceedings of the 11th annual international ACM/IEEE joint conference on Digital libraries
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we propose an attribute retrieval approach which extracts and ranks attributes from Web tables. We combine simple heuristics to filter out improbable attributes and we rank attributes based on frequencies and a table match score. Ranking is reinforced with external evidence from Web search, DBPedia and Wikipedia. Our approach can be applied to whatever instance (e.g. Canada) to retrieve its attributes (capital, GDP). It is shown it has a much higher recall than DBPedia and Wikipedia and that it works better than lexico-syntactic rules for the same purpose.