Retrieving attributes using web tables

Authors:
Arlind Kopliku;Karen Pinel-Sauvagnat;Mohand Boughanem
Affiliations:
IRIT, University of Toulouse, Toulouse, France;IRIT, University of Toulouse, Toulouse, France;IRIT, University of Toulouse, Toulouse, France
Venue:
Proceedings of the 11th annual international ACM/IEEE joint conference on Digital libraries
Year:
2011

Citing 3
Cited 3

A Survey of Web Information Extraction Systems

IEEE Transactions on Knowledge and Data Engineering
WebTables: exploring the power of tables on the web

Proceedings of the VLDB Endowment
Acquisition of instance attributes via labeled and related instances

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval

Attribute retrieval from relational web tables

SPIRE'11 Proceedings of the 18th international conference on String processing and information retrieval
Towards a framework for attribute retrieval

Proceedings of the 20th ACM international conference on Information and knowledge management
Aggregated search: A new information retrieval paradigm

ACM Computing Surveys (CSUR)

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper we propose an attribute retrieval approach which extracts and ranks attributes from Web tables. We combine simple heuristics to filter out improbable attributes and we rank attributes based on frequencies and a table match score. Ranking is reinforced with external evidence from Web search, DBPedia and Wikipedia. Our approach can be applied to whatever instance (e.g. Canada) to retrieve its attributes (capital, GDP). It is shown it has a much higher recall than DBPedia and Wikipedia and that it works better than lexico-syntactic rules for the same purpose.