A node indexing scheme for web entity retrieval

  • Authors:
  • Renaud Delbru;Nickolai Toupikov;Michele Catasta;Giovanni Tummarello

  • Affiliations:
  • Digital Enterprise Research Institute, National University of Ireland, Galway, Galway, Ireland;Digital Enterprise Research Institute, National University of Ireland, Galway, Galway, Ireland;School of Computer and Communication Sciences, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland;,Digital Enterprise Research Institute, National University of Ireland, Galway, Galway, Ireland

  • Venue:
  • ESWC'10 Proceedings of the 7th international conference on The Semantic Web: research and Applications - Volume Part II
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Now motivated also by the partial support of major search engines, hundreds of millions of documents are being published on the web embedding semi-structured data in RDF, RDFa and Microformats. This scenario calls for novel information search systems which provide effective means of retrieving relevant semi-structured information. In this paper, we present an “entity retrieval system” designed to provide entity search capabilities over datasets as large as the entire Web of Data. Our system supports full-text search, semi-structural queries and top-k query results while exhibiting a concise index and efficient incremental updates. We advocate the use of a node indexing scheme and show that it offers a good compromise between query expressiveness, query processing time and update complexity in comparison to three other indexing techniques. We then demonstrate how such system can effectively answer queries over 10 billion triples on a single commodity machine.