SIREn: entity retrieval system for the web of data

  • Authors:
  • Renaud Delbru

  • Affiliations:
  • Digital Enterprise Research Institute, National University of Ireland, Galway, Ireland

  • Venue:
  • FDIA'09 Proceedings of the Third BCS-IRSG conference on Future Directions in Information Access
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present ongoing work on the Semantic Information Retrieval Engine (SIREn), an "entity retrieval system" specifically designed to meet the requirements of indexing and searching a large amount of semi-structured data, e.g. the entire Web of Data. SIREn supports efficient full text search with semi-structural queries and exhibits a concise index, constant time updates and inherits Information Retrieval features such as top-k queries, efficient caching and scalability via distribution over shards. We demonstrate how SIREn can effectively answer queries over 10 billion triples on single commodity machine. The prototype is currently in use in the Sindice search engine which index at the present time more than 50 million harvested documents containing semi-structured data.