Optimizing query shortcuts in RDF databases

  • Authors:
  • Vicky Dritsou;Panos Constantopoulos;Antonios Deligiannakis;Yannis Kotidis

  • Affiliations:
  • Athens University of Economics and Business, Athens, Greece and Digital Curation Unit, IMIS, Athena Research Centre, Greece;Athens University of Economics and Business, Athens, Greece and Digital Curation Unit, IMIS, Athena Research Centre, Greece;Technical University of Crete, Crete, Greece and Digital Curation Unit, IMIS, Athena Research Centre, Greece;Athens University of Economics and Business, Athens, Greece and Digital Curation Unit, IMIS, Athena Research Centre, Greece

  • Venue:
  • ESWC'11 Proceedings of the 8th extended semantic web conference on The semanic web: research and applications - Volume Part II
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

The emergence of the Semantic Web has led to the creation of large semantic knowledge bases, often in the form of RDF databases. Improving the performance of RDF databases necessitates the development of specialized data management techniques, such as the use of shortcuts in the place of path queries. In this paper we deal with the problem of selecting the most beneficial shortcuts that reduce the execution cost of path queries in RDF databases given a space constraint. We first demonstrate that this problem is an instance of the quadratic knapsack problem. Given the computational complexity of solving such problems, we then develop an alternative formulation based on a bi-criterion linear relaxation, which essentially seeks to minimize a weighted sum of the query cost and of the required space consumption. As we demonstrate in this paper, this relaxation leads to very efficient classes of linear programming solutions. We utilize this bi-criterion linear relaxation in an algorithm that selects a subset of shortcuts to materialize. This shortcut selection algorithm is extensively evaluated and compared with a greedy algorithm that we developed in prior work. The reported experiments show that the linear relaxation algorithm manages to significantly reduce the query execution times, while also outperforming the greedy solution.