SPARQL basic graph pattern optimization using selectivity estimation

  • Authors:
  • Markus Stocker;Andy Seaborne;Abraham Bernstein;Christoph Kiefer;Dave Reynolds

  • Affiliations:
  • HP Laboratories, Bristol, United Kingdom;HP Laboratories, Bristol, United Kingdom;University of Zurich, Zurich, Switzerland;University of Zurich, Zurich, Switzerland;HP Laboratories, Bristol, United Kingdom

  • Venue:
  • Proceedings of the 17th international conference on World Wide Web
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we formalize the problem of Basic Graph Pattern (BGP) optimization for SPARQL queries and main memory graph implementations of RDF data. We define and analyze the characteristics of heuristics for selectivity-based static BGP optimization. The heuristics range from simple triple pattern variable counting to more sophisticated selectivity estimation techniques. Customized summary statistics for RDF data enable the selectivity estimation of joined triple patterns and the development of efficient heuristics. Using the Lehigh University Benchmark (LUBM), we evaluate the performance of the heuristics for the queries provided by the LUBM and discuss some of them in more details.