Querying RDF dictionaries in compressed space

  • Authors:
  • Miguel A. Martínez-Prieto;Javier D. Fernández;Rodrigo Cánovas

  • Affiliations:
  • Univ. of Valladolid, Spain and Univ. of Chile, Chile;Univ. of Valladolid, Spain and Univ. of Chile, Chile;Univ. of Melbourne, Australia and Univ. of Chile, Chile

  • Venue:
  • ACM SIGAPP Applied Computing Review
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

The use of dictionaries is a common practice among those applications performing on huge RDF datasets. It allows long terms occurring in the RDF triples to be replaced by short IDs which reference them. This decision greatly compacts the dataset and mitigates the scalability issues underlying to its management. However, the dictionary size is not negligible and the techniques used for its representation also suffer from scalability limitations. This paper focuses on this scenario by adapting compression techniques for string dictionaries to the case of RDF. We propose a novel technique: Dcomp, which can be tuned to represent the dictionary in compressed space (22--64%) and to perform basic lookup operations in a few microseconds (1--50μs). In addition, we propose Dcomp as a basis for specific SPARQL query optimizations leveraging its ability for early FILTER resolution.