Efficiently querying rdf data in triple stores

  • Authors:
  • Ying Yan;Chen Wang;Aoying Zhou;Weining Qian;Li Ma;Yue Pan

  • Affiliations:
  • Fudan University, Shanghai, China;IBM China Research Laboratory, Beijing, China;Fudan University, East China Normal University, Shanghai, China;East China Normal University, Shanghai, China;IBM China Research Laboratory, Beijing, China;IBM China Research Laboratory, Beijing, China

  • Venue:
  • Proceedings of the 17th international conference on World Wide Web
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Efficiently querying RDF data is being an important factor in applying Semantic Web technologies to real-world applications. In this context, many efforts have been made to store and query RDF data in relational database using particular schemas. In this paper, we propose a new scheme to store, index, and query RDF data in triple stores. Graph feature of RDF data is taken into considerations which might help reduce the join costs on the vertical database structure. We would partition RDF triples into overlapped groups, store them in a triple table with one more column of group identity, and build up a signature tree to index them. Based on this infrastructure, a complex RDF query is decomposed into multiple pieces of sub-queries which could be easily filtered into some RDF groups using signature tree index, and finally is evaluated with a composed and optimized SQL with specific constraints. We compare the performance of our method with prior art on typical queries over a large scaled LUBM and UOBM benchmark data (more than 10 million triples). For some extreme cases, they can promote 3 to 4 orders of magnitude.