An Experimental Comparison of RDF Data Management Approaches in a SPARQL Benchmark Scenario

Authors:
Michael Schmidt;Thomas Hornung;Norbert Küchlin;Georg Lausen;Christoph Pinkel
Affiliations:
Freiburg University, Freiburg, Germany 79106;Freiburg University, Freiburg, Germany 79106;Freiburg University, Freiburg, Germany 79106;Freiburg University, Freiburg, Germany 79106;MTC Infomedia OHG, Saarbrücken, Germany 66121
Venue:
ISWC '08 Proceedings of the 7th International Conference on The Semantic Web
Year:
2008

Citing 8
Cited 17

Sesame: A Generic Architecture for Storing and Querying RDF and RDF Schema

ISWC '02 Proceedings of the First International Semantic Web Conference on The Semantic Web
Storing RDF as a Graph

LA-WEB '03 Proceedings of the First Conference on Latin American Web Congress
C-store: a column-oriented DBMS

VLDB '05 Proceedings of the 31st international conference on Very large data bases
An efficient SQL-based RDF querying scheme

VLDB '05 Proceedings of the 31st international conference on Very large data bases
Scalable semantic web data management using vertical partitioning

VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Column-store support for RDF data management: not all swans are white

Proceedings of the VLDB Endowment
Benchmarking database representations of RDF/S stores

ISWC'05 Proceedings of the 4th international conference on The Semantic Web
SPARQL query processing with conventional relational database systems

WISE'05 Proceedings of the 2005 international conference on Web Information Systems Engineering

Benchmarking Fulltext Search Performance of RDF Stores

ESWC 2009 Heraklion Proceedings of the 6th European Semantic Web Conference on The Semantic Web: Research and Applications
Foundations of SPARQL query optimization

Proceedings of the 13th International Conference on Database Theory
An evaluation of approaches to federated query processing over linked data

Proceedings of the 6th International Conference on Semantic Systems
XML-based RDF data management for efficient query processing

Procceedings of the 13th International Workshop on the Web and Databases
Atlas: Storing, updating and querying RDF(S) data on top of DHTs

Web Semantics: Science, Services and Agents on the World Wide Web
To cache or not to cache: the effects of warming cache in complex SPARQL queries

OTM'11 Proceedings of the 2011th Confederated international conference on On the move to meaningful internet systems - Volume Part II
Efficient RDFS entailment in external memory

OTM'11 Proceedings of the 2011th Confederated international conference on On the move to meaningful internet systems
PoweRGen: A power-law based generator of RDFS schemas

Information Systems
FlexTable: using a dynamic relation model to store RDF data

DASFAA'10 Proceedings of the 15th international conference on Database Systems for Advanced Applications - Volume Part I
RDFPath: path query processing on large RDF graphs with mapreduce

ESWC'11 Proceedings of the 8th international conference on The Semantic Web
Static analysis and optimization of semantic web queries

PODS '12 Proceedings of the 31st symposium on Principles of Database Systems
Foundational aspects of semantic web optimization

PhD '12 Proceedings of the on SIGMOD/PODS 2012 PhD Symposium
Evaluating graph traversal algorithms for distributed SPARQL query optimization

JIST'11 Proceedings of the 2011 joint international conference on The Semantic Web
SSTDE: an open source semantic spatiotemporal data engine for sensor web

Proceedings of the First ACM SIGSPATIAL Workshop on Sensor Web Enablement
Binary RDF representation for publication and exchange (HDT)

Web Semantics: Science, Services and Agents on the World Wide Web
Static analysis and optimization of semantic web queries

ACM Transactions on Database Systems (TODS) - Invited papers issue
Ultrawrap: SPARQL execution on relational data

Web Semantics: Science, Services and Agents on the World Wide Web

Quantified Score

Hi-index	0.00

Visualization

Abstract

Efficient RDF data management is one of the cornerstones in realizing the Semantic Web vision. In the past, different RDF storage strategies have been proposed, ranging from simple triple stores to more advanced techniques like clustering or vertical partitioning on the predicates. We present an experimental comparison of existing storage strategies on top of the SP2Bench SPARQL performance benchmark suite and put the results into context by comparing them to a purely relational model of the benchmark scenario. We observe that (1) in terms of performance and scalability, a simple triple store built on top of a column-store DBMS is competitive to the vertically partitioned approach when choosing a physical (predicate, subject, object) sort order, (2) in our scenario with real-world queries, none of the approaches scales to documents containing tens of millions of RDF triples, and (3) none of the approaches can compete with a purely relational model. We conclude that future research is necessary to further bring forward RDF data management.