SPOVC: a scalable RDF store using horizontal partitioning and column oriented DBMS

Authors:
Kunal Mulay;P. Sreenivasa Kumar
Affiliations:
Indian Institute of Technology, Madras, Chennai, India;Indian Institute of Technology, Madras, Chennai, India
Venue:
SWIM '12 Proceedings of the 4th International Workshop on Semantic Web Information Management
Year:
2012

Citing 11
Cited 0

The design and implementation of the redland RDF application framework

Proceedings of the 10th international conference on World Wide Web
Sesame: A Generic Architecture for Storing and Querying RDF and RDF Schema

ISWC '02 Proceedings of the First International Semantic Web Conference on The Semantic Web
An efficient SQL-based RDF querying scheme

VLDB '05 Proceedings of the 31st international conference on Very large data bases
Yago: a core of semantic knowledge

Proceedings of the 16th international conference on World Wide Web
Scalable semantic web data management using vertical partitioning

VLDB '07 Proceedings of the 33rd international conference on Very large data bases
RDF-3X: a RISC-style engine for RDF

Proceedings of the VLDB Endowment
Hexastore: sextuple indexing for semantic web data management

Proceedings of the VLDB Endowment
SW-Store: a vertically partitioned DBMS for Semantic Web data management

The VLDB Journal — The International Journal on Very Large Data Bases
LUBM: A benchmark for OWL knowledge base systems

Web Semantics: Science, Services and Agents on the World Wide Web
Matrix "Bit" loaded: a scalable lightweight join query processor for RDF data

Proceedings of the 19th international conference on World wide web
x-RDF-3X: fast querying, high update rates, and consistency for RDF databases

Proceedings of the VLDB Endowment

Quantified Score

Hi-index	0.00

Visualization

Abstract

Organizing and indexing RDF data for efficient evaluation of SPARQL queries has been attracting a lot of attention in the recent past. Most of the techniques proposed in this context leverage the existing RDBMS or column oriented DB technologies. In this paper, we propose an organization SPOVC that uses five indexes, namely, Subject, Predicate, Object, Value and Class, on top of any column oriented DB. The main techniques used by the proposed scheme are horizontal partitioning of the logical indices and special indices for values and classes. The SPOVC approach has the advantage of delivering better performance if the underlying column store technology improves. The proposed approach is conceptually much simpler than the state-of-the-art native-storage based proposals and roughly gives the same performance. Our proposal extends an existing approach, SW-Store, that uses column oriented DBs and vertical partitioning and obtains a two/three fold performance improvement. In addition, the proposed system is the only system that can effectively tackle SPARQL queries with filter patterns having range conditions and regular expressions.