Scalable reconstruction of RDF-archived relational databases

Authors:
Silvia Stefanova;Tore Risch
Affiliations:
Uppsala University;Uppsala University
Venue:
Proceedings of the Fifth Workshop on Semantic Web Information Management
Year:
2013

Citing 4
Cited 0

Archiving scientific data

ACM Transactions on Database Systems (TODS)
Information Sharing on the Semantic Web

Information Sharing on the Semantic Web
R2D: Extracting Relational Structure from RDF Stores

WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
R2D: A Bridge between the Semantic Web and Relational Visualization Tools

ICSC '09 Proceedings of the 2009 IEEE International Conference on Semantic Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

We have investigated approaches for scalable reconstruction of relational databases (RDBs) archived as RDF files. An archived RDB is reconstructed from a data archive file and a schema archive file, both in N-Triples formats. The archives contain RDF triples representing the archived relational data content and the relational schema describing the content, respectively. When an archived RDB is to be reconstructed, the schema archive is first read to automatically create the RDB schema using a schema reconstruction algorithm which identifies RDB elements by queries to the schema archive. The RDB thus created is then populated by reading the data archive. To populate the RDB we have developed two approaches, the naive Insert Attribute Value (IAV) and Triple Bulk Load (TBL). With the IAV approach the data is populated by stored procedures that execute SQL INSERT or UPDATE statements to insert attribute values in the RDB tables. In the more complex TBL approach the database is populated by bulk loading CSV files generated by sorting the data archive triples joined with schema information. Our experiments show that the TBL approach is substantially faster than the IAV approach.