Parallel database systems: the future of high performance database systems
Communications of the ACM
An overview of data warehousing and OLAP technology
ACM SIGMOD Record
STOC '97 Proceedings of the twenty-ninth annual ACM symposium on Theory of computing
Database Systems: A Practical Approach to Design, Implementation, and Management
Database Systems: A Practical Approach to Design, Implementation, and Management
A survey of approaches to automatic schema matching
The VLDB Journal — The International Journal on Very Large Data Bases
The description logic handbook: theory, implementation, and applications
The description logic handbook: theory, implementation, and applications
RDFPeers: a scalable distributed RDF repository based on a structured peer-to-peer network
Proceedings of the 13th international conference on World Wide Web
Jena: implementing the semantic web recommendations
Proceedings of the 13th international World Wide Web conference on Alternate track papers & posters
Eventually consistent failure detectors
Journal of Parallel and Distributed Computing
Bigtable: a distributed storage system for structured data
OSDI '06 Proceedings of the 7th USENIX Symposium on Operating Systems Design and Implementation - Volume 7
Survey of graph database models
ACM Computing Surveys (CSUR)
Queue - Object-Relational Mapping
Communications of the ACM - Rural engineering development
RDF-3X: a RISC-style engine for RDF
Proceedings of the VLDB Endowment
Marvin: Distributed reasoning over large-scale Semantic Web data
Web Semantics: Science, Services and Agents on the World Wide Web
Cassandra: a decentralized structured storage system
ACM SIGOPS Operating Systems Review
The Hadoop Distributed File System
MSST '10 Proceedings of the 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST)
A case study of linked enterprise data
ISWC'10 Proceedings of the 9th international semantic web conference on The semantic web - Volume Part II
On enhancing scalability for distributed RDF/S stores
Proceedings of the 14th International Conference on Extending Database Technology
Scalable SQL and NoSQL data stores
ACM SIGMOD Record
Ontology alignment evaluation initiative: six years of experience
Journal on data semantics XV
Web Semantics: Science, Services and Agents on the World Wide Web
H2RDF: adaptive query processing on RDF data in the cloud.
Proceedings of the 21st international conference companion on World Wide Web
Managing large dynamic graphs efficiently
SIGMOD '12 Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data
Hi-index | 0.00 |
In light of the challenges of effectively managing Big Data, we are witnessing a gradual shift towards the increasingly popular Linked Open Data (LOD) paradigm. LOD aims to impose a machine-readable semantic layer over structured as well as unstructured data and hence automate some data analysis tasks that are not designed for computers. The convergence of Big Data and LOD is, however, not straightforward: the semantic layer of LOD and the Big Data large scale storage do not get along easily. Meanwhile, the sheer data size envisioned by Big Data denies certain computationally expensive semantic technologies, rendering the latter much less efficient than their performance on relatively small data sets. In this paper, we propose a mechanism allowing LOD to take advantage of existing large-scale data stores while sustaining its "semantic" nature. We demonstrate how RDF-based semantic models can be distributed across multiple storage servers and we examine how a fundamental semantic operation can be tuned to meet the requirements on distributed and parallel data processing. Our future work will focus on stress test of the platform in the magnitude of tens of billions of triples, as well as comparative studies in usability and performance against similar offerings.