New TPC benchmarks for decision support and web commerce
ACM SIGMOD Record
IEEE Internet Computing
Sesame: A Generic Architecture for Storing and Querying RDF and RDF Schema
ISWC '02 Proceedings of the First International Semantic Web Conference on The Semantic Web
THALIA: Test Harness for the Assessment of Legacy Information Integration Approaches
ICDE '05 Proceedings of the 21st International Conference on Data Engineering
Data integration: the teenage years
VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Ontology Matching
Locating data sources in large distributed systems
VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Benchmarking RDF production tools
DEXA'07 Proceedings of the 18th international conference on Database and Expert Systems Applications
Evaluating graph traversal algorithms for distributed SPARQL query optimization
JIST'11 Proceedings of the 2011 joint international conference on The Semantic Web
Ultrawrap: SPARQL execution on relational data
Web Semantics: Science, Services and Agents on the World Wide Web
Formalisation and experiences of R2RML-based SPARQL to SQL query translation using morph
Proceedings of the 23rd international conference on World wide web
Bringing relational databases into the Semantic Web: A survey
Semantic Web - On real-time and ubiquitous social semantics
Hi-index | 0.00 |
Many science archive centres publish very large volumes of image, simulation, and experiment data. In order to integrate and analyse the available data, scientists need to be able to (i) identify and locate all the data relevant to their work; (ii) understand the multiple heterogeneous data models in which the data is published; and (iii) interpret and process the data they retrieve. rdf has been shown to be a generally successful framework within which to perform such data integration work. It can be equally successful in the context of scientific data, if it is demonstrably practical to expose that data as rdf . In this paper we investigate the capabilities of rdf to enable the integration of scientific data sources. Specifically, we discuss the suitability of sparql for expressing scientific queries, and the performance of several triple stores and rdbrdf tools for executing queries over a moderately sized sample of a large astronomical data set. We found that more research and improvements are required into sparql and rdbrdf tools to efficiently expose existing science archives for data integration.