Improving the recall of live linked data querying through reasoning

Authors:
Jürgen Umbrich;Aidan Hogan;Axel Polleres;Stefan Decker
Affiliations:
Digital Enterprise Research Institute, National University of Ireland, Galway, Ireland;Digital Enterprise Research Institute, National University of Ireland, Galway, Ireland;Siemens AG Österreich, Vienna, Austria;Digital Enterprise Research Institute, National University of Ireland, Galway, Ireland
Venue:
RR'12 Proceedings of the 6th international conference on Web Reasoning and Rule Systems
Year:
2012

Citing 20
Cited 3

Index structures and algorithms for querying distributed RDF repositories

Proceedings of the 13th international conference on World Wide Web
Sindice.com: a document-oriented lookup index for open linked data

International Journal of Metadata, Semantics and Ontologies
Simple and Efficient Minimal RDFS

Web Semantics: Science, Services and Agents on the World Wide Web
Executing SPARQL Queries over the Web of Linked Data

ISWC '09 Proceedings of the 8th International Semantic Web Conference
YARS2: a federated repository for querying graph structured data from the web

ISWC'07/ASWC'07 Proceedings of the 6th international The semantic web and 2nd Asian conference on Asian semantic web conference
Querying distributed RDF data sources with SPARQL

ESWC'08 Proceedings of the 5th European semantic web conference on The semantic web: research and applications
When owl: sameAs isn't the same: an analysis of identity in linked data

ISWC'10 Proceedings of the 9th international semantic web conference on The semantic web - Volume Part I
Linked data query processing strategies

ISWC'10 Proceedings of the 9th international semantic web conference on The semantic web - Volume Part I
Using reformulation trees to optimize queries over distributed heterogeneous sources

ISWC'10 Proceedings of the 9th international semantic web conference on The semantic web - Volume Part I
Summary models for routing keywords to linked data sources

ISWC'10 Proceedings of the 9th international semantic web conference on The semantic web - Volume Part I
Linked Data

Linked Data
SIHJoin: querying remote and local linked data

ESWC'11 Proceedings of the 8th extended semantic web conference on The semantic web: research and applications - Volume Part I
Zero-knowledge query planning for an iterator implementation of link traversal based query execution

ESWC'11 Proceedings of the 8th extended semantic web conference on The semantic web: research and applications - Volume Part I
Semantics and optimization of the SPARQL 1.1 federation extension

ESWC'11 Proceedings of the 8th extended semantic web conference on The semanic web: research and applications - Volume Part II
FedX: a federation layer for distributed query processing on linked open data

ESWC'11 Proceedings of the 8th extended semantic web conference on The semanic web: research and applications - Volume Part II
OWLIM: A family of scalable semantic repositories

Semantic Web
Comparing data summaries for processing live queries over Linked Data

World Wide Web
FactForge: a fast track to the web of data

Semantic Web
SPARQL for a web of linked data: semantics and computability

ESWC'12 Proceedings of the 9th international conference on The Semantic Web: research and applications
Freshening up while staying fast: towards hybrid SPARQL queries

EKAW'12 Proceedings of the 18th international conference on Knowledge Engineering and Knowledge Management

Hybrid SPARQL queries: fresh vs. fast results

ISWC'12 Proceedings of the 11th international conference on The Semantic Web - Volume Part I
RDFS and OWL reasoning for linked data

RW'13 Proceedings of the 9th international conference on Reasoning Web: semantic technologies for intelligent data access
Semantic-based QoS management in cloud systems: Current status and future challenges

Future Generation Computer Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Linked Data principles allow for processing SPARQL queries on-the-fly by dereferencing URIs. Link-traversal query approaches for Linked Data have the benefit of up-to-date results and decentralised execution, but operate only on explicit data from dereferenced documents, affecting recall. In this paper, we show how inferable knowledge--specifically that found through owl:sameAs and RDFS reasoning--can improve recall in this setting. We first analyse a corpus featuring 7 million Linked Data sources and 2.1 billion quadruples: we (1) measure expected recall by only considering dereferenceable information, (2) measure the improvement in recall given by considering rdfs:seeAlso links as previous proposals did. We further propose and measure the impact of additionally considering (3) owl:sameAs links, and (4) applying lightweight RDFS reasoning for finding more results, relying on static schema information. We evaluate different configurations for live queries covering different shapes and domains, generated from random walks over our corpus.