Discovering keys in RDF/OWL dataset with KD2R

Authors:
Danai Symeonidou;Nathalie Pernelle;Fatiha Saïs
Affiliations:
LRI, Université Paris-Sud, Orsay, France;LRI, Université Paris-Sud, Orsay, France;LRI, Université Paris-Sud, Orsay, France
Venue:
Proceedings of the 2nd International Workshop on Open Data
Year:
2013

Citing 9
Cited 0

A knowledge-based approach for duplicate elimination in data cleaning

Information Systems - Data extraction, cleaning and reconciliation
Large-Scale Deduplication with Constraints Using Dedupalog

ICDE '09 Proceedings of the 2009 IEEE International Conference on Data Engineering
Combining a Logical and a Numerical Method for Data Reconciliation

Journal on Data Semantics XII
Discovering and Maintaining Links on the Web of Data

ISWC '09 Proceedings of the 8th International Semantic Web Conference
KD2R: a key discovery method for semantic reference reconciliation

OTM'11 Proceedings of the 2011th Confederated international conference on On the move to meaningful internet systems
EAGLE: efficient active learning of link specifications using genetic programming

ESWC'12 Proceedings of the 9th international conference on The Semantic Web: research and applications
Learning expressive linkage rules using genetic programming

Proceedings of the VLDB Endowment
Keys and pseudo-keys detection for web datasets cleansing and interlinking

EKAW'12 Proceedings of the 18th international conference on Knowledge Engineering and Knowledge Management
Data Linking for the Semantic Web

International Journal on Semantic Web & Information Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

KD2R allows the automatic discovery of composite key constraints in RDF data sources that conform to a given ontology. We consider data sources for which the Unique Name Assumption is fulfilled. KD2R allows this discovery without having to scan all the data. Indeed, the proposed system looks for maximal non keys and derives minimal keys from this set of non keys. KD2R has been tested on several datasets available on the web of data and it has obtained promising results when the discovered keys are used to link data. In the demo, we will demonstrate the functionality of our tool and we will show on several datasets that the keys can be used in a datalinking tool.