Mining Linked Open Data through Semi-supervised Learning Methods Based on Self-Training

Authors:
Nicola Fanizzi;Claudia d'Amato;Floriana Esposito
Affiliations:
-;-;-
Venue:
ICSC '12 Proceedings of the 2012 IEEE Sixth International Conference on Semantic Computing
Year:
2012

Citing 0
Cited 1

A Domain-based Approach to Publish Data on the Web

Proceedings of International Conference on Information Integration and Web-based Applications & Services

Quantified Score

Hi-index	0.00

Visualization

Abstract

The paper tackles the problem of mining linked open data. The inherent lack of knowledge caused by the open-world assumption made on the semantic of the data model determines an abundance of data of uncertain classification. We present a semi-supervised machine learning approach. Specifically a self-training strategy is adopted which iteratively uses labeled instances to predict a label also for unlabeled instances. The approach is empirically evaluated with an extensive experimentation involving several different algorithms demonstrating the added value yielded by a semi-supervised approach over standard supervised methods.