Transductive Learning from Relational Data

  • Authors:
  • Michelangelo Ceci;Annalisa Appice;Nicola Barile;Donato Malerba

  • Affiliations:
  • Dipartimento di Informatica, Università degli Studi di Bari via Orabona, 4 - 70126 Bari, Italy;Dipartimento di Informatica, Università degli Studi di Bari via Orabona, 4 - 70126 Bari, Italy;Dipartimento di Informatica, Università degli Studi di Bari via Orabona, 4 - 70126 Bari, Italy;Dipartimento di Informatica, Università degli Studi di Bari via Orabona, 4 - 70126 Bari, Italy

  • Venue:
  • MLDM '07 Proceedings of the 5th international conference on Machine Learning and Data Mining in Pattern Recognition
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Transduction is an inference mechanism "from particular to particular". Its application to classification tasks implies the use of both labeled (training) data and unlabeled (working) data to build a classifier whose main goal is that of classifying (only) unlabeled data as accurately as possible. Unlike the classical inductive setting, no general rule valid for all possible instances is generated. Transductive learning is most suited for those applications where the examples for which a prediction is needed are already known when training the classifier. Several approaches have been proposed in the literature on building transductive classifiers from data stored in a single table of a relational database. Nonetheless, no attention has been paid to the application of the transduction principle in a (multi-)relational setting, where data are stored in multiple tables of a relational database. In this paper we propose a new transductive classifier, named TRANSC, which is based on a probabilistic approach to making transductive inferences from relational data. This new method works in a transductive setting and employs a principled probabilistic classification in multi-relational data mining to face the challenges posed by some spatial data mining problems. Probabilistic inference allows us to compute the class probability and return, in addition to result of transductive classification, the confidence in the classification. The predictive accuracy of TRANSC has been compared to that of its inductive counterpart in an empirical study involving both a benchmark relational dataset and two spatial datasets. The results obtained are generally in favor of TRANSC, although improvements are small by a narrow margin.