Empirical comparison of "hard" and "soft" label propagation for relational classification

  • Authors:
  • Aram Galstyan;Paul R. Cohen

  • Affiliations:
  • USC Information Sciences Institute, Center for Research on Unexpected Events, Marina del Rey, CA;USC Information Sciences Institute, Center for Research on Unexpected Events, Marina del Rey, CA

  • Venue:
  • ILP'07 Proceedings of the 17th international conference on Inductive logic programming
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we differentiate between hard and soft label propagation for classification of relational (networked) data. The latter method assigns probabilities or class-membership scores to data instances, then propagates these scores throughout the networked data, whereas the former works by explicitly propagating class labels at each iteration. We present a comparative empirical study of these methods applied to a relational binary classification task, and evaluate two approaches on both synthetic and real-world relational data. Our results indicate that while neither approach dominates the other over the entire range of input data parameters, there are some interesting and non-trivial tradeoffs between them.