2009 Special Issue: A hybrid random field model for scalable statistical learning

  • Authors:
  • A. Freno;E. Trentin;M. Gori

  • Affiliations:
  • Dipartimento di Ingegneria dell'Informazione, Universití degli Studi di Siena, Via Roma 56, 53100 Siena (SI), Italy;Dipartimento di Ingegneria dell'Informazione, Universití degli Studi di Siena, Via Roma 56, 53100 Siena (SI), Italy;Dipartimento di Ingegneria dell'Informazione, Universití degli Studi di Siena, Via Roma 56, 53100 Siena (SI), Italy

  • Venue:
  • Neural Networks
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper introduces hybrid random fields, which are a class of probabilistic graphical models aimed at allowing for efficient structure learning in high-dimensional domains. Hybrid random fields, along with the learning algorithm we develop for them, are especially useful as a pseudo-likelihood estimation technique (rather than a technique for estimating strict joint probability distributions). In order to assess the generality of the proposed model, we prove that the class of pseudo-likelihood distributions representable by hybrid random fields strictly includes the class of joint probability distributions representable by Bayesian networks. Once we establish this result, we develop a scalable algorithm for learning the structure of hybrid random fields, which we call 'Markov Blanket Merging'. On the one hand, we characterize some complexity properties of Markov Blanket Merging both from a theoretical and from the experimental point of view, using a series of synthetic benchmarks. On the other hand, we evaluate the accuracy of hybrid random fields (as learned via Markov Blanket Merging) by comparing them to various alternative statistical models in a number of pattern classification and link-prediction applications. As the results show, learning hybrid random fields by the Markov Blanket Merging algorithm not only reduces significantly the computational cost of structure learning with respect to several considered alternatives, but it also leads to models that are highly accurate as compared to the alternative ones.