Learning mixtures of truncated basis functions from data

  • Authors:
  • Helge Langseth;Thomas D. Nielsen;Inmaculada Pérez-Bernabé;Antonio Salmerón

  • Affiliations:
  • Department of Computer and Information Science, The Norwegian University of Science and Technology, Trondheim, Norway;Department of Computer Science, Aalborg University, Aalborg, Denmark;Department of Mathematics, University of Almería, Almería, Spain;Department of Mathematics, University of Almería, Almería, Spain

  • Venue:
  • International Journal of Approximate Reasoning
  • Year:
  • 2014

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we investigate methods for learning hybrid Bayesian networks from data. First we utilize a kernel density estimate of the data in order to translate the data into a mixture of truncated basis functions (MoTBF) representation using a convex optimization technique. When utilizing a kernel density representation of the data, the estimation method relies on the specification of a kernel bandwidth. We show that in most cases the method is robust wrt. the choice of bandwidth, but for certain data sets the bandwidth has a strong impact on the result. Based on this observation, we propose an alternative learning method that relies on the cumulative distribution function of the data. Empirical results demonstrate the usefulness of the approaches: Even though the methods produce estimators that are slightly poorer than the state of the art (in terms of log-likelihood), they are significantly faster, and therefore indicate that the MoTBF framework can be used for inference and learning in reasonably sized domains. Furthermore, we show how a particular sub-class of MoTBF potentials (learnable by the proposed methods) can be exploited to significantly reduce complexity during inference.