Parameter Learning in Probabilistic Databases: A Least Squares Approach

  • Authors:
  • Bernd Gutmann;Angelika Kimmig;Kristian Kersting;Luc Raedt

  • Affiliations:
  • Dept. of Computer Science, Katholieke Universiteit Leuven, Heverlee, Belgium BE-3001;Dept. of Computer Science, Katholieke Universiteit Leuven, Heverlee, Belgium BE-3001;Fraunhofer IAIS, Schloß Birlinghoven, Sankt Augustin, Germany 53754;Dept. of Computer Science, Katholieke Universiteit Leuven, Heverlee, Belgium BE-3001

  • Venue:
  • ECML PKDD '08 Proceedings of the 2008 European Conference on Machine Learning and Knowledge Discovery in Databases - Part I
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

We introduce the problem of learning the parameters of the probabilistic database ProbLog. Given the observed success probabilities of a set of queries, we compute the probabilities attached to facts that have a low approximation error on the training examples as well as on unseen examples. Assuming Gaussian error terms on the observed success probabilities, this naturally leads to a least squares optimization problem. Our approach, called LeProbLog, is able to learn both from queries and from proofs and even from both simultaneously. This makes it flexible and allows faster training in domains where the proofs are available. Experiments on real world data show the usefulness and effectiveness of this least squares calibration of probabilistic databases.