Scalable, efficient and correct learning of markov boundaries under the faithfulness assumption

  • Authors:
  • Jose M. Peña;Johan Björkegren;Jesper Tegnér

  • Affiliations:
  • Computational Biology, Department of Physics and Measurement Technology, Linköping University, Sweden;Center for Genomics and Bioinformatics, Karolinska Institutet, Sweden;Computational Biology, Department of Physics and Measurement Technology, Linköping University, Sweden

  • Venue:
  • ECSQARU'05 Proceedings of the 8th European conference on Symbolic and Quantitative Approaches to Reasoning with Uncertainty
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

We propose an algorithm for learning the Markov boundary of a random variable from data without having to learn a complete Bayesian network. The algorithm is correct under the faithfulness assumption, scalable and data efficient. The last two properties are important because we aim to apply the algorithm to identify the minimal set of random variables that is relevant for probabilistic classification in databases with many random variables but few instances. We report experiments with synthetic and real databases with 37, 441 and 139352 random variables showing that the algorithm performs satisfactorily.