Learning Bayesian networks by hill climbing: efficient methods based on progressive restriction of the neighborhood

Authors:
José A. Gámez;Juan L. Mateo;José M. Puerta
Affiliations:
Department of Computing Systems, Intelligent Systems and Data Mining Group-i3A, University of Castilla-La Mancha, Albacete, Spain 02071;Department of Computing Systems, Intelligent Systems and Data Mining Group-i3A, University of Castilla-La Mancha, Albacete, Spain 02071;Department of Computing Systems, Intelligent Systems and Data Mining Group-i3A, University of Castilla-La Mancha, Albacete, Spain 02071
Venue:
Data Mining and Knowledge Discovery
Year:
2011

Citing 26
Cited 3

Probabilistic reasoning in intelligent systems: networks of plausible inference

Probabilistic reasoning in intelligent systems: networks of plausible inference
Theory refinement on Bayesian networks

Proceedings of the seventh conference (1991) on Uncertainty in artificial intelligence
A Bayesian Method for the Induction of Probabilistic Networks from Data

Machine Learning
Learning Bayesian Networks: The Combination of Knowledge and Statistical Data

Machine Learning
Structure Learning of Bayesian Networks by Genetic Algorithms: A Performance Analysis of Control Parameters

IEEE Transactions on Pattern Analysis and Machine Intelligence
Adaptive Probabilistic Networks with Hidden Variables

Machine Learning - Special issue on learning with probabilistic representations
Probabilistic Networks and Expert Systems

Probabilistic Networks and Expert Systems
Bayesian Networks for Data Mining

Data Mining and Knowledge Discovery
A Guide to the Literature on Learning Probabilistic Networks from Data

IEEE Transactions on Knowledge and Data Engineering
Stochastic Local Algorithms for Learning Belief Networks: Searching in the Space of the Orderings

ECSQARU '01 Proceedings of the 6th European Conference on Symbolic and Quantitative Approaches to Reasoning with Uncertainty
Equivalence and synthesis of causal models

UAI '90 Proceedings of the Sixth Annual Conference on Uncertainty in Artificial Intelligence
Optimal structure identification with greedy search

The Journal of Machine Learning Research
The max-min hill-climbing Bayesian network structure learning algorithm

Machine Learning
Learning Bayesian Networks

Learning Bayesian Networks
Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)

Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)
A Scoring Function for Learning Bayesian Networks based on Mutual Information and Conditional Independence Tests

The Journal of Machine Learning Research
Towards scalable and data efficient learning of Markov boundaries

International Journal of Approximate Reasoning
Improving Bayesian Network Structure Learning with Mutual Information-Based Node Ordering in the K2 Algorithm

IEEE Transactions on Knowledge and Data Engineering
Bayesian Substructure Learning - Approximate Learning of Very Large Network Structures

ECML '07 Proceedings of the 18th European conference on Machine Learning
Bayesian Networks and Decision Graphs

Bayesian Networks and Decision Graphs
Searching for Bayesian network structures in the space of restricted acyclic partially directed graphs

Journal of Artificial Intelligence Research
A hybrid anytime algorithm for the construction of causal models from sparse data

UAI'99 Proceedings of the Fifteenth conference on Uncertainty in artificial intelligence
Learning bayesian network structure from massive datasets: the «sparse candidate« algorithm

UAI'99 Proceedings of the Fifteenth conference on Uncertainty in artificial intelligence
MIDAS - an influence diagram for management of mildew in winter wheat

UAI'96 Proceedings of the Twelfth international conference on Uncertainty in artificial intelligence
Constrained score+(local)search methods for learning bayesian networks

ECSQARU'05 Proceedings of the 8th European conference on Symbolic and Quantitative Approaches to Reasoning with Uncertainty
An efficient data mining method for learning Bayesian networks using an evolutionary algorithm-based hybrid approach

IEEE Transactions on Evolutionary Computation

Scaling up the greedy equivalence search algorithm by constraining the search space of equivalence classes

ECSQARU'11 Proceedings of the 11th European conference on Symbolic and quantitative approaches to reasoning with uncertainty
New skeleton-based approaches for Bayesian structure learning of Bayesian networks

Applied Soft Computing
Scaling up the Greedy Equivalence Search algorithm by constraining the search space of equivalence classes

International Journal of Approximate Reasoning

Quantified Score

Hi-index	0.00

Visualization

Abstract

Learning Bayesian networks is known to be an NP-hard problem and that is the reason why the application of a heuristic search has proven advantageous in many domains. This learning approach is computationally efficient and, even though it does not guarantee an optimal result, many previous studies have shown that it obtains very good solutions. Hill climbing algorithms are particularly popular because of their good trade-off between computational demands and the quality of the models learned. In spite of this efficiency, when it comes to dealing with high-dimensional datasets, these algorithms can be improved upon, and this is the goal of this paper. Thus, we present an approach to improve hill climbing algorithms based on dynamically restricting the candidate solutions to be evaluated during the search process. This proposal, dynamic restriction, is new because other studies available in the literature about restricted search in the literature are based on two stages rather than only one as it is presented here. In addition to the aforementioned advantages of hill climbing algorithms, we show that under certain conditions the model they return is a minimal I-map of the joint probability distribution underlying the training data, which is a nice theoretical property with practical implications. In this paper we provided theoretical results that guarantee that, under these same conditions, the proposed algorithms also output a minimal I-map. Furthermore, we experimentally test the proposed algorithms over a set of different domains, some of them quite large (up to 800 variables), in order to study their behavior in practice.