Probabilistic reasoning in intelligent systems: networks of plausible inference
Probabilistic reasoning in intelligent systems: networks of plausible inference
Theory refinement on Bayesian networks
Proceedings of the seventh conference (1991) on Uncertainty in artificial intelligence
Wrappers for feature subset selection
Artificial Intelligence - Special issue on relevance
On the Optimality of the Simple Bayesian Classifier under Zero-One Loss
Machine Learning - Special issue on learning with probabilistic representations
Machine Learning - Special issue on learning with probabilistic representations
AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
On Bias, Variance, 0/1—Loss, and the Curse-of-Dimensionality
Data Mining and Knowledge Discovery
Exact model averaging with naive Bayesian classifiers
ICML '02 Proceedings of the Nineteenth International Conference on Machine Learning
Bayesian Averaging of Classifiers and the Overfitting Problem
ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Equivalence and synthesis of causal models
UAI '90 Proceedings of the Sixth Annual Conference on Uncertainty in Artificial Intelligence
Tractable Bayesian Learning of Tree Belief Networks
UAI '00 Proceedings of the 16th Conference on Uncertainty in Artificial Intelligence
Cached sufficient statistics for efficient machine learning with large datasets
Journal of Artificial Intelligence Research
A framework for agent-based distributed machine learning and data mining
Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Constructing Bayesian networks for criminal profiling from limited data
Knowledge-Based Systems
MALEF: Framework for distributed machine learning and data mining
International Journal of Intelligent Information and Database Systems
ACM Transactions on Internet Technology (TOIT)
An agent-based framework for distributed learning
Engineering Applications of Artificial Intelligence
Learning Instance-Specific Predictive Models
The Journal of Machine Learning Research
Distributed learning with data reduction
Transactions on computational collective intelligence IV
Robust bayesian linear classifier ensembles
ECML'05 Proceedings of the 16th European conference on Machine Learning
Review: learning bayesian networks: Approaches and issues
The Knowledge Engineering Review
Credal ensembles of classifiers
Computational Statistics & Data Analysis
Learning optimal bayesian networks: a shortest path perspective
Journal of Artificial Intelligence Research
Hi-index | 0.00 |
In this paper we consider the problem of performing Bayesian model-averaging over a class of discrete Bayesian network structures consistent with a partial ordering and with bounded in-degree k. We show that for N nodes this class contains in the worst-case at least distinct network structures, and yet model averaging over these structures can be performed using operations. Furthermore we show that there exists a single Bayesian network that defines a joint distribution over the variables that is equivalent to model averaging over these structures. Although constructing this network is computationally prohibitive, we show that it can be approximated by a tractable network, allowing approximate model-averaged probability calculations to be performed in O(N) time. Our result also leads to an exact and linear-time solution to the problem of averaging over the 2N possible feature sets in a naive Bayes model, providing an exact Bayesian solution to the troublesome feature-selection problem for naive Bayes classifiers. We demonstrate the utility of these techniques in the context of supervised classification, showing empirically that model averaging consistently beats other generative Bayesian-network-based models, even when the generating model is not guaranteed to be a member of the class being averaged over. We characterize the performance over several parameters on simulated and real-world data.