Model Averaging for Prediction with Discrete Bayesian Networks

Authors:
Denver Dash;Gregory F. Cooper
Affiliations:
-;-
Venue:
The Journal of Machine Learning Research
Year:
2004

Citing 14
Cited 11

Probabilistic reasoning in intelligent systems: networks of plausible inference

Probabilistic reasoning in intelligent systems: networks of plausible inference
Theory refinement on Bayesian networks

Proceedings of the seventh conference (1991) on Uncertainty in artificial intelligence
A Bayesian Method for the Induction of Probabilistic Networks from Data

Machine Learning
Learning Bayesian Networks: The Combination of Knowledge and Statistical Data

Machine Learning
Wrappers for feature subset selection

Artificial Intelligence - Special issue on relevance
On the Optimality of the Simple Bayesian Classifier under Zero-One Loss

Machine Learning - Special issue on learning with probabilistic representations
Bayesian Network Classifiers

Machine Learning - Special issue on learning with probabilistic representations
SMILE: Structural Modeling, Inference, and Learning Engine and GeNIe: a development environment for graphical decision-theoretic models

AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
On Bias, Variance, 0/1—Loss, and the Curse-of-Dimensionality

Data Mining and Knowledge Discovery
Exact model averaging with naive Bayesian classifiers

ICML '02 Proceedings of the Nineteenth International Conference on Machine Learning
Bayesian Averaging of Classifiers and the Overfitting Problem

ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Equivalence and synthesis of causal models

UAI '90 Proceedings of the Sixth Annual Conference on Uncertainty in Artificial Intelligence
Tractable Bayesian Learning of Tree Belief Networks

UAI '00 Proceedings of the 16th Conference on Uncertainty in Artificial Intelligence
Cached sufficient statistics for efficient machine learning with large datasets

Journal of Artificial Intelligence Research

A framework for agent-based distributed machine learning and data mining

Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Constructing Bayesian networks for criminal profiling from limited data

Knowledge-Based Systems
MALEF: Framework for distributed machine learning and data mining

International Journal of Intelligent Information and Database Systems
Forecasting Click-Through Rates Based on Sponsored Search Advertiser Bids and Intermediate Variable Regression

ACM Transactions on Internet Technology (TOIT)
An agent-based framework for distributed learning

Engineering Applications of Artificial Intelligence
Learning Instance-Specific Predictive Models

The Journal of Machine Learning Research
Distributed learning with data reduction

Transactions on computational collective intelligence IV
Robust bayesian linear classifier ensembles

ECML'05 Proceedings of the 16th European conference on Machine Learning
Review: learning bayesian networks: Approaches and issues

The Knowledge Engineering Review
Credal ensembles of classifiers

Computational Statistics & Data Analysis
Learning optimal bayesian networks: a shortest path perspective

Journal of Artificial Intelligence Research

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper we consider the problem of performing Bayesian model-averaging over a class of discrete Bayesian network structures consistent with a partial ordering and with bounded in-degree k. We show that for N nodes this class contains in the worst-case at least distinct network structures, and yet model averaging over these structures can be performed using operations. Furthermore we show that there exists a single Bayesian network that defines a joint distribution over the variables that is equivalent to model averaging over these structures. Although constructing this network is computationally prohibitive, we show that it can be approximated by a tractable network, allowing approximate model-averaged probability calculations to be performed in O(N) time. Our result also leads to an exact and linear-time solution to the problem of averaging over the 2N possible feature sets in a naive Bayes model, providing an exact Bayesian solution to the troublesome feature-selection problem for naive Bayes classifiers. We demonstrate the utility of these techniques in the context of supervised classification, showing empirically that model averaging consistently beats other generative Bayesian-network-based models, even when the generating model is not guaranteed to be a member of the class being averaged over. We characterize the performance over several parameters on simulated and real-world data.