Exploiting Data Missingness in Bayesian Network Modeling

Authors:
Sérgio Rodrigues De Morais;Alex Aussem
Affiliations:
LIESP, INSA-Lyon, University of Lyon, Villeurbanne, France 69622;LIESP, UCBL, University of Lyon, Villeurbanne, France 69622
Venue:
IDA '09 Proceedings of the 8th International Symposium on Intelligent Data Analysis: Advances in Intelligent Data Analysis VIII
Year:
2009

Citing 17
Cited 1

Statistical analysis with missing data

Statistical analysis with missing data
Probabilistic reasoning in intelligent systems: networks of plausible inference

Probabilistic reasoning in intelligent systems: networks of plausible inference
Robust Learning with Missing Data

Machine Learning
Speculative Markov Blanket Discovery for Optimal Feature Selection

ICDM '05 Proceedings of the Fifth IEEE International Conference on Data Mining
The max-min hill-climbing Bayesian network structure learning algorithm

Machine Learning
Growing Bayesian network models of gene networks from seed genes

Bioinformatics
Learning Bayesian Networks

Learning Bayesian Networks
Towards scalable and data efficient learning of Markov boundaries

International Journal of Approximate Reasoning
Handling Missing Values when Applying Classification Models

The Journal of Machine Learning Research
Exploiting missing clinical data in Bayesian network modeling for predicting medical problems

Journal of Biomedical Informatics
Learning Reliable Classifiers From Small or Incomplete Data Sets: The Naive Credal Classifier 2

The Journal of Machine Learning Research
Impact of imputation of missing values on classification error for discrete data

Pattern Recognition
A Novel Scalable and Data Efficient Feature Subset Selection Algorithm

ECML PKDD '08 Proceedings of the European conference on Machine Learning and Knowledge Discovery in Databases - Part II
Using an Approximate Bayesian Bootstrap to multiply impute nonignorable missing data

Computational Statistics & Data Analysis
A Conservative Feature Subset Selection Algorithm with Missing Data

ICDM '08 Proceedings of the 2008 Eighth IEEE International Conference on Data Mining
Graphical Models in Applied Multivariate Statistics

Graphical Models in Applied Multivariate Statistics
Learning Bayesian networks from incomplete data with stochastic search algorithms

UAI'99 Proceedings of the Fifteenth conference on Uncertainty in artificial intelligence

Learning probabilistic Description logic concepts: under different Assumptions on missing knowledge

Proceedings of the 27th Annual ACM Symposium on Applied Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper proposes a framework built on the use of Bayesian networks (BN) for representing statistical dependencies between the existing random variables and additional dummy boolean variables, which represent the presence/absence of the respective random variable value. We show how augmenting the BN with these additional variables helps pinpoint the mechanism through which missing data contributes to the classification task. The missing data mechanism is thus explicitly taken into account to predict the class variable using the data at hand. Extensive experiments on synthetic and real-world incomplete data sets reveals that the missingness information improves classification accuracy.