Statistical analysis with missing data
Statistical analysis with missing data
Probabilistic reasoning in intelligent systems: networks of plausible inference
Probabilistic reasoning in intelligent systems: networks of plausible inference
Robust Learning with Missing Data
Machine Learning
Speculative Markov Blanket Discovery for Optimal Feature Selection
ICDM '05 Proceedings of the Fifth IEEE International Conference on Data Mining
Learning Bayesian Networks
Towards scalable and data efficient learning of Markov boundaries
International Journal of Approximate Reasoning
Handling Missing Values when Applying Classification Models
The Journal of Machine Learning Research
Exploiting missing clinical data in Bayesian network modeling for predicting medical problems
Journal of Biomedical Informatics
Learning Reliable Classifiers From Small or Incomplete Data Sets: The Naive Credal Classifier 2
The Journal of Machine Learning Research
Impact of imputation of missing values on classification error for discrete data
Pattern Recognition
A Novel Scalable and Data Efficient Feature Subset Selection Algorithm
ECML PKDD '08 Proceedings of the European conference on Machine Learning and Knowledge Discovery in Databases - Part II
Using an Approximate Bayesian Bootstrap to multiply impute nonignorable missing data
Computational Statistics & Data Analysis
A Conservative Feature Subset Selection Algorithm with Missing Data
ICDM '08 Proceedings of the 2008 Eighth IEEE International Conference on Data Mining
Graphical Models in Applied Multivariate Statistics
Graphical Models in Applied Multivariate Statistics
Learning Bayesian networks from incomplete data with stochastic search algorithms
UAI'99 Proceedings of the Fifteenth conference on Uncertainty in artificial intelligence
Learning probabilistic Description logic concepts: under different Assumptions on missing knowledge
Proceedings of the 27th Annual ACM Symposium on Applied Computing
Hi-index | 0.00 |
This paper proposes a framework built on the use of Bayesian networks (BN) for representing statistical dependencies between the existing random variables and additional dummy boolean variables, which represent the presence/absence of the respective random variable value. We show how augmenting the BN with these additional variables helps pinpoint the mechanism through which missing data contributes to the classification task. The missing data mechanism is thus explicitly taken into account to predict the class variable using the data at hand. Extensive experiments on synthetic and real-world incomplete data sets reveals that the missingness information improves classification accuracy.