Robust Learning with Missing Data

Authors:
Marco Ramoni;Paola Sebastiani
Affiliations:
Children's Hospital Informatics Program, Harvard Medical School, Boston, MA 02115, USA. marco_ramoni@harvard.edu;Department of Mathematics and Statistics, University of Massachusetts, Amherst, MA 01002, USA. sebas@math.umass.edu
Venue:
Machine Learning
Year:
2001

Citing 14
Cited 29

Probabilistic reasoning in intelligent systems: networks of plausible inference

Probabilistic reasoning in intelligent systems: networks of plausible inference
A Bayesian Method for the Induction of Probabilistic Networks from Data

Machine Learning
C4.5: programs for machine learning

C4.5: programs for machine learning
The EM algorithm for graphical association models with missing data

Computational Statistics & Data Analysis - Special issue dedicated to Toma´sˇ Havra´nek
Learning Bayesian Networks: The Combination of Knowledge and Statistical Data

Machine Learning
Irrelevance and parameter learning in Bayesian networks

Artificial Intelligence
Bayesian classification (AutoClass): theory and results

Advances in knowledge discovery and data mining
On the Optimality of the Simple Bayesian Classifier under Zero-One Loss

Machine Learning - Special issue on learning with probabilistic representations
Bayesian Network Classifiers

Machine Learning - Special issue on learning with probabilistic representations
Expert Systems and Probabiistic Network Models

Expert Systems and Probabiistic Network Models
Probability Intervals Over Influence Diagrams

IEEE Transactions on Pattern Analysis and Machine Intelligence
Bayesian methods

Intelligent data analysis
Local learning in probabilistic networks with hidden variables

IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2
Ignorant influence diagrams

IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2

Updating beliefs with incomplete observations

Artificial Intelligence
Robust inference of trees

Annals of Mathematics and Artificial Intelligence
Fault diagnosis for airplane engines using Bayesian networks and distributed particle swarm optimization

Parallel Computing
Query processing over incomplete autonomous databases

VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Maximum entropy and least square error minimizing procedures for estimating missing conditional probabilities in Bayesian networks

Computational Statistics & Data Analysis
Learning Bayesian networks from incomplete databases using a novel evolutionary algorithm

Decision Support Systems
Nasopharyngeal Carcinoma Data Analysis with a Novel Bayesian Network Skeleton Learning Algorithm

AIME '07 Proceedings of the 11th conference on Artificial Intelligence in Medicine
POP algorithm: Kernel-based imputation to treat missing values in knowledge discovery from databases

Expert Systems with Applications: An International Journal
Learning Bayesian network parameters under incomplete data with domain knowledge

Pattern Recognition
Approximation Methods for Efficient Learning of Bayesian Networks

Proceedings of the 2008 conference on Approximation Methods for Efficient Learning of Bayesian Networks
Exploiting Data Missingness in Bayesian Network Modeling

IDA '09 Proceedings of the 8th International Symposium on Intelligent Data Analysis: Advances in Intelligent Data Analysis VIII
Conservative inference rule for uncertain reasoning under incompleteness

Journal of Artificial Intelligence Research
Query processing over incomplete autonomous databases: query rewriting using learned data dependencies

The VLDB Journal — The International Journal on Very Large Data Bases
Credible classification for environmental problems

Environmental Modelling & Software
Extensions of belief functions and possibility distributions by using the imprecise Dirichlet model

Fuzzy Sets and Systems
Partial identification with missing data: concepts and findings

International Journal of Approximate Reasoning
A conservative feature subset selection algorithm with missing data

Neurocomputing
Optimized parameters for missing data imputation

PRICAI'06 Proceedings of the 9th Pacific Rim international conference on Artificial intelligence
Missing value imputation based on data clustering

Transactions on computational science I
Learn++.MF: A random subspace approach for the missing feature problem

Pattern Recognition
Missing data imputation by utilizing information within incomplete instances

Journal of Systems and Software
A comparison of imputation methods for handling missing scores in biometric fusion

Pattern Recognition
Review: learning bayesian networks: Approaches and issues

The Knowledge Engineering Review
Reliable diagnoses of dementia by the naive credal classifier inferred from incomplete cognitive data

Artificial Intelligence in Medicine
Learning probabilistic Description logic concepts: under different Assumptions on missing knowledge

Proceedings of the 27th Annual ACM Symposium on Applied Computing
Tutorial and selected approaches on parameter learning in bayesian network with incomplete data

ISNN'12 Proceedings of the 9th international conference on Advances in Neural Networks - Volume Part I
WebPut: efficient web-based data imputation

WISE'12 Proceedings of the 13th international conference on Web Information Systems Engineering
A selective Bayes classifier with meta-heuristics for incomplete data

Neurocomputing
Robust predictive model for evaluating breast cancer survivability

Engineering Applications of Artificial Intelligence

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper introduces a new method, called the robust Bayesian estimator (RBE), to learn conditional probability distributions from incomplete data sets. The intuition behind the RBE is that, when no information about the pattern of missing data is available, an incomplete database constrains the set of all possible estimates and this paper provides a characterization of these constraints. An experimental comparison with two popular methods to estimate conditional probability distributions from incomplete data—Gibbs sampling and the EM algorithm—shows a gain in robustness. An application of the RBE to quantify a naive Bayesian classifier from an incomplete data set illustrates its practical relevance.