An Efficient Prediction Model for Diabetic Database Using Soft Computing Techniques

Authors:
Veena H. Bhat;Prasanth G. Rao;P. Deepa Shenoy;K. R. Venugopal;L. M. Patnaik
Affiliations:
University Visvesvaraya College of Engineering, Bangalore University, Bangalore, India;University Visvesvaraya College of Engineering, Bangalore University, Bangalore, India;University Visvesvaraya College of Engineering, Bangalore University, Bangalore, India;University Visvesvaraya College of Engineering, Bangalore University, Bangalore, India;Vice Chancellor, Defence Institute of Advanced Technology, Deemed University, Pune, India
Venue:
RSFDGrC '09 Proceedings of the 12th International Conference on Rough Sets, Fuzzy Sets, Data Mining and Granular Computing
Year:
2009

Citing 2
Cited 0

Statistical analysis with missing data

Statistical analysis with missing data
"Missing Is Useful': Missing Values in Cost-Sensitive Decision Trees

IEEE Transactions on Knowledge and Data Engineering

Quantified Score

Hi-index	0.00

Visualization

Abstract

Organizations aim at harnessing predictive insights, using the vast real-time data stores that they have accumulated through the years, using data mining techniques. Health sector, has an extremely large source of digital data - patient-health related data-store, which can be effectively used for predictive analytics. This data, may consists of missing, incorrect and sometimes incomplete values sets that can have a detrimental effect on the decisions that are outcomes of data analytics. Using the PIMA Indians Diabetes dataset, we have proposed an efficient imputation method using a hybrid combination of CART and Genetic Algorithm, as a preprocessing step. The classical neural network model is used for prediction, on the preprocessed dataset. The accuracy achieved by the proposed model far exceeds the existing models, mainly because of the soft computing preprocessing adopted. This approach is simple, easy to understand and implement and practical in its approach.