A selective classifier for incomplete data

Authors:
Jingnian Chen;Houkuan Huang;Fengzhan Tian;Shengfeng Tian
Affiliations:
School of Computer and Information Technology, Beijing Jiaotong University, Beijing, China and Dept. of Information and computing Science, Shandong University of Finance, Jinan, Shandong, China;School of Computer and Information Technology, Beijing Jiaotong University, Beijing, China;School of Computer and Information Technology, Beijing Jiaotong University, Beijing, China;School of Computer and Information Technology, Beijing Jiaotong University, Beijing, China
Venue:
PAKDD'08 Proceedings of the 12th Pacific-Asia conference on Advances in knowledge discovery and data mining
Year:
2008

Citing 7
Cited 0

Statistical analysis with missing data

Statistical analysis with missing data
C4.5: programs for machine learning

C4.5: programs for machine learning
Estimating attributes: analysis and extensions of RELIEF

ECML-94 Proceedings of the European conference on machine learning on Machine Learning
Robust Bayes classifiers

Artificial Intelligence
Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)

Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)
On Classification with Incomplete Data

IEEE Transactions on Pattern Analysis and Machine Intelligence
Local learning in probabilistic networks with hidden variables

IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2

Quantified Score

Hi-index	0.01

Visualization

Abstract

Classifiers based on feature selection (selective classifiers) are a kind of algorithms that can effectively improve the accuracy and efficiency of classification by deleting irrelevant or redundant attributes of a data set. Due to the complexity of processing incomplete data, however, most of them deal with complete data. Yet actual data are often incomplete and have many redundant or irrelevant attributes. So constructing selective classifiers for incomplete data is an important problem. With the analysis of main methods of processing incomplete data for classification, a selective classifier for incomplete data named RBSR (ReliefF algorithm-Based Selective Robust Bayes Classifier), which is based on the Robust Bayes Classifiers (RBC) and ReliefF algorithm, is presented. The proposed algorithm needs no assumptions about data sets that are necessary for previous methods of processing incomplete data in classification. This algorithm can deal with incomplete data sets with many attributes and instances. Experiments were performed on twelve benchmark incomplete data sets. We compared RBSR with the very effective RBC and several other classifiers for incomplete data. The experimental results show that RBSR can not only enormously reduce the number of redundant or irrelevant attributes, but greatly improve the accuracy and stability of classification as well.