Mining association rules between sets of items in large databases
SIGMOD '93 Proceedings of the 1993 ACM SIGMOD international conference on Management of data
Database Mining: A Performance Perspective
IEEE Transactions on Knowledge and Data Engineering
Probabilistic Approach to Association Rules in Incomplete Databases
WAIM '00 Proceedings of the First International Conference on Web-Age Information Management
Fast Algorithms for Mining Association Rules in Large Databases
VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
Treatment of Missing Values for Association Rules
PAKDD '98 Proceedings of the Second Pacific-Asia Conference on Research and Development in Knowledge Discovery and Data Mining
Approximate Association Rule Mining
Proceedings of the Fourteenth International Florida Artificial Intelligence Research Society Conference
Using Association Rules for Completing Missing Data
HIS '04 Proceedings of the Fourth International Conference on Hybrid Intelligent Systems
Knowledge management performance evaluation: a decade review from 1995 to 2004
Journal of Information Science
Combined association rules for dealing with missing values
Journal of Information Science
Hi-index | 12.05 |
The problem of recovering missing values from a dataset has become an important research issue in the field of data mining and machine learning. In this thesis, we introduce an iterative missing-value completion method based on the RAR (Robust Association Rules) support values to extract useful association rules for inferring missing values in an iterative way. It consists of three phases. The first phase uses the association rules to roughly complete the missing values. The second phase iteratively reduces the minimum support to gather more association rules to complete the rest of missing values. The third phase uses the association rules from the completed dataset to correct the missing values that have been filled in. Experimental results show the proposed approaches have good accuracy and data recovery even when the missing-value rate is high.