Information enhancement for data mining

  • Authors:
  • Shichao Zhang

  • Affiliations:
  • Department of Computer Science, Zhejiang Normal University, PR China and State Key Laboratory for Novel Software Technology, Nanjing University, PR China

  • Venue:
  • Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Information enhancement techniques are desired in many areas such as data mining, machine learning, business intelligence, and web data analysis. Information enhancement mainly includes the following topics: data cleaning, data preparation and transformation, missing values imputation, feature and instance selection, feature construction, treatment of noisy and inconsistent data, data integration, data collection and housing, information enhancement, web data availability, web data capture and representation, and the others. It is impossible to outline all the research topics in a single paper. In this study, we discuss the information enhancement for data mining with existing missing data imputation techniques. We first review the current research on imputing missing values, and then experimentally evaluate the techniques and demonstrate the efficiency of missing data imputation techniques to enhance information in the process of pattern discovery from datasets with missing values. © 2011 John Wiley & Sons, Inc. WIREs Data Mining Knowl Discov 2011 1 284–295 DOI: 10.1002/widm.21