Handling of incomplete data sets using ICA and SOM in data mining

  • Authors:
  • Hongyi Peng;Siming Zhu

  • Affiliations:
  • Sun Yat-sen University, Department of Applied Mathematics, 510275, Guangzhou, China;Sun Yat-sen University, Department of Applied Mathematics, 510275, Guangzhou, China

  • Venue:
  • Neural Computing and Applications
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Based on independent component analysis (ICA) and self-organizing maps (SOM), this paper proposes an ISOM-DH model for the incomplete data’s handling in data mining. Under these circumstances the data remain dependent and non-Gaussian, this model can make full use of the information of the given data to estimate the missing data and can visualize the handled high-dimensional data. Compared with mixture of principal component analyzers (MPCA), mean method and standard SOM-based fuzzy map model, ISOM-DH model can be applied to more cases, thus performing its superiority. Meanwhile, the correctness and reasonableness of ISOM-DH model is also validated by the experiment carried out in this paper.