The linear factorial smoothing for the analysis of incomplete data

  • Authors:
  • Basavanneppa Tallur

  • Affiliations:
  • IRISA, Université de Rennes 1, Rennes, France

  • Venue:
  • PReMI'05 Proceedings of the First international conference on Pattern Recognition and Machine Intelligence
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

Huge amounts of data are generated in every field of science and technology and the need for the proper data analysis tools and their adaptation to the ever-increasing data size is more and more crucial. Statistical exploratary data analysis techniques –such as principal component analysis, correspondence analysis, clustering and classification among others– are greatly useful in discovering useful information –or knowledge– hidden in data but they require the data set to be complete. In many situations the data is incomplete for various reasons. Erroneous and uncertain data may also be considered as missing since their use may lead to incorrect results. Many research works have addressed this issue in specific applications. This paper presents a simple and efficient iterative method for estimating the missing values in the data set based on linear factorial smoothing. Though this work was prompted by the recurrent problem faced in the field of bioinformatics while analysing the gene expression data, the method proposed for missing value imputation in this paper may be useful in any area.