Bayesian Robust PCA for Incomplete Data

  • Authors:
  • Jaakko Luttinen;Alexander Ilin;Juha Karhunen

  • Affiliations:
  • Department of Information and Computer Science, Helsinki University of Technology TKK, Espoo, Finland FI-02015 TKK;Department of Information and Computer Science, Helsinki University of Technology TKK, Espoo, Finland FI-02015 TKK;Department of Information and Computer Science, Helsinki University of Technology TKK, Espoo, Finland FI-02015 TKK

  • Venue:
  • ICA '09 Proceedings of the 8th International Conference on Independent Component Analysis and Signal Separation
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present a probabilistic model for robust principal component analysis (PCA) in which the observation noise is modelled by Student-t distributions that are independent for different data dimensions. A heavy-tailed noise distribution is used to reduce the negative effect of outliers. Intractability of posterior evaluation is solved using variational Bayesian approximation methods. We show experimentally that the proposed model can be a useful tool for PCA preprocessing for incomplete noisy data. We also demonstrate that the assumed noise model can yield more accurate reconstructions of missing values: Corrupted dimensions of a "bad" sample may be reconstructed well from other dimensions of the same data vector. The model was motivated by a real-world weather dataset which was used for comparison of the proposed technique to relevant probabilistic PCA models.