A pure L1-norm principal component analysis

  • Authors:
  • J. P. Brooks;J. H. Dulá;E. L. Boone

  • Affiliations:
  • Department of Statistical Sciences and Operations Research, Virginia Commonwealth University, Richmond, VA 23284, United States;Department of Management, Virginia Commonwealth University, Richmond, VA 23284, United States;Department of Statistical Sciences and Operations Research, Virginia Commonwealth University, Richmond, VA 23284, United States

  • Venue:
  • Computational Statistics & Data Analysis
  • Year:
  • 2013

Quantified Score

Hi-index 0.03

Visualization

Abstract

The L"1 norm has been applied in numerous variations of principal component analysis (PCA). An L"1-norm PCA is an attractive alternative to traditional L"2-based PCA because it can impart robustness in the presence of outliers and is indicated for models where standard Gaussian assumptions about the noise may not apply. Of all the previously-proposed PCA schemes that recast PCA as an optimization problem involving the L"1 norm, none provide globally optimal solutions in polynomial time. This paper proposes an L"1-norm PCA procedure based on the efficient calculation of the optimal solution of the L"1-norm best-fit hyperplane problem. We present a procedure called L"1-PCA^* based on the application of this idea that fits data to subspaces of successively smaller dimension. The procedure is implemented and tested on a diverse problem suite. Our tests show that L"1-PCA^* is the indicated procedure in the presence of unbalanced outlier contamination.