Semi-Supervised Dimensionality Reduction based on Partial Least Squares for Visual Analysis of High Dimensional Data

  • Authors:
  • Jose Gustavo S. Paiva;William Robson Schwartz;Helio Pedrini;Rosane Minghim

  • Affiliations:
  • USP, Sao Carlos, Brazil and UFU, Uberlandia, Brazil;UNICAMP, Campinas, Brazil and UFMG, Belo Horizonte, Brazil;UNICAMP, Campinas, Brazil;USP, Sao Carlos, Brazil

  • Venue:
  • Computer Graphics Forum
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Dimensionality reduction is employed for visual data analysis as a way to obtaining reduced spaces for high dimensional data or to mapping data directly into 2D or 3D spaces. Although techniques have evolved to improve data segregation on reduced or visual spaces, they have limited capabilities for adjusting the results according to user's knowledge. In this paper, we propose a novel approach to handling both dimensionality reduction and visualization of high dimensional data, taking into account user's input. It employs Partial Least Squares (PLS), a statistical tool to perform retrieval of latent spaces focusing on the discriminability of the data. The method employs a training set for building a highly precise model that can then be applied to a much larger data set very effectively. The reduced data set can be exhibited using various existing visualization techniques. The training data is important to code user's knowledge into the loop. However, this work also devises a strategy for calculating PLS reduced spaces when no training data is available. The approach produces increasingly precise visual mappings as the user feeds back his or her knowledge and is capable of working with small and unbalanced training sets. © 2012 Wiley Periodicals, Inc.