Biplots of fuzzy coded data

  • Authors:
  • Zerrin Aşan;Michael Greenacre

  • Affiliations:
  • Department of Statistics, Anadolu University, Eskişehir, Turkey;Department of Economics and Business, Universitat Pompeu Fabra, Barcelona, Spain and Barcelona Graduate School of Economics, Barcelona, Spain

  • Venue:
  • Fuzzy Sets and Systems
  • Year:
  • 2011

Quantified Score

Hi-index 0.20

Visualization

Abstract

A biplot, which is the multivariate generalization of the two-variable scatterplot, can be used to visualize the results of many multivariate techniques, especially those that are based on the singular value decomposition. We consider data sets consisting of continuous-scale measurements, their fuzzy coding and the biplots that visualize them, using a fuzzy version of multiple correspondence analysis. Of special interest is the way quality of fit of the biplot is measured, since it is well known that regular (i.e., crisp) multiple correspondence analysis seriously under-estimates this measure. We show how the results of fuzzy multiple correspondence analysis can be defuzzified to obtain estimated values of the original data, and prove that this implies an orthogonal decomposition of variance. This permits a measure-of-fit to be calculated in the familiar form of a percentage of explained variance, which is directly comparable to the corresponding fit measure used in principal component analysis of the original data. The approach is motivated initially by its application to a simulated data set, showing how the fuzzy approach can lead to diagnosing nonlinear relationships, and finally it is applied to a real set of meteorological data.