A comparison of three methods for principal component analysis of fuzzy interval data

Authors:
Paolo Giordani;Henk A. L. Kiers
Affiliations:
Department of Statistics, Probability and Applied Statistics, University of Rome "La Sapienza", P.Le Aldo Moro 5, 00185 Rome, Italy;Heymans Institute (DPMG), University of Groningen, Grote Kruisstraat 2/1, 9712 TS Groningen, The Netherlands
Venue:
Computational Statistics & Data Analysis
Year:
2006

Citing 5
Cited 4

Neural networks and principal component analysis: learning from examples without local minima

Neural Networks
Fuzzy set theory—and its applications (3rd ed.)

Fuzzy set theory—and its applications (3rd ed.)
Analysis of Symbolic Data: Exploratory Methods for Extracting Statistical Information from Complex Data

Analysis of Symbolic Data: Exploratory Methods for Extracting Statistical Information from Complex Data
Editorial: special issue: Interfaces between fuzzy set theory and interval analysis

Fuzzy Sets and Systems - Special issue: Interfaces between fuzzy set theory and interval analysis
Principal component analysis of fuzzy data using autoassociative neural networks

IEEE Transactions on Fuzzy Systems

Book review

Fuzzy Sets and Systems
The fuzzy approach to statistical analysis

Computational Statistics & Data Analysis
Three-way analysis of imprecise data

Journal of Multivariate Analysis
Estimation of a flexible simple linear model for interval data based on set arithmetic

Computational Statistics & Data Analysis

Quantified Score

Hi-index	0.03

Visualization

Abstract

Vertices Principal Component Analysis (V-PCA), and Centers Principal Component Analysis (C-PCA) generalize Principal Component Analysis (PCA) in order to summarize interval valued data. Neural Network Principal Component Analysis (NN-PCA) represents an extension of PCA for fuzzy interval data. However, also the first two methods can be used for analyzing fuzzy interval data, but they then ignore the spread information. In the literature, the V-PCA method is usually considered computationally cumbersome because it requires the transformation of the interval valued data matrix into a single valued data matrix the number of rows of which depends exponentially on the number of variables and linearly on the number of observation units. However, it has been shown that this problem can be overcome by considering the cross-products matrix which is easy to compute. A review of C-PCA and V-PCA (which hence also includes the computational short-cut to V-PCA) and NN-PCA is provided. Furthermore, a comparison is given of the three methods by means of a simulation study and by an application to an empirical data set. In the simulation study, fuzzy interval data are generated according to various models, and it is reported in which conditions each method performs best.