Quantifiable data mining using principal component analysis
Quantifiable data mining using principal component analysis
LAPACK Users' guide (third ed.)
LAPACK Users' guide (third ed.)
Principal Direction Divisive Partitioning
Data Mining and Knowledge Discovery
Enhancing Clustering Quality through Landmark-Based Dimensionality Reduction
ACM Transactions on Knowledge Discovery from Data (TKDD)
Distributed knowledge discovery with non linear dimensionality reduction
PAKDD'10 Proceedings of the 14th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part II
ACM Transactions on Knowledge Discovery from Data (TKDD)
Hi-index | 0.00 |
Principal component analysis (PCA) is a statistical technique to identify the dependency structure of multivariate stochastic observations. PCA is frequently used in data mining applications. This paper considers PCA in the context of the emerging network-based computing environments. It offers a technique to perform PCA from distributed and heterogeneous data sets with relatively small communication overhead. The technique is evaluated against different data sets, including a data set for a web mining application. This approach is likely to facilitate the development of distributed clustering, associative link analysis, and other heterogeneous data mining applications that frequently use PCA.