Principal component analysis of measures, with special emphasis on grain-size curves

  • Authors:
  • Claude Manté;Anne-Françoise Yao;Claude Degiovanni

  • Affiliations:
  • LMGEM, Modelling, Statistics and Biological Systems Data Analysis Team, UMR C.N.R.S. 6117, Campus de Luminy, Case 901, F13288 Marseille Cedex 09, France;LMGEM, Modelling, Statistics and Biological Systems Data Analysis Team, UMR C.N.R.S. 6117, Campus de Luminy, Case 901, F13288 Marseille Cedex 09, France;DIMAR, UMR C.N.R.S. 6540, Campus de Luminy, Case 901, F13288 Marseille Cedex 09, France

  • Venue:
  • Computational Statistics & Data Analysis
  • Year:
  • 2007

Quantified Score

Hi-index 0.03

Visualization

Abstract

A new exploratory method for analyzing a set F^1,...,F^n of distribution functions is proposed. It consists in analyzing an approximation of the table of distances between the densities dF^1/d@m,...,dF^n/d@m, where @m is a fixed reference probability. This probability makes it possible to take into account changes of scale, or methodological options linked with the phenomenon investigated. It is shown that this analysis is (in some sense) invariant under change of scale, once the reference probabilistic framework has been fixed. It practically consists in conducting an ordinary principal components analysis on raw data (i.e. cumulative curves), using a metrics depending on the reference probability space. It is noteworthy that this method is immediately extensible to any set of absolutely continuous bounded signed measures supported by a fixed bounded interval. This method is applied to surface sediments from the Berre lagoon (Southern France), since grain-size curves have the same characteristics as distribution functions. Two reference probabilities were used: the first one stems from the physics of sediment transport, the second one is classical in sedimentology. Both types of analyses evidenced the main features of the lagoon, with some differences due to distinct reference probabilities.