Statistical analysis of microspectroscopy signals for algae classification and phylogenetic comparison

  • Authors:
  • Anna Tonazzini;Primo Coltelli;Paolo Gualtieri

  • Affiliations:
  • Istituto di Scienza e Tecnologie dell'Informazione, CNR, Pisa, Italy;Istituto di Scienza e Tecnologie dell'Informazione, CNR, Pisa, Italy;Istituto di Biofisica, CNR, Pisa, Italy

  • Venue:
  • MDA'06/07 Proceedings of the 2007 international conference on Advances in mass data analysis of signals and images in medicine biotechnology and chemistry
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

We performed microspectroscopic evaluation of the pigment composition of the photosynthetic compartments of algae belonging to different taxonomic divisions and higher plants. In [11], a supervised Gaussian bands decompositions was performed for the pigment spectra, the algae spectrum was modelled as the linear mixture, with unknown coefficients, of the pigment spectra, and a user-guided fitting algorithm was employed. The method provided a reliable discrimination among chlorophylls a, b and c, phycobiliproteins and carotenoids. Comparative analysis of absorption spectra highlighted the evolutionary grouping of the algae into three main lineages in accordance with the most recent endosymbiotic theories. In this paper, we adopt an unsupervised statistical estimation approach to automatically perform both Gaussian bands decomposition of the pigments and algae fitting. In a fully Bayesian setting, we propose estimating both the algae mixture coefficients and the parameters of the pigment spectra decomposition, on the basis of the alga spectrum alone. As a priori information to stabilize this highly underdetermined problem, templates for the pigment spectra are assumed to be available, though, due to their measurements outside the protein moiety, they differ in shape from the real spectra of the pigments present in nature by unknown, slight displacements and contraction/dilatation factors. We propose a classification system subdivided into two phases. In the first, the learning phase, the parameters of the Gaussians decomposition and the shape factors are estimated. In the second phase, the classification phase, the now known real spectra of the pigments are used as a base set to fit any other spectrum of algae. The unsupervised method provided results comparable to those of the previous, supervised method.