Biologically-aware latent dirichlet allocation (BaLDA) for the classification of expression microarray

  • Authors:
  • Alessandro Perina;Pietro Lovato;Vittorio Murino;Manuele Bicego

  • Affiliations:
  • University of Verona, Verona, Italy;University of Verona, Verona, Italy;University of Verona, Verona, Italy and Italian Institute of Technology, Genova, Italy;University of Verona, Verona, Italy and Italian Institute of Technology, Genova, Italy

  • Venue:
  • PRIB'10 Proceedings of the 5th IAPR international conference on Pattern recognition in bioinformatics
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Topic models have recently shown to be really useful tools for the analysis of microarray experiments. In particular they have been successfully applied to gene clustering and, very recently, also to samples classification. In this latter case, nevertheless, the basic assumption of functional independence between genes is limiting, since many other a priori information about genes' interactions may be available (co-regulation, spatial proximity or other a priori knowledge). In this paper a novel topic model is proposed, which enriches and extends the Latent Dirichlet Allocation (LDA) model by integrating such dependencies, encoded in a categorization of genes. The proposed topic model is used to derive a highly informative and discriminant representation for microarray experiments. Its usefulness, in comparison with standard topic models, has been demonstrated in two different classification tests.