Multivariate methods for the integration and visualization of omics data

  • Authors:
  • Alex Sánchez;José Fernández-Real;Esteban Vegas;Francesc Carmona;Jacques Amar;Remy Burcelin;Matteo Serino;Francisco Tinahones;M. Carmen Ruíz de Villa;Antonio Minãrro;Ferran Reverter

  • Affiliations:
  • -;-;-;-;-;-;-;-;-;-;-

  • Venue:
  • JBI'10 Proceedings of the 10th Spanish conference on Bioinformatics for Personalized Medicine
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

As the developments in high throughput technologies have become more common and accessible it is becoming usual to take several distinct simultaneous approaches to study the same problem. In practice, this means that data of different types (expression, proteins, metabolites...) may be available for the same study, highlighting the need for methods and tools to analyze them in a combined way. In recent years there have been developed many methods that allow for the integrated analysis of different types of data. Corresponding to a certain tradition in bioinformatics many methodologies are rooted in machine learning such as bayesian networks, support vector machines or graph-based methods. In contrast with the high number of applications from these fields, another that seems to have contributed less to "omic" data integration is multivariate statistics, which has however a long tradition in being used to combine and visualize multidimensional data. In this work, we discuss the application of multivariate statistical approaches to integrate bio-molecular information by using multiple factorial analysis. The techniques are applied to a real unpublished data set consisting of three different data types: clinical variables, expression microarrays and DNA Gel Electrophoretic bands. We show how these statistical techniques can be used to perform reduction dimension and then visualize data of one type useful to explain those from other types. Whereas this is more or less straightforward when we deal with two types of data it turns to be more complicated when the goal is to visualize simultaneously more than two types. Comparison between the approaches shows that the information they provide is complementary suggesting their combined use yields more information than simply using one of them.