Phylogenetic distances are encoded in networks of interacting pathways

  • Authors:
  • Aurélien Mazurie;Danail Bonchev;Benno Schwikowski;Gregory A. Buck

  • Affiliations:
  • -;-;-;-

  • Venue:
  • Bioinformatics
  • Year:
  • 2008

Quantified Score

Hi-index 3.84

Visualization

Abstract

Motivation: Although metabolic reactions are unquestionably shaped by evolutionary processes, the degree to which the overall structure and complexity of their interconnections are linked to the phylogeny of species has not been evaluated in depth. Here, we apply an original metabolome representation, termed Network of Interacting Pathways or NIP, with a combination of graph theoretical and machine learning strategies, to address this question. NIPs compress the information of the metabolic network exhibited by a species into much smaller networks of overlapping metabolic pathways, where nodes are pathways and links are the metabolites they exchange. Results: Our analysis shows that a small set of descriptors of the structure and complexity of the NIPs combined into regression models reproduce very accurately reference phylogenetic distances derived from 16S rRNA sequences (10-fold cross-validation correlation coefficient higher than 0.9). Our method also showed better scores than previous work on metabolism-based phylogenetic reconstructions, as assessed by branch distances score, topological similarity and second cousins score. Thus, our metabolome representation as network of overlapping metabolic pathways captures sufficient information about the underlying evolutionary events leading to the formation of metabolic networks and species phylogeny. It is important to note that precise knowledge of all of the reactions in these pathways is not required for these reconstructions. These observations underscore the potential for the use of abstract, modular representations of metabolic reactions as tools in studying the evolution of species. Contact: aurelien.mazurie@pasteur.fr Supplementary information: Supplementary data are available at Bioinformatics online.