Computing fragmentation trees from metabolite multiple mass spectrometry data

  • Authors:
  • Kerstin Scheubert;Franziska Hufsky;Florian Rasche;Sebastian Böcker

  • Affiliations:
  • Lehrstuhl für Bioinformatik, Friedrich-Schiller-Universität Jena, Jena, Germany;Lehrstuhl für Bioinformatik, Friedrich-Schiller-Universität Jena, Jena, Germany and Max Planck Institute for Chemical Ecology, Beutenberg Campus, Jena, Germany;Lehrstuhl für Bioinformatik, Friedrich-Schiller-Universität Jena, Jena, Germany;Lehrstuhl für Bioinformatik, Friedrich-Schiller-Universität Jena, Jena, Germany

  • Venue:
  • RECOMB'11 Proceedings of the 15th Annual international conference on Research in computational molecular biology
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Since metabolites cannot be predicted from the genome sequence, high-throughput de-novo identification of small molecules is highly sought. Mass spectrometry (MS) in combination with a fragmentation technique is commonly used for this task. Unfortunately, automated analysis of such data is in its infancy. Recently, fragmentation trees have been proposed as an analysis tool for such data. Additional fragmentation steps (MSn) reveal more information about the molecule. We propose to use MSn data for the computation of fragmentation trees, and present the COLORFUL SUBTREE CLOSURE problem to formalize this task: There, we search for a colorful subtree inside a vertexcolored graph, such that the weight of the transitive closure of the subtree is maximal. We give several negative results regarding the tractability and approximability of this and related problems. We then present an exact dynamic programming algorithm, which is parameterized by the number of colors in the graph and is swift in practice. Evaluation of our method on a dataset of 45 reference compounds showed that the quality of constructed fragmentation trees is improved by using MSn instead of MS2 measurements.