Finding maximum colorful subtrees in practice

  • Authors:
  • Imran Rauf;Florian Rasche;François Nicolas;Sebastian Böcker

  • Affiliations:
  • Department of Computer Science, University of Karachi, Karachi, Pakistan;Lehrstuhl für Bioinformatik, Friedrich-Schiller-Universität Jena, Jena, Germany;Lehrstuhl für Bioinformatik, Friedrich-Schiller-Universität Jena, Jena, Germany;Lehrstuhl für Bioinformatik, Friedrich-Schiller-Universität Jena, Jena, Germany

  • Venue:
  • RECOMB'12 Proceedings of the 16th Annual international conference on Research in Computational Molecular Biology
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

In metabolomics and other fields dealing with small compounds, mass spectrometry is applied as sensitive high-throughput technique. Recently, fragmentation trees have been proposed to automatically analyze the fragmentation mass spectra recorded by such instruments. Computationally, this leads to the problem of finding a maximum weight subtree in an edge weighted and vertex colored graph, such that every color appears at most once in the solution. We introduce new heuristics and an exact algorithm for this Maximum Colorful Subtree problem, and evaluate them against existing algorithms on real-world datasets. Our tree completion heuristic consistently scores better than other heuristics, while the integer programming-based algorithm produces optimal trees with modest running times. Our fast and accurate heuristic can help to determine molecular formulas based on fragmentation trees. On the other hand, optimal trees from the integer linear program are useful if structure is relevant, e.g., for tree alignments.