Efficient bubble enumeration in directed graphs

  • Authors:
  • Etienne Birmelé;Pierluigi Crescenzi;Rui Ferreira;Roberto Grossi;Vincent Lacroix;Andrea Marino;Nadia Pisanti;Gustavo Sacomoto;Marie-France Sagot

  • Affiliations:
  • INRIA Grenoble Rhône-Alpes, France,Université d'Évry, France;Dipartimento di Sistemi e Informatica, Università di Firenze, Firenze, Italy;Dipartimento di Informatica, Università di Pisa, Pisa, Italy;Dipartimento di Informatica, Università di Pisa, Pisa, Italy;INRIA Grenoble Rhône-Alpes, France,Université de Lyon 1, Villeurbanne, France;INRIA Grenoble Rhône-Alpes, France,Dipartimento di Sistemi e Informatica, Università di Firenze, Firenze, Italy;Dipartimento di Informatica, Università di Pisa, Pisa, Italy;INRIA Grenoble Rhône-Alpes, France,Université de Lyon 1, Villeurbanne, France;INRIA Grenoble Rhône-Alpes, France,Université de Lyon 1, Villeurbanne, France

  • Venue:
  • SPIRE'12 Proceedings of the 19th international conference on String Processing and Information Retrieval
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Polymorphisms in DNA- or RNA-seq data lead to recognisable patterns in a de Bruijn graph representation of the reads obtained by sequencing. Such patterns have been called mouths, or bubbles in the literature. They correspond to two vertex-disjoint directed paths between a source s and a target t. Due to the high number of such bubbles that may be present in real data, their enumeration is a major issue concerning the efficiency of dedicated algorithms. We propose in this paper the first linear delay algorithm to enumerate all bubbles with a given source.