The greedy path-merging algorithm for contig scaffolding

  • Authors:
  • Daniel H. Huson;Knut Reinert;Eugene W. Myers

  • Affiliations:
  • Tübingen University, Tübingen, Germany;Free University Berlin, Berlin, Germany;University of California Berkeley, Berkeley, CA

  • Venue:
  • Journal of the ACM (JACM)
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

Given a collection of contigs and mate-pairs. The Contig Scaffolding Problem is to order and orientate the given contigs in a manner that is consistent with as many mate-pairs as possible. This paper describes an efficient heuristic called the greedy-path merging algorithm for solving this problem. The method was originally developed as a key component of the compartmentalized assembly strategy developed at Celera Genomics. This interim approach was used at an early stage of the sequencing of the human genome to produce a preliminary assembly based on preliminary whole genome shotgun data produced at Celera and preliminary human contigs produced by the Human Genome Project.