Parallel Position Weight Matrices algorithms

  • Authors:
  • Mathieu Giraud;Jean-Stéphane Varré

  • Affiliations:
  • LIFL, UMR CNRS 8022, Université Lille 1, INRIA Lille-Nord Europe, Lille, France;LIFL, UMR CNRS 8022, Université Lille 1, INRIA Lille-Nord Europe, Lille, France

  • Venue:
  • Parallel Computing
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Position Weight Matrices (PWMs) are broadly used in computational biology. The basic problems, Scan and MultipleScan, aim to find all the occurrences of a given PWM or a set of PWMs in long sequences. Some other PWM tasks share a common NP-hard subproblem, ScoreDistribution. The existing algorithms rely on the enumeration on a large set of scores or words, and they are mostly not suitable for parallelization. We propose a new algorithm, BucketScoreDistribution, that is both very efficient and suitable for parallelization. We bound the error induced by this algorithm. We realized a GPU prototype for Scan, MultipleScan and BucketScoreDistribution with the CUDA libraries, and report for the different problems speedups larger than 10x on several Nvidia cards.