Rapid Large-Scale Oligonucleotide Selection for Microarrays

  • Authors:
  • Sven Rahmann

  • Affiliations:
  • -

  • Venue:
  • WABI '02 Proceedings of the Second International Workshop on Algorithms in Bioinformatics
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present the first program that selects short oligonucleotide probes (e.g. 25-mers) for microarray experiments on a large scale. Our approach is up to two orders of magnitude faster than previous approaches (e.g. [2], [3]) and is the first one that allows handling truly large-scale datasets. For example, oligos for human genes can be found within 50 hours. This becomes possible by using the longest common substring as a specificity measure for candidate oligos. We present an algorithm based on a suffix array [1] with additional information that is efficient both in terms of memory usage and running time to rank all candidate oligos according to their specificity. We also introduce the concept of master sequences to describe the sequences from which oligos are to be selected. Constraints such as oligo length, melting temperature, and self-complementarity are incorporated in the master sequence at a preprocessing stage and thus kept separate from the main selection problem. As a result, custom oligos can be designed for any sequenced genome, just as the technology for on-site chip synthesis is becoming increasingly mature. Details will be given in the presentation and can be found in [4].