Sequential Forward Selection Approach to the Non-unique Oligonucleotide Probe Selection Problem

  • Authors:
  • Lili Wang;Alioune Ngom;Luis Rueda

  • Affiliations:
  • School of Computer Science, 5115 Lambton Tower, University of Windsor, Windsor, Canada N9B 3P4;School of Computer Science, 5115 Lambton Tower, University of Windsor, Windsor, Canada N9B 3P4;Department of Computer Science, University of Concepción, Chile

  • Venue:
  • PRIB '08 Proceedings of the Third IAPR International Conference on Pattern Recognition in Bioinformatics
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

In order to accurately measure the gene expression levels in microarray experiments, it is crucial to design unique, highly specific and highly sensitive oligonucleotide probes for the identification of biological agents such as genes in a sample. Unique probes are difficult to obtain for closely related genes such as the known strains of HIV genes. The non-uniqueprobe selection problem is to find one of the smallest probe set that is able to uniquely identify targets in a biological sample. This is an NP-hard problem. We present heuristic for finding near-minimal non-unique probe sets. Our method is a variant of the sequential forward selectionalgorithm, which used for feature subset selection in pattern recognition systems. The heuristic is guided by a probe set selection criterion which evaluates the efficiency and the effectiveness of a probe set in classifying targets genes as present or absent in a biological sample. Our methods outperformed all currently published greedy algorithms for this problem.