An efficient algorithm for the identification of repetitive variable motifs in the regulatory sequences of co-expressed genes

  • Authors:
  • Abanish Singh;Nikola Stojanovic

  • Affiliations:
  • Department of Computer Science and Engineering, The University of Texas at Arlington, Arlington, TX;Department of Computer Science and Engineering, The University of Texas at Arlington, Arlington, TX

  • Venue:
  • ISCIS'06 Proceedings of the 21st international conference on Computer and Information Sciences
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Over the last several years there has been an explosion in the number of computational methods for the detection of transcription factor binding sites in DNA sequences. Although there has been some success in this field, the existing tools are still neither sensitive nor specific enough, usually suffering from the detection of a large number of false positive signals. Given the properties of genomic sequences this is not unexpected, but one can still find interesting features worthy of further computational and laboratory bench study. We present an efficient algorithm developed to find all significant variable motifs in given sequences. In our view, it is important that we generate complete data, upon which separate selection criteria can be applied depending on the nature of the sites one wants to locate, and their biological properties. We discuss our algorithm and our supplementary software, and conclude with an illustration of their application on two eukaryotic data sets.