Discriminating TATA box from putative TATA boxes in plant genome

  • Authors:
  • Raja Loganantharaj

  • Affiliations:
  • The Center for Computer Science, University of Louisiana, P.O. Box 44330, Lafayette, LA 70504, USA

  • Venue:
  • International Journal of Bioinformatics Research and Applications
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

The TATA box has been used successfully to identify a transcription start site (TSS) and thereby a promoter. Unfortunately, there are many substrings which fit the profile of a TATA box and such substrings are called putative TATA boxes. We have applied linear and non linear classifiers for discriminating TATA box from putative TATA boxes and have compared their performances. We have also investigated the influence of the length of the pair of sequences flanking a putative TATA box on the prediction accuracy. The techniques we have presented in this paper are general enough to be applicable to other domains or to other genomes.