Discriminating TATA box from putative TATA boxes in plant genome

Authors:
Raja Loganantharaj
Affiliations:
The Center for Computer Science, University of Louisiana, P.O. Box 44330, Lafayette, LA 70504, USA
Venue:
International Journal of Bioinformatics Research and Applications
Year:
2006

Citing 4
Cited 1

Artificial intelligence (3rd ed.)

Artificial intelligence (3rd ed.)
Artificial intelligence: a modern approach

Artificial intelligence: a modern approach
Bioinformatics: the machine learning approach

Bioinformatics: the machine learning approach
The Gene-Finder Computer Tools for Analysis of Human and Model Organisms Genome Sequences

Proceedings of the 5th International Conference on Intelligent Systems for Molecular Biology

Extensions of naive bayes and their applications to bioinformatics

ISBRA'07 Proceedings of the 3rd international conference on Bioinformatics research and applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

The TATA box has been used successfully to identify a transcription start site (TSS) and thereby a promoter. Unfortunately, there are many substrings which fit the profile of a TATA box and such substrings are called putative TATA boxes. We have applied linear and non linear classifiers for discriminating TATA box from putative TATA boxes and have compared their performances. We have also investigated the influence of the length of the pair of sequences flanking a putative TATA box on the prediction accuracy. The techniques we have presented in this paper are general enough to be applicable to other domains or to other genomes.