International Journal of Bioinformatics Research and Applications
Splice site detection in DNA sequences using a fast classification algorithm
SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
Declarative belief set merging using merging plans
PADL'11 Proceedings of the 13th international conference on Practical aspects of declarative languages
Hi-index | 0.00 |
The classification of human gene sequences into exons and introns is an important but difficult problem. We study the discriminative power of various statistical features (22 in total) in term of their mutual information (MI). By performing correlation analysis, we are able to identify a set of features that has high MI value while at the same time is complementary in their information content. Using the set of features, which consists of the three SZ features, the AMI feature, and the first stop codon feature, we are able to achieve classification accuracy as high as 92%.