A high recall DNA splice site prediction based on association analysis

  • Authors:
  • Nittaya Kerdprasop;Kittisak Kerdprasop

  • Affiliations:
  • Data Engineering and Knowledge Discovery Research Unit, School of Computer Engineering, Suranaree University of Technology, Nakhon Ratchasima, Thailand;Data Engineering and Knowledge Discovery Research Unit, School of Computer Engineering, Suranaree University of Technology, Nakhon Ratchasima, Thailand

  • Venue:
  • ACS'10 Proceedings of the 10th WSEAS international conference on Applied computer science
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Genes in complex organisms such as primates and humans are composed of regions that code for protein creation, called exons, and non-coding regions, called introns. During the transcription from the DNA template for later translating into amino acid chain of protein structure, introns are to be removed and exons are then joined to form a continuous messenger-RNA strand. Splice sites are the junctions between introns and exons. Accurate detection of splice sites from the fragments of DNA sequence is important to the success of gene prediction. In this paper, we propose a splice site prediction technique based on association analysis, named assoDNA. We apply association mining to each splice junction types, that is, exon/intron, intron/exon, and none of the two types. The frequent DNA patterns are then combined and prioritized with respect to their annotated confidence and support values. The final result of our method is a set of cascaded rules to be used for gene prediction. From the experimental results, our method can make a high recall prediction on a test set comparative to other classification-based methods.