Accurately predicting transcription start sites using logitlinear model and local oligonucleotide frequencies

  • Authors:
  • Jia Wang;Chuang Ma;Dao Zhou;Libin Zhang;Yanhong Zhou

  • Affiliations:
  • Hubei Bioinformatics and Molecular Imaging Key Laboratory, Huazhong University of Science and Technology, Wuhan, Hubei, China;Hubei Bioinformatics and Molecular Imaging Key Laboratory, Huazhong University of Science and Technology, Wuhan, Hubei, China;Hubei Bioinformatics and Molecular Imaging Key Laboratory, Huazhong University of Science and Technology, Wuhan, Hubei, China;Hubei Bioinformatics and Molecular Imaging Key Laboratory, Huazhong University of Science and Technology, Wuhan, Hubei, China;Hubei Bioinformatics and Molecular Imaging Key Laboratory, Huazhong University of Science and Technology, Wuhan, Hubei, China

  • Venue:
  • ICIC'11 Proceedings of the 7th international conference on Intelligent Computing: bio-inspired computing and applications
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this study, we construct a transcription start site (TSS) prediction model using the logitlinear model and the genomic context features mined in promoter regions. We also develop a computational program named ProKey that is able to accurately predict TSSs in long DNA sequences. Performance evaluation results on the whole human genome show that ProKey could achieve 71.2% sensitivity and 76.3% specificity at the resolution level of 2000bp. Further comparison results exhibit that the correlation coefficient (CC) value of ProKey is higher than that of DragonGSF and Eponine.