SARNA-ensemble-predict: the effect of different dissimilarity metrics on a novel ensemble-based RNA secondary structure prediction algorithm

  • Authors:
  • Herbert H. Tsang;Kay C. Wiese

  • Affiliations:
  • School of Computing Science, Simon Fraser University, Surrey, British Columbia, Canada;School of Computing Science, Simon Fraser University, Surrey, British Columbia, Canada

  • Venue:
  • CIBCB'09 Proceedings of the 6th Annual IEEE conference on Computational Intelligence in Bioinformatics and Computational Biology
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Recently, there is a resurgence of interest in the RNA secondary structure prediction problem due to the discovery of many new families of non-coding RNAs with a variety of functions. This paper describes and presents a novel algorithm for RNA secondary structure prediction based on an ensemble-based approach. An evaluation of the performance in terms of sensitivity and specificity is made. Experiments were performed on eleven structures from four RNA classes (RNaseP, Group I intron 16S rRNA, Group I intron 23S rRNA and 16S rRNA). Three RNA secondary structure similarity metrics (base pair distance, tree edit distance, and thermodynamic energy distance) and their effects on the clustering algorithm were explored. The significant contribution of this paper is in the examining of the various results from employing different dissimilarity metrics. Overall, the base pair distance dissimilarity metric shows better results with the other two distance metrics (tree edit distance and thermodynamic energy distance). The results presented in this paper demonstrate that SARNA-Ensemble-Predict can give comparable performance to a state-of-the-art algorithm Sfold in terms of sensitivity.