The WEKA data mining software: an update
ACM SIGKDD Explorations Newsletter
Designing Filters for Fast-Known NcRNA Identification
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Hi-index | 0.00 |
Generating reliable alignments for ncRNAs is an important step in ncRNA secondary structure prediction and ncRNA gene finding. Existing sequence alignment programs can generate reliable alignments for ncRNAs with high sequence conservation. For highly structured ncRNAs that may lack strong sequence similarity, structural alignment programs are required. However, conducting reliable structural alignment is much more expensive than sequence alignment and is not ideal for large-scale input such as whole genomes or next-generation sequencing data. In this paper, we propose an accurate ncRNA alignment approach to align highly structured ncRNAs using only sequence similarity. By incorporating posterior probability and a machine learning approach, we can generate accurate alignments of highly structured ncRNAs without using structural information. We tested our approach on over three hundreds of pairs of highly structured ncRNAs from BRAliBase 2.1. The experimental results show that our approach can achieve more accurate alignments than commonly used sequence alignment programs and a popular structural alignment tool. The source codes of glu-RNA can be downloaded at http://sourceforge.net/projects/glu-rna/.