Improved Algorithms for Parsing ESLTAGs: A Grammatical Model Suitable for RNA Pseudoknots

  • Authors:
  • Sanguthevar Rajasekaran;Sahar Al Seesi;Reda A. Ammar

  • Affiliations:
  • University of Connecticut, Storrs;University of Connecticut, Storrs;University of Connecticut, Storrs

  • Venue:
  • IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Formal grammars have been employed in biology to solve various important problems. In particular, grammars have been used to model and predict RNA structures. Two such grammars are Simple Linear Tree Adjoining Grammars (SLTAGs) and Extended SLTAGs (ESLTAGs). Performances of techniques that employ grammatical formalisms critically depend on the efficiency of the underlying parsing algorithms. In this paper, we present efficient algorithms for parsing SLTAGs and ESLTAGs. Our algorithm for SLTAGs parsing takes O({\rm min} \{m,n^4\} ) time and O({\rm min} \{m,n^4\} ) space, where m is the number of entries that will ever be made in the matrix M (that is normally used by TAG parsing algorithms). Our algorithm for ESLTAGs parsing takes O(n{\rm min} \{m,n^4\} ) time and O({\rm min} \{m,n^4\} ) space. We show that these algorithms perform better, in practice, than the algorithms of Uemura et al. [21].