Flexible RNA design under structure and sequence constraints using formal languages

  • Authors:
  • Yu Zhou;Yann Ponty;Stéphane Vialette;Jérôme Waldispuhl;Yi Zhang;Alain Denise

  • Affiliations:
  • LRI, Univ. Paris-Sud, Orsay F-91405, France;LIX, Ecole Polytechnique and AMIB, INRIA Saclay, Palaiseau F-91128, France;LIGM, Univ. Paris-Est, Marne-la-Vallée F-77454, France;McGill University, Montreal, Quebec H3A 2B2, Canada;State Key Laboratory of Virology, Wuhan University, and ABLife Inc, Wuhan, Hubei 430072, China;LRI and IGM, Univ. Paris-Sud, Orsay F-91405 and AMIB, INRIA Saclay, France

  • Venue:
  • Proceedings of the International Conference on Bioinformatics, Computational Biology and Biomedical Informatics
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

The problem of RNA secondary structure design is the following: given a target secondary structure, one aims to create a sequence that folds into, or is compatible with, a given structure. In several practical applications in biology, additional constraints must be taken into account, such as the presence/absence of regulatory motifs, either at a specific location or anywhere in the sequence. In this study, we investigate the design of RNA sequences from their targeted secondary structure, given these additional sequence constraints. To this purpose, we develop a general framework based on concepts of language theory, namely context-free grammars and finite state automata. We efficiently combine a comprehensive set of constraints into a unifying context-free grammar of moderate size. From there, we use generic algorithms to perform a (weighted) random generation, or an exhaustive enumeration, of candidate sequences. The resulting method, whose complexity scales linearly with the length of the RNA, was implemented as a standalone program. The resulting software was embedded into a publicly available dedicated web server. The applicability of the method was demonstrated on a concrete case study dedicated to Exon Splicing Enhancers, in which our approach was successfully used in the design of in vitro experiments.