Acta Informatica
Efficiently mining frequent trees in a forest
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
gSpan: Graph-Based Substructure Pattern Mining
ICDM '02 Proceedings of the 2002 IEEE International Conference on Data Mining
CloseGraph: mining closed frequent graph patterns
Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Hi-index | 0.00 |
RNA plays key roles in many biological processes, and its functiondepends largely on its three-dimensional structure. We describe a comparativeapproach to learning biologically important RNA structures, including thosethat are not the predicted minimum free energy (MFE) structure. Our approachidentifies the greatest conserved structure(s) in a set of RNA sequences, even inthe presence of sequences that have no conserved features. We convert RNAstructures to a graph representation (XIOS RNA graph) that includes pseudoknots,and mutually exclusive structures, thereby simultaneously representingensembles of RNA structures. By modifying existing algorithms for maximalsubgraph isomorphism, we can identify the similar portions of the graphs andintegrate this with MFE structure prediction tools to identify biologically relevantnear-MFE conserved structures.