Structural alignment of biomolecules by text modeling techniques

Authors:
Jafar Razmara;Safaai B. Deris;Rosli Md Illias
Affiliations:
Faculty of Computer Science and Information Systems, Universiti Teknologi Malaysia, Johor Bahru, Malaysia;Faculty of Computer Science and Information Systems, Universiti Teknologi Malaysia, Johor Bahru, Malaysia;Faculty of Chemical and Natural Resources Engineering, Universiti Teknologi Malaysia, Johor Bahru, Malaysia
Venue:
ACE'10 Proceedings of the 9th WSEAS international conference on Applications of computer engineering
Year:
2010

Citing 2
Cited 0

Foundations of statistical natural language processing

Foundations of statistical natural language processing
Hierarchical Protein Structure Superposition Using Both Secondary Structure and Atomic Representations

Proceedings of the 5th International Conference on Intelligent Systems for Molecular Biology

Quantified Score

Hi-index	0.00

Visualization

Abstract

In the era of structural biology, it is necessary to apply efficient and effective tools to compare and align 3D-structure of biomolecules. Although a great number of structural comparison and alignment methods have been developed, none of them gives an exact solution to the problem. In this paper, we introduce a novel method for structural alignment of proteins based on language modelling techniques. In this way, we summarized the protein secondary and tertiary structure in two textual sequences. The first sequence is used to initial superposiotion of secondary structure elements and the second sequence is employed to align the 3D-structure of two compared structure. In order to compare sequences, the method applies a technique inspired from computational linguistics for analysing and comparing textual data. In this strategy, the cross-entropy measure over n-gram models is used to capture regularities between sequences of protein structures. Some experiments were performed in order to compare the performance of the method with the other structure alignment methods. The results of the experiments reported here, provide evidence for the usefulness of the new approach and its preference and applicability comparing with the other related methods.