DDPIn: distance and density based protein indexing

Authors:
David Hoksza
Affiliations:
Venue:
CIBCB'09 Proceedings of the 6th Annual IEEE conference on Computational Intelligence in Bioinformatics and Computational Biology
Year:
2009

Citing 5
Cited 0

Geometric Hashing: An Overview

IEEE Computational Science & Engineering
M-tree: An Efficient Access Method for Similarity Search in Metric Spaces

VLDB '97 Proceedings of the 23rd International Conference on Very Large Data Bases
Towards Index-based Similarity Search for Protein Structure Databases

CSB '03 Proceedings of the IEEE Computer Society Conference on Bioinformatics
CTSS: A Robust and Efficient Method for Protein Structure Alignment Based on Local Geometrical and Biological Features

CSB '03 Proceedings of the IEEE Computer Society Conference on Bioinformatics
PSIST: Indexing Protein Structures Using Suffix Trees

CSB '05 Proceedings of the 2005 IEEE Computational Systems Bioinformatics Conference

Quantified Score

Hi-index	0.00

Visualization

Abstract

Protein structure similarity and classification methods have many applications in protein function prediction and associated fields (e.g. drug discovery). In this paper, we propose a new protein structure representation method enabling fast and accurate classification. In our approach, each protein structure is represented by number of vectors (based on histogram of distances) equivalent to the number of its Cα residues. Each Cα residue represents a viewpoint from which the distances to each of the other residues are computed. Consequently, we use several methods to convert these distances into a n-dimensional feature vector which is indexed using a metric indexing structure (M-tree is the structure of our choice). While searching, we use single or multi-step approach which provides us with classification accuracy and speed comparable to the best contemporary classification methods.