Relevant and Non-Redundant Amino Acid Sequence Selection for Protein Functional Site Identification

Authors:
Chandra Das;Pradipta Maji
Affiliations:
West Bengal University of Technology, India;Indian Statistical Institute, India
Venue:
International Journal of Software Science and Computational Intelligence
Year:
2010

Citing 14
Cited 0

The nature of statistical learning theory

The nature of statistical learning theory
Artificial neural network model for predicting HIV protease cleavage sites in protein

Advances in Engineering Software
Bioinformatics: the machine learning approach

Bioinformatics: the machine learning approach
Matching Protein b-Sheet Partners by Feedforward and Recurrent Neural Networks

Proceedings of the Eighth International Conference on Intelligent Systems for Molecular Biology
Fast and accurate database homology search using upper bounds of local alignment scores

Bioinformatics
Prediction of caspase cleavage sites using Bayesian bio-basis function neural networks

Bioinformatics
RONN: the bio-basis function neural network technique applied to the detection of natively disordered regions in proteins

Bioinformatics
Paradigms of Denotational Mathematics for Cognitive Informatics and Cognitive Computing

Fundamenta Informaticae - Cognitive Informatics, Cognitive Computing, and Their Denotational Mathematical Foundations (I)
Contemporary cybernetics and its facets of cognitive informatics and computational intelligence

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics - Special issue on cybernetics and cognitive informatics
Editorial Recent Advances in Cognitive Informatics

IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews
Cognitive informatics models of the brain

IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews
Orthogonal kernel Machine for the prediction of functional sites in proteins

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Brief communication: Reduced bio basis function neural network for identification of protein phosphorylation sites: comparison with pattern recognition algorithms

Computational Biology and Chemistry
Bio-basis function neural network for prediction of protease cleavage sites in proteins

IEEE Transactions on Neural Networks

Quantified Score

Hi-index	0.00

Visualization

Abstract

In order to apply a powerful pattern recognition algorithm to predict functional sites in proteins, amino acids cannot be used directly as inputs since they are non-numerical variables. Therefore, they need encoding prior to input. In this regard, the bio-basis function maps a non-numerical sequence space to a numerical feature space. One of the important issues for the bio-basis function is how to select a minimum set of bio-basis strings with maximum information. In this paper, an efficient method to select bio-basis strings for the bio-basis function is described integrating the concepts of the Fisher ratio and "degree of resemblance". The integration enables the method to select a minimum set of most informative bio-basis strings. The "degree of resemblance" enables efficient selection of a set of distinct bio-basis strings. In effect, it reduces the redundant features in numerical feature space. Quantitative indices are proposed for evaluating the quality of selected bio-basis strings. The effectiveness of the proposed bio-basis string selection method, along with a comparison with existing methods, is demonstrated on different data sets.