Data fusion in robotics and machine intelligence
Data fusion in robotics and machine intelligence
On Measuring Uncertainty and Uncertainty-Based Information: Recent Developments
Annals of Mathematics and Artificial Intelligence
Combining Pattern Classifiers: Methods and Algorithms
Combining Pattern Classifiers: Methods and Algorithms
Uncertainty and Information: Foundations of Generalized Information Theory
Uncertainty and Information: Foundations of Generalized Information Theory
Ensemble classifier for protein fold pattern recognition
Bioinformatics
A new expertness index for assessment of secondary structure prediction engines
Computational Biology and Chemistry
Supervised machine learning algorithms for protein structure classification
Computational Biology and Chemistry
An evidence-theoretic k-NN rule with parameter optimization
IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews
On equivalence classes of fuzzy connectives-the case of fuzzy integrals
IEEE Transactions on Fuzzy Systems
Computational Biology and Chemistry
Ensemble of diversely trained support vector machines for protein fold recognition
ACIIDS'13 Proceedings of the 5th Asian conference on Intelligent Information and Database Systems - Volume Part I
Protein fold recognition using segmentation-based feature extraction model
ACIIDS'13 Proceedings of the 5th Asian conference on Intelligent Information and Database Systems - Volume Part I
Protein fold recognition with a two-layer method based on SVM-SA, WP-NN and C4.5 TLM-SNC
International Journal of Data Mining and Bioinformatics
Hi-index | 0.00 |
Protein function is related to its chemical reaction to the surrounding environment including other proteins. On the other hand, this depends on the spatial shape and tertiary structure of protein and folding of its constituent components in space. The correct identification of protein domain fold solely using extracted information from protein sequence is a complicated and controversial task in the current computational biology. In this article a combined classifier based on the information content of extracted features from the primary structure of protein has been introduced to face this challenging problem. In the first stage of our proposed two-tier architecture, there are several classifiers each of which is trained with a different sequence based feature vector. Apart from the application of the predicted secondary structure, hydrophobicity, van der Waals volume, polarity, polarizability, and different dimensions of pseudo-amino acid composition vectors in similar studies, the position specific scoring matrix (PSSM) has also been used to improve the correct classification rate (CCR) in this study. Using K-fold cross validation on training dataset related to 27 famous folds of SCOP, the 28 dimensional probability output vector from each evidence theoretic K-NN classifier is used to determine the information content or expertness of corresponding feature for discrimination in each fold class. In the second stage, the outputs of classifiers for test dataset are fused using Sugeno fuzzy integral operator to make better decision for target fold class. The expertness factor of each classifier in each fold class has been used to calculate the fuzzy integral operator weights. Results make it possible to provide deeper interpretation about the effectiveness of each feature for discrimination in target classes for query proteins.