Computational Biology and Chemistry
Comparing SVM sequence kernels: a protein subcellular localization theme
WISB '06 Proceedings of the 2006 workshop on Intelligent systems for bioinformatics - Volume 73
A Unified String Kernel for Biology Sequence
ICIC '08 Proceedings of the 4th international conference on Intelligent Computing: Advanced Intelligent Computing Theories and Applications - with Aspects of Artificial Intelligence
WABI '08 Proceedings of the 8th international workshop on Algorithms in Bioinformatics
PairProSVM: Protein Subcellular Localization Based on Local Pairwise Profile Alignment and SVM
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Peptide programs: applying fragment programs to protein classification
Proceedings of the 2nd international workshop on Data and text mining in bioinformatics
A Class of Evolution-Based Kernels for Protein Homology Analysis: A Generalization of the PAM Model
ISBRA '09 Proceedings of the 5th International Symposium on Bioinformatics Research and Applications
CIBCB'09 Proceedings of the 6th Annual IEEE conference on Computational Intelligence in Bioinformatics and Computational Biology
A Study of Hierarchical and Flat Classification of Proteins
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Computational Biology and Chemistry
Conotoxin protein classification using pairwise comparison and amino acid composition: toxin-aam
Proceedings of the 13th annual conference on Genetic and evolutionary computation
Protein remote homology detection based on auto-cross covariance transformation
Computers in Biology and Medicine
Computers in Biology and Medicine
Comparison of one-class SVM and two-class SVM for fold recognition
ICONIP'06 Proceedings of the 13th international conference on Neural Information Processing - Volume Part II
A solution to the curse of dimensionality problem in pairwise scoring techniques
ICONIP'06 Proceedings of the 13 international conference on Neural Information Processing - Volume Part I
A hidden Markov model variant for sequence classification
IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Two
A family of feed-forward models for protein sequence classification
ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part II
Remote homology detection incorporating the context of physicochemical properties
Computers in Biology and Medicine
Hi-index | 3.84 |
Motivation: Protein remote homology detection is a central problem in computational biology. Supervised learning algorithms based on support vector machines are currently one of the most effective methods for remote homology detection. The performance of these methods depends on how the protein sequences are modeled and on the method used to compute the kernel function between them. Results: We introduce two classes of kernel functions that are constructed by combining sequence profiles with new and existing approaches for determining the similarity between pairs of protein sequences. These kernels are constructed directly from these explicit protein similarity measures and employ effective profile-to-profile scoring schemes for measuring the similarity between pairs of proteins. Experiments with remote homology detection and fold recognition problems show that these kernels are capable of producing results that are substantially better than those produced by all of the existing state-of-the-art SVM-based methods. In addition, the experiments show that these kernels, even when used in the absence of profiles, produce results that are better than those produced by existing non-profile-based schemes. Availability: The programs for computing the various kernel functions are available on request from the authors. Contact: karypis@cs.umn.edu