The nature of statistical learning theory
The nature of statistical learning theory
Bioinformatics
Working Set Selection Using Second Order Information for Training Support Vector Machines
The Journal of Machine Learning Research
PairProSVM: Protein Subcellular Localization Based on Local Pairwise Profile Alignment and SVM
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
A novel class dependent feature selection method for cancer biomarker discovery
Computers in Biology and Medicine
Hi-index | 0.00 |
Knowledge of protein subcellular locations can help decipher a protein's biological function. This work proposes new features: sequence-based: Hybrid Amino Acid Pair (HAAP) and two structure-based: Secondary Structural Element Composition (SSEC) and solvent accessibility state frequency. A multi-class Support Vector Machine is developed to predict the locations. Testing on two established data sets yields better prediction accuracies than the best available systems. Comparisons with existing methods show comparable results to ESLPred2. When StruLocPred is applied to the entire Arabidopsis proteome, over 77% of proteins with known locations match the prediction results. An implementation of this system is at http://wgzhou.ece. iastate.edu/StruLocPred/.