Subcellular Localization Prediction with New Protein Encoding Schemes
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Neuro-fuzzy modeling of nucleation kinetics of protein
FS'06 Proceedings of the 7th WSEAS International Conference on Fuzzy Systems
ADMA '07 Proceedings of the 3rd international conference on Advanced Data Mining and Applications
PairProSVM: Protein Subcellular Localization Based on Local Pairwise Profile Alignment and SVM
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Computational Biology and Chemistry
Guest editorial: Computational intelligence and machine learning in bioinformatics
Artificial Intelligence in Medicine
Hybrid SVM-ANFIS for protein subcellular location prediction
International Journal of Computational Intelligence in Bioinformatics and Systems Biology
Computational Biology and Chemistry
Using decision templates to predict subcellular localization of protein
PRIB'07 Proceedings of the 2nd IAPR international conference on Pattern recognition in bioinformatics
Three challenges in data mining
Frontiers of Computer Science in China
PAKDD'05 Proceedings of the 9th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining
The binary Multi-SVM voting system for protein subcellular localization prediction
ICCSA'05 Proceedings of the 2005 international conference on Computational Science and Its Applications - Volume Part III
Prediction of protein subcellular localizations using moment descriptors and support vector machine
PRIB'06 Proceedings of the 2006 international conference on Pattern Recognition in Bioinformatics
Computers in Biology and Medicine
International Journal of Data Mining and Bioinformatics
Expert Systems with Applications: An International Journal
Computers in Biology and Medicine
Multilabel Learning via Random Label Selection for Protein Subcellular Multilocations Prediction
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Fuzzy nearest neighbor algorithms: Taxonomy, experimental analysis and prospects
Information Sciences: an International Journal
Hi-index | 3.84 |
Motivation: Protein localization data are a valuable information resource helpful in elucidating protein functions. It is highly desirable to predict a protein's subcellular locations automatically from its sequence. Results: In this paper, fuzzy k-nearest neighbors (k-NN) algorithm has been introduced to predict proteins' subcellular locations from their dipeptide composition. The prediction is performed with a new data set derived from version 41.0 SWISS-PROT databank, the overall predictive accuracy about 80% has been achieved in a jackknife test. The result demonstrates the applicability of this relative simple method and possible improvement of prediction accuracy for the protein subcellular locations. We also applied this method to annotate six entirely sequenced proteomes, namely Saccharomyces cerevisiae, Caenorhabditis elegans, Drosophila melanogaster, Oryza sativa, Arabidopsis thaliana and a subset of all human proteins. Availability: Supplementary information and subcellular location annotations for eukaryotes are available at http://166.111.30.65/hying/fuzzy_loc.htm