Prediction of protein solvent accessibility using fuzzy k-nearest neighbor method

Authors:
Jaehyun Sim;Seung-Yeon Kim;Julian Lee
Affiliations:
Department of Bioinformatics and Life Science, Bioinformatics and Molecular Design Technology Innovation Center, and Computer Aided Molecular Design Research Center, Soongsil University Seoul 156 ...;School of Computational Sciences, Korea Institute for Advanced Study Seoul 130-722, South Korea;Department of Bioinformatics and Life Science, Bioinformatics and Molecular Design Technology Innovation Center, and Computer Aided Molecular Design Research Center, Soongsil University Seoul 156 ...
Venue:
Bioinformatics
Year:
2005

Citing 0
Cited 6

Fuzzy K-Nearest Neighbor Classifier to Predict Protein Solvent Accessibility

Neural Information Processing
Modeling and implementing an agent-based system for prediction of protein relative solvent accessibility

Expert Systems with Applications: An International Journal
Fuzzy k-nearest neighbor method for protein secondary structure prediction and its parallel implementation

ICIC'06 Proceedings of the 2006 international conference on Computational Intelligence and Bioinformatics - Volume Part III
An efficient diagnosis system for detection of Parkinson's disease using fuzzy k-nearest neighbor approach

Expert Systems with Applications: An International Journal
Design of an Enhanced Fuzzy k-nearest Neighbor Classifier Based Computer Aided Diagnostic System for Thyroid Disease

Journal of Medical Systems
A novel class dependent feature selection method for cancer biomarker discovery

Computers in Biology and Medicine

Quantified Score

Hi-index	3.84

Visualization

Abstract

Motivation: The solvent accessibility of amino acid residues plays an important role in tertiary structure prediction, especially in the absence of significant sequence similarity of a query protein to those with known structures. The prediction of solvent accessibility is less accurate than secondary structure prediction in spite of improvements in recent researches. The k-nearest neighbor method, a simple but powerful classification algorithm, has never been applied to the prediction of solvent accessibility, although it has been used frequently for the classification of biological and medical data. Results: We applied the fuzzy k-nearest neighbor method to the solvent accessibility prediction, using PSI-BLAST profiles as feature vectors, and achieved high prediction accuracies. With leave-one-out cross-validation on the ASTRAL SCOP reference dataset constructed by sequence clustering, our method achieved 64.1% accuracy for a 3-state (buried/intermediate/exposed) prediction (thresholds of 9% for buried/intermediate and 36% for intermediate/exposed) and 86.7, 82.0, 79.0 and 78.5% accuracies for 2-state (buried/exposed) predictions (thresholds of each 0, 5, 16 and 25% for buried/exposed), respectively. Our method also showed slightly better accuracies than other methods by about 2--5% on the RS126 dataset and a benchmarking dataset with 229 proteins. Availability: Program and datasets are available at http://biocom1.ssu.ac.kr/FKNNacc/ Contact: jul@ssu.ac.kr