Detecting disease genes based on semi-supervised learning and protein-protein interaction networks
Artificial Intelligence in Medicine
Hi-index | 0.00 |
The discovery of human genes that contribute to the appearance and growth of hereditary diseases is an important problem in bioinformatics research. Many techniques have been devised for classifying genes based on information from a variety of sources such as sequence and functional annotation. Recently, the use of topological information in protein-protein interaction networks has shown promise in disease- genes. In this paper, we develop a disease-gene classification system that integrates topological features of protein interaction networks with sequence- derived and other features, utilizing support vector machines for disease-gene classification. We identified several novel topological, sequence, and function-based features that can help to characterize hereditary disease-genes. We also found that using a more complex classifier can contribute to disease-gene classification. We validated our methods by selecting previously unclassified genes that were predicted with high probabilities as disease-genes, and searching for evidence in recent literature of their involvements in disease.