A prediction model of substrates and non-substrates of breast cancer resistance protein (BCRP) developed by GA-CG-SVM method

  • Authors:
  • Lei Zhong;Chang-Ying Ma;Hui Zhang;Li-Jun Yang;Hua-Lin Wan;Qing-Qing Xie;Lin-Li Li;Sheng-Yong Yang

  • Affiliations:
  • State Key Laboratory of Biotherapy, West China Hospital, West China Medical School, Sichuan University, Chengdu, Sichuan 610041, PR China and West China School of pharmacy, Sichuan University, Sic ...;State Key Laboratory of Biotherapy, West China Hospital, West China Medical School, Sichuan University, Chengdu, Sichuan 610041, PR China;State Key Laboratory of Biotherapy, West China Hospital, West China Medical School, Sichuan University, Chengdu, Sichuan 610041, PR China;State Key Laboratory of Biotherapy, West China Hospital, West China Medical School, Sichuan University, Chengdu, Sichuan 610041, PR China;State Key Laboratory of Biotherapy, West China Hospital, West China Medical School, Sichuan University, Chengdu, Sichuan 610041, PR China;State Key Laboratory of Biotherapy, West China Hospital, West China Medical School, Sichuan University, Chengdu, Sichuan 610041, PR China;West China School of pharmacy, Sichuan University, Sichuan 610041, PR China;State Key Laboratory of Biotherapy, West China Hospital, West China Medical School, Sichuan University, Chengdu, Sichuan 610041, PR China

  • Venue:
  • Computers in Biology and Medicine
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Breast cancer resistance protein (BCRP) is one of the key multi-drug resistance proteins, which significantly influences the therapeutic effects of many drugs, particularly anti-cancer drugs. Thus, distinguishing between substrates and non-substrates of BCRP is important not only for clinical use but also for drug discovery and development. In this study, a prediction model of the substrates and non-substrates of BCRP was developed using a modified support vector machine (SVM) method, namely GA-CG-SVM. The overall prediction accuracy of the established GA-CG-SVM model is 91.3% for the training set and 85.0% for an independent validation set. For comparison, two other machine learning methods, namely, C4.5 DT and k-NN, were also adopted to build prediction models. The results show that the GA-CG-SVM model is significantly superior to C4.5 DT and k-NN models in terms of the prediction accuracy. To sum up, the prediction model of BCRP substrates and non-substrates generated by the GA-CG-SVM method is sufficiently good and could be used as a screening tool for identifying the substrates and non-substrates of BCRP.