Meta-analysis of Genomic and Proteomic Features to Predict Synthetic Lethality of Yeast and Human Cancer

  • Authors:
  • Min Wu;Xuejuan Li;Fan Zhang;Xiaoli Li;Chee-Keong Kwoh;Jie Zheng

  • Affiliations:
  • School of Computer Engineering, Nanyang Technological University, Singapore 639798 and Data Analytic Department, Institute for Infocomm Research, A*STAR, Singapore 138632;School of Computer Engineering, Nanyang Technological University, Singapore 639798;School of Computer Engineering, Nanyang Technological University, Singapore 639798;Data Analytic Department, Institute for Infocomm Research, A*STAR, Singapore 138632;School of Computer Engineering, Nanyang Technological University, Singapore 639798;School of Computer Engineering, Nanyang Technological University, Singapore 639798 and Genome Institute of Singapore, A*STAR, Singapore 138672

  • Venue:
  • Proceedings of the International Conference on Bioinformatics, Computational Biology and Biomedical Informatics
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

A major goal in cancer medicine is to find selective drugs with reduced side-effect. A pair of genes is called synthetic lethality (SL) if mutations of both genes will kill a cell while mutation of either gene alone will not. Hence, a gene in SL interactions with a cancer-specific mutated gene will be a promising drug target with anti-cancer selectivity. Wet-lab screening approach is still so costly that even for yeast only a small fraction of gene pairs has been covered. Computational methods are therefore important for large-scale discovery of SL interactions. Most existing approaches focus on individual features or machine learning methods, which are prone to noise or overfitting. In this paper, we propose an approach of meta-analysis that integrates 17 genomic and proteomic features and the outputs of 10 classification methods. It thus combines the strengths of existing methods. It also adjusts relative contributions of multiple methods with weights learned from the training data. Running on a dataset of the yeast strain of S. cerevisiae, our method achieves AUC (area under ROC curve) of 87.2% the highest among all competitors. Moreover, through orthologous mapping from yeast to human genes, we predicted a list of SL pairs in human that contain top mutated genes in lung and breast cancers recently reported by The Cancer Genome Atlas (TCGA). Our method and predictions would shed light on mechanisms of SL and lead to discovery of novel anti-cancer drugs.