Algorithms for detecting significantly mutated pathways in cancer

Authors:
Fabio Vandin;Eli Upfal;Benjamin J. Raphael
Affiliations:
Dipartimento di Ingegneria dell'Informazione, Università di Padova, Padova, Italy;Department of Computer Science, Brown University, Providence, RI;Department of Computer Science, Brown University, Providence, RI
Venue:
RECOMB'10 Proceedings of the 14th Annual international conference on Research in Computational Molecular Biology
Year:
2010

Citing 11
Cited 1

Approximation algorithms for NP-hard problems

Approximation algorithms for NP-hard problems
Diffusion Kernels on Graphs and Other Discrete Input Spaces

ICML '02 Proceedings of the Nineteenth International Conference on Machine Learning
Learning kernels from biological networks by maximizing entropy

Bioinformatics
Whole-proteome prediction of protein function via graph-theoretic analysis of interaction maps

Bioinformatics
Global topological features of cancer proteins in the human interactome

Bioinformatics
CGI: a new approach for prioritizing genes by combining gene expression and protein--protein interaction data

Bioinformatics
Gene expression network analysis and applications to immunology

Bioinformatics
An efficient rigorous approach for identifying statistically significant frequent itemsets

Proceedings of the twenty-eighth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Evaluating Between-Pathway Models with Expression Data

RECOMB 2'09 Proceedings of the 13th Annual International Conference on Research in Computational Molecular Biology
Detecting disease-specific dysregulated pathways via analysis of clinical expression profiles

RECOMB'08 Proceedings of the 12th annual international conference on Research in computational molecular biology
Connected set cover problem and its applications

AAIM'06 Proceedings of the Second international conference on Algorithmic Aspects in Information and Management

Network-based classification of recurrent endometrial cancers using high-throughput DNA methylation data

Proceedings of the ACM Conference on Bioinformatics, Computational Biology and Biomedicine

Quantified Score

Hi-index	0.00

Visualization

Abstract

Recent genome sequencing studies have shown that the somatic mutations that drive cancer development are distributed across a large number of genes This mutational heterogeneity complicates efforts to distinguish functional mutations from sporadic, passenger mutations Since cancer mutations are hypothesized to target a relatively small number of cellular signaling and regulatory pathways, a common approach is to assess whether known pathways are enriched for mutated genes However, restricting attention to known pathways will not reveal novel cancer genes or pathways An alterative strategy is to examine mutated genes in the context of genome-scale interaction networks that include both well characterized pathways and additional gene interactions measured through various approaches We introduce a computational framework for de novo identification of subnetworks in a large gene interaction network that are mutated in a significant number of patients This framework includes two major features First, we introduce a diffusion process on the interaction network to define a local neighborhood of “influence” for each mutated gene in the network Second, we derive a two-stage multiple hypothesis test to bound the false discovery rate (FDR) associated with the identified subnetworks We test these algorithms on a large human protein-protein interaction network using mutation data from two recent studies: glioblastoma samples from The Cancer Genome Atlas and lung adenocarcinoma samples from the Tumor Sequencing Project We successfully recover pathways that are known to be important in these cancers, such as the p53 pathway We also identify additional pathways, such as the Notch signaling pathway, that have been implicated in other cancers but not previously reported as mutated in these samples Our approach is the first, to our knowledge, to demonstrate a computationally efficient strategy for de novo identification of statistically significant mutated subnetworks We anticipate that our approach will find increasing use as cancer genome studies increase in size and scope.