Supervised enzyme network inference from the integration of genomic data and chemical information

Authors:
Yoshihiro Yamanishi;Jean-Philippe Vert;Minoru Kanehisa
Affiliations:
Bioinformatics Center, Institute for Chemical Research, Kyoto University Gokasho, Uji, Kyoto 611-0011, Japan;Computational Biology Group, Ecole des Mines de Paris 35 rue Saint-Honoré, 77305 Fontainebleau cedex, France;Bioinformatics Center, Institute for Chemical Research, Kyoto University Gokasho, Uji, Kyoto 611-0011, Japan
Venue:
Bioinformatics
Year:
2005

Citing 0
Cited 7

Kernelizing the output of tree-based methods

ICML '06 Proceedings of the 23rd international conference on Machine learning
Gradient boosting for kernelized output spaces

Proceedings of the 24th international conference on Machine learning
On Pairwise Kernels: An Efficient Alternative and Generalization Analysis

PAKDD '09 Proceedings of the 13th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining
Link prediction using probabilistic group models of network structure

Proceedings of the 2010 ACM Symposium on Applied Computing
Fast and scalable algorithms for semi-supervised link prediction on static and dynamic graphs

ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part III
lp-Norm Multiple Kernel Learning

The Journal of Machine Learning Research
Link prediction via matrix factorization

ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part II

Quantified Score

Hi-index	3.84

Visualization

Abstract

Motivation: The metabolic network is an important biological network which relates enzyme proteins and chemical compounds. A large number of metabolic pathways remain unknown nowadays, and many enzymes are missing even in known metabolic pathways. There is, therefore, an incentive to develop methods to reconstruct the unknown parts of the metabolic network and to identify genes coding for missing enzymes. Results: This paper presents new methods to infer enzyme networks from the integration of multiple genomic data and chemical information, in the framework of supervised graph inference. The originality of the methods is the introduction of chemical compatibility as a constraint for refining the network predicted by the network inference engine. The chemical compatibility between two enzymes is obtained automatically from the information encoded by their Enzyme Commission (EC) numbers. The proposed methods are tested and compared on their ability to infer the enzyme network of the yeast Saccharomyces cerevisiae from four datasets for enzymes with assigned EC numbers: gene expression data, protein localization data, phylogenetic profiles and chemical compatibility information. It is shown that the prediction accuracy of the network reconstruction consistently improves owing to the introduction of chemical constraints, the use of a supervised approach and the weighted integration of multiple datasets. Finally, we conduct a comprehensive prediction of a global enzyme network consisting of all enzyme candidate proteins of the yeast to obtain new biological findings. Availability: Softwares are available upon request. Contact: yoshi@kuicr.kyoto-u.ac.jp