Multi-target protein-chemical interaction prediction using task-regularized and boosted multi-task learning

Authors:
Jintao Zhang;Gerald H. Lushington;Jun Huan
Affiliations:
University of Kansas, Lawrence, KS;University of Kansas, Lawrence, KS;University of Kansas, Lawrence, KS
Venue:
Proceedings of the ACM Conference on Bioinformatics, Computational Biology and Biomedicine
Year:
2012

Citing 13
Cited 0

Multitask Learning

Machine Learning - Special issue on inductive transfer
A decision-theoretic generalization of on-line learning and an application to boosting

EuroCOLT '95 Proceedings of the Second European Conference on Computational Learning Theory
A Framework for Learning Predictive Structures from Multiple Tasks and Unlabeled Data

The Journal of Machine Learning Research
An Interior-Point Method for Large-Scale l1-Regularized Logistic Regression

The Journal of Machine Learning Research
Multi-task learning for HIV therapy screening

Proceedings of the 25th international conference on Machine learning
Protein-ligand interaction prediction

Bioinformatics
Linear dimensionality reduction for multi-label classification

IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Sparse multitask regression for identifying common mechanism of response to therapeutic targets

Bioinformatics
Multi-population GWA mapping via multi-task regularized regression

Bioinformatics
Dealing with sparse data in predicting outcomes of HIV combination therapies

Bioinformatics
Multi-label boosting for image annotation by structural grouping sparsity

Proceedings of the international conference on Multimedia
LIBSVM: A library for support vector machines

ACM Transactions on Intelligent Systems and Technology (TIST)
Multitask Learning for Protein Subcellular Location Prediction

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)

Quantified Score

Hi-index	0.00

Visualization

Abstract

Interactions between proteins and small-molecule chemicals modulate many protein functions and biological processes, and identifying these interactions is a crucial step in modern drug discovery. Supervised learning methods for predicting protein-chemical interactions (PCI) have been widely studied, but their performance is largely limited by insufficient availability of binding data for many proteins. In addition, many complex diseases such as Alzheimer's disease and cancers are found associated with multiple target proteins. Chemicals that selectively modulate only one of these target proteins are unable to effectively conquer these diseases. In this paper we propose two multi-task learning (MTL) algorithms for predicting active compounds of multiple proteins related to the same diseases, some of which may have very few binding examples. In the first method we optimize the likelihood of compound features with a Gaussian prior, while the second method boosts compound features using a number of independent boosting classifiers. Experimental studies demonstrate significant performance improvement of our MTL methods over baseline methods. Our MTL methods are also able to accurately identify promiscuous compounds that interact with multiple related proteins.