Bridging the Gap Between Neural Network and Kernel Methods: Applications to Drug Discovery

Authors:
Pierre Baldi;Chloe Azencott;S. Joshua Swamidass
Affiliations:
Department of Computer Science, University of California, Irvine and Institute for Genomics and Bioinformatics, University of California, Irvine;Department of Computer Science, University of California, Irvine and Institute for Genomics and Bioinformatics, University of California, Irvine;Department of Computer Science, University of California, Irvine and Institute for Genomics and Bioinformatics, University of California, Irvine
Venue:
Proceedings of the 2011 conference on Neural Nets WIRN10: Proceedings of the 20th Italian Workshop on Neural Nets
Year:
2011

Citing 4
Cited 0

SVMTorch: support vector machines for large-scale regression problems

The Journal of Machine Learning Research
Sparse bayesian learning and the relevance vector machine

The Journal of Machine Learning Research
Kernels for small molecules and the prediction of mutagenicity, toxicity and anti-cancer activity

Bioinformatics
An Introduction to Chemoinformatics

An Introduction to Chemoinformatics

Quantified Score

Hi-index	0.00

Visualization

Abstract

We develop a hybrid machine learning architecture, the Influence Relevance Voter (IRV), where an initial geometry-or kernel-based step is followed by a feature-based step to derive the final prediction. While other implementations of the general idea are possible, we use a k-Nearest-Neighbor approach to implement the first step, and a Neural Network approach to implement the second step for a classification problem. In this version of the IRV, the rank and similarities of the k nearest neighbors of an input are used to compute their individual relevances. Relevances are combined multiplicatively with the class membership values to produce influences. Finally the influences of all the neighbors are aggregated to produce the final probabilistic prediction. IRVs have several advantages: they can be trained fast, they are easily interpretable and modifiable, and they are not prone to overfitting since they rely on extensive weight sharing across neighbors. The IRV approach is applied to the problem of predicting whether a given compound is active or not with respect to a particular biochemical assay in drug discovery and shown to perform well in comparison to other predictors. In addition, we also introduce and demonstrate a new approach, the Concentrated ROC (CROC), for assessing prediction performance in situations ranging from drug discovery to information retrieval, where ROC curves are not adequate, because only a very small subset of the top ranked positives is practically useful. The CROC approach uses a change of coordinates to smoothly magnify the relevant portion of the ROC curve.