Feature selection and kernel design via linear programming

Authors:
Glenn Fung;Romer Rosales;R. Bharat Rao
Affiliations:
Siemens Medical Solutions, Malvern, PA;Siemens Medical Solutions, Malvern, PA;Siemens Medical Solutions, Malvern, PA
Venue:
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Year:
2007

Citing 12
Cited 0

Occam's razor

Information Processing Letters
Matrix computations (3rd ed.)

Matrix computations (3rd ed.)
Learning to order things

NIPS '97 Proceedings of the 1997 conference on Advances in neural information processing systems 10
An introduction to support Vector Machines: and other kernel-based learning methods

An introduction to support Vector Machines: and other kernel-based learning methods
Constrained K-means Clustering with Background Knowledge

ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
Kernel Matrix Completion by Semidefinite Programming

ICANN '02 Proceedings of the International Conference on Artificial Neural Networks
MARK: a boosting algorithm for heterogeneous kernel models

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Learning the Kernel Matrix with Semidefinite Programming

The Journal of Machine Learning Research
Learning a kernel matrix for nonlinear dimensionality reduction

ICML '04 Proceedings of the twenty-first international conference on Machine learning
A fast iterative algorithm for fisher discriminant using heterogeneous kernels

ICML '04 Proceedings of the twenty-first international conference on Machine learning
Learning sparse metrics via linear programming

Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
BoostMap: a method for efficient approximate similarity rankings

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition

Quantified Score

Hi-index	0.00

Visualization

Abstract

The definition of object (e.g., data point) similarity is critical to the performance of many machine learning algorithms, both in terms of accuracy and computational efficiency. However, it is often the case that a similarity function is unknown or chosen by hand. This paper introduces a formulation that given relative similarity comparisons among triples of points of the form object i is more like object j than object k, it constructs a kernel function that preserves the given relationships. Our approach is based on learning a kernel that is a combination of functions taken from a set of base functions (these could be kernels as well). The formulation is based on defining an optimization problem that can be solved using linear programming instead of a semidefinite program usually required for kernel learning. We show how to construct a convex problem from the given set of similarity comparisons and then arrive to a linear programming formulation by employing a subset of the positive definite matrices. We extend this formulation to consider representation/evaluation efficiency based on formulating a novel form of feature selection using kernels (that is not much more expensive to solve). Using publicly available data, we experimentally demonstrate how the formulation introduced in this paper shows excellent performance in practice by comparing it with a baseline method and a related state-of-the art approach, in addition of being much more efficient computationally.