Convex and scalable weakly labeled SVMs

Authors:
Yu-Feng Li;Ivor W. Tsang;James T. Kwok;Zhi-Hua Zhou
Affiliations:
National Key Laboratory for Novel Software Technology, Nanjing University, Nanjing, China;School of Computer Engineering, Nanyang Technological University, Singapore;Department of Computer Science and Engineering, Hong Kong University of Science & Technology, Hong Kong;National Key Laboratory for Novel Software Technology, Nanjing University, Nanjing, China
Venue:
The Journal of Machine Learning Research
Year:
2013

Citing 46
Cited 0

Algorithms for clustering data

Algorithms for clustering data
Solving the multiple instance problem with axis-parallel rectangles

Artificial Intelligence
Fast training of support vector machines using sequential minimal optimization

Advances in kernel methods
DC programming: overview

Journal of Optimization Theory and Applications
Normalized Cuts and Image Segmentation

IEEE Transactions on Pattern Analysis and Machine Intelligence
Learning From Noisy Examples

Machine Learning
Multi-Instance Kernels

ICML '02 Proceedings of the Nineteenth International Conference on Machine Learning
Multiple-Instance Learning for Natural Scene Classification

ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Transductive Inference for Text Classification using Support Vector Machines

ICML '99 Proceedings of the Sixteenth International Conference on Machine Learning
On the algorithmic implementation of multiclass kernel-based vector machines

The Journal of Machine Learning Research
Convex Optimization

Convex Optimization
Learning the Kernel Matrix with Semidefinite Programming

The Journal of Machine Learning Research
Multiple kernel learning, conic duality, and the SMO algorithm

ICML '04 Proceedings of the twenty-first international conference on Machine learning
Core Vector Machines: Fast SVM Training on Very Large Data Sets

The Journal of Machine Learning Research
Large Margin Methods for Structured and Interdependent Output Variables

The Journal of Machine Learning Research
A continuation method for semi-supervised SVMs

ICML '06 Proceedings of the 23rd international conference on Machine learning
A regularization framework for multiple-instance learning

ICML '06 Proceedings of the 23rd international conference on Machine learning
Deterministic annealing for semi-supervised kernel machines

ICML '06 Proceedings of the 23rd international conference on Machine learning
Large scale semi-supervised linear SVMs

SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Training linear SVMs in linear time

Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Working Set Selection Using Second Order Information for Training Support Vector Machines

The Journal of Machine Learning Research
Training a Support Vector Machine in the Primal

Neural Computation
Statistical Comparisons of Classifiers over Multiple Data Sets

The Journal of Machine Learning Research
Large Scale Multiple Kernel Learning

The Journal of Machine Learning Research
Large Scale Transductive SVMs

The Journal of Machine Learning Research
Manifold Regularization: A Geometric Framework for Learning from Labeled and Unlabeled Examples

The Journal of Machine Learning Research
Pegasos: Primal Estimated sub-GrAdient SOlver for SVM

Proceedings of the 24th international conference on Machine learning
Maximum margin clustering made practical

Proceedings of the 24th international conference on Machine learning
On the relation between multi-instance learning and semi-supervised learning

Proceedings of the 24th international conference on Machine learning
A dual coordinate descent method for large-scale linear SVM

Proceedings of the 25th international conference on Machine learning
Adaptive p-posterior mixture-model kernels for multiple instance learning

Proceedings of the 25th international conference on Machine learning
Optimization Techniques for Semi-Supervised Support Vector Machines

The Journal of Machine Learning Research
Get another label? improving data quality and data mining using multiple, noisy labelers

Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Semi-supervised learning using label mean

ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Prototype vector machine for large scale semi-supervised learning

ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
A Convex Method for Locating Regions of Interest with Multi-instance Learning

ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part II
Unsupervised and semi-supervised multi-class support vector machines

AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 2
A Minimax Theorem with Applications to Machine Learning, Signal Processing, and Finance

SIAM Journal on Optimization
Maximum margin clustering made practical

IEEE Transactions on Neural Networks
Max-margin Multiple-Instance Learning via Semidefinite Programming

ACML '09 Proceedings of the 1st Asian Conference on Machine Learning: Advances in Machine Learning
Semi-Supervised Learning

Semi-Supervised Learning
Semi-supervised learning by disagreement

Knowledge and Information Systems
Multi-instance multi-label learning

Artificial Intelligence
Locating regions of interest in CBIR with multi-instance learning techniques

AI'05 Proceedings of the 18th Australian Joint conference on Advances in Artificial Intelligence
Multi-label learning with incomplete class assignments

CVPR '11 Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition
Multi-instance multi-label learning with weak label

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we study the problem of learning from weakly labeled data, where labels of the training examples are incomplete. This includes, for example, (i) semi-supervised learning where labels are partially known; (ii) multi-instance learning where labels are implicitly known; and (iii) clustering where labels are completely unknown. Unlike supervised learning, learning with weak labels involves a difficult Mixed-Integer Programming (MIP) problem. Therefore, it can suffer from poor scalability and may also get stuck in local minimum. In this paper, we focus on SVMs and propose the WELLSVM via a novel label generation strategy. This leads to a convex relaxation of the original MIP, which is at least as tight as existing convex Semi-Definite Programming (SDP) relaxations. Moreover, the WELLSVM can be solved via a sequence of SVM subproblems that are much more scalable than previous convex SDP relaxations. Experiments on three weakly labeled learning tasks, namely, (i) semi-supervised learning; (ii) multi-instance learning for locating regions of interest in content-based information retrieval; and (iii) clustering, clearly demonstrate improved performance, and WELLSVM is also readily applicable on large data sets.