Graph construction and b-matching for semi-supervised learning

Authors:
Tony Jebara;Jun Wang;Shih-Fu Chang
Affiliations:
Columbia University, New York, NY;Columbia University, New York, NY;Columbia University, New York, NY
Venue:
ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Year:
2009

Citing 7
Cited 29

Learning from Labeled and Unlabeled Data using Graph Mincuts

ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
Local Features and Kernels for Classification of Texture and Object Categories: A Comprehensive Study

International Journal of Computer Vision
Label Propagation through Linear Neighborhoods

IEEE Transactions on Knowledge and Data Engineering
Graph transduction via alternating minimization

Proceedings of the 25th international conference on Machine learning
Towards a theoretical foundation for Laplacian-based manifold methods

Journal of Computer and System Sciences
Optimality of belief propagation for random assignment problem

SODA '09 Proceedings of the twentieth Annual ACM-SIAM Symposium on Discrete Algorithms
B-Matching for spectral clustering

ECML'06 Proceedings of the 17th European conference on Machine Learning

Brain state decoding for rapid image retrieval

MM '09 Proceedings of the 17th ACM international conference on Multimedia
Graph-optimized locality preserving projections

Pattern Recognition
Semi-supervised sequence classification using abstraction augmented Markov models

Proceedings of the First ACM International Conference on Bioinformatics and Computational Biology
A graph-based semi-supervised learning for question semantic labeling

SS '10 Proceedings of the NAACL HLT 2010 Workshop on Semantic Search
Sample-dependent graph construction with application to dimensionality reduction

Neurocomputing
Joint training for open-domain extraction on the web: exploiting overlap when supervision is limited

Proceedings of the fourth ACM international conference on Web search and data mining
Hubs in Space: Popular Nearest Neighbors in High-Dimensional Data

The Journal of Machine Learning Research
Social content matching in MapReduce

Proceedings of the VLDB Endowment
Using the mutual k-nearest neighbor graphs for semi-supervised classification of natural language data

CoNLL '11 Proceedings of the Fifteenth Conference on Computational Natural Language Learning
Linear programming in the semi-streaming model with application to the maximum matching problem

ICALP'11 Proceedings of the 38th international conference on Automata, languages and programming - Volume Part II
Pick your neighborhood: improving labels and neighborhood structure for label propagation

DAGM'11 Proceedings of the 33rd international conference on Pattern recognition
Semi-supervised classification based on random subspace dimensionality reduction

Pattern Recognition
Graph optimization for dimensionality reduction with sparsity constraints

Pattern Recognition
Supervised neighborhood graph construction for semi-supervised classification

Pattern Recognition
Graph transduction as a noncooperative game

Neural Computation
Dense Neighborhoods on Affinity Graph

International Journal of Computer Vision
Constrained large Margin Local Projection algorithms and extensions for multimodal dimensionality reduction

Pattern Recognition
Learning neighborhoods for metric learning

ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part I
Linear programming in the semi-streaming model with application to the maximum matching problem

Information and Computation
Graph-based semi-supervised learning with multi-modality propagation for large-scale image datasets

Journal of Visual Communication and Image Representation
Semi-supervised learning using greedy max-cut

The Journal of Machine Learning Research
Structured exploration of who, what, when, and where in heterogeneous multimedia news sources

Proceedings of the 21st ACM international conference on Multimedia
A distributed algorithm for large-scale generalized matching

Proceedings of the VLDB Endowment
Low-rank coding with b-matching constraint for semi-supervised classification

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Semi-supervised learning with manifold fitted graphs

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Dimensionality reduction with adaptive graph

Frontiers of Computer Science: Selected Publications from Chinese Universities
Perceptual relativity-based semi-supervised dimensionality reduction algorithm

Applied Soft Computing
Single network relational transductive learning

Journal of Artificial Intelligence Research
Editor's Choice Article: Sparse feature selection based on graph Laplacian for web image annotation

Image and Vision Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Graph based semi-supervised learning (SSL) methods play an increasingly important role in practical machine learning systems. A crucial step in graph based SSL methods is the conversion of data into a weighted graph. However, most of the SSL literature focuses on developing label inference algorithms without extensively studying the graph building method and its effect on performance. This article provides an empirical study of leading semi-supervised methods under a wide range of graph construction algorithms. These SSL inference algorithms include the Local and Global Consistency (LGC) method, the Gaussian Random Field (GRF) method, the Graph Transduction via Alternating Minimization (GTAM) method as well as other techniques. Several approaches for graph construction, sparsification and weighting are explored including the popular k-nearest neighbors method (kNN) and the b-matching method. As opposed to the greedily constructed kNN graph, the b-matched graph ensures each node in the graph has the same number of edges and produces a balanced or regular graph. Experimental results on both artificial data and real benchmark datasets indicate that b-matching produces more robust graphs and therefore provides significantly better prediction accuracy without any significant change in computation time.