Two-view feature generation model for semi-supervised learning

Authors:
Rie Kubota Ando;Tong Zhang
Affiliations:
IBM T. J. Watson Research Center, Hawthorne, New York;Yahoo Inc., New York, New York
Venue:
Proceedings of the 24th international conference on Machine learning
Year:
2007

Citing 3
Cited 14

Combining labeled and unlabeled data with co-training

COLT' 98 Proceedings of the eleventh annual conference on Computational learning theory
Text Classification from Labeled and Unlabeled Documents using EM

Machine Learning - Special issue on information retrieval
A Framework for Learning Predictive Structures from Multiple Tasks and Unlabeled Data

The Journal of Machine Learning Research

Multi-view clustering via canonical correlation analysis

ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Surrogate learning: from feature independence to semi-supervised classification

SemiSupLearn '09 Proceedings of the NAACL HLT 2009 Workshop on Semi-Supervised Learning for Natural Language Processing
A Multi-view Approach for Relation Extraction

WISM '09 Proceedings of the International Conference on Web Information Systems and Mining
Exploiting tag and word correlations for improved webpage clustering

SMUC '10 Proceedings of the 2nd international workshop on Search and mining user-generated contents
Semi-supervised Bayesian ARTMAP

Applied Intelligence
A semi-supervised approach for reject inference in credit scoring using SVMs

ICDM'10 Proceedings of the 10th industrial conference on Advances in data mining: applications and theoretical aspects
Regularized tensor factorization for multi-modality medical image classification

MICCAI'11 Proceedings of the 14th international conference on Medical image computing and computer-assisted intervention - Volume Part III
Leveraging Social Bookmarks from Partially Tagged Corpus for Improved Web Page Clustering

ACM Transactions on Intelligent Systems and Technology (TIST)
CoNet: feature generation for multi-view semi-supervised learning with partially observed views

Proceedings of the 21st ACM international conference on Information and knowledge management
Metafraud: a meta-learning framework for detecting financial fraud

MIS Quarterly
Multi-source learning with block-wise missing data for Alzheimer's disease prediction

Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining
Neighborhood Correlation Analysis for Semi-paired Two-View Data

Neural Processing Letters
Sparse Discriminative Information Preservation for Chinese character font categorization

Neurocomputing
Improving multi-view semi-supervised learning with agreement-based sampling

Intelligent Data Analysis - Combined Learning Methods and Mining Complex Data

Quantified Score

Hi-index	0.02

Visualization

Abstract

We consider a setting for discriminative semi-supervised learning where unlabeled data are used with a generative model to learn effective feature representations for discriminative training. Within this framework, we revisit the two-view feature generation model of co-training and prove that the optimum predictor can be expressed as a linear combination of a few features constructed from unlabeled data. From this analysis, we derive methods that employ two views but are very different from co-training. Experiments show that our approach is more robust than co-training and EM, under various data generation conditions.