Multi-view regression via canonical correlation analysis

Authors:
Sham M. Kakade;Dean P. Foster
Affiliations:
Toyota Technological Institute at Chicago, Chicago, IL;University of Pennsylvania, Philadelphia, PA
Venue:
COLT'07 Proceedings of the 20th annual conference on Learning theory
Year:
2007

Citing 7
Cited 15

Combining labeled and unlabeled data with co-training

COLT' 98 Proceedings of the eleventh annual conference on Computational learning theory
Unsupervised word sense disambiguation rivaling supervised methods

ACL '95 Proceedings of the 33rd annual meeting on Association for Computational Linguistics
Understanding the Yarowsky Algorithm

Computational Linguistics
Learning Bounds for Kernel Regression Using Effective Data Dimensionality

Neural Computation
Canonical Correlation Analysis: An Overview with Application to Learning Methods

Neural Computation
Efficient co-regularised least squares regression

ICML '06 Proceedings of the 23rd international conference on Machine learning
A PAC-Style model for learning from labeled and unlabeled data

COLT'05 Proceedings of the 18th annual conference on Learning Theory

Multi-view clustering via canonical correlation analysis

ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Posterior Regularization for Structured Latent Variable Models

The Journal of Machine Learning Research
Exploiting tag and word correlations for improved webpage clustering

SMUC '10 Proceedings of the 2nd international workshop on Search and mining user-generated contents
A novel ensemble construction method for multi-view data using random cross-view correlation between within-class examples

Pattern Recognition
Linear Algorithms for Online Multitask Classification

The Journal of Machine Learning Research
Multitask Bregman clustering

Neurocomputing
Multiview semi-supervised learning for ranking multilingual documents

ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part III
Regularized tensor factorization for multi-modality medical image classification

MICCAI'11 Proceedings of the 14th international conference on Medical image computing and computer-assisted intervention - Volume Part III
A multi-view regularization method for semi-supervised learning

ISNN'10 Proceedings of the 7th international conference on Advances in Neural Networks - Volume Part I
Multi-view learning via probabilistic latent semantic analysis

Information Sciences: an International Journal
Leveraging Social Bookmarks from Partially Tagged Corpus for Improved Web Page Clustering

ACM Transactions on Intelligent Systems and Technology (TIST)
On multiview-based meta-learning for automatic quality assessment of wiki articles

TPDL'12 Proceedings of the Second international conference on Theory and Practice of Digital Libraries
Large-margin multi-view Gaussian process for image classification

Proceedings of the Fifth International Conference on Internet Multimedia Computing and Service
Fractional-order embedding canonical correlation analysis and its applications to multi-view dimensionality reduction and recognition

Pattern Recognition
Improving multi-view semi-supervised learning with agreement-based sampling

Intelligent Data Analysis - Combined Learning Methods and Mining Complex Data

Quantified Score

Hi-index	0.01

Visualization

Abstract

In the multi-view regression problem, we have a regression problem where the input variable (which is a real vector) can be partitioned into two different views, where it is assumed that either view of the input is sufficient to make accurate predictions -- this is essentially (a significantly weaker version of) the co-training assumption for the regression problem. We provide a semi-supervised algorithm which first uses unlabeled data to learn a norm (or, equivalently, a kernel) and then uses labeled data in a ridge regression algorithm (with this induced norm) to provide the predictor. The unlabeled data is used via canonical correlation analysis (CCA, which is a closely related to PCA for two random variables) to derive an appropriate norm over functions. We are able to characterize the intrinsic dimensionality of the subsequent ridge regression problem (which uses this norm) by the correlation coefficients provided by CCA in a rather simple expression. Interestingly, the norm used by the ridge regression algorithm is derived from CCA, unlike in standard kernel methods where a special apriori norm is assumed (i.e. a Banach space is assumed). We discuss how this result shows that unlabeled data can decrease the sample complexity.