Combining labeled and unlabeled data with co-training
COLT' 98 Proceedings of the eleventh annual conference on Computational learning theory
Unsupervised word sense disambiguation rivaling supervised methods
ACL '95 Proceedings of the 33rd annual meeting on Association for Computational Linguistics
Understanding the Yarowsky Algorithm
Computational Linguistics
Learning Bounds for Kernel Regression Using Effective Data Dimensionality
Neural Computation
Efficient co-regularised least squares regression
ICML '06 Proceedings of the 23rd international conference on Machine learning
A PAC-Style model for learning from labeled and unlabeled data
COLT'05 Proceedings of the 18th annual conference on Learning Theory
Multi-view clustering via canonical correlation analysis
ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Posterior Regularization for Structured Latent Variable Models
The Journal of Machine Learning Research
Exploiting tag and word correlations for improved webpage clustering
SMUC '10 Proceedings of the 2nd international workshop on Search and mining user-generated contents
Linear Algorithms for Online Multitask Classification
The Journal of Machine Learning Research
Neurocomputing
Multiview semi-supervised learning for ranking multilingual documents
ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part III
Regularized tensor factorization for multi-modality medical image classification
MICCAI'11 Proceedings of the 14th international conference on Medical image computing and computer-assisted intervention - Volume Part III
A multi-view regularization method for semi-supervised learning
ISNN'10 Proceedings of the 7th international conference on Advances in Neural Networks - Volume Part I
Multi-view learning via probabilistic latent semantic analysis
Information Sciences: an International Journal
Leveraging Social Bookmarks from Partially Tagged Corpus for Improved Web Page Clustering
ACM Transactions on Intelligent Systems and Technology (TIST)
On multiview-based meta-learning for automatic quality assessment of wiki articles
TPDL'12 Proceedings of the Second international conference on Theory and Practice of Digital Libraries
Large-margin multi-view Gaussian process for image classification
Proceedings of the Fifth International Conference on Internet Multimedia Computing and Service
Improving multi-view semi-supervised learning with agreement-based sampling
Intelligent Data Analysis - Combined Learning Methods and Mining Complex Data
Hi-index | 0.01 |
In the multi-view regression problem, we have a regression problem where the input variable (which is a real vector) can be partitioned into two different views, where it is assumed that either view of the input is sufficient to make accurate predictions -- this is essentially (a significantly weaker version of) the co-training assumption for the regression problem. We provide a semi-supervised algorithm which first uses unlabeled data to learn a norm (or, equivalently, a kernel) and then uses labeled data in a ridge regression algorithm (with this induced norm) to provide the predictor. The unlabeled data is used via canonical correlation analysis (CCA, which is a closely related to PCA for two random variables) to derive an appropriate norm over functions. We are able to characterize the intrinsic dimensionality of the subsequent ridge regression problem (which uses this norm) by the correlation coefficients provided by CCA in a rather simple expression. Interestingly, the norm used by the ridge regression algorithm is derived from CCA, unlike in standard kernel methods where a special apriori norm is assumed (i.e. a Banach space is assumed). We discuss how this result shows that unlabeled data can decrease the sample complexity.