Multiview semi-supervised learning for ranking multilingual documents

Authors:
Nicolas Usunier;Massih-Reza Amini;Cyril Goutte
Affiliations:
Université Pierre et Marie Curie, LIP6, Paris cedex, France;National Research Council Canada, IIT, Gatineau, QC, Canada;National Research Council Canada, IIT, Gatineau, QC, Canada
Venue:
ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part III
Year:
2011

Citing 17
Cited 2

Combining labeled and unlabeled data with co-training

COLT' 98 Proceedings of the eleventh annual conference on Computational learning theory
Text Classification from Labeled and Unlabeled Documents using EM

Machine Learning - Special issue on information retrieval
Optimizing search engines using clickthrough data

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Laplacian Eigenmaps for dimensionality reduction and data representation

Neural Computation
An efficient boosting algorithm for combining preferences

The Journal of Machine Learning Research
Semi-Supervised Learning on Riemannian Manifolds

Machine Learning
Multiple kernel learning, conic duality, and the SMO algorithm

ICML '04 Proceedings of the twenty-first international conference on Machine learning
Generalization Bounds for the Area Under the ROC Curve

The Journal of Machine Learning Research
A support vector method for multivariate performance measures

ICML '05 Proceedings of the 22nd international conference on Machine learning
Training linear SVMs in linear time

Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
A boosting algorithm for learning bipartite ranking functions with partially labeled data

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Introduction to Information Retrieval

Introduction to Information Retrieval
Semi-supervised ensemble ranking

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 2
Learning to order things

Journal of Artificial Intelligence Research
Multi-view regression via canonical correlation analysis

COLT'07 Proceedings of the 20th annual conference on Learning theory
Semi-Supervised Learning

Semi-Supervised Learning
Ranking and scoring using empirical risk minimization

COLT'05 Proceedings of the 18th annual conference on Learning Theory

Temporal web image retrieval

SPIRE'12 Proceedings of the 19th international conference on String Processing and Information Retrieval
Democracy is good for ranking: towards multi-view rank learning and adaptation in web search

Proceedings of the 7th ACM international conference on Web search and data mining

Quantified Score

Hi-index	0.01

Visualization

Abstract

We address the problem of learning to rank documents in a multilingual context, when reference ranking information is only partially available. We propose a multiview learning approach to this semisupervised ranking task, where the translation of a document in a given language is considered as a view of the document. Although both multiview and semi-supervised learning of classifiers have been studied extensively in recent years, their application to the problem of ranking has received much less attention. We describe a semi-supervised multiview ranking algorithm that exploits a global agreement between viewspecific ranking functions on a set of unlabeled observations. We show that our proposed algorithm achieves significant improvements over both semi-supervised multiview classification and semi-supervised single-view rankers on a large multilingual collection of Reuters news covering 5 languages. Our experiments also suggest that our approach is most effective when few labeled documents are available and the classes are imbalanced.