Learning aspect models with partially labeled data

Authors:
Anastasia Krithara;Massih R. Amini;Cyril Goutte;Jean-Michel Renders
Affiliations:
Xerox Research Centre Europe, 6, Chemin de Maupertuis, 38240 Meylan, France and University Pierre et Marie Curie, 4, Place Jussieu, 75252 Paris cedex 05, France;University Pierre et Marie Curie, 4, Place Jussieu, 75252 Paris cedex 05, France and National Research Council Canada, 283, Bd. Alexandre-Taché, QC, Canada J8X 3X7;National Research Council Canada, 283, Bd. Alexandre-Taché, QC, Canada J8X 3X7;Xerox Research Centre Europe, 6, Chemin de Maupertuis, 38240 Meylan, France
Venue:
Pattern Recognition Letters
Year:
2011

Citing 17
Cited 0

Discriminant analysis with a stochastic supervisor

Pattern Recognition
Efficiency of learning with imperfect supervision

Pattern Recognition
An alternative stochastic supervisor in discriminant analysis

Pattern Recognition
On the exponential value of labeled samples

Pattern Recognition Letters
Combining labeled and unlabeled data with co-training

COLT' 98 Proceedings of the eleventh annual conference on Computational learning theory
Text Classification from Labeled and Unlabeled Documents using EM

Machine Learning - Special issue on information retrieval
Unsupervised learning by probabilistic latent semantic analysis

Machine Learning
The use of unlabeled data to improve supervised learning for text summarization

SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
A Hierarchical Model for Clustering and Categorising Documents

Proceedings of the 24th BCS-IRSG European Colloquium on IR Research: Advances in Information Retrieval
Learning Classification with Both Labeled and Unlabeled Data

ECML '02 Proceedings of the 13th European Conference on Machine Learning
Estimating a Kernel Fisher Discriminant in the Presence of Label Noise

ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
Transductive Inference for Text Classification using Support Vector Machines

ICML '99 Proceedings of the Sixteenth International Conference on Machine Learning
Semi-supervised learning with explicit misclassification modeling

IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Semi-supervised document classification with a mislabeling error model

ECIR'08 Proceedings of the IR research, 30th European conference on Advances in information retrieval
Expectation-propagation for the generative aspect model

UAI'02 Proceedings of the Eighteenth conference on Uncertainty in artificial intelligence
The relative value of labeled and unlabeled samples in pattern recognition with an unknown mixing parameter

IEEE Transactions on Information Theory - Part 2
Learning to recognize patterns without a teacher

IEEE Transactions on Information Theory

Quantified Score

Hi-index	0.10

Visualization

Abstract

In this paper, we address the problem of learning aspect models with partially labeled data for the task of document categorization. The motivation of this work is to take advantage of the amount of available unlabeled data together with the set of labeled examples to learn latent models whose structure and underlying hypotheses take more accurately into account the document generation process, compared to other mixture-based generative models. We present one semi-supervised variant of the Probabilistic Latent Semantic Analysis (PLSA) model (Hofmann, 2001). In our approach, we try to capture the possible data mislabeling errors which occur during the training of our model. This is done by iteratively assigning class labels to unlabeled examples using the current aspect model and re-estimating the probabilities of the mislabeling errors. We perform experiments over the 20Newsgroups, WebKB and Reuters document collections, as well as over a real world dataset coming from a Business Group of Xerox and show the effectiveness of our approach compared to a semi-supervised version of Naive Bayes, another semi-supervised version of PLSA and to transductive Support Vector Machines.