Soft-supervised learning for text classification

Authors:
Amarnag Subramanya;Jeff Bilmes
Affiliations:
University of Washington, Seattle, WA;University of Washington, Seattle, WA
Venue:
EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Year:
2008

Citing 15
Cited 13

A critical investigation of recall and precision as measures of retrieval system performance

ACM Transactions on Information Systems (TOIS)
Elements of information theory

Elements of information theory
Combining labeled and unlabeled data with co-training

COLT' 98 Proceedings of the eleventh annual conference on Computational learning theory
Inductive learning algorithms and representations for text categorization

Proceedings of the seventh international conference on Information and knowledge management
Learning to classify text from labeled and unlabeled documents

AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Transductive Inference for Text Classification using Support Vector Machines

ICML '99 Proceedings of the Sixteenth International Conference on Machine Learning
Learning from Labeled and Unlabeled Data using Graph Mincuts

ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
Distributional word clusters vs. words for text categorization

The Journal of Machine Learning Research
Unsupervised word sense disambiguation rivaling supervised methods

ACL '95 Proceedings of the 33rd annual meeting on Association for Computational Linguistics
Beyond the point cloud: from transductive to semi-supervised learning

ICML '05 Proceedings of the 22nd international conference on Machine learning
Propagating distributions on a hypergraph by dual information regularization

ICML '05 Proceedings of the 22nd international conference on Machine learning
Semi-supervised learning with graphs

Semi-supervised learning with graphs
Graph transduction via alternating minimization

Proceedings of the 25th international conference on Machine learning
Semi-Supervised Learning

Semi-Supervised Learning
On information regularization

UAI'03 Proceedings of the Nineteenth conference on Uncertainty in Artificial Intelligence

Classifying Amharic webnews

Information Retrieval
New Regularized Algorithms for Transductive Learning

ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part II
Learning better data representation using inference-driven metric learning

ACLShort '10 Proceedings of the ACL 2010 Conference Short Papers
Semi-Supervised Learning with Measure Propagation

The Journal of Machine Learning Research
Graph-based lexicon expansion with sparsity-inducing penalties

NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Graph-based semi-supervised learning algorithms for NLP

ACL '12 Tutorial Abstracts of ACL 2012
Bidirectional semi-supervised learning with graphs

ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part II
Graph-Based transduction with confidence

ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part II
Stock price prediction based on a complex interrelation network of economic factors

Engineering Applications of Artificial Intelligence
Prediction of movement direction in crude oil prices based on semi-supervised learning

Decision Support Systems
Robust predictive model for evaluating breast cancer survivability

Engineering Applications of Artificial Intelligence
Large-scale multilabel propagation based on efficient sparse graph construction

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Sharpened graph ensemble for semi-supervised learning

Intelligent Data Analysis

Quantified Score

Hi-index	0.00

Visualization

Abstract

We propose a new graph-based semi-supervised learning (SSL) algorithm and demonstrate its application to document categorization. Each document is represented by a vertex within a weighted undirected graph and our proposed framework minimizes the weighted Kullback-Leibler divergence between distributions that encode the class membership probabilities of each vertex. The proposed objective is convex with guaranteed convergence using an alternating minimization procedure. Further, it generalizes in a straightforward manner to multi-class problems. We present results on two standard tasks, namely Reuters-21578 and WebKB, showing that the proposed algorithm significantly outperforms the state-of-the-art.