Learning sentiment classification model from labeled features

Authors:
Yulan He
Affiliations:
The Open University, Milton Keynes, United Kingdom
Venue:
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Year:
2010

Citing 9
Cited 1

Question Answering via Bayesian inference on lexical relations

MultiSumQA '03 Proceedings of the ACL 2003 workshop on Multilingual summarization and question answering - Volume 12
Learning from labeled features using generalized expectation criteria

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Combining learn-based and lexicon-based techniques for sentiment detection without using labeled examples

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Sentiment analysis of blogs by combining lexical knowledge with text classification

Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Automatic seed word selection for unsupervised sentiment classification of Chinese text

COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
SELC: a self-supervised model for sentiment classification

Proceedings of the 18th ACM conference on Information and knowledge management
A non-negative matrix tri-factorization approach to sentiment classification with lexical prior knowledge

ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1
Topic-wise, sentiment-wise, or otherwise?: Identifying the hidden dimension for unsupervised text classification

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
A comparative study of Bayesian models for unsupervised sentiment detection

CoNLL '10 Proceedings of the Fourteenth Conference on Computational Natural Language Learning

Self-training from labeled features for sentiment analysis

Information Processing and Management: an International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

We propose a novel framework where an initial classifier is learned by incorporating prior information extracted from an existing sentiment lexicon. Preferences on expectations of sentiment labels of those lexicon words are expressed using generalized expectation criteria. Documents classified with high confidence are then used as pseudo-labeled examples for automatical domain-specific feature acquisition. The word-class distributions of such self-learned features are estimated from the pseudo-labeled examples and are used to train another classifier by constraining the model's predictions on unlabeled instances. Experiments on both the movie review data and the multi-domain sentiment dataset show that our approach attains comparable or better performance than exiting weakly-supervised sentiment classification methods despite using no labeled documents.