Leveraging web 2.0 data for scalable semi-supervised learning of domain-specific sentiment lexicons

Authors:
Raymond Yiu Keung Lau;Chun Lam Lai;Peter B. Bruza;Kam F. Wong
Affiliations:
City University of Hong Kong, Kowloon, Hong Kong;City University of Hong Kong, Kowloon, Hong Kong;Queensland University of Technology, Brisbane, Australia;Chinese University of Hong Kong, Shatin, Hong Kong
Venue:
Proceedings of the 20th ACM international conference on Information and knowledge management
Year:
2011

Citing 18
Cited 1

A language modeling approach to information retrieval

Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Measuring praise and criticism: Inference of semantic orientation from association

ACM Transactions on Information Systems (TOIS)
Belief revision for adaptive information retrieval

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Determining the semantic orientation of terms through gloss classification

Proceedings of the 14th ACM international conference on Information and knowledge management
Thumbs up?: sentiment classification using machine learning techniques

EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
OpinionFinder: a system for subjectivity analysis

HLT-Demo '05 Proceedings of HLT/EMNLP on Interactive Demonstrations
Show me the money!: deriving the pricing power of product features by mining consumer reviews

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Towards a belief-revision-based adaptive and context-sensitive information retrieval system

ACM Transactions on Information Systems (TOIS)
An effective statistical approach to blog post opinion retrieval

Proceedings of the 17th ACM conference on Information and knowledge management
Yahoo! for Amazon: Sentiment Extraction from Small Talk on the Web

Management Science
Toward a Fuzzy Domain Ontology Extraction Method for Adaptive e-Learning

IEEE Transactions on Knowledge and Data Engineering
Semi-supervised polarity lexicon induction

EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
Fully automatic lexicon expansion for domain-oriented sentiment analysis

EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Recognizing contextual polarity: An exploration of features for phrase-level sentiment analysis

Computational Linguistics
Expanding domain sentiment lexicon through double propagation

IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Automatic construction of an opinion-term vocabulary for ad hoc retrieval

ECIR'08 Proceedings of the IR research, 30th European conference on Advances in information retrieval
Generating focused topic-specific sentiment lexicons

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Opinion word expansion and target extraction through double propagation

Computational Linguistics

Automatic construction of domain-specific sentiment lexicon based on constrained label propagation

Knowledge-Based Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Since manually constructing domain-specific sentiment lexicons is extremely time consuming and it may not even be feasible for domains where linguistic expertise is not available, research on automatic construction of domain-specific sentiment lexicons has become a hot topic in recent years. The main contribution of this paper is the illustration of a novel semi-supervised learning method which exploits both term-to-term and document-to-term relations hidden in a corpus for the construction of domain-specific sentiment lexicons. More specifically, the proposed two-pass pseudo labeling method combines shallow linguistic parsing and corpus-base statistical learning to make domain-specific sentiment extraction scalable with respect to the sheer volume of opinionated documents archived on the Internet these days. Our experiments show that the proposed method can generate high quality domain-specific sentiment lexicons according to users' evaluation.