Sentiment analysis of blogs by combining lexical knowledge with text classification

Authors:
Prem Melville;Wojciech Gryc;Richard D. Lawrence
Affiliations:
IBM Research, Yorktown Heights, NY, USA;Oxford University, Oxford, United Kingdom;IBM Research, Yorktown Heights, NY, USA
Venue:
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Year:
2009

Citing 22
Cited 52

The Strength of Weak Learnability

Machine Learning
Combining Symbolic and Neural Learning

Machine Learning
An algorithm for suffix stripping

Readings in information retrieval
Incorporating Prior Knowledge into Boosting

ICML '02 Proceedings of the Nineteenth International Conference on Machine Learning
Using unlabeled data to improve text classification

Using unlabeled data to improve text classification
Mining and summarizing customer reviews

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Incorporating prior knowledge with weighted margin support vector machines

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Thumbs up?: sentiment classification using machine learning techniques

EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
Question Answering via Bayesian inference on lexical relations

MultiSumQA '03 Proceedings of the ACL 2003 workshop on Multilingual summarization and question answering - Volume 12
Towards answering opinion questions: separating facts from opinions and identifying the polarity of opinion sentences

EMNLP '03 Proceedings of the 2003 conference on Empirical methods in natural language processing
Constructing informative prior distributions from domain knowledge in text classification

SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Movie review mining and summarization

CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
A sentimental education: sentiment analysis using subjectivity summarization based on minimum cuts

ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
Determining the sentiment of opinions

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Recognizing contextual polarity in phrase-level sentiment analysis

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Examining the role of linguistic knowledge sources in the automatic identification and classification of reviews

COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
COBRA - Mining Web for Corporate Brand and Reputation Analysis

WI '07 Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence
Learning from labeled features using generalized expectation criteria

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Document-Word Co-regularization for Semi-supervised Sentiment Analysis

ICDM '08 Proceedings of the 2008 Eighth IEEE International Conference on Data Mining
Text classification by labeling words

AAAI'04 Proceedings of the 19th national conference on Artifical intelligence
Predicting the political sentiment of web log posts using supervised machine learning techniques coupled with feature selection

WebKDD'06 Proceedings of the 8th Knowledge discovery on the web international conference on Advances in web mining and web usage analysis

Uncertainty sampling and transductive experimental design for active dual supervision

ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Data quality from crowdsourcing: a study of annotation selection criteria

HLT '09 Proceedings of the NAACL HLT 2009 Workshop on Active Learning for Natural Language Processing
Active dual supervision: reducing the cost of annotating examples and features

HLT '09 Proceedings of the NAACL HLT 2009 Workshop on Active Learning for Natural Language Processing
Scary films good, scary flights bad: topic driven feature selection for classification of sentiment

Proceedings of the 1st international CIKM workshop on Topic-sentiment analysis for mass opinion
Opinion mining and summarization of reviews in web forums

Proceedings of the Third Annual ACM Bangalore Conference
Classification of Dreams Using Machine Learning

Proceedings of the 2010 conference on ECAI 2010: 19th European Conference on Artificial Intelligence
Multi Grain Sentiment Analysis using Collective Classification

Proceedings of the 2010 conference on ECAI 2010: 19th European Conference on Artificial Intelligence
OpinionIt: a text mining system for cross-lingual opinion analysis

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Learning sentiment classification model from labeled features

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Efficient term cloud generation for streaming web content

ICWE'10 Proceedings of the 10th international conference on Web engineering
A unified approach to active dual supervision for labeling features and examples

ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part I
Lexicon based sentiment analysis of Urdu text using SentiUnits

MICAI'10 Proceedings of the 9th Mexican international conference on Advances in artificial intelligence: Part I
Enhanced sentiment learning using Twitter hashtags and smileys

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Quantifying sentiment and influence in blogspaces

Proceedings of the First Workshop on Social Media Analytics
An unsupervised sentiment classifier on summarized or full reviews

WISE'10 Proceedings of the 11th international conference on Web information systems engineering
Self-training from labeled features for sentiment analysis

Information Processing and Management: an International Journal
Using a heterogeneous dataset for emotion analysis in text

Canadian AI'11 Proceedings of the 24th Canadian conference on Advances in artificial intelligence
From bias to opinion: a transfer-learning approach to real-time sentiment analysis

Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Holistic approaches to identifying the sentiment of blogs using opinion words

WISE'11 Proceedings of the 12th international conference on Web information system engineering
Developing robust models for favourability analysis

WASSA '11 Proceedings of the 2nd Workshop on Computational Approaches to Subjectivity and Sentiment Analysis
A cross-corpus study of unsupervised subjectivity identification based on calibrated EM

WASSA '11 Proceedings of the 2nd Workshop on Computational Approaches to Subjectivity and Sentiment Analysis
Deploying an interactive machine learning system in an evidence-based practice center: abstrackr

Proceedings of the 2nd ACM SIGHIT International Health Informatics Symposium
Twitter polarity classification with label propagation over lexical links and the follower graph

EMNLP '11 Proceedings of the First Workshop on Unsupervised Learning in NLP
A non-negative matrix factorization based approach for active dual supervision from document and word labels

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Closing the loop: fast, interactive semi-supervised annotation with queries on features and instances

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Incorporating Sentiment Prior Knowledge for Weakly Supervised Sentiment Analysis

ACM Transactions on Asian Language Information Processing (TALIP)
Survey on mining subjective data on the web

Data Mining and Knowledge Discovery
An approach of semi-automatic public sentiment analysis for opinion and district

WAIM'11 Proceedings of the 2011 international conference on Web-Age Information Management
A generate-and-test method of detecting negative-sentiment sentences

CICLing'12 Proceedings of the 13th international conference on Computational Linguistics and Intelligent Text Processing - Volume Part I
Semi-supervised document clustering with dual supervision through seeding

Proceedings of the 27th Annual ACM Symposium on Applied Computing
A unified framework for document clustering with dual supervision

ACM SIGAPP Applied Computing Review
Personalized document clustering with dual supervision

Proceedings of the 2012 ACM symposium on Document engineering
On developing robust models for favourability analysis: Model choice, feature sets and imbalanced data

Decision Support Systems
Behavioral factors in interactive training of text classifiers

NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Dual word and document seed selection for semi-supervised sentiment classification

Proceedings of the 21st ACM international conference on Information and knowledge management
Healthy or harmful? polarity analysis applied to biomedical entity relationships

PRICAI'12 Proceedings of the 12th Pacific Rim international conference on Trends in Artificial Intelligence
Sentiment analysis by augmenting expectation maximisation with lexical knowledge

WISE'12 Proceedings of the 13th international conference on Web Information Systems Engineering
Mining Product Reviews in Web Forums

International Journal of Information Retrieval Research
Active learning on sentiment classification by selecting both words and documents

CLSW'12 Proceedings of the 13th Chinese conference on Chinese Lexical Semantics
Polarity Analysis for Food and Disease Relationships

WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 01
Ontology-based sentiment analysis of twitter posts

Expert Systems with Applications: An International Journal
Revised mutual information approach for german text sentiment classification

Proceedings of the 22nd international conference on World Wide Web companion
Amplifying the voice of youth in Africa via text analytics

Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining
Cross-media sentiment classification and application to box-office forecasting

Proceedings of the 10th Conference on Open Research Areas in Information Retrieval
Detecting changes in content and posting time distributions in social media

Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
Using the length of the speech to measure the opinion

Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
Exploring weakly supervised latent sentiment explanations for aspect-level review analysis

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
End-user feature labeling: Supervised and semi-supervised approaches based on locally-weighted logistic regression

Artificial Intelligence
Sentiment visualization and classification via semi-supervised nonlinear dimensionality reduction

Pattern Recognition
A weakly supervised approach to Chinese sentiment classification using partitioned self-training

Journal of Information Science
Bootstrapping polarity classifiers with rule-based classification

Language Resources and Evaluation
Platform and applications for massive-scale streaming network analytics

IBM Journal of Research and Development

Quantified Score

Hi-index	0.00

Visualization

Abstract

The explosion of user-generated content on the Web has led to new opportunities and significant challenges for companies, that are increasingly concerned about monitoring the discussion around their products. Tracking such discussion on weblogs, provides useful insight on how to improve products or market them more effectively. An important component of such analysis is to characterize the sentiment expressed in blogs about specific brands and products. Sentiment Analysis focuses on this task of automatically identifying whether a piece of text expresses a positive or negative opinion about the subject matter. Most previous work in this area uses prior lexical knowledge in terms of the sentiment-polarity of words. In contrast, some recent approaches treat the task as a text classification problem, where they learn to classify sentiment based only on labeled training data. In this paper, we present a unified framework in which one can use background lexical information in terms of word-class associations, and refine this information for specific domains using any available training examples. Empirical results on diverse domains show that our approach performs better than using background knowledge or training data in isolation, as well as alternative approaches to using lexical knowledge with text classification.