Generating high-coverage semantic orientation lexicons from overtly marked words and a thesaurus

Authors:
Saif Mohammad;Cody Dunne;Bonnie Dorr
Affiliations:
Laboratory for Computational Linguistics and Information Processing and Institute for Advanced Computer Studies and Human Language Technology Center of Excellence;Human-Computer Interaction Lab and University of Maryland;Laboratory for Computational Linguistics and Information Processing and Institute for Advanced Computer Studies and University of Maryland and Human Language Technology Center of Excellence
Venue:
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
Year:
2009

Citing 19
Cited 20

Graph drawing by force-directed placement

Software—Practice & Experience
PHOAKS: a system for sharing recommendations

Communications of the ACM
Virtual reviewers for collaborative exploration of movie reviews

Proceedings of the 5th international conference on Intelligent user interfaces
Measuring praise and criticism: Inference of semantic orientation from association

ACM Transactions on Information Systems (TOIS)
Tracking point of view in narrative

Computational Linguistics
Predicting the semantic orientation of adjectives

ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
Mining and summarizing customer reviews

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Towards answering opinion questions: separating facts from opinions and identifying the polarity of opinion sentences

EMNLP '03 Proceedings of the 2003 conference on Empirical methods in natural language processing
Extracting semantic orientations of words using spin model

ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Recognizing contextual polarity in phrase-level sentiment analysis

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Opinion Mining and Sentiment Analysis

Foundations and Trends in Information Retrieval
Analyzing (social media) networks with NodeXL

Proceedings of the fourth international conference on Communities and technologies
Distributional measures of concept-distance: a task-oriented evaluation

EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Fully automatic lexicon expansion for domain-oriented sentiment analysis

EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Learning with compositional semantics as structural inference for subsentential sentiment analysis

EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Computing word-pair antonymy

EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Qualitative dimensions in question answering: extending the definitional QA task

AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 4
Identifying expressions of emotion in text

TSD'07 Proceedings of the 10th international conference on Text, speech and dialogue

The viability of web-derived polarity lexicons

HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Emotions evoked by common words and phrases: using mechanical turk to create an emotion lexicon

CAAGET '10 Proceedings of the NAACL HLT 2010 Workshop on Computational Approaches to Analysis and Generation of Emotion in Text
Aspect and sentiment unification model for online review analysis

Proceedings of the fourth ACM international conference on Web search and data mining
Which clustering do you want? inducing your ideal clustering with minimal feedback

Journal of Artificial Intelligence Research
Automatic construction of a context-aware sentiment lexicon: an optimization approach

Proceedings of the 20th international conference on World wide web
Colourful language: measuring word-colour associations

CMCL '11 Proceedings of the 2nd Workshop on Cognitive Modeling and Computational Linguistics
From once upon a time to happily ever after: tracking emotions in novels and fairy tales

LaTeCH '11 Proceedings of the 5th ACL-HLT Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities
Generating semantic orientation lexicon using large data and thesaurus

WASSA '11 Proceedings of the 2nd Workshop on Computational Approaches to Subjectivity and Sentiment Analysis
Tracking sentiment in mail: how genders differ on emotional axes

WASSA '11 Proceedings of the 2nd Workshop on Computational Approaches to Subjectivity and Sentiment Analysis
Mining subjective knowledge from customer reviews: a specific case of irony detection

WASSA '11 Proceedings of the 2nd Workshop on Computational Approaches to Subjectivity and Sentiment Analysis
Mining slang and urban opinion words and phrases from cQA services: an optimization approach

Proceedings of the fifth ACM international conference on Web search and data mining
Compositional matrix-space models for sentiment analysis

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Learning general connotation of words using graph-based algorithms

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
From humor recognition to irony detection: The figurative language of social media

Data & Knowledge Engineering
Building subjectivity lexicon(s) from scratch for essay data

CICLing'12 Proceedings of the 13th international conference on Computational Linguistics and Intelligent Text Processing - Volume Part I
From once upon a time to happily ever after: Tracking emotions in mail and books

Decision Support Systems
Making objective decisions from subjective data: Detecting irony in customer reviews

Decision Support Systems
Collocation polarity disambiguation using web-based pseudo contexts

EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
A multidimensional approach for detecting irony in Twitter

Language Resources and Evaluation
Some experiments on modeling stock market behavior using investor sentiment analysis and posting volume from Twitter

Proceedings of the 3rd International Conference on Web Intelligence, Mining and Semantics

Quantified Score

Hi-index	0.00

Visualization

Abstract

Sentiment analysis often relies on a semantic orientation lexicon of positive and negative words. A number of approaches have been proposed for creating such lexicons, but they tend to be computationally expensive, and usually rely on significant manual annotation and large corpora. Most of these methods use WordNet. In contrast, we propose a simple approach to generate a high-coverage semantic orientation lexicon, which includes both individual words and multi-word expressions, using only a Roget-like thesaurus and a handful of affixes. Further, the lexicon has properties that support the Polyanna Hypothesis. Using the General Inquirer as gold standard, we show that our lexicon has 14 percentage points more correct entries than the leading WordNet-based high-coverage lexicon (SentiWordNet). In an extrinsic evaluation, we obtain significantly higher performance in determining phrase polarity using our thesaurus-based lexicon than with any other. Additionally, we explore the use of visualization techniques to gain insight into the our algorithm beyond the evaluations mentioned above.