The viability of web-derived polarity lexicons

Authors:
Leonid Velikovich;Sasha Blair-Goldensohn;Kerry Hannan;Ryan McDonald
Affiliations:
Google Inc., New York, NY;Google Inc., New York, NY;Google Inc., New York, NY;Google Inc., New York, NY
Venue:
HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Year:
2010

Citing 18
Cited 20

Learning Subjective Adjectives from Corpora

Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on Innovative Applications of Artificial Intelligence
Predicting the semantic orientation of adjectives

ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
Mining and summarizing customer reviews

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Thumbs up?: sentiment classification using machine learning techniques

EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
Learning extraction patterns for subjective expressions

EMNLP '03 Proceedings of the 2003 conference on Empirical methods in natural language processing
Determining the sentiment of opinions

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Recognizing contextual polarity in phrase-level sentiment analysis

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Yahoo! for Amazon: Sentiment Extraction from Small Talk on the Web

Management Science
Generating a non-English subjectivity lexicon: relations that matter

EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
Sentiment summarization: evaluating and learning user preferences

EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
Semi-supervised polarity lexicon induction

EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
Multilingual subjectivity analysis using machine translation

EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Large-scale computation of distributional similarities for queries

NAACL-Short '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers
Adapting a polarity lexicon using integer linear programming for domain-specific sentiment classification

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
Generating high-coverage semantic orientation lexicons from overtly marked words and a thesaurus

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
Web-scale distributional similarity and entity set expansion

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
Pulse: mining customer opinions from free text

IDA'05 Proceedings of the 6th international conference on Advances in Intelligent Data Analysis

What's great and what's not: learning to classify the scope of negation for improved sentiment analysis

NeSp-NLP '10 Proceedings of the Workshop on Negation and Speculation in Natural Language Processing
Semi-supervised latent variable models for sentence-level sentiment analysis

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
Generating semantic orientation lexicon using large data and thesaurus

WASSA '11 Proceedings of the 2nd Workshop on Computational Approaches to Subjectivity and Sentiment Analysis
Sentiment lexicons for health-related opinion mining

Proceedings of the 2nd ACM SIGHIT International Health Informatics Symposium
Mining slang and urban opinion words and phrases from cQA services: an optimization approach

Proceedings of the fifth ACM international conference on Web search and data mining
Semi-supervised recursive autoencoders for predicting sentiment distributions

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Compositional matrix-space models for sentiment analysis

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Cooooooooooooooollllllllllllll!!!!!!!!!!!!!!: using word lengthening to detect sentiment in microblogs

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Learning general connotation of words using graph-based algorithms

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Lexicon-based Comments-oriented News Sentiment Analyzer system

Expert Systems with Applications: An International Journal
Towards building large-scale distributed systems for twitter sentiment analysis

Proceedings of the 27th Annual ACM Symposium on Applied Computing
A generic approach to generate opinion lists of phrases for opinion mining applications

Proceedings of the First International Workshop on Issues of Sentiment Discovery and Opinion Mining
Detecting visual text

NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Collocation polarity disambiguation using web-based pseudo contexts

EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Excitatory or inhibitory: a new semantic orientation extracts contradiction and causality from the web

EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Fast large-scale approximate graph construction for NLP

EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Generating contextualized sentiment lexica based on latent topics and user ratings

Proceedings of the 24th ACM Conference on Hypertext and Social Media
Automatic construction of domain and aspect specific sentiment lexicons for customer review mining

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Chinese-English mixed text normalization

Proceedings of the 7th ACM international conference on Web search and data mining
Bootstrapping polarity classifiers with rule-based classification

Language Resources and Evaluation

Quantified Score

Hi-index	0.00

Visualization

Abstract

We examine the viability of building large polarity lexicons semi-automatically from the web. We begin by describing a graph propagation framework inspired by previous work on constructing polarity lexicons from lexical graphs (Kim and Hovy, 2004; Hu and Liu, 2004; Esuli and Sabastiani, 2009; Blair-Goldensohn et al., 2008; Rao and Ravichandran, 2009). We then apply this technique to build an English lexicon that is significantly larger than those previously studied. Crucially, this web-derived lexicon does not require WordNet, part-of-speech taggers, or other language-dependent resources typical of sentiment analysis systems. As a result, the lexicon is not limited to specific word classes -- e.g., adjectives that occur in WordNet -- and in fact contains slang, misspellings, multiword expressions, etc. We evaluate a lexicon derived from English documents, both qualitatively and quantitatively, and show that it provides superior performance to previously studied lexicons, including one derived from WordNet.