Learning opinions in user-generated web content

Authors:
M. Sokolova;G. Lapalme
Affiliations:
Department of pediatrics, faculty of medicine, children's hospital of eastern ontario research institute, university of ottawa, 401 smyth rd., ottawa, ontario, canada, k1h 8l1 email: sokolova@uott ...;Département d'informatique et de recherche opérationnelle, université de montréal, c.p. 6128, succ centre-ville, montréal, quebec, canada, h3c 3j7 email: lapalme@iro.umont ...
Venue:
Natural Language Engineering
Year:
2011

Citing 28
Cited 2

Measuring praise and criticism: Inference of semantic orientation from association

ACM Transactions on Information Systems (TOIS)
Benchmarking Attribute Selection Techniques for Discrete Class Data Mining

IEEE Transactions on Knowledge and Data Engineering
Mining and summarizing customer reviews

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Thumbs up?: sentiment classification using machine learning techniques

EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
Towards answering opinion questions: separating facts from opinions and identifying the polarity of opinion sentences

EMNLP '03 Proceedings of the 2003 conference on Empirical methods in natural language processing
Identifying comparative sentences in text documents

SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)

Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)
Language and the Internet

Language and the Internet
Speech and Language Processing (2nd Edition)

Speech and Language Processing (2nd Edition)
Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data (Data-Centric Systems and Applications)

Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data (Data-Centric Systems and Applications)
Seeing stars: exploiting class relationships for sentiment categorization with respect to rating scales

ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Extracting product features and opinions from reviews

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Examining the role of linguistic knowledge sources in the automatic identification and classification of reviews

COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
Opinion spam and analysis

WSDM '08 Proceedings of the 2008 International Conference on Web Search and Data Mining
Query-Based Summarization of Customer Reviews

CAI '07 Proceedings of the 20th conference of the Canadian Society for Computational Studies of Intelligence on Advances in Artificial Intelligence
Experience Mining: Building a Large-Scale Database of Personal Experiences and Opinions from Web Documents

WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
Opinion Learning without Emotional Words

Canadian AI '09 Proceedings of the 22nd Canadian Conference on Artificial Intelligence: Advances in Artificial Intelligence
Topic identification for fine-grained opinion analysis

COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
Feature subsumption for opinion analysis

EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Identifying expressions of opinion in context

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Recognizing stances in online debates

ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1
Mine the easy, classify the hard: a semi-supervised approach to automatic sentiment classification

ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - Volume 2
Verbs speak loud: verb categories in learning polarity and strength of opinions

Canadian AI'08 Proceedings of the Canadian Society for computational studies of intelligence, 21st conference on Advances in artificial intelligence
Focusing solutions for data mining: analytical studies and experimental results in real-world domains

Focusing solutions for data mining: analytical studies and experimental results in real-world domains
"Was it good? It was provocative." Learning the meaning of scalar adjectives

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Employing personal/impersonal views in supervised and semi-supervised sentiment classification

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Sentiment classification and polarity shifting

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics

Learning sentiments from tweets with personal health information

Canadian AI'12 Proceedings of the 25th Canadian conference on Advances in Artificial Intelligence
External validity of sentiment mining reports: Can current methods identify demographic biases, event biases, and manipulation of reviews?

Decision Support Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

The user-generated Web content has been intensively analyzed in Information Extraction and Natural Language Processing research. Web-posted reviews of consumer goods are studied to find customer opinions about the products. We hypothesize that nonemotionally charged descriptions can be applied to predict those opinions. The descriptions may include indicators of product size (tall), commonplace (some), frequency of happening (often), and reviewer certainty (maybe). We first construct patterns of how the descriptions are used in consumer-written texts and then represent individual reviews through these patterns. We propose a semantic hierarchy that organizes individual words into opinion types. We run machine learning algorithms on five data sets of user-written product reviews: four are used in classification experiments, another one for regression and classification. The obtained results support the use of non-emotional descriptions in opinion learning.