Learning opinions in user-generated web content

  • Authors:
  • M. Sokolova;G. Lapalme

  • Affiliations:
  • Department of pediatrics, faculty of medicine, children's hospital of eastern ontario research institute, university of ottawa, 401 smyth rd., ottawa, ontario, canada, k1h 8l1 email: sokolova@uott ...;Département d'informatique et de recherche opérationnelle, université de montréal, c.p. 6128, succ centre-ville, montréal, quebec, canada, h3c 3j7 email: lapalme@iro.umont ...

  • Venue:
  • Natural Language Engineering
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

The user-generated Web content has been intensively analyzed in Information Extraction and Natural Language Processing research. Web-posted reviews of consumer goods are studied to find customer opinions about the products. We hypothesize that nonemotionally charged descriptions can be applied to predict those opinions. The descriptions may include indicators of product size (tall), commonplace (some), frequency of happening (often), and reviewer certainty (maybe). We first construct patterns of how the descriptions are used in consumer-written texts and then represent individual reviews through these patterns. We propose a semantic hierarchy that organizes individual words into opinion types. We run machine learning algorithms on five data sets of user-written product reviews: four are used in classification experiments, another one for regression and classification. The obtained results support the use of non-emotional descriptions in opinion learning.