Experiments on hybrid corpus-based sentiment lexicon acquisition

  • Authors:
  • Goran Glavaš;Jan Šnajder;Bojana Dalbelo Bašić

  • Affiliations:
  • University of Zagreb, Zagreb, Croatia;University of Zagreb, Zagreb, Croatia;University of Zagreb, Zagreb, Croatia

  • Venue:
  • HYBRID '12 Proceedings of the Workshop on Innovative Hybrid Approaches to the Processing of Textual Data
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Numerous sentiment analysis applications make usage of a sentiment lexicon. In this paper we present experiments on hybrid sentiment lexicon acquisition. The approach is corpus-based and thus suitable for languages lacking general dictionary-based resources. The approach is a hybrid two-step process that combines semi-supervised graph-based algorithms and supervised models. We evaluate the performance on three tasks that capture different aspects of a sentiment lexicon: polarity ranking task, polarity regression task, and sentiment classification task. Extensive evaluation shows that the results are comparable to those of a well-known sentiment lexicon SentiWordNet on the polarity ranking task. On the sentiment classification task, the results are also comparable to SentiWordNet when restricted to monosentimous (all senses carry the same sentiment) words. This is satisfactory, given the absence of explicit semantic relations between words in the corpus.