Experiments in cross-lingual sentiment analysis in discussion forums

  • Authors:
  • Hatem Ghorbel

  • Affiliations:
  • Information and Communication Systems Lab (ISIC), HES-SO, HE-Arc Ingénierie, St-Imier, Switzerland

  • Venue:
  • SocInfo'12 Proceedings of the 4th international conference on Social Informatics
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

One of the objectives of sentiment analysis is to classify the polarity of conveyed opinions from the perspective of textual evidence. Most of the work in the field has been intensively applied to the English language and only few experiments have explored other languages. In this paper, we present a supervised classification of posts in French online forums where sentiment analysis is based on shallow linguistic features such as POS tagging, chunking and common negation forms. Furthermore, we incorporate word semantic orientation extracted from the English lexical resource SentiWordNet as an additional feature. Since SentiWordNet is an English resource, lexical entries in the studied French corpus should be translated into English. For this purpose, we propose a number of French to English translation experiments such as machine translation and WordNet synset translation using EuroWordNet. Obtained results show that WordNet synset translation have not significantly improved the classification performance with respect to the bag of words baseline due to the shortage in coverage. Automatic translation haven't either significantly improved the results due to its insufficient quality. Propositions of improving the classification performance are given by the end of the article.