Combining supervised and unsupervised polarity classification for non-english reviews

  • Authors:
  • José M. Perea-Ortega;Eugenio Martínez-Cámara;María-Teresa Martín-Valdivia;L. Alfonso Ureña-López

  • Affiliations:
  • SINAI Research Group, Computer Science Department, University of Jaén Escuela Politécnica Superior, Jaén, Spain;SINAI Research Group, Computer Science Department, University of Jaén Escuela Politécnica Superior, Jaén, Spain;SINAI Research Group, Computer Science Department, University of Jaén Escuela Politécnica Superior, Jaén, Spain;SINAI Research Group, Computer Science Department, University of Jaén Escuela Politécnica Superior, Jaén, Spain

  • Venue:
  • CICLing'13 Proceedings of the 14th international conference on Computational Linguistics and Intelligent Text Processing - Volume 2
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

Two main approaches are used in order to detect the sentiment polarity from reviews. The supervised methods apply machine learning algorithms when training data are provided and the unsupervised methods are usually applied when linguistic resources are available and training data are not provided. Each one of them has its own advantages and disadvantages and for this reason we propose the use of meta-classifiers that combine both of them in order to classify the polarity of reviews. Firstly, the non-English corpus is translated to English with the aim of taking advantage of English linguistic resources. Then, it is generated two machine learning models over the two corpora (original and translated), and an unsupervised technique is only applied to the translated version. Finally, the three models are combined with a voting algorithm. Several experiments have been carried out using Spanish and Arabic corpora showing that the proposed combination approach achieves better results than those obtained by using the methods separately.