A Multi-Classifier System for Sentiment Analysis and Opinion Mining

  • Authors:
  • Luana Bezerra Batista;Sylvie Ratte

  • Affiliations:
  • -;-

  • Venue:
  • ASONAM '12 Proceedings of the 2012 International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2012)
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Although successfully employed to reduce error rates of difficult pattern recognition problems, multi-classifier systems (MCS) are not in widespread use in the field of Sentiment Analysis and Opinion Mining. The motivation of using a MCS stems from the fact that different classifiers usually make different errors on different samples. By using just the best classifier, it is possible to loose valuable information contained in the other sub optimal classifiers. In this work, we take advantage of unigrams, big rams and trig rams to design a multi-classifier system for Sentiment Analysis and Opinion Mining. Three different Naive Bayes classifiers are trained--each one with a specific set of features-- , and then combined in the ROC space by using the Iterative Boolean Combination (IBC) technique. IBC iteratively combines the ROC curves produced by different classifiers using all Boolean functions, and does not require prior assumption that the classifiers are statistically independent. An experimental study investigates the advantage of using the proposed MCS, over each individual classifier, in classifying Twitter messages as positive or negative. The Stanford University's Twitter database is employed for this task. As real-world application, the proposed MCS is used to identify the sentiment of electors regarding the main candidates for the 2012 United States Presidential Elections. Results indicate that the proposed MCS can provide useful information about people's opinions that are comparable to conventional opinion polls.