Sentiment classification of web review using association rules

  • Authors:
  • Man Yuan;Yuanxin Ouyang;Zhang Xiong;Hao Sheng

  • Affiliations:
  • School of Computer Science and Technology, Beihang University, Beijing, P.R. China,Research Institute of Beihang University in Shenzhen, Shenzhen, P.R. China;School of Computer Science and Technology, Beihang University, Beijing, P.R. China,Research Institute of Beihang University in Shenzhen, Shenzhen, P.R. China;School of Computer Science and Technology, Beihang University, Beijing, P.R. China,Research Institute of Beihang University in Shenzhen, Shenzhen, P.R. China;School of Computer Science and Technology, Beihang University, Beijing, P.R. China,Research Institute of Beihang University in Shenzhen, Shenzhen, P.R. China

  • Venue:
  • OCSC'13 Proceedings of the 5th international conference on Online Communities and Social Computing
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

Sentiment Classification of web reviews or comments is an important and challenging task in Web Mining and Data Mining. This paper presents a novel approach using association rules for sentiment classification of web reviews. A new restraint measure AD-Sup is used to extract discriminative frequent term sets and eliminate terms with no sentiment orientation which contain close frequency in both positive and negative reviews. An optimal classification rule set is then generated which abandons the redundant general rule with lower confidence than the specific one. In the class label prediction procedure, we proposed a new metric voting scheme to solve the problem when the covered rules are not adequately confident or not applicable. The final score of a test review depends on the overall contributions of four metrics. Extensive experiments on multiple domain datasets from web site demonstrate that 50% is the best min-conf to guarantee classification rules both abundant and persuasive and the voting strategy obtains improvements on other baselines of using confidence. Another comparison to popular machine learning algorithms such as SVM, Naïve Bayes and kNN also indicates that the proposed method outperforms these strong benchmarks.