Sentiment classification of Chinese online reviews: analysing and improving supervised machine learning

  • Authors:
  • Pei Yin;Hongwei Wang;Lijuan Zheng

  • Affiliations:
  • Department of Management Science and Engineering, School of Economics and Management, Tongji University, 1239 Siping Road, Shanghai 200092, China.;Department of Management Science and Engineering, School of Economics and Management, Tongji University, 1239 Siping Road, Shanghai 200092, China.;Department of Management Science and Engineering, School of Economics and Management, Tongji University, 1239 Siping Road, Shanghai 200092, China

  • Venue:
  • International Journal of Web Engineering and Technology
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

With the boost of online reviews, a large quantity of consumers' opinions on certain products and services are generated and spread over the internet, thus techniques of sentiment classification for online reviews rise in response to the requirement of retrieving valuable information. This paper is mainly focused on improving sentiment classification of Chinese online reviews through analysing and improving each step in supervised machine learning. At first, adjectives, adverbs, and verbs are selected as the initial text features. Then, three statistic methods (DF, IG and CHI) are utilised to extract features. At last, a Boolean method is applied to set weight to features and a support vector machine (SVM) is employed as the classifier. Several comparative experiments have been conducted on reviews of two domains: mobile phone (product) reviews and hotel (service) reviews. The experimental results indicate that part of speech (POS), the number of features, evaluation domain, feature extraction algorithm and kernel function of SVM have great influences on sentiment classification, while the number of training corpora has a little impact. In addition, further improvements of DF IG and CHI have been made, which demonstrate the theoretical significance and the practical value of this research.