Sentiment classification of Internet restaurant reviews written in Cantonese

Authors:
Ziqiong Zhang;Qiang Ye;Zili Zhang;Yijun Li
Affiliations:
Dept. of Management Science & Engineering, Harbin Institute of Technology, Harbin 150001, China;Dept. of Management Science & Engineering, Harbin Institute of Technology, Harbin 150001, China;Dept. of Management Science & Engineering, Harbin Institute of Technology, Harbin 150001, China;Dept. of Management Science & Engineering, Harbin Institute of Technology, Harbin 150001, China
Venue:
Expert Systems with Applications: An International Journal
Year:
2011

Citing 12
Cited 4

A re-examination of text categorization methods

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Text Categorization with Suport Vector Machines: Learning with Many Relevant Features

ECML '98 Proceedings of the 10th European Conference on Machine Learning
Mining the peanut gallery: opinion extraction and semantic classification of product reviews

WWW '03 Proceedings of the 12th international conference on World Wide Web
The Digitization of Word of Mouth: Promise and Challenges of Online Feedback Mechanisms

Management Science
Predicting the semantic orientation of adjectives

ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
Opinion observer: analyzing and comparing opinions on the Web

WWW '05 Proceedings of the 14th international conference on World Wide Web
Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Thumbs up?: sentiment classification using machine learning techniques

EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
Sentiment classification of online reviews to travel destinations by supervised machine learning approaches

Expert Systems with Applications: An International Journal
Yahoo! for Amazon: Sentiment Extraction from Small Talk on the Web

Management Science
Using emoticons to reduce dependency in machine learning techniques for sentiment classification

ACLstudent '05 Proceedings of the ACL Student Research Workshop
A system for summarizing and visualizing arguments in subjective documents: toward supporting decision making

SST '06 Proceedings of the Workshop on Sentiment and Subjectivity in Text

Weakness Finder: Find product weakness from Chinese reviews by using aspects based sentiment analysis

Expert Systems with Applications: An International Journal
A comparative study of feature selection and machine learning techniques for sentiment analysis

Proceedings of the 2012 ACM Research in Applied Computation Symposium
Sentiment classification of Chinese online reviews: analysing and improving supervised machine learning

International Journal of Web Engineering and Technology
RETRACTED: Sentiment Analysis in Decision Sciences Research: An Illustration to IT Governance

Decision Support Systems

Quantified Score

Hi-index	12.05

Visualization

Abstract

Cantonese is an important dialect in some regions of Southern China. Local online users often represent their opinions and experiences on the web with written Cantonese. Although the information in those reviews is valuable to potential consumers and sellers, the huge amount of web reviews make it difficult to give an unbiased evaluation to a product and the Cantonese reviews are unintelligible for Mandarin Chinese speakers. In this paper, standard machine learning techniques naive Bayes and SVM are incorporated into the domain of online Cantonese-written restaurant reviews to automatically classify user reviews as positive or negative. The effects of feature presentations and feature sizes on classification performance are discussed. We find that accuracy is influenced by interaction between the classification models and the feature options. The naive Bayes classifier achieves as well as or better accuracy than SVM. Character-based bigrams are proved better features than unigrams and trigrams in capturing Cantonese sentiment orientation.