Sentiment classification: The contribution of ensemble learning

Authors:
Gang Wang;Jianshan Sun;Jian Ma;Kaiquan Xu;Jibao Gu
Affiliations:
School of Management, Hefei University of Technology, Hefei, Anhui 230009, PR China and Key Laboratory of Process Optimization and Intelligent Decision-making, Ministry of Education, Hefei, Anhui, ...;Department of Information Systems, City University of Hong Kong, Tat Chee Avenue, Kowloon, Hong Kong and School of Management, University of Science and Technology of China, Hefei, Anhui, PR China;Department of Information Systems, City University of Hong Kong, Tat Chee Avenue, Kowloon, Hong Kong;Department of Electronic Commerce, School of Business, Nanjing University, Nanjing, Jiangsu 210093, PR China;School of Management, University of Science and Technology of China, Hefei, Anhui, PR China
Venue:
Decision Support Systems
Year:
2014

Citing 38
Cited 0

The Strength of Weak Learnability

Machine Learning
C4.5: programs for machine learning

C4.5: programs for machine learning
The nature of statistical learning theory

The nature of statistical learning theory
Bagging predictors

Machine Learning
The Random Subspace Method for Constructing Decision Forests

IEEE Transactions on Pattern Analysis and Machine Intelligence
MultiBoosting: A Technique for Combining Boosting and Wagging

Machine Learning
An Empirical Comparison of Voting Classification Algorithms: Bagging, Boosting, and Variants

Machine Learning
Neural Network Ensembles

IEEE Transactions on Pattern Analysis and Machine Intelligence
Mining the peanut gallery: opinion extraction and semantic classification of product reviews

WWW '03 Proceedings of the 12th international conference on World Wide Web
An extensive empirical study of feature selection metrics for text classification

The Journal of Machine Learning Research
Sentiment Analyzer: Extracting Sentiments about a Given Topic using Natural Language Processing Techniques

ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
Predicting the semantic orientation of adjectives

ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Thumbs up?: sentiment classification using machine learning techniques

EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
A sentimental education: sentiment analysis using subjectivity summarization based on minimum cuts

ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
Determining the sentiment of opinions

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Statistical Comparisons of Classifiers over Multiple Data Sets

The Journal of Machine Learning Research
Sentiment analysis in multiple languages: Feature selection for opinion classification in Web forums

ACM Transactions on Information Systems (TOIS)
Affect Analysis of Web Forums and Blogs Using Correlation Ensembles

IEEE Transactions on Knowledge and Data Engineering
AVA: Adjective-Verb-Adverb Combinations for Sentiment Analysis

IEEE Intelligent Systems
Opinion Mining and Sentiment Analysis

Foundations and Trends in Information Retrieval
Feature selection for text classification with Naïve Bayes

Expert Systems with Applications: An International Journal
Building a General Purpose Cross-Domain Sentiment Mining Model

CSIE '09 Proceedings of the 2009 WRI World Congress on Computer Science and Information Engineering - Volume 04
A machine learning approach to sentiment analysis in multilingual Web texts

Information Retrieval
Constructing ensembles of classifiers by means of weighted instance selection

IEEE Transactions on Neural Networks
Sentiment analysis of Chinese documents: From sentence to document level

Journal of the American Society for Information Science and Technology
Encyclopedia of Database Systems

Encyclopedia of Database Systems
A Lexicon-Enhanced Method for Sentiment Classification: An Experiment on Online Product Reviews

IEEE Intelligent Systems
Document sentiment classification by exploring description model of topical terms

Computer Speech and Language
Aspect-based sentiment analysis of movie reviews on discussion boards

Journal of Information Science
Ensemble of feature sets and classification algorithms for sentiment classification

Information Sciences: an International Journal
Data Mining: Practical Machine Learning Tools and Techniques

Data Mining: Practical Machine Learning Tools and Techniques
Sentiment classification: a lexical similarity based approach for extracting subjectivity in documents

Information Retrieval
Nearest neighbor pattern classification

IEEE Transactions on Information Theory
Special Issue on Social Media Analytics: Understanding the Pulse of the Society

IEEE Transactions on Systems, Man, and Cybernetics, Part A: Systems and Humans
On developing robust models for favourability analysis: Model choice, feature sets and imbalanced data

Decision Support Systems
Ensemble Methods: Foundations and Algorithms

Ensemble Methods: Foundations and Algorithms
Ensemble learning for sentiment classification

CLSW'12 Proceedings of the 13th Chinese conference on Chinese Lexical Semantics

Quantified Score

Hi-index	0.00

Visualization

Abstract

With the rapid development of information technologies, user-generated contents can be conveniently posted online. While individuals, businesses, and governments are interested in evaluating the sentiments behind this content, there are no consistent conclusions on which sentiment classification technologies are best. Recent studies suggest that ensemble learning methods may have potential applicability in sentiment classification. In this study, we conduct a comparative assessment of the performance of three popular ensemble methods (Bagging, Boosting, and Random Subspace) based on five base learners (Naive Bayes, Maximum Entropy, Decision Tree, K Nearest Neighbor, and Support Vector Machine) for sentiment classification. Moreover, ten public sentiment analysis datasets were investigated to verify the effectiveness of ensemble learning for sentiment analysis. Based on a total of 1200 comparative group experiments, empirical results reveal that ensemble methods substantially improve the performance of individual base learners for sentiment classification. Among the three ensemble methods, Random Subspace has the better comparative results, although it was seldom discussed in the literature. These results illustrate that ensemble learning methods can be used as a viable method for sentiment classification.