A feature selection method based on improved fisher's discriminant ratio for text sentiment classification

Authors:
Suge Wang;Deyu Li;Xiaolei Song;Yingjie Wei;Hongxia Li
Affiliations:
School of Computer and Information Technology, Shanxi University, Taiyuan, 030006 Shanxi, China and Key Laboratory of Computational Intelligence and Chinese Information Processing of Ministry of E ...;Key Laboratory of Computational Intelligence and Chinese Information Processing of Ministry of Education, Taiyuan, 030006 Shanxi, China and School of Mathematics Science, Shanxi University, Taiyua ...;School of Computer and Information Technology, Shanxi University, Taiyuan, 030006 Shanxi, China;Science Press, 100717 Beijing, China;School of Computer and Information Technology, Shanxi University, Taiyuan, 030006 Shanxi, China
Venue:
Expert Systems with Applications: An International Journal
Year:
2011

Citing 19
Cited 6

A Comparative Study on Feature Selection in Text Categorization

ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Learning Subjective Adjectives from Corpora

Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on Innovative Applications of Artificial Intelligence
Measuring praise and criticism: Inference of semantic orientation from association

ACM Transactions on Information Systems (TOIS)
Sentiment Analyzer: Extracting Sentiments about a Given Topic using Natural Language Processing Techniques

ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
Effects of adjective orientation and gradability on sentence subjectivity

COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 1
Movie Review Mining: a Comparison between Supervised and Unsupervised Classification Approaches

HICSS '05 Proceedings of the Proceedings of the 38th Annual Hawaii International Conference on System Sciences (HICSS'05) - Track 4 - Volume 04
Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Learning Subjective Language

Computational Linguistics
Thumbs up?: sentiment classification using machine learning techniques

EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
Learning subjective nouns using extraction pattern bootstrapping

CONLL '03 Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003 - Volume 4
Towards answering opinion questions: separating facts from opinions and identifying the polarity of opinion sentences

EMNLP '03 Proceedings of the 2003 conference on Empirical methods in natural language processing
Sentiment classification on customer feedback data: noisy data, large feature vectors, and the role of linguistic analysis

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Recognizing contextual polarity in phrase-level sentiment analysis

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
A Hybrid Method of Feature Selection for Chinese Text Sentiment Classification

FSKD '07 Proceedings of the Fourth International Conference on Fuzzy Systems and Knowledge Discovery - Volume 03
An empirical study of sentiment analysis for chinese documents

Expert Systems with Applications: An International Journal
Sentiment analysis in multiple languages: Feature selection for opinion classification in Web forums

ACM Transactions on Information Systems (TOIS)
Sentiment classification of online reviews to travel destinations by supervised machine learning approaches

Expert Systems with Applications: An International Journal
Feature subsumption for opinion analysis

EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Affect analysis of text using fuzzy semantic typing

IEEE Transactions on Fuzzy Systems

Comparison of term frequency and document frequency based feature selection metrics in text categorization

Expert Systems with Applications: An International Journal
Automatic foldering of email messages: a combination approach

ECIR'12 Proceedings of the 34th European conference on Advances in Information Retrieval
Sample cutting method for imbalanced text sentiment classification based on BRC

Knowledge-Based Systems
A comparative study of feature selection and machine learning techniques for sentiment analysis

Proceedings of the 2012 ACM Research in Applied Computation Symposium
Projected-prototype based classifier for text categorization

Knowledge-Based Systems
Sentiment visualization and classification via semi-supervised nonlinear dimensionality reduction

Pattern Recognition

Quantified Score

Hi-index	12.05

Visualization

Abstract

Owing to its openness, virtualization and sharing criterion, the Internet has been rapidly becoming a platform for people to express their opinion, attitude, feeling and emotion. As the subjectivity texts are often too many for people to go through, how to automatically classify them into different sentiment orientation categories (e.g. positive/negative) has become an important research problem. In this paper, based on Fisher's discriminant ratio, an effective feature selection method is proposed for subjectivity text sentiment classification. In order to validate the proposed method, we compared it with the method based on Information Gain while Support Vector Machine is adopted as the classifier. Two experiments are conducted by combining different feature selection methods with two kinds of candidate feature sets. Under 2739 subjectivity documents of COAE2008s and 1006 car-related subjectivity documents, the experimental results indicate that the Fisher's discriminant ratio based on word frequency estimation has the best performance respectively with accuracy 86.61% and 82.80% under two corpus while the candidate features are the words which appear in both positive and negative texts.