If you are happy and you know it... tweet

Authors:
Amir Asiaee T.;Mariano Tepper;Arindam Banerjee;Guillermo Sapiro
Affiliations:
University of Minnesota, Minneapolis, MN, USA;Duke University, Durham, NC, USA;University of Minnesota, Minneapolis, MN, USA;Duke University, Durham, NC, USA
Venue:
Proceedings of the 21st ACM international conference on Information and knowledge management
Year:
2012

Citing 7
Cited 0

Text Classification from Labeled and Unlabeled Documents using EM

Machine Learning - Special issue on information retrieval
Opinion Mining and Sentiment Analysis

Foundations and Trends in Information Retrieval
Feature hashing for large scale multitask learning

ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
From Sparse Solutions of Systems of Equations to Sparse Modeling of Signals and Images

SIAM Review
Robust sentiment detection on Twitter from biased and noisy data

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Target-dependent Twitter sentiment classification

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Sentiment analysis of Twitter data

LSM '11 Proceedings of the Workshop on Languages in Social Media

Quantified Score

Hi-index	0.00

Visualization

Abstract

Extracting sentiment from Twitter data is one of the fundamental problems in social media analytics. Twitter's length constraint renders determining the positive/negative sentiment of a tweet difficult, even for a human judge. In this work we present a general framework for per-tweet (in contrast with batches of tweets) sentiment analysis which consists of: (1) extracting tweets about a desired target subject, (2) separating tweets with sentiment, and (3) setting apart positive from negative tweets. For each step, we study the performance of a number of classical and new machine learning algorithms. We also show that the intrinsic sparsity of tweets allows performing classification in a low dimensional space, via random projections, without losing accuracy. In addition, we present weighted variants of all employed algorithms, exploiting the available labeling uncertainty, which further improve classification accuracy. Finally, we show that spatially aggregating our per-tweet classification results produces a very satisfactory outcome, making our approach a good candidate for batch tweet sentiment analysis.