Reading tweeting minds: real-time analysis of short text for computational social science

Authors:
Zhe Wang;Daniele Quercia;Diarmuid Ó Séaghdha
Affiliations:
University of Cambridge, United Kingdom;Yahoo! Research, Barcelona Spain;University of Cambridge, United Kingdom
Venue:
Proceedings of the 24th ACM Conference on Hypertext and Social Media
Year:
2013

Citing 3
Cited 0

Tweets from Justin Bieber's heart: the dynamics of the location field in user profiles

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Privacy dictionary: a linguistic taxonomy of privacy for content analysis

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Blogs as a collective war diary

Proceedings of the ACM 2012 conference on Computer Supported Cooperative Work

Quantified Score

Hi-index	0.00

Visualization

Abstract

Twitter status updates (tweets) have great potential for unobtrusive analysis of users' perceptions in real time, providing a way of investigating social patterns at scale. Here we present a tool that performs textual analysis of tweets mentioning a topic of interest and outputs words statistically associated with it in the form of word lists and word graphs. Such a tool could be of value for helping social scientists to navigate the overwhelming amounts of data that are produced on Twitter. To evaluate our tool, we select three concepts of interest to social scientists (i.e., privacy, serendipity, and Occupy Wall Street), build ground truths for each concept using the Grounded Theory approach, and perform a quantitative assessment based on two widely-used information retrieval metrics. To then offer qualitative assessments complementary to the quantitative ones, we run a user study involving 32 individuals. We find that simple information-theoretic association measures are more accurate than frequency-based measures. We also spell out under which conditions these metrics tend to work best.