#Santiago is not #Chile, or is it?: a model to normalize social media impact

  • Authors:
  • Eduardo Graells-Garrido;Bárbara Poblete

  • Affiliations:
  • Universitat Pompeu Fabra, Barcelona, Spain;Universidad de Chile, Santiago, Chile

  • Venue:
  • Proceedings of the 2013 Chilean Conference on Human - Computer Interaction
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

Online social networks are known to be demographically biased. Currently there are questions about what degree of representativity of the physical population they have, and how population biases impact user-generated content. In this paper we focus on centralism, a problem affecting Chile. Assuming that local differences exist in a country, in terms of vocabulary, we built a methodology based on the vector space model to find distinctive content from different locations, and used it to create classifiers to predict whether the content of a micro-post is related to a particular location, having in mind a geographically diverse selection of micro-posts. We evaluate them in a case study where we analyze the virtual population of Chile that participated in the Twitter social network during an event of national relevance: the municipal (local governments) elections held in 2012. We observe that the participating virtual population is spatially representative of the physical population, implying that there is centralism in Twitter. Our classifiers out-perform a non geographically-diverse baseline at the regional level, and have the same accuracy at a provincial level. However, our approach makes assumptions that need to be tested in multi-thematic and more general datasets. We leave this for future work.