Discovering geographical topics in the twitter stream

Authors:
Liangjie Hong;Amr Ahmed;Siva Gurumurthy;Alexander J. Smola;Kostas Tsioutsiouliklis
Affiliations:
Lehigh University, Bethlehem, PA, USA;Yahoo! Research, Sunnyvale, CA, USA;Twitter, San Francisco, CA, USA;Yahoo! Research, Sunnyvale, CA, USA;Twitter, San Francisco, CA, USA
Venue:
Proceedings of the 21st international conference on World Wide Web
Year:
2012

Citing 13
Cited 26

Unsupervised learning by probabilistic latent semantic analysis

Machine Learning
A probabilistic approach to spatiotemporal theme pattern mining on weblogs

Proceedings of the 15th international conference on World Wide Web
Topic modeling: beyond bag-of-words

ICML '06 Proceedings of the 23rd international conference on Machine learning
Mining geographic knowledge using location aware topic model

Proceedings of the 4th ACM workshop on Geographical information retrieval
Structured correspondence topic models for mining captioned figures in biological literature

Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
A Fast Iterative Shrinkage-Thresholding Algorithm for Linear Inverse Problems

SIAM Journal on Imaging Sciences
A Fast Iterative Shrinkage-Thresholding Algorithm for Linear Inverse Problems

SIAM Journal on Imaging Sciences
GeoFolk: latent spatial semantics in web 2.0 social media

Proceedings of the third ACM international conference on Web search and data mining
Equip tourists with knowledge mined from travelogues

Proceedings of the 19th international conference on World wide web
A latent variable model for geographic lexical variation

EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Geographical topic discovery and comparison

Proceedings of the 20th international conference on World wide web
Simple supervised document geolocation with geodesic grids

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Friendship and mobility: user movement in location-based social networks

Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining

Latent geographic feature extraction from social media

Proceedings of the 20th International Conference on Advances in Geographic Information Systems
Location Comparison through Geographical Topics

WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 01
Sumblr: continuous summarization of evolving tweet streams

Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
A location-based news article recommendation with explicit localized semantic analysis

Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Mining web search topics with diverse spatiotemporal patterns

Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Who, where, when and what: discover spatio-temporal topics for twitter users

Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining
Learning geographical preferences for point-of-interest recommendation

Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining
One theme in all views: modeling consensus topics in multiple contexts

Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining
Leveraging geographical metadata to improve search over social media

Proceedings of the 22nd international conference on World Wide Web companion
Hierarchical geographical modeling of user locations from social media posts

Proceedings of the 22nd international conference on World Wide Web
A novel method for geographical social event detection in social media

Proceedings of the Fifth International Conference on Internet Multimedia Computing and Service
Discovering hot topics using Twitter streaming data: social topic detection and geographic clustering

Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
Robust detection of hyper-local events from geotagged social media data

Proceedings of the Thirteenth International Workshop on Multimedia Data Mining
An architecture for detecting events in real-time using massive heterogeneous data sources

Proceedings of the 2nd International Workshop on Big Data, Streams and Heterogeneous Source Mining: Algorithms, Systems, Programming Models and Applications
How the live web feels about events

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Exploring venue-based city-to-city similarity measures

Proceedings of the 2nd ACM SIGKDD International Workshop on Urban Computing
Spatial topic modeling in online social media for location recommendation

Proceedings of the 7th ACM conference on Recommender systems
A unified generative model for characterizing microblogs' topics

WAIM'13 Proceedings of the 14th international conference on Web-Age Information Management
A probablistic model for spatio-temporal signal extraction from social media

Proceedings of the 21st ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems
Spatio-temporal characteristics of bursty words in Twitter streams

Proceedings of the 21st ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems
Inferring the origin locations of tweets with quantitative confidence

Proceedings of the 17th ACM conference on Computer supported cooperative work & social computing
Detecting non-gaussian geographical topics in tagged photo collections

Proceedings of the 7th ACM international conference on Web search and data mining
A few good predictions: selective node labeling in a social network

Proceedings of the 7th ACM international conference on Web search and data mining
Fast topic discovery from web search streams

Proceedings of the 23rd international conference on World wide web
CoBaFi: collaborative bayesian filtering

Proceedings of the 23rd international conference on World wide web
Activity-based topic discovery

Web Intelligence and Agent Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Micro-blogging services have become indispensable communication tools for online users for disseminating breaking news, eyewitness accounts, individual expression, and protest groups. Recently, Twitter, along with other online social networking services such as Foursquare, Gowalla, Facebook and Yelp, have started supporting location services in their messages, either explicitly, by letting users choose their places, or implicitly, by enabling geo-tagging, which is to associate messages with latitudes and longitudes. This functionality allows researchers to address an exciting set of questions: 1) How is information created and shared across geographical locations, 2) How do spatial and linguistic characteristics of people vary across regions, and 3) How to model human mobility. Although many attempts have been made for tackling these problems, previous methods are either complicated to be implemented or oversimplified that cannot yield reasonable performance. It is a challenge task to discover topics and identify users' interests from these geo-tagged messages due to the sheer amount of data and diversity of language variations used on these location sharing services. In this paper we focus on Twitter and present an algorithm by modeling diversity in tweets based on topical diversity, geographical diversity, and an interest distribution of the user. Furthermore, we take the Markovian nature of a user's location into account. Our model exploits sparse factorial coding of the attributes, thus allowing us to deal with a large and diverse set of covariates efficiently. Our approach is vital for applications such as user profiling, content recommendation and topic tracking. We show high accuracy in location estimation based on our model. Moreover, the algorithm identifies interesting topics based on location and language.