Transient crowd discovery on the real-time social web

Authors:
Krishna Yeshwanth Kamath;James Caverlee
Affiliations:
Texas A&M University, College Station, TX, USA;Texas A&M University, College Station, TX, USA
Venue:
Proceedings of the fourth ACM international conference on Web search and data mining
Year:
2011

Citing 8
Cited 5

A new approach to the maximum flow problem

STOC '86 Proceedings of the eighteenth annual ACM symposium on Theory of computing
A fast kernel-based multilevel algorithm for graph clustering

Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Group formation in large social networks: membership, growth, and evolution

Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Dynamic Algorithm for Graph Clustering Using Minimum Cut Tree

ICDMW '06 Proceedings of the Sixth IEEE International Conference on Data Mining - Workshops
GraphScope: parameter-free mining of large time-evolving graphs

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
An event-based framework for characterizing the evolutionary behavior of interaction graphs

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Graph Clustering Via a Discrete Uncoupling Process

SIAM Journal on Matrix Analysis and Applications
Identifying hotspots on the real-time web

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management

CrowdTracker: enabling community-based real-time web monitoring

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Group Profiling for Understanding Social Structures

ACM Transactions on Intelligent Systems and Technology (TIST)
Predicting semantic annotations on the real-time web

Proceedings of the 23rd ACM conference on Hypertext and social media
Magnet community identification on social networks

Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
Content-based crowd retrieval on the real-time web

Proceedings of the 21st ACM international conference on Information and knowledge management

Quantified Score

Hi-index	0.01

Visualization

Abstract

In this paper, we study the problem of automatically discovering and tracking transient crowds in highly-dynamic social messaging systems like Twitter and Facebook. Unlike the more static and long-lived group-based membership offered on many social networks (e.g., fan of the LA Lakers), a transient crowd is a short-lived ad-hoc collection of users, representing a "hotspot" on the real-time web. Successful detection of these hotspots can positively impact related research directions in online event detection, content personalization, social information discovery, etc. Concretely, we propose to model crowd formation and dispersion through a message-based communication clustering approach over time-evolving graphs that captures the natural conversational nature of social messaging systems. Two of the salient features of the proposed approach are (i) an efficient locality- based clustering approach for identifying crowds of users in near real-time compared to more heavyweight static clustering algorithms; and (ii) a novel crowd tracking and evolution approach for linking crowds across time periods. We find that the locality-based clustering approach results in empirically high-quality clusters relative to static graph clus- tering techniques at a fraction of the computational cost. Based on a three month snapshot of Twitter consisting of 711,612 users and 61.3 million messages, we show how the proposed approach can successfully identify and track interesting crowds based on the Twitter communication structure and uncover crowd-based topics of interest.