The quark and the jaguar: adventures in the simple and the complex
The quark and the jaguar: adventures in the simple and the complex
The anatomy of a large-scale hypertextual Web search engine
WWW7 Proceedings of the seventh international conference on World Wide Web 7
Lanczos Algorithms for Large Symmetric Eigenvalue Computations, Vol. 1
Lanczos Algorithms for Large Symmetric Eigenvalue Computations, Vol. 1
The political blogosphere and the 2004 U.S. election: divided they blog
Proceedings of the 3rd international workshop on Link discovery
Elements of Information Theory (Wiley Series in Telecommunications and Signal Processing)
Elements of Information Theory (Wiley Series in Telecommunications and Signal Processing)
Massive Social Network Analysis: Mining Twitter for Social Good
ICPP '10 Proceedings of the 2010 39th International Conference on Parallel Processing
Spectral analysis for billion-scale graphs: discoveries and implementation
PAKDD'11 Proceedings of the 15th Pacific-Asia conference on Advances in knowledge discovery and data mining - Volume Part II
CUDA Application Design and Development
CUDA Application Design and Development
Hi-index | 0.00 |
Making vast amounts of online social media data comprehensible to an analyst is a key question in operational analytics. Twitter and micro-blog conversations can easily be gathered from Internet services such as Spinn3r to create graphs representing the interactions between the entities in an online community that contains billions of vertices and tens of billions of edges. Graphs of this size can easily be represented in a modern laptop or workstation. The challenge lies in making them comprehensible. This paper focuses on methods to assemble social network graphs from online social media to reveal nodes that are ‘interesting’ in the context of operational analysis—meaning that the computational results can be interpreted by a human analyst wishing to answer some operational questions. Only metrics based on the structure of the graph are utilized, which avoid the challenges and costs involved in message content analysis. We further restrict ourselves to the use of metrics that are computational tractable on billion node graphs. The reported results demonstrate that nodes with a high impact or disproportionally large agency on the whole network (e.g., online community) can be found in a variety of online communities. Validation of the importance of these high-agency nodes by human and computational methods is discussed, and the efficacy of our approach by both quantitative methods and tests against the null hypothesis is reported. © 2012 Wiley Periodicals, Inc. Statistical Analysis and Data Mining 5: 205–217, 2012 © 2012 Wiley Periodicals, Inc.