An in-depth analysis of stochastic Kronecker graphs

Authors:
C. Seshadhri;Ali Pinar;Tamara G. Kolda
Affiliations:
Sandia National Laboratories, Livermore, CA;Sandia National Laboratories, Livermore, CA;Sandia National Laboratories, Livermore, CA
Venue:
Journal of the ACM (JACM)
Year:
2013

Citing 14
Cited 0

Randomized algorithms

Randomized algorithms
Inferring Web communities from link topology

Proceedings of the ninth ACM conference on Hypertext and hypermedia : links, objects, time and space---structure in hypermedia systems: links, objects, time and space---structure in hypermedia systems
Authoritative sources in a hyperlinked environment

Journal of the ACM (JACM)
The "DGX" distribution for mining massive, skewed data

Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
Graph mining: Laws, generators, and algorithms

ACM Computing Surveys (CSUR)
Scalable modeling of real graphs using Kronecker multiplication

Proceedings of the 24th international conference on Machine learning
Finding Dense Subgraphs with Size Bounds

WAW '09 Proceedings of the 6th International Workshop on Algorithms and Models for the Web-Graph
Power-Law Distributions in Empirical Data

SIAM Review
Kronecker Graphs: An Approach to Modeling Networks

The Journal of Machine Learning Research
Measurement-calibrated graph models for social network experiments

Proceedings of the 19th international conference on World wide web
Stochastic Kronecker graphs

WAW'07 Proceedings of the 5th international conference on Algorithms and models for the web-graph
Stochastic kronecker graphs

Random Structures & Algorithms
A mathematical analysis of the R-MAT random graph generator

Networks
Realistic, mathematically tractable graph generation and evolution, using kronecker multiplication

PKDD'05 Proceedings of the 9th European conference on Principles and Practice of Knowledge Discovery in Databases

Quantified Score

Hi-index	0.00

Visualization

Abstract

Graph analysis is playing an increasingly important role in science and industry. Due to numerous limitations in sharing real-world graphs, models for generating massive graphs are critical for developing better algorithms. In this article, we analyze the stochastic Kronecker graph model (SKG), which is the foundation of the Graph500 supercomputer benchmark due to its favorable properties and easy parallelization. Our goal is to provide a deeper understanding of the parameters and properties of this model so that its functionality as a benchmark is increased. We develop a rigorous mathematical analysis that shows this model cannot generate a power-law distribution or even a lognormal distribution. However, we formalize an enhanced version of the SKG model that uses random noise for smoothing. We prove both in theory and in practice that this enhancement leads to a lognormal distribution. Additionally, we provide a precise analysis of isolated vertices, showing that the graphs that are produced by SKG might be quite different than intended. For example, between 50% and 75% of the vertices in the Graph500 benchmarks will be isolated. Finally, we show that this model tends to produce extremely small core numbers (compared to most social networks and other real graphs) for common parameter choices.