The little engine(s) that could: scaling online social networks

Authors:
Josep M. Pujol;Vijay Erramilli;Georgos Siganos;Xiaoyuan Yang;Nikolaos Laoutaris;Parminder Chhabra;Pablo Rodriguez
Affiliations:
3scale, Barcelona, Spain and Telefonica Research, Barcelona, Spain;Telefonica Research, Barcelona, Spain;Telefonica Research, Barcelona, Spain;Telefonica Research, Barcelona, Spain;Telefonica Research, Barcelona, Spain;Telefonica Research, Barcelona, Spain;Telefonica Research, Barcelona, Spain
Venue:
IEEE/ACM Transactions on Networking (TON)
Year:
2012

Citing 16
Cited 3

Coda: A Highly Available File System for a Distributed Workstation Environment

IEEE Transactions on Computers
Managing update conflicts in Bayou, a weakly connected replicated storage system

SOSP '95 Proceedings of the fifteenth ACM symposium on Operating systems principles
The dangers of replication and a solution

SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
A Fast and High Quality Multilevel Scheme for Partitioning Irregular Graphs

SIAM Journal on Scientific Computing
Computers and Intractability: A Guide to the Theory of NP-Completeness

Computers and Intractability: A Guide to the Theory of NP-Completeness
A Polylogarithmic Approximation of the Minimum Bisection

SIAM Journal on Computing
Approximating metrics by tree metrics

ACM SIGACT News
Farsite: federated, available, and reliable storage for an incompletely trusted environment

OSDI '02 Proceedings of the 5th symposium on Operating systems design and implementationCopyright restrictions prevent ACM from being able to make the PDFs for this conference available for downloading
Graph evolution: Densification and shrinking diameters

ACM Transactions on Knowledge Discovery from Data (TKDD)
Dynamo: amazon's highly available key-value store

Proceedings of twenty-first ACM SIGOPS symposium on Operating systems principles
Measurement and analysis of online social networks

Proceedings of the 7th ACM SIGCOMM conference on Internet measurement
Expander flows, geometric embeddings and graph partitioning

Journal of the ACM (JACM)
On the evolution of user interaction in Facebook

Proceedings of the 2nd ACM workshop on Online social networks
Understanding online social network usage from a network perspective

Proceedings of the 9th ACM SIGCOMM conference on Internet measurement conference
Characterizing user behavior in online social networks

Proceedings of the 9th ACM SIGCOMM conference on Internet measurement conference
Mining communities in networks: a solution for consistency and its evaluation

Proceedings of the 9th ACM SIGCOMM conference on Internet measurement conference

Mizan: a system for dynamic load balancing in large-scale graph processing

Proceedings of the 8th ACM European Conference on Computer Systems
On benchmarking online social media analytical queries

First International Workshop on Graph Data Management Experiences and Systems
Analysis of partitioning strategies for graph processing in bulk synchronous parallel models

Proceedings of the fifth international workshop on Cloud data management

Quantified Score

Hi-index	0.00

Visualization

Abstract

The difficulty of partitioning social graphs has introduced new system design challenges for scaling of online social networks (OSNs). Vertical scaling by resorting to full replication can be a costly proposition. Scaling horizontally by partitioning and distributing data among multiple servers using, for e.g., distributed hash tables (DHTs), can suffer from expensive interserver communication. Such challenges have often caused costly rearchitecting efforts for popular OSNs like Twitter and Facebook. We design, implement, and evaluate SPAR, a Social Partitioning and Replication middleware that mediates transparently between the application and the database layer of an OSN. SPAR leverages the underlying social graph structure in order to minimize the required replication overhead for ensuring that users have their neighbors' data colocated in the same machine. The gains from this aremultifold: Application developers can assume local semantics, i.e., develop as they would for a single machine; scalability is achieved by adding commodity machines with low memory and network I/O requirements; and N+K redundancy is achieved at a fraction of the cost. We provide a complete system design, extensive evaluation based on datasets from Twitter, Orkut, and Facebook, and a working implementation. We show that SPAR incurs minimum overhead, can help a well-known Twitter clone reach Twitter's scale without changing a line of its application logic, and achieves higher throughput than Cassandra, a popular key-value store database.