Cassandra: a structured storage system on a P2P network

Authors:
Avinash Lakshman;Prashant Malik
Affiliations:
Facebook Inc, Palo Alto, CA, USA;Facebook Inc, Palo Alto, CA, USA
Venue:
Proceedings of the twenty-first annual symposium on Parallelism in algorithms and architectures
Year:
2009

Citing 0
Cited 3

Where event processing grand challenge meets real-time web: PLAY event marketplace

Proceedings of the 6th ACM International Conference on Distributed Event-Based Systems
The data partition strategy based on hybrid range consistent hash in NoSQL database

Proceedings of the Fifth International Conference on Internet Multimedia Computing and Service
Low-latency multi-datacenter databases using replicated commit

Proceedings of the VLDB Endowment

Quantified Score

Hi-index	0.00

Visualization

Abstract

Cassandra is a distributed storage system for managing structured data that is designed to scale to a very large size across many commodity servers, with no single point of failure. Reliability at massive scale is a very big challenge. Outages in the service can have significant negative impact. Hence Cassandra aims to run on top of an infrastructure of hundreds of nodes (possibly spread across different datacenters). At this scale, small and large components fail continuously; the way Cassandra manages the persistent state in the face of these failures drives the reliability and scalability of the software systems relying on this service. Cassandra has achieved several goals -- scalability, high performance, high availability and applicability. In many ways Cassandra resembles a database and shares many design and implementation strategies with databases. Cassandra does not support a full relational data model; instead, it provides clients with a simple data model that supports dynamic control over data layout and format.