Concurrency control and recovery in database systems
Concurrency control and recovery in database systems
Parallel database systems: the future of high performance database systems
Communications of the ACM
STOC '97 Proceedings of the twenty-ninth annual ACM symposium on Theory of computing
Flexible update propagation for weakly consistent replication
Proceedings of the sixteenth ACM symposium on Operating systems principles
The state of the art in distributed query processing
ACM Computing Surveys (CSUR)
Chord: A scalable peer-to-peer lookup service for internet applications
Proceedings of the 2001 conference on Applications, technologies, architectures, and protocols for computer communications
Wide-area cooperative storage with CFS
SOSP '01 Proceedings of the eighteenth ACM symposium on Operating systems principles
Transaction Processing: Concepts and Techniques
Transaction Processing: Concepts and Techniques
Fast Algorithms for Maintaining Replica Consistency in Lazy Master Replicated Databases
VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
Pastry: Scalable, Decentralized Object Location, and Routing for Large-Scale Peer-to-Peer Systems
Middleware '01 Proceedings of the IFIP/ACM International Conference on Distributed Systems Platforms Heidelberg
SOSP '03 Proceedings of the nineteenth ACM symposium on Operating systems principles
Lazy database replication with snapshot isolation
VLDB '06 Proceedings of the 32nd international conference on Very large data bases
CRUSH: controlled, scalable, decentralized placement of replicated data
Proceedings of the 2006 ACM/IEEE conference on Supercomputing
Boxwood: abstractions as the foundation for storage infrastructure
OSDI'04 Proceedings of the 6th conference on Symposium on Opearting Systems Design & Implementation - Volume 6
MapReduce: simplified data processing on large clusters
OSDI'04 Proceedings of the 6th conference on Symposium on Opearting Systems Design & Implementation - Volume 6
Bigtable: a distributed storage system for structured data
OSDI '06 Proceedings of the 7th USENIX Symposium on Operating Systems Design and Implementation - Volume 7
Ceph: a scalable, high-performance distributed file system
OSDI '06 Proceedings of the 7th USENIX Symposium on Operating Systems Design and Implementation - Volume 7
Timestamp-based algorithms for concurrency control in distributed database systems
VLDB '80 Proceedings of the sixth international conference on Very Large Data Bases - Volume 6
Sinfonia: a new paradigm for building scalable distributed systems
Proceedings of twenty-first ACM SIGOPS symposium on Operating systems principles
Dynamo: amazon's highly available key-value store
Proceedings of twenty-first ACM SIGOPS symposium on Operating systems principles
Querying the internet with PIER
VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Efficient bulk insertion into a distributed ordered table
Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Pig latin: a not-so-foreign language for data processing
Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Clouder: a flexible large scale decentralized object store: architecture overview
Proceedings of the Third Workshop on Dependable Distributed Data Management
Open-source grid technologies for web-scale computing
ACM SIGACT News
Asynchronous view maintenance for VLSD databases
Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
E = MC3: managing uncertain enterprise data in a cluster-computing environment
Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
Extend UDF Technology for Integrated Analytics
DaWaK '09 Proceedings of the 11th International Conference on Data Warehousing and Knowledge Discovery
Efficiently support MapReduce-like computation models inside parallel DBMS
IDEAS '09 Proceedings of the 2009 International Database Engineering & Applications Symposium
Operational BI platform for video analytics
Proceedings of the International Conference on Management of Emergent Digital EcoSystems
Leveraging a scalable row store to build a distributed text index
Proceedings of the first international workshop on Cloud data management
Proceedings of the first international workshop on Cloud data management
How best to build web-scale data managers?
Proceedings of the VLDB Endowment
Consistency rationing in the cloud: pay only when it matters
Proceedings of the VLDB Endowment
Adaptively parallelizing distributed range queries
Proceedings of the VLDB Endowment
The case for RAMClouds: scalable high-performance storage entirely in DRAM
ACM SIGOPS Operating Systems Review
Clouds at the crossroads: research perspectives
Crossroads - Plugging Into the Cloud
Optimizing joins in a map-reduce environment
Proceedings of the 13th International Conference on Extending Database Technology
Optimizing information flow in the gossip objects platform
ACM SIGOPS Operating Systems Review
Benchmarking cloud serving systems with YCSB
Proceedings of the 1st ACM symposium on Cloud computing
G-Store: a scalable data store for transactional multi key access in the cloud
Proceedings of the 1st ACM symposium on Cloud computing
Indexing multi-dimensional data in a cloud system
Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
Feeding frenzy: selectively materializing users' event feeds
Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
Extreme scale with full SQL language support in microsoft SQL Azure
Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
Eventually linearizable shared objects
Proceedings of the 29th ACM SIGACT-SIGOPS symposium on Principles of distributed computing
Future Generation Computer Systems
ElasTraS: an elastic transactional data store in the cloud
HotCloud'09 Proceedings of the 2009 conference on Hot topics in cloud computing
CloudViews: communal data sharing in public clouds
HotCloud'09 Proceedings of the 2009 conference on Hot topics in cloud computing
Consistability: describing usually consistent systems
HotDep'08 Proceedings of the Fourth conference on Hot topics in system dependability
Prophecy: using history for high-throughput fault tolerance
NSDI'10 Proceedings of the 7th USENIX conference on Networked systems design and implementation
Object storage on CRAQ: high-throughput chain replication for read-mostly workloads
USENIX'09 Proceedings of the 2009 conference on USENIX Annual technical conference
Benchmarking cloud-based data management systems
CloudDB '10 Proceedings of the second international workshop on Cloud data management
Load Balancing Content-Based Publish/Subscribe Systems
ACM Transactions on Computer Systems (TOCS)
Experience in extending query engine for continuous analytics
DaWaK'10 Proceedings of the 12th international conference on Data warehousing and knowledge discovery
NPC'10 Proceedings of the 2010 IFIP international conference on Network and parallel computing
Generalized UDF for analytics inside database engine
WAIM'10 Proceedings of the 11th international conference on Web-age information management
Schism: a workload-driven approach to database replication and partitioning
Proceedings of the VLDB Endowment
Towards elastic transactional cloud storage with range query support
Proceedings of the VLDB Endowment
Big data and cloud computing: new wine or just new bottles?
Proceedings of the VLDB Endowment
What consistency does your key-value store actually provide?
HotDep'10 Proceedings of the Sixth international conference on Hot topics in system dependability
Finding a needle in Haystack: facebook's photo storage
OSDI'10 Proceedings of the 9th USENIX conference on Operating systems design and implementation
Large-scale incremental processing using distributed transactions and notifications
OSDI'10 Proceedings of the 9th USENIX conference on Operating systems design and implementation
Depot: cloud storage with minimal trust
OSDI'10 Proceedings of the 9th USENIX conference on Operating systems design and implementation
Onix: a distributed control platform for large-scale production networks
OSDI'10 Proceedings of the 9th USENIX conference on Operating systems design and implementation
On the expressiveness and trade-offs of large scale tuple stores
OTM'10 Proceedings of the 2010 international conference on On the move to meaningful internet systems: Part II
Scalable transactions in the cloud: partitioning revisited
OTM'10 Proceedings of the 2010 international conference on On the move to meaningful internet systems: Part II
Using Paxos to build a scalable, consistent, and highly available datastore
Proceedings of the VLDB Endowment
Big data and cloud computing: current state and future opportunities
Proceedings of the 14th International Conference on Extending Database Technology
The SCADS director: scaling a distributed storage system under stringent performance requirements
FAST'11 Proceedings of the 9th USENIX conference on File and stroage technologies
Communications of the ACM
ASTERIX: towards a scalable, semistructured data platform for evolving-world models
Distributed and Parallel Databases
Ripple: A publish/subscribe service for multidata item updates propagation in the cloud
Journal of Network and Computer Applications
A batch of PNUTS: experiences connecting cloud batch and serving systems
Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
Providing scalable database services on the cloud
WISE'10 Proceedings of the 11th international conference on Web information systems engineering
HotOS'13 Proceedings of the 13th USENIX conference on Hot topics in operating systems
Wide area placement of data replicas for fast and highly available data access
Proceedings of the fourth international workshop on Data-intensive distributed computing
Feed following: the big data challenge in social applications
Databases and Social Networks
Database scalability, elasticity, and autonomy in the cloud
DASFAA'11 Proceedings of the 16th international conference on Database systems for advanced applications - Volume Part I
Online migration for geo-distributed storage systems
USENIXATC'11 Proceedings of the 2011 USENIX conference on USENIX annual technical conference
Proceedings of the VLDB Endowment
Towards a scalable and robust multi-tenancy SaaS
Proceedings of the Second Asia-Pacific Symposium on Internetware
A correlation-aware data placement strategy for key-value stores
Proceedings of the 11th IFIP WG 6.1 international conference on Distributed applications and interoperable systems
Principles of distributed data management in 2020?
DEXA'11 Proceedings of the 22nd international conference on Database and expert systems applications - Volume Part I
ActiveSLA: a profit-oriented admission control framework for database-as-a-service providers
Proceedings of the 2nd ACM Symposium on Cloud Computing
Scalable consistency in Scatter
SOSP '11 Proceedings of the Twenty-Third ACM Symposium on Operating Systems Principles
Fast crash recovery in RAMCloud
SOSP '11 Proceedings of the Twenty-Third ACM Symposium on Operating Systems Principles
Windows Azure Storage: a highly available cloud storage service with strong consistency
SOSP '11 Proceedings of the Twenty-Third ACM Symposium on Operating Systems Principles
Transactional storage for geo-replicated systems
SOSP '11 Proceedings of the Twenty-Third ACM Symposium on Operating Systems Principles
Don't settle for eventual: scalable causal consistency for wide-area storage with COPS
SOSP '11 Proceedings of the Twenty-Third ACM Symposium on Operating Systems Principles
Smile: enabling easy and fast development of domain-specific scheduling protocols
BNCOD'11 Proceedings of the 28th British national conference on Advances in databases
Chimera: data sharing flexibility, shared nothing simplicity
Proceedings of the 15th Symposium on International Database Engineering & Applications
PIQL: success-tolerant query processing in the cloud
Proceedings of the VLDB Endowment
Scalable real time data management for smart grid
Proceedings of the Middleware 2011 Industry Track Workshop
Eventual consistency: How soon is eventual? An evaluation of Amazon S3's consistency behavior
Proceedings of the 6th Workshop on Middleware for Service Oriented Computing
NoSQL databases: a step to database scalability in web environment
Proceedings of the 13th International Conference on Information Integration and Web-based Applications and Services
Wukong: A cloud-oriented file service for mobile Internet devices
Journal of Parallel and Distributed Computing
Case study of scientific data processing on a cloud using hadoop
HPCS'09 Proceedings of the 23rd international conference on High Performance Computing Systems and Applications
Data management challenges in cloud computing infrastructures
DNIS'10 Proceedings of the 6th international conference on Databases in Networked Information Systems
The evolving landscape of data management in the cloud
International Journal of Computational Science and Engineering
Communications of the ACM
Performance Evaluation of Range Queries in Key Value Stores
Journal of Grid Computing
Serving large-scale batch computed data with project Voldemort
FAST'12 Proceedings of the 10th USENIX conference on File and Storage Technologies
Calvin: fast distributed transactions for partitioned database systems
SIGMOD '12 Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data
bLSM: a general purpose log structured merge tree
SIGMOD '12 Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data
Walnut: a unified cloud object store
SIGMOD '12 Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data
Theory and network applications of balanced kautz tree structures
ACM Transactions on Internet Technology (TOIT)
Mobius: unified messaging and data serving for mobile apps
Proceedings of the 10th international conference on Mobile systems, applications, and services
Minuet: a scalable distributed multiversion B-tree
Proceedings of the VLDB Endowment
Proceedings of the 6th ACM International Conference on Distributed Event-Based Systems
LogBase: a scalable log-structured database system in the cloud
Proceedings of the VLDB Endowment
HyperDex: a distributed, searchable key-value store
Proceedings of the ACM SIGCOMM 2012 conference on Applications, technologies, architectures, and protocols for computer communication
Granola: low-overhead distributed transaction coordination
USENIX ATC'12 Proceedings of the 2012 USENIX conference on Annual Technical Conference
Wimpy nodes with 10GbE: leveraging one-sided operations in soft-RDMA to boost memcached
USENIX ATC'12 Proceedings of the 2012 USENIX conference on Annual Technical Conference
Surviving congestion in geo-distributed storage systems
USENIX ATC'12 Proceedings of the 2012 USENIX conference on Annual Technical Conference
Poor man's social network: consistently trade freshness for scalability
WebApps'12 Proceedings of the 3rd USENIX conference on Web Application Development
Performance guarantees for distributed reachability queries
Proceedings of the VLDB Endowment
Transaction processing using thread-to-metadata
Proceedings of the 16th International Database Engineering & Applications Sysmposium
Solving big data challenges for enterprise application performance management
Proceedings of the VLDB Endowment
Avatara: OLAP for web-scale analytics products
Proceedings of the VLDB Endowment
MyCassandra: a cloud storage supporting both read heavy and write heavy workloads
Proceedings of the 5th Annual International Systems and Storage Conference
Toward fast eventual consistency with performance guarantees
Proceedings of the 9th international conference on Autonomic computing
HyperDex: a distributed, searchable key-value store
ACM SIGCOMM Computer Communication Review - Special october issue SIGCOMM '12
Spanner: Google's globally-distributed database
OSDI'12 Proceedings of the 10th USENIX conference on Operating Systems Design and Implementation
Making geo-replicated systems fast as possible, consistent when necessary
OSDI'12 Proceedings of the 10th USENIX conference on Operating Systems Design and Implementation
High performance RDMA-based design of HDFS over InfiniBand
SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
The Yahoo!: cloud datastore load balancer
Proceedings of the fourth international workshop on Cloud data management
Finding the silver lining for data freshness on the cloud: [extended abstract]
Proceedings of the fourth international workshop on Cloud data management
Cake: enabling high-level SLOs on shared storage systems
Proceedings of the Third ACM Symposium on Cloud Computing
Untangling cluster management with Helix
Proceedings of the Third ACM Symposium on Cloud Computing
The potential dangers of causal consistency and an explicit solution
Proceedings of the Third ACM Symposium on Cloud Computing
Quality-of-service for consistency of data geo-replication in cloud computing
Euro-Par'12 Proceedings of the 18th international conference on Parallel Processing
HSIENA: a hybrid publish/subscribe system
SAFECOMP'12 Proceedings of the 2012 international conference on Computer Safety, Reliability, and Security
A generalized algorithm for publish/subscribe overlay design and its fast implementation
DISC'12 Proceedings of the 26th international conference on Distributed Computing
Opportunistic multipath forwarding in content-based publish/subscribe overlays
Proceedings of the 13th International Middleware Conference
ElasTraS: An elastic, scalable, and self-managing transactional database for the cloud
ACM Transactions on Database Systems (TODS)
Cloud Platform Datastore Support
Journal of Grid Computing
Pollux: towards scalable distributed real-time search on microblogs
Proceedings of the 16th International Conference on Extending Database Technology
Scheduling with freshness and performance guarantees for web applications in the cloud
ADC '11 Proceedings of the Twenty-Second Australasian Database Conference - Volume 115
Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
LinkBench: a database benchmark based on the Facebook social graph
Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
On brewing fresh espresso: LinkedIn's distributed data serving platform
Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
Rethinking eventual consistency
Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
ChainReaction: a causal+ consistent datastore based on chain replication
Proceedings of the 8th ACM European Conference on Computer Systems
MDCC: multi-data center consistency
Proceedings of the 8th ACM European Conference on Computer Systems
Performance evaluation of a MongoDB and hadoop platform for scientific data analysis
Proceedings of the 4th ACM workshop on Scientific cloud computing
MoSQL: an elastic storage engine for MySQL
Proceedings of the 28th Annual ACM Symposium on Applied Computing
Stronger semantics for low-latency geo-replicated storage
nsdi'13 Proceedings of the 10th USENIX conference on Networked Systems Design and Implementation
DAX: a widely distributed multitenant storage service for DBMS hosting
Proceedings of the VLDB Endowment
DYNATOPS: a dynamic topic-based publish/subscribe architecture
Proceedings of the 7th ACM international conference on Distributed event-based systems
Future Generation Computer Systems
HAT, not CAP: towards highly available transactions
HotOS'13 Proceedings of the 14th USENIX conference on Hot Topics in Operating Systems
Spanner: Google’s Globally Distributed Database
ACM Transactions on Computer Systems (TOCS)
The data partition strategy based on hybrid range consistent hash in NoSQL database
Proceedings of the Fifth International Conference on Internet Multimedia Computing and Service
OSIRIS-SR: a scalable yet reliable distributed workflow execution engine
Proceedings of the 2nd ACM SIGMOD Workshop on Scalable Workflow Execution Engines and Technologies
Replicated data consistency explained through baseball
Communications of the ACM
Discovering influential authors in heterogeneous academic networks by a co-ranking method
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Archiving the relaxed consistency web
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Data-Intensive Cloud Computing: Requirements, Expectations, Challenges, and Solutions
Journal of Grid Computing
Dynamic Synchronous/Asynchronous Replication
ACM Transactions on Storage (TOS)
SO-1SR: towards a self-optimizing one-copy serializability protocol for data management in the cloud
Proceedings of the fifth international workshop on Cloud data management
Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles
ACM SIGOPS 24th Symposium on Operating Systems Principles
LibRe: a consistency protocol for modern storage systems
Proceedings of the 6th ACM India Computing Convention
Transaction chains: achieving serializability with low latency in geo-distributed storage systems
Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles
Consistency-based service level agreements for cloud storage
Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles
Proceedings of the 4th annual Symposium on Cloud Computing
CG_Hadoop: computational geometry in MapReduce
Proceedings of the 21st ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems
Understanding and mitigating the impact of load imbalance in the memory caching tier
Proceedings of the 4th annual Symposium on Cloud Computing
TAO: Facebook's distributed data store for the social graph
USENIX ATC'13 Proceedings of the 2013 USENIX conference on Annual Technical Conference
Scalable transactions across heterogeneous NoSQL key-value data stores
Proceedings of the VLDB Endowment
Low-latency multi-datacenter databases using replicated commit
Proceedings of the VLDB Endowment
Form-Based Web Service Composition for Domain Experts
ACM Transactions on the Web (TWEB)
Minimal broker overlay design for content-based publish/subscribe systems
CASCON '13 Proceedings of the 2013 Conference of the Center for Advanced Studies on Collaborative Research
The Journal of Supercomputing
Log-structured memory for DRAM-based storage
FAST'14 Proceedings of the 12th USENIX conference on File and Storage Technologies
Easy freshness with Pequod cache joins
NSDI'14 Proceedings of the 11th USENIX Conference on Networked Systems Design and Implementation
Hi-index | 0.05 |
We describe PNUTS, a massively parallel and geographically distributed database system for Yahoo!'s web applications. PNUTS provides data storage organized as hashed or ordered tables, low latency for large numbers of concurrent requests including updates and queries, and novel per-record consistency guarantees. It is a hosted, centrally managed, and geographically distributed service, and utilizes automated load-balancing and failover to reduce operational complexity. The first version of the system is currently serving in production. We describe the motivation for PNUTS and the design and implementation of its table storage and replication layers, and then present experimental results.