Minuet: rethinking concurrency control in storage area networks

Authors:
Andrey Ermolinskiy;Daekyeong Moon;Byung-Gon Chun;Scott Shenker
Affiliations:
University of California at Berkeley;University of California at Berkeley;Intel Research Berkeley;University of California at Berkeley and ICSI
Venue:
FAST '09 Proccedings of the 7th conference on File and storage technologies
Year:
2009

Citing 19
Cited 0

Leases: an efficient fault-tolerant mechanism for distributed file cache consistency

SOSP '89 Proceedings of the twelfth ACM symposium on Operating systems principles
ARIES: a transaction recovery method supporting fine-granularity locking and partial rollbacks using write-ahead logging

ACM Transactions on Database Systems (TODS)
On optimistic methods for concurrency control

Readings in database systems (2nd ed.)
Fine-grained sharing in a page server OODBMS

SIGMOD '94 Proceedings of the 1994 ACM SIGMOD international conference on Management of data
Impossibility of distributed consensus with one faulty process

Journal of the ACM (JACM)
Efficient optimistic concurrency control using loosely synchronized clocks

SIGMOD '95 Proceedings of the 1995 ACM SIGMOD international conference on Management of data
Transactional client-server cache consistency: alternatives and performance

ACM Transactions on Database Systems (TODS)
Adaptive, fine-grained sharing in a client-server OODBMS: a callback-based approach

ACM Transactions on Database Systems (TODS)
The part-time parliament

ACM Transactions on Computer Systems (TOCS)
A case for intelligent disks (IDISKs)

ACM SIGMOD Record
Active disks: programming model, algorithms and evaluation

Proceedings of the eighth international conference on Architectural support for programming languages and operating systems
Efficient locking for concurrent operations on B-trees

ACM Transactions on Database Systems (TODS)
Decentralized extrema-finding in circular configurations of processors

Communications of the ACM
GPFS: A Shared-Disk File System for Large Computing Clusters

FAST '02 Proceedings of the Conference on File and Storage Technologies
Simulation study of cached RAID5 designs

HPCA '95 Proceedings of the 1st IEEE Symposium on High-Performance Computer Architecture
zFS " A Scalable Distributed File System Using Object Disks

MSS '03 Proceedings of the 20 th IEEE/11 th NASA Goddard Conference on Mass Storage Systems and Technologies (MSS'03)
An integrated experimental environment for distributed systems and networks

OSDI '02 Proceedings of the 5th symposium on Operating systems design and implementationCopyright restrictions prevent ACM from being able to make the PDFs for this conference available for downloading
The Chubby lock service for loosely-coupled distributed systems

OSDI '06 Proceedings of the 7th USENIX Symposium on Operating Systems Design and Implementation - Volume 7
Scalability and failure recovery in a linux cluster file system

ALS'00 Proceedings of the 4th annual Linux Showcase & Conference - Volume 4

Quantified Score

Hi-index	0.00

Visualization

Abstract

Clustered applications in storage area networks (SANs), widely adopted in enterprise datacenters, have traditionally relied on distributed locking protocols to coordinate concurrent access to shared storage devices. We examine the semantics of traditional lock services for SAN environments and ask whether they are sufficient to guarantee data safety at the application level. We argue that a traditional lock service design that enforces strict mutual exclusion via a globally-consistent view of locking state is neither sufficient nor strictly necessary to ensure application-level correctness in the presence of asynchrony and failures. We also argue that in many cases, strongly-consistent locking imposes an additional and unnecessary constraint on application availability. Armed with these observations, we develop a set of novel concurrency control and recovery protocols for clustered SAN applications that achieve safety and liveness in the face of arbitrary asynchrony, crash failures, and network partitions. Finally, we present and evaluate Minuet- a new synchronization primitive based on these protocols that can serve as a foundational building block for safe and highly-available SAN applications.