Serverless network file systems
SOSP '95 Proceedings of the fifteenth ACM symposium on Operating systems principles
The Transis approach to high availability cluster communication
Communications of the ACM
Petal: distributed virtual disks
Proceedings of the seventh international conference on Architectural support for programming languages and operating systems
Frangipani: a scalable distributed file system
Proceedings of the sixteenth ACM symposium on Operating systems principles
An evaluation of flow control in group communication
IEEE/ACM Transactions on Networking (TON)
Practical Byzantine fault tolerance
OSDI '99 Proceedings of the third symposium on Operating systems design and implementation
Reliable Distributed Computing with the ISIS Toolkit
Reliable Distributed Computing with the ISIS Toolkit
Overview of the InterGroup Protocols
ICCS '01 Proceedings of the International Conference on Computational Sciences-Part I
Deceit: A Flexible Distributed File System
Deceit: A Flexible Distributed File System
Total order broadcast and multicast algorithms: Taxonomy and survey
ACM Computing Surveys (CSUR)
Active/Active Replication for Highly Available HPC System Services
ARES '06 Proceedings of the First International Conference on Availability, Reliability and Security
MOLAR: adaptive runtime support for high-end computing operating and runtime systems
ACM SIGOPS Operating Systems Review
A comparison of file system workloads
ATEC '00 Proceedings of the annual conference on USENIX Annual Technical Conference
PVFS: a parallel file system for linux clusters
ALS'00 Proceedings of the 4th annual Linux Showcase & Conference - Volume 4
Total order communications: a practical analysis
EDCC'05 Proceedings of the 5th European conference on Dependable Computing
Symmetric active/active metadata service for high availability parallel file systems
Journal of Parallel and Distributed Computing
Hi-index | 0.00 |
In a typical distributed storage system, metadata is stored and managed by dedicated metadata servers. One way to improve the availability of distributed storage systems is to deploy multiple metadata servers. Past research focused on the active/standby model, where each active server has at least one redundant idle backup. However, interruption of service and loss of service state may occur during a fail-over depending on the used replication technique. The research in this paper targets the symmetric active/active replication model using multiple redundant service nodes running in virtual synchrony. In this model, service node failures do not cause a fail-over to a backup and there is no disruption of service or loss of service state. We use a fast delivery protocol to reduce the latency of total order broadcast. Our prototype implementation shows that high availability of metadata servers can be achieved with an acceptable performance trade-off using the active/active metadata server solution.