The Design and Use of Persistent Memory on the DNCP Hardware Fault-Tolerant Platform

Authors:
Thomas C. Bressoud;Tom Clark;Ti Kan
Affiliations:
-;-;-
Venue:
DSN '01 Proceedings of the 2001 International Conference on Dependable Systems and Networks (formerly: FTCS)
Year:
2001

Citing 13
Cited 2

Principles of transaction-oriented database recovery

ACM Computing Surveys (CSUR)
Checkpointing and Rollback-Recovery for Distributed Systems

IEEE Transactions on Software Engineering - Special issue on distributed systems
Implementing fault-tolerant services using the state machine approach: a tutorial

ACM Computing Surveys (CSUR)
The Rio file cache: surviving operating system crashes

Proceedings of the seventh international conference on Architectural support for programming languages and operating systems
Free transactions with Rio Vista

Proceedings of the sixteenth ACM symposium on Operating systems principles
Reliable Distributed Computing with the ISIS Toolkit

Reliable Distributed Computing with the ISIS Toolkit
Transaction Processing: Concepts and Techniques

Transaction Processing: Concepts and Techniques
The Architecture of the Dalí Main-Memory Storage Manager

Multimedia Tools and Applications
Implementation techniques for main memory database systems

SIGMOD '84 Proceedings of the 1984 ACM SIGMOD international conference on Management of data
Main Memory Database Systems: An Overview

IEEE Transactions on Knowledge and Data Engineering
Integrating reliable memory in databases

The VLDB Journal — The International Journal on Very Large Data Bases
Checkpointing and Its Applications

FTCS '95 Proceedings of the Twenty-Fifth International Symposium on Fault-Tolerant Computing
Libckpt: transparent checkpointing under Unix

TCON'95 Proceedings of the USENIX 1995 Technical Conference Proceedings

Mnemosyne: lightweight persistent memory

Proceedings of the sixteenth international conference on Architectural support for programming languages and operating systems
Kiln: closing the performance gap between systems with and without persistence support

Proceedings of the 46th Annual IEEE/ACM International Symposium on Microarchitecture

Quantified Score

Hi-index	0.00

Visualization

Abstract

Abstract: Systems that are designed to recover from system failure due to software faults of the operating system and/or application typically require a means of persistently storing a subset of the state of the application. Disk drives are most often used as this persistent storage, but at a performance cost incurred repeatedly during normal execution as well as again at recovery time. Academic work has pioneered the concept of using a region of conventional memory, protecting it, and making it persist across operating system crashes and reboots, and making it as reliable as disk. This can be used in place of disk to alleviate the performance penalties noted above. This paper describes a project to take these concepts and apply them in a RAM disk based realization of persistent memory (PM) as part of the Lucent DNCP hardware fault-tolerant platform and implemented for the HP-UX operating system, focusing on its use by a main-memory database system (MMDB). While we found that the reduction in recovery time was small relative to reboot time, we achieved a nearly 40% reduction in execution time for an MMDB benchmark run on PM as opposed to its normal use of disk for achieving recoverability.