Principles of transaction-oriented database recovery
ACM Computing Surveys (CSUR)
Checkpointing and Rollback-Recovery for Distributed Systems
IEEE Transactions on Software Engineering - Special issue on distributed systems
Implementing fault-tolerant services using the state machine approach: a tutorial
ACM Computing Surveys (CSUR)
The Rio file cache: surviving operating system crashes
Proceedings of the seventh international conference on Architectural support for programming languages and operating systems
Free transactions with Rio Vista
Proceedings of the sixteenth ACM symposium on Operating systems principles
Reliable Distributed Computing with the ISIS Toolkit
Reliable Distributed Computing with the ISIS Toolkit
Transaction Processing: Concepts and Techniques
Transaction Processing: Concepts and Techniques
The Architecture of the Dalí Main-Memory Storage Manager
Multimedia Tools and Applications
Implementation techniques for main memory database systems
SIGMOD '84 Proceedings of the 1984 ACM SIGMOD international conference on Management of data
Main Memory Database Systems: An Overview
IEEE Transactions on Knowledge and Data Engineering
Integrating reliable memory in databases
The VLDB Journal — The International Journal on Very Large Data Bases
Checkpointing and Its Applications
FTCS '95 Proceedings of the Twenty-Fifth International Symposium on Fault-Tolerant Computing
Libckpt: transparent checkpointing under Unix
TCON'95 Proceedings of the USENIX 1995 Technical Conference Proceedings
Mnemosyne: lightweight persistent memory
Proceedings of the sixteenth international conference on Architectural support for programming languages and operating systems
Kiln: closing the performance gap between systems with and without persistence support
Proceedings of the 46th Annual IEEE/ACM International Symposium on Microarchitecture
Hi-index | 0.00 |
Abstract: Systems that are designed to recover from system failure due to software faults of the operating system and/or application typically require a means of persistently storing a subset of the state of the application. Disk drives are most often used as this persistent storage, but at a performance cost incurred repeatedly during normal execution as well as again at recovery time. Academic work has pioneered the concept of using a region of conventional memory, protecting it, and making it persist across operating system crashes and reboots, and making it as reliable as disk. This can be used in place of disk to alleviate the performance penalties noted above. This paper describes a project to take these concepts and apply them in a RAM disk based realization of persistent memory (PM) as part of the Lucent DNCP hardware fault-tolerant platform and implemented for the HP-UX operating system, focusing on its use by a main-memory database system (MMDB). While we found that the reduction in recovery time was small relative to reboot time, we achieved a nearly 40% reduction in execution time for an MMDB benchmark run on PM as opposed to its normal use of disk for achieving recoverability.