Using Rescue Points to Navigate Software Recovery

Authors:
Stelios Sidiroglou;Oren Laadan;Angelos D. Keromytis;Jason Nieh
Affiliations:
Columbia University;Columbia University;Columbia University;Columbia University
Venue:
SP '07 Proceedings of the 2007 IEEE Symposium on Security and Privacy
Year:
2007

Citing 0
Cited 7

From STEM to SEAD: speculative execution for automated defense

ATC'07 2007 USENIX Annual Technical Conference on Proceedings of the USENIX Annual Technical Conference
Enhancing storage system availability on multi-core architectures with recovery-conscious scheduling

FAST'08 Proceedings of the 6th USENIX Conference on File and Storage Technologies
Return Value Predictability Profiles for Self---healing

IWSEC '08 Proceedings of the 3rd International Workshop on Security: Advances in Information and Computer Security
A systematic approach to system state restoration during storage controller micro-recovery

FAST '09 Proccedings of the 7th conference on File and storage technologies
Automatically patching errors in deployed software

Proceedings of the ACM SIGOPS 22nd symposium on Operating systems principles
Recovery scopes, recovery groups, and fine-grained recovery in enterprise storage controllers with multi-core processors

IBM Journal of Research and Development
A role-based approach to self-healing in autonomous monitoring systems

PPAM'09 Proceedings of the 8th international conference on Parallel processing and applied mathematics: Part II

Quantified Score

Hi-index	0.00

Visualization

Abstract

We present a new technique that enables software recovery in legacy applications by retrofitting exception-handling capabilities, error virtualization using rescue points. We introduce the idea of "rescue points" as program locations to which an application can recover its execution in the presence of failures. The use of rescue points reduces the chance of unanticipated execution paths thereby making recovery more robust by mimicking system behavior under controlled error conditions. These controlled error conditions can be thought of as a set erroneous inputs, like the ones used by most quality-assurance teams during software development, designed to stress-test an application. To discover rescue points applications are profiled and monitored during tests that bombard the program with bad/random inputs. The intuition is that by monitoring application behavior during these runs, we gain insight into how programmer-tested program points are used to propagate faults gracefully.