Simultaneous logging and replay for recording evidences of system failures

  • Authors:
  • Shuichi Oikawa;Jin Kawasaki

  • Affiliations:
  • Department of Computer Science, University of Tsukuba, Tsukuba, Ibaraki, Japan;Department of Computer Science, University of Tsukuba, Tsukuba, Ibaraki, Japan

  • Venue:
  • SEUS'10 Proceedings of the 8th IFIP WG 10.2 international conference on Software technologies for embedded and ubiquitous systems
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

As embedded systems take more important roles at many places, it is more important for them to be able to show the evidences of system failures. Providing such evidences makes it easier to investigate the root causes of the failures and to prove the responsible parties. This paper proposes simultaneous logging and replaying of a system that enables recording evidences of system failures. The proposed system employs two virtual machines, one for the primary execution and the other for the backup execution. The backup virtual machine maintains the past state of the primary virtual machine along with the log to make the backup the same state as the primary. When a system failure occurs on the primary virtual machine, the VMM saves the backup state and the log. The saved backup state and the log can be used as an evidence. By replaying the backup virtual machine from the saved state following the saved log, the execution path to the failure can be completely analyzed. We developed such a logging and replaying feature in a VMM. It can log and replay the execution of the Linux operating system. The experiment results show the overhead of the primary execution is only fractional.