Transparent logging as a technique for debugging complex distributed systems

Authors:
M. Satyanarayanan;David C. Steere;Masashi Kudo;Hank Mashburn
Affiliations:
Carnegie Mellon University, Pittsburgh, PA;Carnegie Mellon University, Pittsburgh, PA;Carnegie Mellon University, Pittsburgh, PA;Carnegie Mellon University Pittsburgh, PA
Venue:
EW 5 Proceedings of the 5th workshop on ACM SIGOPS European workshop: Models and paradigms for distributed systems structuring
Year:
1992

Citing 4
Cited 2

Reimplementing the Cedar file system using logging and group commit

SOSP '87 Proceedings of the eleventh ACM Symposium on Operating systems principles
Integrating security in a large distributed system

ACM Transactions on Computer Systems (TOCS)
Scalable, Secure, and Highly Available Distributed File Access

Computer
The design and implementation of a log-structured file system

ACM Transactions on Computer Systems (TOCS)

Lightweight recoverable virtual memory

SOSP '93 Proceedings of the fourteenth ACM symposium on Operating systems principles
Lightweight recoverable virtual memory

ACM Transactions on Computer Systems (TOCS) - Special issue on operating systems principles

Quantified Score

Hi-index	0.00

Visualization

Abstract

As any battle-scarred veteran will testify, debugging a distributed system in production use is an enterprise fraught with great difficulty and frustration. By the time the system is released for production use, most of the easy bugs have been found and fixed. The remaining bugs are typically non-deterministic in nature, and will only manifest themselves under conditions of heavy use. Although rare, such bugs cannot be ignored because they often have serious consequences.In this position paper, we put forth the thesis that logging is a flexible, powerful, and convenient tool for debugging complex distributed systems. We substantiate this thesis in three steps. First, we argue that logging is particularly well suited for debugging distributed systems. Next, we observe that logging is already used in distributed systems for reasons independent of debugging. Finally, we show that the latter uses of logging can be transparently extended to support debugging.