Debugging Parallel Programs with Instant Replay
IEEE Transactions on Computers
Hermes: a language for distributed computing
Hermes: a language for distributed computing
Achieving target-system independence in event visualisation
CASCON '95 Proceedings of the 1995 conference of the Centre for Advanced Studies on Collaborative research
CASCON '94 Proceedings of the 1994 conference of the Centre for Advanced Studies on Collaborative research
Integrating real-time and partial-order information in event-data displays
CASCON '94 Proceedings of the 1994 conference of the Centre for Advanced Studies on Collaborative research
The use of process clustering in distributed-system event displays
CASCON '93 Proceedings of the 1993 conference of the Centre for Advanced Studies on Collaborative research: software engineering - Volume 1
Services supporting management of distributed applications and systems
IBM Systems Journal
Hi-index | 0.00 |
Debugging distributed applications presents many challenges in addition to those found in debugging sequential applications. This paper describes a tool, and the principles underlying it, that has been developed to assist in debugging such distributed applications. Although the tool can also be applied in other environments, this paper primarily describes its application to OSF DCE. Special attention is also given to a facility that has presently been implemented only for OSF DCE, the ability to replay an application, that is, to re-execute it with execution constrained to follow the partial order of an initial execution.