A probe effect in concurrent programs
Software—Practice & Experience
On the possibility and impossibility of achieving clock synchronization
Journal of Computer and System Sciences
The rice parallel processing testbed
SIGMETRICS '88 Proceedings of the 1988 ACM SIGMETRICS conference on Measurement and modeling of computer systems
Understanding fault-tolerant distributed systems
Communications of the ACM
PROTEUS: a high-performance parallel-architecture simulator
SIGMETRICS '92/PERFORMANCE '92 Proceedings of the 1992 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
Estelle development toolset (EDT)
Computer Networks and ISDN Systems
Fault Injection and Dependability Evaluation of Fault-Tolerant Systems
IEEE Transactions on Computers
Testing of fault-tolerant and real-time distributed systems via protocol fault injection
FTCS '96 Proceedings of the The Twenty-Sixth Annual International Symposium on Fault-Tolerant Computing (FTCS '96)
Centralized Failure Injection for Distributed,Fault-Tolerant Protocol Testing
ICDCS '97 Proceedings of the 17th International Conference on Distributed Computing Systems (ICDCS '97)
Fault-tolerant external clock synchronization
ICDCS '95 Proceedings of the 15th International Conference on Distributed Computing Systems
DOCTOR: an integrated software fault injection environment for distributed real-time systems
IPDS '95 Proceedings of the International Computer Performance and Dependability Symposium on Computer Performance and Dependability Symposium
A Rigorous Approach to Fault-Tolerant Programming
IEEE Transactions on Software Engineering
Hi-index | 0.00 |
Cesium is an object-oriented environment for testing that implementations of real-time, fault-tolerant protocols satisfy the safety and timeliness properties prescribed by their specifications. Protocol implementations are tested under configurable workloads and failure scenarios. A centralized simulator executes all tasks in a single address space while providing the appearance of truly distributed execution. Experiments can be exactly reproduced any number of times; Cesium provides an unprecedented degree of monitoring and control over them. It is not necessary to instrument (or even to have access to) the source code of the protocols under test. The observed behaviors correspond exactly to executions in the real system being simulated, as Cesium does not change the time of occurrence of any event. Besides from providing a testing and performance evaluation environment superior to real distributed systems, Cesium can test properties of existing protocols that can not be tested in any distributed environment.