ACM Transactions on Programming Languages and Systems (TOPLAS)
Distributed discrete-event simulation
ACM Computing Surveys (CSUR)
SOSP '87 Proceedings of the eleventh ACM Symposium on Operating systems principles
Optimal checkpointing of real-time tasks
IEEE Transactions on Computers
Distributed Checkpointing for Globally Consistent States of Databases
IEEE Transactions on Software Engineering
Optimistic approaches to parallel discrete event simulation
Transactions of the Society for Computer Simulation International
Comparative Analysis of Different Models of Checkpointing and Recovery
IEEE Transactions on Software Engineering
On the Optimal Checkpointing of Critical Tasks and Transaction-Oriented Systems
IEEE Transactions on Software Engineering
Selecting the checkpoint interval in time warp simulation
PADS '93 Proceedings of the seventh workshop on Parallel and distributed simulation
Using split event sets to form and schedule event combinations in discrete event simulation
ANSS '92 Proceedings of the 25th annual symposium on Simulation
High performance parallelized discrete event simulation of stochastic queueing networks
WSC '88 Proceedings of the 20th conference on Winter simulation
A spectrum of options for parallel simulation
WSC '88 Proceedings of the 20th conference on Winter simulation
On the Optimum Checkpoint Interval
Journal of the ACM (JACM)
Non-preemptive time warp scheduling algorithms
ACM SIGOPS Operating Systems Review
Memory management techniques for Time Warp on a distributed memory machine
PADS '95 Proceedings of the ninth workshop on Parallel and distributed simulation
Clustered time warp and logic simulation
PADS '95 Proceedings of the ninth workshop on Parallel and distributed simulation
Prediction of memory consumption in conservative parallel simulation
PADS '95 Proceedings of the ninth workshop on Parallel and distributed simulation
Event sensitive state saving in time warp parallel discrete event simulations
WSC '96 Proceedings of the 28th conference on Winter simulation
Language based state saving extensions for optimistic parallel simulation
WSC '96 Proceedings of the 28th conference on Winter simulation
Multiplexed state saving for bounded rollback
Proceedings of the 29th conference on Winter simulation
Event history based sparse state saving in time warp
PADS '98 Proceedings of the twelfth workshop on Parallel and distributed simulation
An Analytical Model for Hybrid Checkpointing in Time Warp Distributed Simulation
IEEE Transactions on Parallel and Distributed Systems
Combining periodic and probabilistic checkpointing in optimistic simulation
PADS '99 Proceedings of the thirteenth workshop on Parallel and distributed simulation
PADS '99 Proceedings of the thirteenth workshop on Parallel and distributed simulation
Grain sensitive event scheduling in time warp parallel discrete event simulation
PADS '00 Proceedings of the fourteenth workshop on Parallel and distributed simulation
A Cost Model for Selecting Checkpoint Positions in Time Warp Parallel Simulation
IEEE Transactions on Parallel and Distributed Systems
Proceedings of the 32nd conference on Winter simulation
Conditional checkpoint abort: an alternative semantic for re-synchronization in CCL
Proceedings of the sixteenth workshop on Parallel and distributed simulation
On Rolling Back and Checkpointing in Time Warp
IEEE Transactions on Parallel and Distributed Systems
On the processor scheduling problem in time warp synchronization
ACM Transactions on Modeling and Computer Simulation (TOMACS)
ICS '03 Proceedings of the 17th annual international conference on Supercomputing
CCL v3.0: Multiprogrammed Semi-Asynchronous Checkpoints
Proceedings of the seventeenth workshop on Parallel and distributed simulation
Nonblocking Checkpointing for Optimistic Parallel Simulation: Description and an Implementation
IEEE Transactions on Parallel and Distributed Systems
An overhead reducing technique for Time Warp
Journal of Parallel and Distributed Computing
Journal of Parallel and Distributed Computing
A Version of MASM Portable Across Different UNIX Systems and Different Hardware Architectures
DS-RT '05 Proceedings of the 9th IEEE International Symposium on Distributed Simulation and Real-Time Applications
Multiprogrammed non-blocking checkpoints in support of optimistic simulation on myrinet clusters
Journal of Systems Architecture: the EUROMICRO Journal
Proceedings of the 22nd Workshop on Principles of Advanced and Distributed Simulation
PADS '09 Proceedings of the 2009 ACM/IEEE/SCS 23rd Workshop on Principles of Advanced and Distributed Simulation
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
An evolutionary algorithm to optimize log/restore operations within optimistic simulation platforms
Proceedings of the 4th International ICST Conference on Simulation Tools and Techniques
Cache-aware memory manager for optimistic simulations
Proceedings of the 5th International ICST Conference on Simulation Tools and Techniques
Hi-index | 0.00 |
Optimistically synchronized parallel discrete-event simulation is based on the use of communicating sequential processes. Optimistic synchronization means that the processes proceed under the assumption that a synchronized execution schedule is fortuitous. Periodic checkpointing of the state of a process allows the process to roll back to an earlier state when synchronization errors are detected. This article examines the effects of varying the checkpoint interval on the execution time and memory space needed to perform a parallel simulation.The empirical results presented in this article were obtained from the simulation of closed stochastic queuing networks with several different topologies. Various intraprocessor process-scheduling algorithms and both lazy and aggressive cancellation strategies are considered. The empirical results are compared with analytical formulae predicting time-optimal checkpoint intervals. Two modes of operation, throttling and thrashing, have been noted and their effect examined. As the checkpoint interval is increased from one, there is a throttling effect among processes on the same processor, which improves performance. When the checkpoint interval is made too large, there is a thrashing effect caused by interaction between processes on different processors. It is shown that the time-optimal and space-optimal checkpoint intervals are not the same. Furthermore, a checkpoint interval that is too small affects space adversely more than time, whereas, a checkpoint interval that is too large affects time adversely more than space.