The fuzzy barrier: a mechanism for high speed synchronization of processors
ASPLOS III Proceedings of the third international conference on Architectural support for programming languages and operating systems
Parallel discrete event simulation
Communications of the ACM - Special issue on simulation
An analytical comparison of periodic checkpointing and incremental state saving
PADS '93 Proceedings of the seventh workshop on Parallel and distributed simulation
A case study in simulating PCS networks using Time Warp
PADS '95 Proceedings of the ninth workshop on Parallel and distributed simulation
Background execution of time warp programs
PADS '96 Proceedings of the tenth workshop on Parallel and distributed simulation
Speculative parallel simulation with an adaptive throttle scheme
Proceedings of the eleventh workshop on Parallel and distributed simulation
The implementation of the Cilk-5 multithreaded language
PLDI '98 Proceedings of the ACM SIGPLAN 1998 conference on Programming language design and implementation
Replicating parallel simulation on heterogeneous clusters
Journal of Systems Architecture: the EUROMICRO Journal - Special double issue: cluster computing
Efficient Execution of Time Warp Programs on Heterogeneous, NOW Platforms
IEEE Transactions on Parallel and Distributed Systems
ROSS: a high-performance, low memory, modular time warp system
PADS '00 Proceedings of the fourteenth workshop on Parallel and distributed simulation
Efficient optimistic parallel simulations using reverse computation
ACM Transactions on Modeling and Computer Simulation (TOMACS)
Foundations of Parallel and Distributed Programming
Foundations of Parallel and Distributed Programming
On Process Migration and Load Balancing in Time Warp
IEEE Transactions on Parallel and Distributed Systems
Proceedings of the 2003 ACM/IEEE conference on Supercomputing
System noise, OS clock ticks, and fine-grained parallel applications
Proceedings of the 19th annual international conference on Supercomputing
Controlling over-optimism in time-warp via CPU-based flow control
WSC '04 Proceedings of the 36th conference on Winter simulation
Proceedings of the 2007 ACM/IEEE conference on Supercomputing
Optimistic Synchronization of Parallel Simulations in Cloud Computing Environments
CLOUD '09 Proceedings of the 2009 IEEE International Conference on Cloud Computing
Addressing shared resource contention in multicore processors via scheduling
Proceedings of the fifteenth edition of ASPLOS on Architectural support for programming languages and operating systems
Optimization of Parallel Discrete Event Simulator for Multi-core Systems
IPDPS '12 Proceedings of the 2012 IEEE 26th International Parallel and Distributed Processing Symposium
Dynamically Adjusting Core Frequencies to Accelerate Time Warp Simulations in Many-Core Processors
PADS '12 Proceedings of the 2012 ACM/IEEE/SCS 26th Workshop on Principles of Advanced and Distributed Simulation
Towards Symmetric Multi-threaded Optimistic Simulation Kernels
PADS '12 Proceedings of the 2012 ACM/IEEE/SCS 26th Workshop on Principles of Advanced and Distributed Simulation
Partitioning on Dynamic Behavior for Parallel Discrete Event Simulation
PADS '12 Proceedings of the 2012 ACM/IEEE/SCS 26th Workshop on Principles of Advanced and Distributed Simulation
Assessing load-sharing within optimistic simulation platforms
Proceedings of the Winter Simulation Conference
Hi-index | 0.00 |
Parallel Discrete Event Simulation (PDES) harnesses the power of parallel processing to improve the performance and capacity of simulation, supporting bigger models, in more details and for more scenarios. PDES engines are typically designed and evaluated assuming a homogeneous parallel computing system that is dedicated to the simulation application. In this paper, we first show that the presence of interference from other users, even a single process in an arbitrarily large parallel environment, can lead to dramatic slowdown in the performance of the simulation. We define a new metric, which we call proportional slowdown, that represents the idealized target for graceful slowdown in the presence of interference. We identify some of the reasons why simulators fall far short of proportional slowdown. Based on these observations, we design alternative simulation scheduling and mapping algorithms that are better able to tolerate interference. More precisely, the most resilient simulators will allow dynamic mapping of simulation event execution to processing resources (a work pool model). However, this model has significant overhead and can substantially impact locality. Thus, we propose a locality-aware adaptive dynamic-mapping (LADM) algorithm for PDES on multi-core systems. LADM reduces the number of active threads in the presence of interference, avoiding having threads disabled due to context switching. We show that LADM can substantially reduce the impact of interference while maintaining memory locality reducing the gap with proportional slowdown. LADM and similar techniques can also help in situations where there is load imbalance or processor heterogeneity.