ACM Transactions on Programming Languages and Systems (TOPLAS)
Reducing Null Messages in Misra's Distributed Discrete Event Simulation Method
IEEE Transactions on Software Engineering
Design and Evaluation of the Rollback Chip: Special Purpose Hardware for Time Warp
IEEE Transactions on Computers
A generalized carrier-null method for conservative parallel simulation
PADS '94 Proceedings of the eighth workshop on Parallel and distributed simulation
Effects of the checkpoint interval on time and space in time warp
ACM Transactions on Modeling and Computer Simulation (TOMACS)
Parallel DEVS: a parallel, hierarchical, modular, modeling formalism
WSC '94 Proceedings of the 26th conference on Winter simulation
Memory management techniques for Time Warp on a distributed memory machine
PADS '95 Proceedings of the ninth workshop on Parallel and distributed simulation
Buffer management in shared-memory Time Warp systems
PADS '95 Proceedings of the ninth workshop on Parallel and distributed simulation
Performance Evaluation of Conservative Algorithms in Parallel Simulation Languages
IEEE Transactions on Parallel and Distributed Systems
A scaled version of the elastic time algorithm
Proceedings of the fifteenth workshop on Parallel and distributed simulation
Parallel and Distribution Simulation Systems
Parallel and Distribution Simulation Systems
Early cancellation: an active NIC optimization for time-warp
Proceedings of the sixteenth workshop on Parallel and distributed simulation
Theory of Modeling and Simulation
Theory of Modeling and Simulation
N-dimensional Cell-DEVS Models
Discrete Event Dynamic Systems
CD++: a toolkit to develop DEVS models
Software—Practice & Experience
The DEVS Environment for High-Performance Modeling and Simulation
IEEE Computational Science & Engineering
HPDC '97 Proceedings of the 6th IEEE International Symposium on High Performance Distributed Computing
Nonblocking Checkpointing for Optimistic Parallel Simulation: Description and an Implementation
IEEE Transactions on Parallel and Distributed Systems
Conservative synchronization of large-scale network simulations
Proceedings of the eighteenth workshop on Parallel and distributed simulation
Sequential Performance of Asynchronous Conservative PDES Algorithms
Proceedings of the 19th Workshop on Principles of Advanced and Distributed Simulation
Heterogeneous Chip Multiprocessors
Computer
Chip multiprocessing and the cell broadband engine
Proceedings of the 3rd conference on Computing frontiers
The potential of the cell processor for scientific computing
Proceedings of the 3rd conference on Computing frontiers
Performance, Power Efficiency and Scalability of Asymmetric Cluster Chip Multiprocessors
IEEE Computer Architecture Letters
Discrete-event Execution Alternatives on General Purpose Graphical Processing Units (GPGPUs)
Proceedings of the 20th Workshop on Principles of Advanced and Distributed Simulation
MPI Microtask for programming the cell broadband engineTM processor
IBM Systems Journal
Introduction to the cell multiprocessor
IBM Journal of Research and Development - POWER5 and packaging
Core architecture optimization for heterogeneous chip multiprocessors
Proceedings of the 15th international conference on Parallel architectures and compilation techniques
Parallel and distributed simulation: traditional techniques and recent advances
Proceedings of the 38th conference on Winter simulation
Dynamic multigrain parallelization on the cell broadband engine
Proceedings of the 12th ACM SIGPLAN symposium on Principles and practice of parallel programming
Compilation for explicitly managed memory hierarchies
Proceedings of the 12th ACM SIGPLAN symposium on Principles and practice of parallel programming
Multiprogrammed non-blocking checkpoints in support of optimistic simulation on myrinet clusters
Journal of Systems Architecture: the EUROMICRO Journal
GPU-Accelerated Evaluation Platform for High Fidelity Network Modeling
Proceedings of the 21st International Workshop on Principles of Advanced and Distributed Simulation
An Effective Strategy for Porting C++ Applications on Cell
ICPP '07 Proceedings of the 2007 International Conference on Parallel Processing
Distributed Simulation: A Case Study in Design and Verification of Distributed Programs
IEEE Transactions on Software Engineering
Executing stream joins on the cell processor
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
MEDEA '07 Proceedings of the 2007 workshop on MEmory performance: DEaling with Applications, systems and architecture
Offloading Data Distribution Management to Network Processors in HLA-Based Distributed Simulations
IEEE Transactions on Parallel and Distributed Systems
Cell broadband engine architecture and its first implementation: a performance view
IBM Journal of Research and Development
CellSs: making it easier to program the cell broadband engine processor
IBM Journal of Research and Development
The cell broadband engine: exploiting multiple levels of parallelism in a chip multiprocessor
International Journal of Parallel Programming
Scientific computing Kernels on the cell processor
International Journal of Parallel Programming
Streamware: programming general-purpose multicore processors using streams
Proceedings of the 13th international conference on Architectural support for programming languages and operating systems
Orchestrating the execution of stream programs on multicore platforms
Proceedings of the 2008 ACM SIGPLAN conference on Programming language design and implementation
Entering the petaflop era: the architecture and performance of Roadrunner
Proceedings of the 2008 ACM/IEEE conference on Supercomputing
ICPP '08 Proceedings of the 2008 37th International Conference on Parallel Processing
PRIB '08 Proceedings of the Third IAPR International Conference on Pattern Recognition in Bioinformatics
DS-RT '08 Proceedings of the 2008 12th IEEE/ACM International Symposium on Distributed Simulation and Real-Time Applications
Scientific Programming - High Performance Computing with the Cell Broadband Engine
High-performance regular expression scanning on the Cell/B.E. processor
Proceedings of the 23rd international conference on Supercomputing
Computer generation of fast fourier transforms for the cell broadband engine
Proceedings of the 23rd international conference on Supercomputing
PADS '09 Proceedings of the 2009 ACM/IEEE/SCS 23rd Workshop on Principles of Advanced and Distributed Simulation
Hardware Supported Time Synchronization in Multi-core Architectures
PADS '09 Proceedings of the 2009 ACM/IEEE/SCS 23rd Workshop on Principles of Advanced and Distributed Simulation
Advanced risk analytics on the cell broadband engine
IPDPS '09 Proceedings of the 2009 IEEE International Symposium on Parallel&Distributed Processing
Switching to High Gear: Opportunities for Grand-Scale Real-Time Parallel Simulations
DS-RT '09 Proceedings of the 2009 13th IEEE/ACM International Symposium on Distributed Simulation and Real Time Applications
IISWC '09 Proceedings of the 2009 IEEE International Symposium on Workload Characterization (IISWC)
Modeling multigrain parallelism on heterogeneous multi-core processors: a case study of the cell BE
HiPEAC'08 Proceedings of the 3rd international conference on High performance embedded architectures and compilers
Accelerating large-scale DEVS-based simulation on the cell processor
SpringSim '10 Proceedings of the 2010 Spring Simulation Multiconference
Using genetic algorithms to limit the optimism in time warp
Winter Simulation Conference
Exploring Multi-Grained Parallelism in Compute-Intensive DEVS Simulations
PADS '10 Proceedings of the 2010 IEEE Workshop on Principles of Advanced and Distributed Simulation
Towards Symmetric Multi-threaded Optimistic Simulation Kernels
PADS '12 Proceedings of the 2012 ACM/IEEE/SCS 26th Workshop on Principles of Advanced and Distributed Simulation
Approximate parallel simulation of web search engines
Proceedings of the 2013 ACM SIGSIM conference on Principles of advanced discrete simulation
Coordinator-master-worker model for efficient large scale network simulation
Proceedings of the 6th International ICST Conference on Simulation Tools and Techniques
Hi-index | 0.00 |
Parallel discrete-event simulation on heterogeneous multicore platforms requires innovative redesign of existing algorithms in return for better performance. Based on the Discrete Event System Specification (DEVS) methodology, a technique called Multicore Acceleration of DEVS Systems is proposed for efficient parallel discrete-event simulation on the IBM Cell processor. The technique combines multi-grained parallelism and various optimizations to overcome performance bottlenecks, while hiding the technical details of multicore programming from non-expert users. By explicitly exploiting the data- and event-level parallelism inherent in the simulation, the technique significantly accelerates both memory-bound and compute-bound computational kernels in demanding parallel DEVS simulations, as shown in the experimental results. Several key concepts and methods derived from this research can also be applied to other multicore and shared-memory architectures.