Coordinator-master-worker model for efficient large scale network simulation

Authors:
Bilel Ben Romdhanne;Navid Nikaein;Christian Bonnet
Affiliations:
Eurecom;Eurecom;Eurecom
Venue:
Proceedings of the 6th International ICST Conference on Simulation Tools and Techniques
Year:
2013

Citing 15
Cited 0

Using MPI (2nd ed.): portable parallel programming with the message-passing interface

Using MPI (2nd ed.): portable parallel programming with the message-passing interface
Simulation-based optimization: practical introduction to simulation optimization

Proceedings of the 35th conference on Winter simulation: driving innovation
Parallel and distributed simulation: traditional techniques and recent advances

Proceedings of the 38th conference on Winter simulation
Hybrid MPI/OpenMP Parallel Programming on Clusters of Multi-Core SMP Nodes

PDP '09 Proceedings of the 2009 17th Euromicro International Conference on Parallel, Distributed and Network-based Processing
Efficient Master/Worker Parallel Discrete Event Simulation

PADS '09 Proceedings of the 2009 ACM/IEEE/SCS 23rd Workshop on Principles of Advanced and Distributed Simulation
Switching to High Gear: Opportunities for Grand-Scale Real-Time Parallel Simulations

DS-RT '09 Proceedings of the 2009 13th IEEE/ACM International Symposium on Distributed Simulation and Real Time Applications
Efficient simulation of agent-based models on multi-GPU and multi-core clusters

Proceedings of the 3rd International ICST Conference on Simulation Tools and Techniques
A GPU-Based Application Framework Supporting Fast Discrete-Event Simulation

Simulation
An analysis of queuing network simulation using GPU-based hardware acceleration

ACM Transactions on Modeling and Computer Simulation (TOMACS)
Performance analysis of a hybrid MPI/CUDA implementation of the NASLU benchmark

ACM SIGMETRICS Performance Evaluation Review - Special issue on the 1st international workshop on performance modeling, benchmarking and simulation of high performance computing systems (PMBS 10)
OpenAirInterface large-scale wireless emulation platform and methodology

Proceedings of the 6th ACM workshop on Performance monitoring and measurement of heterogeneous wireless and wired networks
Efficient Master/Worker Parallel Discrete Event Simulation on Metacomputing Systems

IEEE Transactions on Parallel and Distributed Systems
Multicore acceleration of Discrete Event System Specification systems

Simulation
A Latency-Hiding Algorithm for ABMS on Parallel/Distributed Computing Environment

PADS '12 Proceedings of the 2012 ACM/IEEE/SCS 26th Workshop on Principles of Advanced and Distributed Simulation
Hybrid CPU-GPU Distributed Framework for Large Scale Mobile Networks Simulation

DS-RT '12 Proceedings of the 2012 IEEE/ACM 16th International Symposium on Distributed Simulation and Real Time Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this work, we propose a coordinator-master-worker (CMW) model for medium to extra-large scale network simulation. The model supports distributed and parallel simulation for a heterogeneous computing node architecture with both multi-core CPUs and GPUs. The model aims at maximizing the hardware usage rate while reducing the overall management overhead. In the CMW model, the coordinator is the toplevel simulation CPU process that performs an initial partitioning of the simulation into multiple instances and is responsible for load balancing and synchronization services among all the active masters. The master is also a CPU process and provides event scheduling, synchronization, and communication services to the workers. It manages workers operating potentially on different computing resources within the same shared memory context and communicates with the coordinator and others masters through the messages passing interface. The worker is the elementary actor of CMW model that performs the simulation routines and interacts with the input and output data, and can be a CPU or a GPU thread. Compared to existing master-worker models, the CMW is natively parallel and GPU compliant, and can be extended to support additional computing resources. The performance gain of the model is evaluated through different benchmarking scenarios using low-cost publicly available GPU platforms. The results have been shown that the speedup up to 3000 times can be achieved compared to a sequential execution and up to 6 times compared to a mono-GPU MW-based simulation. The hardware activities rate of the CMW services for both CPU and GPU are analyzed in detail.