ACM Transactions on Programming Languages and Systems (TOPLAS)
Fault-tolerant distributed simulation
PADS '98 Proceedings of the twelfth workshop on Parallel and distributed simulation
The resource sharing system: dynamic federate mapping for HLA-based distributed simulation
Proceedings of the fifteenth workshop on Parallel and distributed simulation
IEEE Internet Computing
Concepts for dependable distributed discrete event simulation
Proceedings of the 14th European Simulation Multiconference on Simulation and Modelling: Enablers for a Better Quality of Life
HLA-based Adaptive Distributed Simulation of Wireless Mobile Systems
Proceedings of the seventeenth workshop on Parallel and distributed simulation
A Generic Rollback Manager for Optimistic HLA Simulations
DS-RT '00 Proceedings of the Fourth IEEE International Workshop on Distributed Simulation and Real-Time Applications
A Load Management System for Running HLA-Based Distributed Simulations over the Grid
DS-RT '02 Proceedings of the Sixth IEEE International Workshop on Distributed Simulation and Real-Time Applications
Optimistic synchronization in HLA based distributed simulation
Proceedings of the eighteenth workshop on Parallel and distributed simulation
ANSS '05 Proceedings of the 38th annual Symposium on Simulation
Evaluation of a Fault-Tolerance Mechanism for HLA-Based Distributed Simulations
Proceedings of the 20th Workshop on Principles of Advanced and Distributed Simulation
A replication structure for efficient and fault-tolerant parallel and distributed simulations
SpringSim '10 Proceedings of the 2010 Spring Simulation Multiconference
Federate Fault Tolerance in HLA-Based Simulation
PADS '10 Proceedings of the 2010 IEEE Workshop on Principles of Advanced and Distributed Simulation
Hi-index | 0.00 |
The widespread use of simulation in future military systems depends, among others, on the degree of reuse and availability of simulation models. Simulation support in such systems must also cope with failure in software or hardware. Research in fault-tolerant distributed simulation, especially in the context of the High Level Architecture (HLA), has been quite sparse. Nor does the HLA standard itself cover fault-tolerance extensively. This paper describes a framework, named Distributed Resource Management System (DRMS), for robust execution of federations. The implementation of the framework is based on Web Services and Semantic Web technology, and provides fundamental services and a consistent mechanism for description of resources managed by the environment. To evaluate the proposed framework, a federation has been developed that utilizes time-warp mechanism for synchronization. In this paper, we describe our approach to fault tolerance and give an example to illustrate how DRMS behaves when it faces faulty federates.