ACM Computing Surveys (CSUR)
Fault Tolerant Wide-Area Parallel Computing
IPDPS '00 Proceedings of the 15 IPDPS 2000 Workshops on Parallel and Distributed Processing
On the development of a communication-aware task mapping technique
Journal of Systems Architecture: the EUROMICRO Journal
Fault-tolerant grid services using primary-backup: feasibility and performance
CLUSTER '04 Proceedings of the 2004 IEEE International Conference on Cluster Computing
Optical Control Plane for the Grid Community
IEEE Communications Surveys & Tutorials
IP restoration vs. WDM protection: is there an optimal choice?
IEEE Network: The Magazine of Global Internetworking
Hi-index | 0.00 |
This study first reviews how grid-enabled applications can be provided with fault tolerance. Existing methods, implemented either in the grid application/middleware or in a Generalized Multi-Protocol Label Switching (GMPLS)-based network, are outlined. Then, the paper shows the advantages of integrating application/middleware fault-tolerant schemes, such as service replication, with GMPLS network-layer fault-tolerant schemes, such as path restoration. An integrated fault-tolerant scheme is capable of providing flexible QoS-aware fault tolerance while minimizing the necessary computational and network resources. In the end, the implementation of the proposed integrated scheme in a Video-on-Demand (VoD) application is experimentally validated.