Transparent Fault Tolerance for Web Services Based Architectures
Euro-Par '02 Proceedings of the 8th International Euro-Par Conference on Parallel Processing
Dynamic Monitoring of High-Performance Distributed Applications
HPDC '02 Proceedings of the 11th IEEE International Symposium on High Performance Distributed Computing
Fault-scalable Byzantine fault-tolerant services
Proceedings of the twentieth ACM symposium on Operating systems principles
Workflows for e-Science: Scientific Workflows for Grids
Workflows for e-Science: Scientific Workflows for Grids
A long-term study of a popular MMORPG
Proceedings of the 6th ACM SIGCOMM workshop on Network and system support for games
RTF: a real-time framework for developing scalable multiplayer online games
Proceedings of the 6th ACM SIGCOMM workshop on Network and system support for games
Efficient management of data center resources for massively multiplayer online games
Proceedings of the 2008 ACM/IEEE conference on Supercomputing
Grid Application Fault Diagnosis Using Wrapper Services and Machine Learning
ICSOC '07 Proceedings of the 5th international conference on Service-Oriented Computing
Neural Network-Based Load Prediction for Highly Dynamic Distributed Online Games
Euro-Par '08 Proceedings of the 14th international Euro-Par conference on Parallel Processing
An Information System for Real-Time Online Interactive Applications
Euro-Par 2008 Workshops - Parallel Processing
GECON'07 Proceedings of the 4th international conference on Grid economics and business models
Hi-index | 0.00 |
The edutain@grid European project [1] is developing a support platform for deployment, management and execution of Real-Time Online Interactive Applications (ROIA) on Grid. In this paper we present a monitoring system we developed which collects data from all the resources in a distributed environment and from the ROIA managed by our platform. We also describe a fault tolerance service which addresses not only the faults commonly encountered in distributed systems, but also faults manifesting at service level, within the platform's management services. Finally, a use-case consisting of the platform running a massively multiplayer online game as a concrete ROIA, is presented in order to demonstrate the roles of the monitoring and fault tolerance services.