Optimistic recovery in distributed systems
ACM Transactions on Computer Systems (TOCS)
Recovery in distributed systems using asynchronous message logging and checkpointing
PODC '88 Proceedings of the seventh annual ACM Symposium on Principles of distributed computing
Transparent fault-tolerance in parallel Orca programs
SEDMS III Papers from the symposium on Experiences with distributed and multiprocessor systems
Agents for the Grid: A Comparison with Web Services (Part I: Transport Layer)
CCGRID '02 Proceedings of the 2nd IEEE/ACM International Symposium on Cluster Computing and the Grid
The GRID: Computational and Data Resource Sharing in Engineering Optimisation and Design Search
ICPPW '01 Proceedings of the 2001 International Conference on Parallel Processing Workshops
The Anatomy of the Grid: Enabling Scalable Virtual Organizations
International Journal of High Performance Computing Applications
Transparent fault tolerance for parallel applications on networks of workstations
ATEC '96 Proceedings of the 1996 annual conference on USENIX Annual Technical Conference
Real-Time Strategy and Practice in Service Grid
COMPSAC '04 Proceedings of the 28th Annual International Computer Software and Applications Conference - Volume 01
FTWeb: A Fault Tolerant Infrastructure for Web Services
EDOC '05 Proceedings of the Ninth IEEE International EDOC Enterprise Computing Conference
A resource management and fault tolerance services in grid computing
Journal of Parallel and Distributed Computing - Special issue: Design and performance of networks for super-, cluster-, and grid-computing: Part II
UbiSrvInt - a context-aware fault-tolerant approach toward wireless P2P service provision
Expert Systems with Applications: An International Journal
Journal of Systems Architecture: the EUROMICRO Journal
An approach to grid resource selection and fault management based on ECA rules
Future Generation Computer Systems
Towards a framework for web service compositions recovery
Proceedings of the Warm Up Workshop for ACM/IEEE ICSE 2010
Design and implementation of a Byzantine fault tolerance framework for Web services
Journal of Systems and Software
A lightweight fault tolerance framework for Web services
Web Intelligence and Agent Systems
Monitoring and fault tolerance for real-time online interactive applications
Euro-Par'09 Proceedings of the 2009 international conference on Parallel processing
Service research challenges and solutions for the future internet
From session guarantees to contract guarantees for consistency of SOA-compliant processing
ACIIDS'11 Proceedings of the Third international conference on Intelligent information and database systems - Volume Part I
A fault-tolerant web services architecture
APWeb'06 Proceedings of the 2006 international conference on Advanced Web and Network Technologies, and Applications
A metabolic approach to protocol resilience
WAC'04 Proceedings of the First international IFIP conference on Autonomic Communication
Providing fault-tolerant execution of web-service-based workflows within clouds
Proceedings of the 2nd International Workshop on Cloud Computing Platforms
Ensuring reliability in B2B services: Fault tolerant inter-organizational workflows
Information Systems Frontiers
D-reserve: distributed reliable service environment
ADBIS'12 Proceedings of the 16th East European conference on Advances in Databases and Information Systems
Consistency guarantees for recovery of service-oriented distributed processing
International Journal of Intelligent Information and Database Systems
A survey on reliability in distributed systems
Journal of Computer and System Sciences
Hi-index | 0.00 |
Service-based architectures enable the development of new classes of Grid and distributed applications. One of the main capabilities provided by such systems is the dynamic and flexible integration of services, according to which services are allowed to be a part of more than one distributed system and simultaneously serve different applications. This increased flexibility in system composition makes it difficult to address classical distributed system issues such as fault-tolerance. While it is relatively easy to make an individual service fault-tolerant, improving fault-tolerance of services collaborating in multiple application scenarios is a challenging task. In this paper, we look at the issue of developing fault-tolerant service-based distributed systems, and propose an infrastructure to implement fault tolerance capabilities transparent to services.