Multicast routing in datagram internetworks and extended LANs
ACM Transactions on Computer Systems (TOCS)
Bayeux: an architecture for scalable and fault-tolerant wide-area data dissemination
NOSSDAV '01 Proceedings of the 11th international workshop on Network and operating systems support for digital audio and video
Design and evaluation of a wide-area event notification service
ACM Transactions on Computer Systems (TOCS)
Group communication specifications: a comprehensive study
ACM Computing Surveys (CSUR)
Epidemic Algorithms for Reliable Content-Based Publish-Subscribe: An Evaluation
ICDCS '04 Proceedings of the 24th International Conference on Distributed Computing Systems (ICDCS'04)
Basic Concepts and Taxonomy of Dependable and Secure Computing
IEEE Transactions on Dependable and Secure Computing
XNET: A Reliable Content-Based Publish/Subscribe System
SRDS '04 Proceedings of the 23rd IEEE International Symposium on Reliable Distributed Systems
Error Correction Coding: Mathematical Methods and Algorithms
Error Correction Coding: Mathematical Methods and Algorithms
Taxonomy of Distributed Event-Based Programming Systems
The Computer Journal
An Experimental Study of Internet Path Diversity
IEEE Transactions on Dependable and Secure Computing
Why do internet services fail, and what can be done about it?
USITS'03 Proceedings of the 4th conference on USENIX Symposium on Internet Technologies and Systems - Volume 4
Fault-Tolerant Reliable Delivery of Messages in Distributed Publish/Subscribe Systems
ICAC '07 Proceedings of the Fourth International Conference on Autonomic Computing
A replication oriented approach to event based middleware over structured peer to peer networks
Proceedings of the 5th international workshop on Middleware for pervasive and ad-hoc computing: held at the ACM/IFIP/USENIX 8th International Middleware Conference
An adaptive approach for ensuring reliability in event based middleware
Proceedings of the second international conference on Distributed event-based systems
P2P Networking and Applications
P2P Networking and Applications
Reliable publish/subscribe middleware for time-sensitive internet-scale applications
Proceedings of the Third ACM International Conference on Distributed Event-Based Systems
Loss and Delay Measurements of Internet Backbones
Computer Communications
Reliable Event Dissemination over Wide-Area Networks without Severe Performance Fluctuations
ISORC '10 Proceedings of the 2010 13th IEEE International Symposium on Object/Component/Service-Oriented Real-Time Distributed Computing
Ricochet: lateral error correction for time-critical multicast
NSDI'07 Proceedings of the 4th USENIX conference on Networked systems design & implementation
Hi-index | 0.00 |
Publish/subscribe services are required in several long-term on-going industrial projects that envision a radical rethinking of software systems by integrating existing legacy systems in large-scale federating architectures. In fact, such systems are made of a constellation of systems that cooperate with each other by means of the event notification provided by publish/subscribe services over wide-area networks. Such services have met an enthusiastic success in implementing these large-scale federations thanks to their intrinsic decoupling properties that improve the offered scalability guarantees. However, a very important requirement of such federations is the capability of the adopted publish/subscribe service to tolerate faults occurring in the network and/or computing nodes composing the federation, without negatively affecting the provided event notification. Therefore, it is crucial that publish/subscribe services are equipped with proper methods to support reliable event notification. In this paper, we present this topic of reliable event notification by introducing its definition, a model of the faults that have to be tolerated, the available methods to recover from such faults and how current publish/subscribe products deal with reliability.