Epidemic algorithms for replicated database maintenance
PODC '87 Proceedings of the sixth annual ACM Symposium on Principles of distributed computing
The design and analysis of algorithms
The design and analysis of algorithms
The weakest failure detector for solving consensus
PODC '92 Proceedings of the eleventh annual ACM symposium on Principles of distributed computing
Impossibility of distributed consensus with one faulty process
Journal of the ACM (JACM)
Horus: a flexible group communication system
Communications of the ACM
On the impossibility of group membership
PODC '96 Proceedings of the fifteenth annual ACM symposium on Principles of distributed computing
Building adaptive systems using ensemble
Software—Practice & Experience - Special issue on multiprocessor operating systems
EW 7 Proceedings of the 7th workshop on ACM SIGOPS European workshop: Systems support for worldwide applications
A reliable ordered delivery protocol for interconnected local area networks
ICNP '95 Proceedings of the 1995 International Conference on Network Protocols
Bimodal Multicast
A Gossip-Style Failure Detection Service
A Gossip-Style Failure Detection Service
GROUP MEMBERSHIP IN THE EPIDEMIC STYLE
GROUP MEMBERSHIP IN THE EPIDEMIC STYLE
Resource discovery in distributed networks
Proceedings of the eighteenth annual ACM symposium on Principles of distributed computing
ACM Transactions on Computer Systems (TOCS)
Connectivity and inference problems for temporal networks
STOC '00 Proceedings of the thirty-second annual ACM symposium on Theory of computing
Spatial gossip and resource location protocols
STOC '01 Proceedings of the thirty-third annual ACM symposium on Theory of computing
On scalable and efficient distributed failure detectors
Proceedings of the twentieth annual ACM symposium on Principles of distributed computing
Proceedings of the fourteenth annual ACM symposium on Parallel algorithms and architectures
Peer-to-Peer Membership Management for Gossip-Based Protocols
IEEE Transactions on Computers
Directional Gossip: Gossip in a Wide Area Network
EDCC-3 Proceedings of the Third European Dependable Computing Conference on Dependable Computing
IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
A Probabilistically Correct Leader Election Protocol for Large Groups
DISC '00 Proceedings of the 14th International Conference on Distributed Computing
Gossip versus Deterministically Constrained Flooding on Small Networks
DISC '00 Proceedings of the 14th International Conference on Distributed Computing
Optimistic Replication for Internet Data Services
DISC '00 Proceedings of the 14th International Conference on Distributed Computing
DISC '01 Proceedings of the 15th International Conference on Distributed Computing
Scalable Management and Data Mining Using Astrolabe
IPTPS '01 Revised Papers from the First International Workshop on Peer-to-Peer Systems
Communication Adaptive Self-Stabilizing Group Membership Service
WSS '01 Proceedings of the 5th International Workshop on Self-Stabilizing Systems
Scalable Fault-Tolerant Aggregation in Large Process Groups
DSN '01 Proceedings of the 2001 International Conference on Dependable Systems and Networks (formerly: FTCS)
DISC '00 Proceedings of the 14th International Conference on Distributed Computing
ACM Transactions on Computer Systems (TOCS)
IEEE Transactions on Mobile Computing
The power of epidemics: robust communication for large-scale distributed systems
ACM SIGCOMM Computer Communication Review
Connectivity and inference problems for temporal networks
Journal of Computer and System Sciences - Special issue on STOC 2000
Taming aggressive replication in the Pangaea wide-area file system
ACM SIGOPS Operating Systems Review - OSDI '02: Proceedings of the 5th symposium on Operating systems design and implementation
Achieving Scalable Cluster System Analysis and Management with a Gossip-Based Network Service
LCN '01 Proceedings of the 26th Annual IEEE Conference on Local Computer Networks
On implementing omega with weak reliability and synchrony assumptions
Proceedings of the twenty-second annual symposium on Principles of distributed computing
Anonymous Gossip: Improving Multicast Reliability in Mobile Ad-Hoc Networks
ICDCS '01 Proceedings of the The 21st International Conference on Distributed Computing Systems
Communication Adaptive Self-Stabilizing Group Membership Service
IEEE Transactions on Parallel and Distributed Systems
Scattercast: an adaptable broadcast distribution framework
Multimedia Systems
Collective asynchronous reading with polylogarithmic worst-case overhead
STOC '04 Proceedings of the thirty-sixth annual ACM symposium on Theory of computing
Failure Detection and Membership Management in Grid Environments
GRID '04 Proceedings of the 5th IEEE/ACM International Workshop on Grid Computing
Spatial gossip and resource location protocols
Journal of the ACM (JACM)
An Efficient Topology-Adaptive Membership Protocol for Large-Scale Cluster-Based Services
IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Papers - Volume 01
Taming aggressive replication in the Pangaea wide-area file system
OSDI '02 Proceedings of the 5th symposium on Operating systems design and implementationCopyright restrictions prevent ACM from being able to make the PDFs for this conference available for downloading
Distributed Computing
Gossip-based aggregation in large dynamic networks
ACM Transactions on Computer Systems (TOCS)
HiScamp: self-organizing hierarchical membership protocol
EW 10 Proceedings of the 10th workshop on ACM SIGOPS European workshop
Pangaea: a symbiotic wide-area file system
EW 10 Proceedings of the 10th workshop on ACM SIGOPS European workshop
The notification based approach to implementing failure detectors in distributed systems
InfoScale '06 Proceedings of the 1st international conference on Scalable information systems
Epidemic-based approaches for reliable multicast in mobile ad hoc networks
ACM SIGOPS Operating Systems Review
Efficient and Adaptive Epidemic-Style Protocols for Reliable and Scalable Multicast
IEEE Transactions on Parallel and Distributed Systems
Fireflies: scalable support for intrusion-tolerant network overlays
Proceedings of the 1st ACM SIGOPS/EuroSys European Conference on Computer Systems 2006
Robust gossiping with an application to consensus
Journal of Computer and System Sciences
FUSE: lightweight guaranteed distributed failure notification
OSDI'04 Proceedings of the 6th conference on Symposium on Opearting Systems Design & Implementation - Volume 6
A Scalable and Efficient Self-Organizing Failure Detector for Grid Applications
GRID '05 Proceedings of the 6th IEEE/ACM International Workshop on Grid Computing
Peer-to-Peer in Metric Space and Semantic Space
IEEE Transactions on Knowledge and Data Engineering
Scalability of the microsoft cluster service
WINSYM'98 Proceedings of the 2nd conference on USENIX Windows NT Symposium - Volume 2
Proceedings of the 16th international symposium on High performance distributed computing
Latency and bandwidth-minimizing failure detectors
Proceedings of the 2nd ACM SIGOPS/EuroSys European Conference on Computer Systems 2007
ACM Transactions on Computer Systems (TOCS)
Journal of Systems and Software
On collaborative content distribution using multi-message gossip
Journal of Parallel and Distributed Computing
Gossiping in distributed systems
ACM SIGOPS Operating Systems Review - Gossip-based computer networking
How robust are gossip-based communication protocols?
ACM SIGOPS Operating Systems Review - Gossip-based computer networking
A generic theoretical framework for modeling gossip-based algorithms
ACM SIGOPS Operating Systems Review - Gossip-based computer networking
Formal analysis techniques for gossiping protocols
ACM SIGOPS Operating Systems Review - Gossip-based computer networking
The "art" of programming gossip-based systems
ACM SIGOPS Operating Systems Review - Gossip-based computer networking
Exploiting the synergy between gossiping and structured overlays
ACM SIGOPS Operating Systems Review - Gossip-based computer networking
Reliable on-demand management operations for large-scale distributed applications
ACM SIGOPS Operating Systems Review - Gossip-based computer networking
Experiences with open overlays: a middleware approach to network heterogeneity
Proceedings of the 3rd ACM SIGOPS/EuroSys European Conference on Computer Systems 2008
SSCM: middleware for structure-based service collaboration
Proceedings of the 2008 ACM symposium on Applied computing
On spreading recommendations via social gossip
Proceedings of the twentieth annual symposium on Parallelism in algorithms and architectures
On the complexity of asynchronous gossip
Proceedings of the twenty-seventh ACM symposium on Principles of distributed computing
Efficient Processing of Continuous Join Queries Using Distributed Hash Tables
Euro-Par '08 Proceedings of the 14th international Euro-Par conference on Parallel Processing
A group membership service for large-scale grids
Proceedings of the 6th international workshop on Middleware for grid computing
Failure Detection Service for Large Scale Systems
KES-AMSTA '07 Proceedings of the 1st KES International Symposium on Agent and Multi-Agent Systems: Technologies and Applications
Semantic partitioning of peer-to-peer search space
Computer Communications
End-to-end epidemic multicast loss recovery: Analysis of scalability and robustness
Computer Communications
Deep middleware for the divergent Grid
Proceedings of the ACM/IFIP/USENIX 2005 International Conference on Middleware
Cross-layer cooperation between membership estimation and routing
Proceedings of the 2009 ACM symposium on Applied Computing
Dr. Multicast: Rx for data center communication scalability
LADIS '08 Proceedings of the 2nd Workshop on Large-Scale Distributed Systems and Middleware
Epidemic-based reliable and adaptive multicast for mobile ad hoc networks
Computer Networks: The International Journal of Computer and Telecommunications Networking
Design of the notification system for failure detectors
International Journal of High Performance Computing and Networking
Global data computation in chordal rings
Journal of Parallel and Distributed Computing
Exploiting Synergies between Coexisting Overlays
DAIS '09 Proceedings of the 9th IFIP WG 6.1 International Conference on Distributed Applications and Interoperable Systems
Counter-based reliability optimization for gossip-based broadcasting
Computer Communications
Epidemic protocols for pervasive computing systems: moving focus from architecture to protocol
M-PAC '09 Proceedings of the International Workshop on Middleware for Pervasive Mobile and Embedded Computing
An analytical framework for self-organizing peer-to-peer anti-entropy algorithms
Performance Evaluation
International Journal of Parallel Programming
Application execution management on the InteGrade opportunistic grid middleware
Journal of Parallel and Distributed Computing
Large-scale behavior of end-to-end epidemic message loss recovery
QofIS'02/ICQT'02 Proceedings of the 3rd international conference on quality of future internet services and internet charging and QoS technologies 2nd international conference on From QoS provisioning to QoS charging
Dr. multicast: Rx for data center communication scalability
Proceedings of the 5th European conference on Computer systems
GPC'07 Proceedings of the 2nd international conference on Advances in grid and pervasive computing
RSM-based gossip on P2P network
ICA3PP'07 Proceedings of the 7th international conference on Algorithms and architectures for parallel processing
Cassandra: a decentralized structured storage system
ACM SIGOPS Operating Systems Review
Skip ring topology in fast failure detection service
PPAM'07 Proceedings of the 7th international conference on Parallel processing and applied mathematics
A platform for cooperative server backups based on virtual machines
ISAS'08 Proceedings of the 5th international conference on Service availability
A gossip-based protocol to reach consensus via uninorm aggregation operator
GPC'08 Proceedings of the 3rd international conference on Advances in grid and pervasive computing
Facilitating gossip programming with the GossipKit framework
DAIS'08 Proceedings of the 8th IFIP WG 6.1 international conference on Distributed applications and interoperable systems
Meeting the deadline: on the complexity of fault-tolerant continuous gossip
Proceedings of the 29th ACM SIGACT-SIGOPS symposium on Principles of distributed computing
Self-organize management of mobile adhoc networks
MILCOM'06 Proceedings of the 2006 IEEE conference on Military communications
On collaborative content distribution using multi-message gossip
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
NN-SA based dynamic failure detector for services composition in distributed environment
ADMA'10 Proceedings of the 6th international conference on Advanced data mining and applications - Volume Part II
Modeling gossip-based content dissemination and search in distributed networking
Computer Communications
Journal of Intelligent Manufacturing
Kevlar: a flexible infrastructure for wide-area collaborative applications
Proceedings of the ACM/IFIP/USENIX 11th International Conference on Middleware
Detecting failures in distributed systems with the Falcon spy network
SOSP '11 Proceedings of the Twenty-Third ACM Symposium on Operating Systems Principles
Exact performance measures for peer-to-peer epidemic information diffusion
ISCIS'06 Proceedings of the 21st international conference on Computer and Information Sciences
Rumor spreading and vertex expansion
Proceedings of the twenty-third annual ACM-SIAM symposium on Discrete Algorithms
Time and communication efficient consensus for crash failures
DISC'06 Proceedings of the 20th international conference on Distributed Computing
A specification-to-deployment architecture for overlay networks
ODBASE'06/OTM'06 Proceedings of the 2006 Confederated international conference on On the Move to Meaningful Internet Systems: CoopIS, DOA, GADA, and ODBASE - Volume Part II
SecondSite: disaster tolerance as a service
VEE '12 Proceedings of the 8th ACM SIGPLAN/SIGOPS conference on Virtual Execution Environments
Cooperative failure detection in overlay multicast
NETWORKING'05 Proceedings of the 4th IFIP-TC6 international conference on Networking Technologies, Services, and Protocols; Performance of Computer and Communication Networks; Mobile and Wireless Communication Systems
Deep middleware for the divergent grid
Middleware'05 Proceedings of the ACM/IFIP/USENIX 6th international conference on Middleware
Asynchronous failed sensor node detection method for sensor networks
International Journal of Network Management
Cost-effective broadcast for fully decentralized peer-to-peer networks
Computer Communications
To reach consensus using uninorm aggregation operator: A gossip-based protocol
International Journal of Intelligent Systems
Short Survey: A survey of application level multicast techniques
Computer Communications
LossEstimate: Distributed failure estimation in wireless networks
Journal of Systems and Software
Effects of mobility on membership estimation and routing services in ad hoc networks
ISPA'07 Proceedings of the 5th international conference on Parallel and Distributed Processing and Applications
A Failure Detection System for Large Scale Distributed Systems
International Journal of Distributed Systems and Technologies
Journal of the ACM (JACM)
A gossip-based approach to exascale system services
Proceedings of the 3rd International Workshop on Runtime and Operating Systems for Supercomputers
Gossip-based cooperative caching for mobile applications in mobile wireless networks
Journal of Parallel and Distributed Computing
Autonomic cloud resource sharing for intercloud federations
Future Generation Computer Systems
Pico replication: a high availability framework for middleboxes
Proceedings of the 4th annual Symposium on Cloud Computing
Euro-Par'13 Proceedings of the 19th international conference on Parallel Processing
Hi-index | 0.00 |
Failure Detection is valuable for system management, replication, load balancing, and other distributed services. To date, Failure Detection Services scale badly in the number of members that are being monitored. This paper describes a new protocol based on gossiping that does scale well and provides timely detection. We analyze the protocol, and then extend it to discover and leverage the underlying network topology for much improved resource utilization. We then combine it with another protocol, based on broadcast, that is used to handle partition failures.