Reaching Agreement in the Presence of Faults
Journal of the ACM (JACM)
Fault Tolerant Operating Systems
ACM Computing Surveys (CSUR)
Reliability Issues in Computing System Design
ACM Computing Surveys (CSUR)
Synchronization in Distributed Programs
ACM Transactions on Programming Languages and Systems (TOPLAS)
The Byzantine Generals Problem
ACM Transactions on Programming Languages and Systems (TOPLAS)
Proving Liveness Properties of Concurrent Programs
ACM Transactions on Programming Languages and Systems (TOPLAS)
Time, clocks, and the ordering of events in a distributed system
Communications of the ACM
An axiomatic basis for computer programming
Communications of the ACM
The Science of Programming
A Discipline of Programming
The Temporal Semantics of Concurrent Programs
Proceedings of the International Sympoisum on Semantics of Concurrent Computation
Notes on Data Base Operating Systems
Operating Systems, An Advanced Course
Distributed Systems - Architecture and Implementation, An Advanced Course
Proving real-time properties of programs with temporal logic
SOSP '81 Proceedings of the eighth ACM symposium on Operating systems principles
Understanding and using asynchronous message passing (Preliminary Version)
PODC '82 Proceedings of the first ACM SIGACT-SIGOPS symposium on Principles of distributed computing
Axiomatic verification to enhance software reliability
Axiomatic verification to enhance software reliability
Applications of Byzantine agreement in database systems
ACM Transactions on Database Systems (TODS)
Low cost management of replicated data in fault-tolerant distributed systems
ACM Transactions on Computer Systems (TOCS)
Reliable communication in the presence of failures
ACM Transactions on Computer Systems (TOCS)
Highly available distributed services and fault-tolerant distributed garbage collection
PODC '86 Proceedings of the fifth annual ACM symposium on Principles of distributed computing
SIGMOD '87 Proceedings of the 1987 ACM SIGMOD international conference on Management of data
Update propagation in Bakunin data networks
PODC '87 Proceedings of the sixth annual ACM Symposium on Principles of distributed computing
Fault tolerant distributed process-control systems
ACM '87 Proceedings of the 1987 Fall Joint Computer Conference on Exploring technology: today and tomorrow
An overview of the SR language and implementation
ACM Transactions on Programming Languages and Systems (TOPLAS)
Fault-Tolerant Distributed Algorithm for Election in Complete Networks
IEEE Transactions on Computers - Fault-Tolerant Computing
A Class of Inherently Fault Tolerant Distributed Programs
IEEE Transactions on Software Engineering
A Pessimistic Consistency Control Algorithm for Replicated Files Which Achieves High Availability
IEEE Transactions on Software Engineering
Resilient Objects in Broadband Networks
IEEE Transactions on Software Engineering
A Dynamic Voting Scheme in Distributed Systems
IEEE Transactions on Software Engineering
Reliable scheduling in a TMR database system
ACM Transactions on Computer Systems (TOCS)
A realistic evaluation of consistency algorithms for replicated files
ANSS '88 Proceedings of the 21st annual symposium on Simulation
PODC '88 Proceedings of the seventh annual ACM Symposium on Principles of distributed computing
Automatically increasing the fault-tolerance of distributed systems
PODC '88 Proceedings of the seventh annual ACM Symposium on Principles of distributed computing
Efficient solution to the distributed mutual exclusion problem
Proceedings of the eighth annual ACM Symposium on Principles of distributed computing
Efficient parallel algorithms can be made robust
Proceedings of the eighth annual ACM Symposium on Principles of distributed computing
Increasing availability under mutual exclusion constraints with dynamic vote reassignment
ACM Transactions on Computer Systems (TOCS)
A loop-free extended Bellman-Ford routing protocol without bouncing effect
SIGCOMM '89 Symposium proceedings on Communications architectures & protocols
Failure Transparency in Remote Procedure Calls
IEEE Transactions on Computers
The Use of Self Checks and Voting in Software Error Detection: An Empirical Study
IEEE Transactions on Software Engineering
Dynamic voting algorithms for maintaining the consistency of a replicated database
ACM Transactions on Database Systems (TODS)
Recoverable Distributed Shared Virtual Memory
IEEE Transactions on Computers
Replicated architectures for shared window systems: a critique
COCS '90 Proceedings of the ACM SIGOIS and IEEE CS TC-OA conference on Office information systems
Lazy replication: exploiting the semantics of distributed services
PODC '90 Proceedings of the ninth annual ACM symposium on Principles of distributed computing
Implementing fault-tolerant services using the state machine approach: a tutorial
ACM Computing Surveys (CSUR)
Efficient at-most-once messages based on synchronized clocks
SIGCOMM '90 Proceedings of the ACM symposium on Communications architectures & protocols
Efficient robust parallel computations
STOC '90 Proceedings of the twenty-second annual ACM symposium on Theory of computing
Paradigms for process interaction in distributed programs
ACM Computing Surveys (CSUR)
Management of a remote backup copy for disaster recovery
ACM Transactions on Database Systems (TODS)
Efficient at-most-once messages based on synchronized clocks
ACM Transactions on Computer Systems (TOCS)
Efficient parallel algorithms on restartable fail-stop processors
PODC '91 Proceedings of the tenth annual ACM symposium on Principles of distributed computing
Unreliable failure detectors for asynchronous systems (preliminary version)
PODC '91 Proceedings of the tenth annual ACM symposium on Principles of distributed computing
A fault-tolerant commit protocol for replicated databases
PODS '92 Proceedings of the eleventh ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
The generalized tree quorum protocol: an efficient approach for managing replicated data
ACM Transactions on Database Systems (TODS)
Manetho: Transparent Roll Back-Recovery with Low Overhead, Limited Rollback, and Fast Output Commit
IEEE Transactions on Computers - Special issue on fault-tolerant computing
Open commit protocols tolerating commission failures
ACM Transactions on Database Systems (TODS)
The consensus problem in fault-tolerant computing
ACM Computing Surveys (CSUR)
A non-blocking transaction data flow graph based approach for replicated data
ACM SIGOPS Operating Systems Review
The process group approach to reliable distributed computing
Communications of the ACM
Closure and Convergence: A Foundation of Fault-Tolerant Computing
IEEE Transactions on Software Engineering - Special issue on software reliability
A checkpoint protocol for an entry consistent shared memory system
PODC '94 Proceedings of the thirteenth annual ACM symposium on Principles of distributed computing
Coding-Based Replication Schemes for Distributed Systems
IEEE Transactions on Parallel and Distributed Systems
Supporting Fault-Tolerant Parallel Programming in Linda
IEEE Transactions on Parallel and Distributed Systems
Programming Language Support for Writing Fault-Tolerant Distributed Software
IEEE Transactions on Computers - Special issue on fault-tolerant computing
A Distributed System-Level Diagnosis Algorithm for Arbitrary Network Topologies
IEEE Transactions on Computers - Special issue on fault-tolerant computing
Checkpoint Space Reclamation for Uncoordinated Checkpointing in Message-Passing Systems.
IEEE Transactions on Parallel and Distributed Systems
Asynchronous consensus and broadcast protocols
Journal of the ACM (JACM)
A Fault-Tolerant Algorithm for Replicated Data Management
IEEE Transactions on Parallel and Distributed Systems
Hypervisor-based fault tolerance
SOSP '95 Proceedings of the fifteenth ACM symposium on Operating systems principles
Hypervisor-based fault tolerance
ACM Transactions on Computer Systems (TOCS) - Special issue on operating system principles
The Performance of Two Phase Commit Protocols in the Presence of Site Failures
Distributed and Parallel Databases
Crowds: anonymity for Web transactions
ACM Transactions on Information and System Security (TISSEC)
Efficient Fault-Tolerant Multicast Scheme for Hypercube Multicomputers
IEEE Transactions on Parallel and Distributed Systems
A Gracefully Degrading Massively Parallel System Using the BSP Model, and Its Evaluation
IEEE Transactions on Computers
An Index-Based Checkpointing Algorithm for Autonomous Distributed Systems
IEEE Transactions on Parallel and Distributed Systems
IEEE Transactions on Computers
A simulation study of replication control protocols using volatile witnesses
ANSS '92 Proceedings of the 25th annual symposium on Simulation
Fundamentals of fault-tolerant distributed computing in asynchronous environments
ACM Computing Surveys (CSUR)
Specification and verification of fault-tolerance, timing, and scheduling
ACM Transactions on Programming Languages and Systems (TOPLAS)
The effect of failure and repair distributions on consistency protocols for replicated data objects
ANSS '89 Proceedings of the 22nd annual symposium on Simulation
Replicated distributed programs
Proceedings of the tenth ACM symposium on Operating systems principles
Replication and fault-tolerance in the ISIS system
Proceedings of the tenth ACM symposium on Operating systems principles
A lighweight causal logging scheme for recoverable distributed shared memory
SAC '00 Proceedings of the 2000 ACM symposium on Applied computing - Volume 2
The Journal of Supercomputing
A Low Overhead Logging Scheme for Fast Recovery in Distributed Shared Memory Systems
The Journal of Supercomputing
Byzantine generals in action: implementing fail-stop processors
ACM Transactions on Computer Systems (TOCS)
Fault-tolerance support in distributed systems
EW 4 Proceedings of the 4th workshop on ACM SIGOPS European workshop
Concurrent logic programming of correct and robust coordinated reactive computations
EW 4 Proceedings of the 4th workshop on ACM SIGOPS European workshop
A formal approach to fault-tolerance in distributed real-time systems
EW 4 Proceedings of the 4th workshop on ACM SIGOPS European workshop
Towards a formal framework for fault-tolerance
EW 4 Proceedings of the 4th workshop on ACM SIGOPS European workshop
A Distributed Parallel Programming Framework
IEEE Transactions on Software Engineering
A survey of rollback-recovery protocols in message-passing systems
ACM Computing Surveys (CSUR)
Efficient Garbage Collection Schemes for Causal Message Logging with Independent Checkpointing
The Journal of Supercomputing
Is byzantine agreement useful in a distributed database?
PODS '84 Proceedings of the 3rd ACM SIGACT-SIGMOD symposium on Principles of database systems
A Distributed Fault-Tolerant Design for Multiple-Server VOD Systems
Multimedia Tools and Applications
Recovering from Multiple Process Failures in the Time Warp Mechanism
IEEE Transactions on Computers
A Hybrid Replica Control Algorithm Combining Static and Dynamic Voting
IEEE Transactions on Knowledge and Data Engineering
Storage Efficient Replicated Databases
IEEE Transactions on Knowledge and Data Engineering
The Grid Protocol: A High Performance Scheme for Maintaining Replicated Data
IEEE Transactions on Knowledge and Data Engineering
A Nonblocking Transaction Data Flow Graph Based Protocol For Replicated Databases
IEEE Transactions on Knowledge and Data Engineering
Using Reconfiguration for Efficient Management of Replicated Data
IEEE Transactions on Knowledge and Data Engineering
A Nonblocking Quorum Consensus Protocol for Replicated Data
IEEE Transactions on Parallel and Distributed Systems
Efficient Rollback-Recovery Technique in Distributed Computing Systems
IEEE Transactions on Parallel and Distributed Systems
IEEE Transactions on Software Engineering
Warm Standby in Hierarchically Structured Process-Control Programs
IEEE Transactions on Software Engineering
Controlling memory access concurrency in efficient fault-tolerant parallel algorithms
Nordic Journal of Computing
Garbage Collection in a Distributed Object-Oriented System
IEEE Transactions on Knowledge and Data Engineering
Efficient Execution of Read-Only Transactions in Replicated Multiversion Databases
IEEE Transactions on Knowledge and Data Engineering
Design and Verification of Distributed Recovery Blocks with CSP
Formal Methods in System Design
IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
Efficient Fault-Tolerant Protocol for Mobility Agents in Mobile IP
IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
Reducing Storage for Quorum Consensus Algorithms
VLDB '88 Proceedings of the 14th International Conference on Very Large Data Bases
The Tree Quorum Protocol: An Efficient Approach for Managing Replicated Data
VLDB '90 Proceedings of the 16th International Conference on Very Large Data Bases
Resilient Logical Structures for Efficient Management of Replicated Data
VLDB '92 Proceedings of the 18th International Conference on Very Large Data Bases
The Complexity of Synchronous Iterative Do-All with Crashes
DISC '01 Proceedings of the 15th International Conference on Distributed Computing
Garbage Collection for Modile and Replicated Objects
SOFSEM '99 Proceedings of the 26th Conference on Current Trends in Theory and Practice of Informatics on Theory and Practice of Informatics
A Recovery Technique Using Multi-agent in Distributed Computing Systems
COORDINATION '02 Proceedings of the 5th International Conference on Coordination Models and Languages
Thinking Tools for the Future of Computing Science
Informatics - 10 Years Back. 10 Years Ahead.
Improving Scalability of Replicated Services in Mobile Agent Systems
MA '02 Proceedings of the 6th International Conference on Mobile Agents
An Efficient Optimistic Message Logging Scheme for Recoverable Mobile Computing Systems
IEEE Transactions on Mobile Computing
distributed cooperation and adversity: complexity trade-offs
PCK50 Proceedings of the Paris C. Kanellakis memorial workshop on Principles of computing & knowledge: Paris C. Kanellakis memorial workshop on the occasion of his 50th birthday
A survey of permission-based distributed mutual exclusion algorithms
Computer Standards & Interfaces
Garbage collection in message passing distributed systems
PAS '95 Proceedings of the First Aizu International Symposium on Parallel Algorithms/Architecture Synthesis
Gracefully Degrading Systems Using the Bulk-Synchronous Parallel Model with Randomised Shared Memory
FTCS '95 Proceedings of the Twenty-Fifth International Symposium on Fault-Tolerant Computing
An Efficient Logging Scheme for Lazy Release Consistent Distributed Shared Memory Systems
IPPS '98 Proceedings of the 12th. International Parallel Processing Symposium on International Parallel Processing Symposium
Log Compaction and Garbage Collection: What could they Bring to Each Other?
IWOOOS '95 Proceedings of the 4th International Workshop on Object-Orientation in Operating Systems
(h, k)-Arbiters for h-out-of-k mutual exclusion problem
Theoretical Computer Science
Distributed Diagnosis in Dynamic Fault Environments
IEEE Transactions on Parallel and Distributed Systems
Recovery in the Mobile Wireless Environment Using Mobile Agents
IEEE Transactions on Mobile Computing
On the implementation and use of Ada on fault-tolerant distributed systems
ACM SIGAda Ada Letters
A causal message logging protocol for mobile nodes in mobile computing systems
Future Generation Computer Systems - Special issue: Advanced services for clusters and internet computing
The complexity of synchronous iterative Do-All with crashes
Distributed Computing
Checkpointing for Peta-Scale Systems: A Look into the Future of Practical Rollback-Recovery
IEEE Transactions on Dependable and Secure Computing
Fault-Tolerant Parallel Applications with Dynamic Parallel Schedules
IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Workshop 16 - Volume 17
Current Practice and a Direction Forward in Checkpoint/Restart Implementations for Fault Tolerance
IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Workshop 18 - Volume 19
Performing tasks on synchronous restartable message-passing processors
Distributed Computing
Common knowledge and consistent simultaneous coordination
Distributed Computing
Wireless Personal Communications: An International Journal
Replication for Adaptive Responsiveness in Service-Oriented Systems
QSIC '05 Proceedings of the Fifth International Conference on Quality Software
A resource management and fault tolerance services in grid computing
Journal of Parallel and Distributed Computing - Special issue: Design and performance of networks for super-, cluster-, and grid-computing: Part II
Efficient gossip and robust distributed computation
Theoretical Computer Science
Efficient parallel algorithms can be made robust
Distributed Computing
Dependability through Assured Reconfiguration in Embedded System Software
IEEE Transactions on Dependable and Secure Computing
System Call Monitoring Using Authenticated System Calls
IEEE Transactions on Dependable and Secure Computing
The Do-All problem with Byzantine processor failures
Theoretical Computer Science - Foundations of software science and computation structures
Deriving distribution of thread service time in layered queueing networks
WOSP '07 Proceedings of the 6th international workshop on Software and performance
Declarative failure recovery for sensor networks
Proceedings of the 6th international conference on Aspect-oriented software development
UbiSrvInt - a context-aware fault-tolerant approach toward wireless P2P service provision
Expert Systems with Applications: An International Journal
On modeling and tolerating incorrect software
Journal of High Speed Networks - Self-Stabilizing Systems, Part 2
FC-ORB: A robust distributed real-time embedded middleware with end-to-end utilization control
Journal of Systems and Software
Network subsystems reloaded: a high-performance, defensible network subsystem
ATEC '04 Proceedings of the annual conference on USENIX Annual Technical Conference
Efficient Routing and Broadcasting in Recursive Interconnection Networks
ICPP '94 Proceedings of the 1994 International Conference on Parallel Processing - Volume 01
Adding group communication and fault-tolerance to CORBA
COOTS'95 Proceedings of the USENIX Conference on Object-Oriented Technologies on USENIX Conference on Object-Oriented Technologies (COOTS)
Large-scale byzantine fault tolerance: safe but not always live
HotDep'07 Proceedings of the 3rd workshop on on Hot Topics in System Dependability
A solution for fault tolerance in replicated database systems
International Journal of Computational Science and Engineering
2-step algorithm for enhancing effectiveness of sender-based message logging
SpringSim '07 Proceedings of the 2007 spring simulation multiconference - Volume 2
Effective service replication mechanisms exploiting agent mobility
SEPADS'08 Proceedings of the 7th WSEAS International Conference on Software Engineering, Parallel and Distributed Systems
Development of Fault-Tolerant Software Systems Based on Architectural Abstractions
ECSA '08 Proceedings of the 2nd European conference on Software Architecture
WSEAS Transactions on Computers
Journal of Parallel and Distributed Computing
Message fragment based causal message logging
Journal of Parallel and Distributed Computing
Rapid P2P overlay network programming on a distributed reduction machine
CCNC'09 Proceedings of the 6th IEEE Conference on Consumer Communications and Networking Conference
A solution for fault-tolerance in replicated database systems
ISPA'03 Proceedings of the 2003 international conference on Parallel and distributed processing and applications
On verifying fault tolerance of distributed protocols
TACAS'08/ETAPS'08 Proceedings of the Theory and practice of software, 14th international conference on Tools and algorithms for the construction and analysis of systems
Practical impact of group communication theory
Future directions in distributed computing
Atomic mobile agent group communication
CCNC'10 Proceedings of the 7th IEEE conference on Consumer communications and networking conference
The failure detector abstraction
ACM Computing Surveys (CSUR)
A replica control method for improving availability for read-only transactions
IDEAS'97 Proceedings of the 1997 international conference on International database engineering and applications symposium
Dependable computing: concepts, limits, challenges
FTCS'95 Proceedings of the Twenty-Fifth international conference on Fault-tolerant computing
Agent based dynamic recovery protocol in distributed databases
ISPDC'03 Proceedings of the Second international conference on Parallel and distributed computing
The design of a practical system for fault-tolerant virtual machines
ACM SIGOPS Operating Systems Review
Architecture and protocol support for providing consensus as a fault-tolerant virtualised service
Proceedings of the 8th International Conference on Frontiers of Information Technology
Algorithm-based recovery for iterative methods without checkpointing
Proceedings of the 20th international symposium on High performance distributed computing
Rectifying orphan components using group-failover in distributed real-time and embedded systems
Proceedings of the 14th international ACM Sigsoft symposium on Component based software engineering
Analyzing fault aware collective performance in a process fault tolerant MPI
Parallel Computing
Checkpointing and communication pattern-neutral algorithm for removing messages logged by senders
HPCC'06 Proceedings of the Second international conference on High Performance Computing and Communications
A theory for observational fault tolerance
FOSSACS'06 Proceedings of the 9th European joint conference on Foundations of Software Science and Computation Structures
A quorum based group k-mutual exclusion algorithm for open distributed environments
ISPA'05 Proceedings of the Third international conference on Parallel and Distributed Processing and Applications
A fault-tolerant multi-agent development framework
ISPA'04 Proceedings of the Second international conference on Parallel and Distributed Processing and Applications
A hybrid message Logging-CIC protocol for constrained checkpointability
Euro-Par'05 Proceedings of the 11th international Euro-Par conference on Parallel Processing
Implementing rollback-recovery coordinated checkpoints
ISSADS'05 Proceedings of the 5th international conference on Advanced Distributed Systems
Mobile agent based fault-tolerance support for the reliable mobile computing systems
COORDINATION'05 Proceedings of the 7th international conference on Coordination Models and Languages
Architecting web services applications for improving availability
Architecting Dependable Systems III
Architecting Dependable Systems III
Replication techniques for availability
Replication
A quorum-based replication framework for distributed software transactional memory
OPODIS'11 Proceedings of the 15th international conference on Principles of Distributed Systems
Research: Design of broadcast programming primitives for distributed systems
Computer Communications
Automated model repair for distributed programs
ACM SIGACT News
Future Generation Computer Systems
Fault tolerant preconditioned conjugate gradient for sparse linear system solution
Proceedings of the 26th ACM international conference on Supercomputing
A multi-cycle checkpointing protocol that ensures strict 1-rollback
Information Processing Letters
Fault-tolerant fault tolerance for component-based automation systems
Proceedings of the 4th international ACM Sigsoft symposium on Architecting critical systems
Cooperative security in distributed networks
Computer Communications
Leveraging sharding in the design of scalable replication protocols
Proceedings of the 4th annual Symposium on Cloud Computing
Hi-index | 0.03 |