Goodness-of-fit techniques
Scale and performance in a distributed file system
ACM Transactions on Computer Systems (TOCS)
Coda: A Highly Available File System for a Distributed Workstation Environment
IEEE Transactions on Computers
Disconnected operation in the Coda File System
ACM Transactions on Computer Systems (TOCS)
Rover: a toolkit for mobile information access
SOSP '95 Proceedings of the fifteenth ACM symposium on Operating systems principles
Managing update conflicts in Bayou, a weakly connected replicated storage system
SOSP '95 Proceedings of the fifteenth ACM symposium on Operating systems principles
Using predictive prefetching to improve World Wide Web latency
ACM SIGCOMM Computer Communication Review
Agile application-aware adaptation for mobility
Proceedings of the sixteenth ACM symposium on Operating systems principles
Computer architecture (2nd ed.): a quantitative approach
Computer architecture (2nd ed.): a quantitative approach
Measurements and analysis of end-to-end Internet dynamics
Measurements and analysis of end-to-end Internet dynamics
The end-to-end effects of Internet path selection
Proceedings of the conference on Applications, technologies, architectures, and protocols for computer communication
On the scale and performance of cooperative Web proxy caching
Proceedings of the seventeenth ACM symposium on Operating systems principles
Resource management for scalable disconnected access to Web services
Proceedings of the 10th international conference on World Wide Web
Probability and statistics with reliability, queuing and computer science applications
Probability and statistics with reliability, queuing and computer science applications
SOSP '01 Proceedings of the eighteenth ACM symposium on Operating systems principles
Experimental Study of Internet Stability and Backbone Failures
FTCS '99 Proceedings of the Twenty-Ninth Annual International Symposium on Fault-Tolerant Computing
The case for geographical push-caching
HOTOS '95 Proceedings of the Fifth Workshop on Hot Topics in Operating Systems (HotOS-V)
Coordinated Placement and Replacement for Large-Scale Distributed Caches
WIAPP '99 Proceedings of the 1999 IEEE Workshop on Internet Applications
INFOCOM '97 Proceedings of the INFOCOM '97. Sixteenth Annual Joint Conference of the IEEE Computer and Communications Societies. Driving the Information Revolution
Characteristics of WWW Client-based Traces
Characteristics of WWW Client-based Traces
Design Considerations for Distributed Caching on the Internet
ICDCS '99 Proceedings of the 19th IEEE International Conference on Distributed Computing Systems
Using Mobile Extensions to Support Disconnected Services
Using Mobile Extensions to Support Disconnected Services
TCP Nice: a mechanism for background transfers
OSDI '02 Proceedings of the 5th symposium on Operating systems design and implementationCopyright restrictions prevent ACM from being able to make the PDFs for this conference available for downloading
End-to-end WAN service availability
USITS'01 Proceedings of the 3rd conference on USENIX Symposium on Internet Technologies and Systems - Volume 3
Scalable web caching of frequently updated objects using reliable multicast
USITS'99 Proceedings of the 2nd conference on USENIX Symposium on Internet Technologies and Systems - Volume 2
Organization-based analysis of web-object sharing and caching
USITS'99 Proceedings of the 2nd conference on USENIX Symposium on Internet Technologies and Systems - Volume 2
USITS'99 Proceedings of the 2nd conference on USENIX Symposium on Internet Technologies and Systems - Volume 2
Mining longest repeating subsequences to predict world wide web surfing
USITS'99 Proceedings of the 2nd conference on USENIX Symposium on Internet Technologies and Systems - Volume 2
Active names: flexible location and transport of wide-area resources
USITS'99 Proceedings of the 2nd conference on USENIX Symposium on Internet Technologies and Systems - Volume 2
Exploring the bounds of web latency reduction from caching and prefetching
USITS'97 Proceedings of the USENIX Symposium on Internet Technologies and Systems on USENIX Symposium on Internet Technologies and Systems
The measured access characteristics of world-wide-web client proxy caches
USITS'97 Proceedings of the USENIX Symposium on Internet Technologies and Systems on USENIX Symposium on Internet Technologies and Systems
Using smart clients to build scalable services
ATEC '97 Proceedings of the annual conference on USENIX Annual Technical Conference
Active cache: caching dynamic contents on the Web
Middleware '98 Proceedings of the IFIP International Conference on Distributed Systems Platforms and Open Distributed Processing
The potential costs and benefits of long-term prefetching for content distribution
Computer Communications
Improved survivability with multi-layer dynamic routing
IEEE Communications Magazine
Application specific data replication for edge services
WWW '03 Proceedings of the 12th international conference on World Wide Web
Improving Availability and Performance with Application-Specific Data Replication
IEEE Transactions on Knowledge and Data Engineering
Consistent and automatic replica regeneration
ACM Transactions on Storage (TOS)
The costs and limits of availability for replicated services
ACM Transactions on Computer Systems (TOCS)
Improving web availability for clients with MONET
NSDI'05 Proceedings of the 2nd conference on Symposium on Networked Systems Design & Implementation - Volume 2
PlanetSeer: internet path failure monitoring and characterization in wide-area services
OSDI'04 Proceedings of the 6th conference on Symposium on Opearting Systems Design & Implementation - Volume 6
Improving the reliability of internet paths with one-hop source routing
OSDI'04 Proceedings of the 6th conference on Symposium on Opearting Systems Design & Implementation - Volume 6
Availability of multi-object operations
NSDI'06 Proceedings of the 3rd conference on Networked Systems Design & Implementation - Volume 3
On the analysis of overlay failure detection and recovery
Computer Networks: The International Journal of Computer and Telecommunications Networking
Replication degree customization for high availability
Proceedings of the 3rd ACM SIGOPS/EuroSys European Conference on Computer Systems 2008
A study of end-to-end web access failures
CoNEXT '06 Proceedings of the 2006 ACM CoNEXT conference
Characterization of failures in an operational IP backbone network
IEEE/ACM Transactions on Networking (TON)
An investigation of the Internet's IP-layer connectivity
Computer Communications
Dynamic cost-efficient replication in data clouds
ACDC '09 Proceedings of the 1st workshop on Automated control for datacenters and clouds
iPlane Nano: path prediction for peer-to-peer applications
NSDI'09 Proceedings of the 6th USENIX symposium on Networked systems design and implementation
Zeno: eventually consistent Byzantine-fault tolerance
NSDI'09 Proceedings of the 6th USENIX symposium on Networked systems design and implementation
Concurrent Multipath Transfer during path failure
Computer Communications
Reliable publish/subscribe middleware for time-sensitive internet-scale applications
Proceedings of the Third ACM International Conference on Distributed Event-Based Systems
Efficient locally trackable deduplication in replicated systems
Proceedings of the 10th ACM/IFIP/USENIX International Conference on Middleware
Informed detour selection helps reliability
INFOCOM'09 Proceedings of the 28th IEEE international conference on Computer Communications Workshops
Resilience analysis of packet-witched communication networks
IEEE/ACM Transactions on Networking (TON)
CORS: A cooperative overlay routing service to enhance interactive multimedia communications
Journal of Visual Communication and Image Representation
Analytical availability assessment of IT services
ISAS'08 Proceedings of the 5th international conference on Service availability
A self-organized, fault-tolerant and scalable replication scheme for cloud storage
Proceedings of the 1st ACM symposium on Cloud computing
Evaluating the adaptivity of computing systems
Performance Evaluation
Efficient locally trackable deduplication in replicated systems
Middleware'09 Proceedings of the ACM/IFIP/USENIX 10th international conference on Middleware
Webprofiler: cooperative diagnosis of web failures
COMSNETS'10 Proceedings of the 2nd international conference on COMmunication systems and NETworks
California fault lines: understanding the causes and impact of network failures
Proceedings of the ACM SIGCOMM 2010 conference
Detouring and replication for fast and reliable internet-scale stream processing
Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
Network imprecision: a new consistency metric for scalable monitoring
OSDI'08 Proceedings of the 8th USENIX conference on Operating systems design and implementation
Weak consistency as a last resort
Proceedings of the 4th International Workshop on Large Scale Distributed Systems and Middleware
Modeling user perceived unavailability due to long response times
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
Proceedings of the VLDB Endowment
Prediction models for long-term Internet prefix availability
Computer Networks: The International Journal of Computer and Telecommunications Networking
Differentiated Availability in Cloud Computing SLAs
GRID '11 Proceedings of the 2011 IEEE/ACM 12th International Conference on Grid Computing
End-user perspectives of Internet connectivity problems
Computer Networks: The International Journal of Computer and Telecommunications Networking
Analysis of a service degradation model with preventive rejuvenation
ISAS'06 Proceedings of the Third international conference on Service Availability
Survey On reliability in publish/subscribe services
Computer Networks: The International Journal of Computer and Telecommunications Networking
DepSky: Dependable and Secure Storage in a Cloud-of-Clouds
ACM Transactions on Storage (TOS)
Hi-index | 0.00 |
This paper seeks to understand how network failures affect the availability of service delivery across wide-area networks (WANs) and to evaluate classes of techniques for improving end-to-end service availability. Using several large-scale connectivity traces, we develop a model of network unavailability that includes key parameters such as failure location and failure duration. We then use trace-based simulation to evaluate several classes of techniques for coping with network unavailability. We find that caching alone is seldom effective at insulating services from failures but that the combination of mobile extension code and prefetching can improve average unavailability by as much as an order of magnitude for classes of service whose semantics support disconnected operation. We find that routing-based techniques may provide significant improvements but that the improvements of many individual techniques are limited because they do not address all significant categories of network failures. By combining the techniques we examine, some systems may be able to reduce average unavailability by as much as one or two orders of magnitude.