Adaptive load sharing in homogeneous distributed systems
IEEE Transactions on Software Engineering
Cluster-based scalable network services
Proceedings of the sixteenth ACM symposium on Operating systems principles
End-to-end routing behavior in the Internet
IEEE/ACM Transactions on Networking (TON)
Delayed Internet routing convergence
Proceedings of the conference on Applications, Technologies, Architectures, and Protocols for Computer Communication
SOSP '01 Proceedings of the eighteenth ACM symposium on Operating systems principles
On the use and performance of content distribution networks
IMW '01 Proceedings of the 1st ACM SIGCOMM Workshop on Internet Measurement
Understanding BGP misconfiguration
Proceedings of the 2002 conference on Applications, technologies, architectures, and protocols for computer communications
End-to-end WAN service availability
IEEE/ACM Transactions on Networking (TON)
Experimental Study of Internet Stability and Backbone Failures
FTCS '99 Proceedings of the Twenty-Ninth Annual International Symposium on Fault-Tolerant Computing
A measurement-based analysis of multihoming
Proceedings of the 2003 conference on Applications, technologies, architectures, and protocols for computer communications
Best-path vs. multi-path overlay routing
Proceedings of the 3rd ACM SIGCOMM conference on Internet measurement
Achieving sub-50 milliseconds recovery upon BGP peering link failures
CoNEXT '05 Proceedings of the 2005 ACM conference on Emerging network experiment and technology
Realizing the benefits of user-level channel diversity
ACM SIGCOMM Computer Communication Review
Countering DoS attacks with stateless multipath overlays
Proceedings of the 12th ACM conference on Computer and communications security
Oasis: an overlay-aware network stack
ACM SIGOPS Operating Systems Review
The costs and limits of availability for replicated services
ACM Transactions on Computer Systems (TOCS)
Source selectable path diversity via routing deflections
Proceedings of the 2006 conference on Applications, technologies, architectures, and protocols for computer communications
Drafting behind Akamai (travelocity-based detouring)
Proceedings of the 2006 conference on Applications, technologies, architectures, and protocols for computer communications
On the impact of research network based testbeds on wide-area experiments
Proceedings of the 6th ACM SIGCOMM conference on Internet measurement
An Experimental Study of Internet Path Diversity
IEEE Transactions on Dependable and Secure Computing
Experiences in building and operating ePOST, a reliable peer-to-peer application
Proceedings of the 1st ACM SIGOPS/EuroSys European Conference on Computer Systems 2006
Computer Networks: The International Journal of Computer and Telecommunications Networking
Improving web availability for clients with MONET
NSDI'05 Proceedings of the 2nd conference on Symposium on Networked Systems Design & Implementation - Volume 2
Non-transitive connectivity and DHTs
WORLDS'05 Proceedings of the 2nd conference on Real, Large Distributed Systems - Volume 2
On the analysis of overlay failure detection and recovery
Computer Networks: The International Journal of Computer and Telecommunications Networking
Reliability as an interdomain service
Proceedings of the 2007 conference on Applications, technologies, architectures, and protocols for computer communications
Model-driven optimization using adaptive probes
SODA '07 Proceedings of the eighteenth annual ACM-SIAM symposium on Discrete algorithms
Application-layer multipath data transfer via TCP: Schemes and performance tradeoffs
Performance Evaluation
NIRA: a new inter-domain routing architecture
IEEE/ACM Transactions on Networking (TON)
Achieving sub-50 milliseconds recovery upon BGP peering link failures
IEEE/ACM Transactions on Networking (TON)
A study of end-to-end web access failures
CoNEXT '06 Proceedings of the 2006 ACM CoNEXT conference
Quality of service routing in peer-to-peer overlays
CoNEXT '06 Proceedings of the 2006 ACM CoNEXT conference
Studying black holes in the internet with Hubble
NSDI'08 Proceedings of the 5th USENIX Symposium on Networked Systems Design and Implementation
NetComplex: a complexity metric for networked system designs
NSDI'08 Proceedings of the 5th USENIX Symposium on Networked Systems Design and Implementation
Proceedings of the ACM SIGCOMM 2008 conference on Data communication
Satellitelab: adding heterogeneity to planetary-scale network testbeds
Proceedings of the ACM SIGCOMM 2008 conference on Data communication
Motivating participation in internet routing overlays
Proceedings of the 3rd international workshop on Economics of networked systems
UFO: a resilient layered routing architecture
ACM SIGCOMM Computer Communication Review
DPLM: a novel application-layer routing scheme for low delay VoIP
Proceedings of the 4th Asian Conference on Internet Engineering
An investigation of the Internet's IP-layer connectivity
Computer Communications
MOSAIC: unified declarative platform for dynamic overlay composition
CoNEXT '08 Proceedings of the 2008 ACM CoNEXT Conference
AS alliance: cooperatively improving resilience of intra-alliance communication
CoNEXT '08 Proceedings of the 2008 ACM CoNEXT Conference
iPlane Nano: path prediction for peer-to-peer applications
NSDI'09 Proceedings of the 6th USENIX symposium on Networked systems design and implementation
Symbiotic relationships in internet routing overlays
NSDI'09 Proceedings of the 6th USENIX symposium on Networked systems design and implementation
Secure and policy-compliant source routing
IEEE/ACM Transactions on Networking (TON)
Proceedings of the ACM SIGCOMM 2009 conference on Data communication
A2M: Access-Assured Mobile Desktop Computing
ISC '09 Proceedings of the 12th International Conference on Information Security
On the placement of infrastructure overlay nodes
IEEE/ACM Transactions on Networking (TON)
Scaling all-pairs overlay routing
Proceedings of the 5th international conference on Emerging networking experiments and technologies
Relay node selection in large-scale VoIP overlay networks
ICUFN'09 Proceedings of the first international conference on Ubiquitous and future networks
GameNets'09 Proceedings of the First ICST international conference on Game Theory for Networks
Informed detour selection helps reliability
INFOCOM'09 Proceedings of the 28th IEEE international conference on Computer Communications Workshops
Drafting behind Akamai: inferring network conditions based on CDN redirections
IEEE/ACM Transactions on Networking (TON)
CORS: A cooperative overlay routing service to enhance interactive multimedia communications
Journal of Visual Communication and Image Representation
Journal of Visual Communication and Image Representation
On the effectiveness of proactive path-diversity based routing for robustness to path failures
NETWORKING'08 Proceedings of the 7th international IFIP-TC6 networking conference on AdHoc and sensor networks, wireless networks, next generation internet
On the impact of routing matrix inconsistencies on statistical path monitoring in overlay networks
Computer Networks: The International Journal of Computer and Telecommunications Networking
On peer-to-peer relays for VoIP applications
ICICS'09 Proceedings of the 7th international conference on Information, communications and signal processing
A survey on the design, applications, and enhancements of application-layer overlay networks
ACM Computing Surveys (CSUR)
Webprofiler: cooperative diagnosis of web failures
COMSNETS'10 Proceedings of the 2nd international conference on COMmunication systems and NETworks
A collaborative P2P scheme for NAT Traversal Server discovery based on topological information
Computer Networks: The International Journal of Computer and Telecommunications Networking
California fault lines: understanding the causes and impact of network failures
Proceedings of the ACM SIGCOMM 2010 conference
Detouring and replication for fast and reliable internet-scale stream processing
Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
Deconstructing internet paths: an approach for AS-level detour route discovery
IPTPS'09 Proceedings of the 8th international conference on Peer-to-peer systems
Optimizing cost and performance in online service provider networks
NSDI'10 Proceedings of the 7th USENIX conference on Networked systems design and implementation
Wide-area route control for distributed services
USENIXATC'10 Proceedings of the 2010 USENIX conference on USENIX annual technical conference
Path selection and multipath congestion control
Communications of the ACM
How to probe for an extreme value
ACM Transactions on Algorithms (TALG)
Effects of internet path selection on video-QoE
MMSys '11 Proceedings of the second annual ACM conference on Multimedia systems
Finding routing shortcuts using an internet coordinate system
IWSOS'11 Proceedings of the 5th international conference on Self-organizing systems
International Journal of Communication Networks and Distributed Systems
On the feasibility of bandwidth detouring
PAM'11 Proceedings of the 12th international conference on Passive and active measurement
Proceedings of the ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
ACM SIGMETRICS Performance Evaluation Review - Performance evaluation review
Machiavellian routing: improving internet availability with BGP poisoning
Proceedings of the 10th ACM Workshop on Hot Topics in Networks
Adaptive Uncertainty Resolution in Bayesian Combinatorial Optimization Problems
ACM Transactions on Algorithms (TALG)
Verifying and enforcing network paths with icing
Proceedings of the Seventh COnference on emerging Networking EXperiments and Technologies
Dynamic internet congestion with bursts
HiPC'06 Proceedings of the 13th international conference on High Performance Computing
MOSAIC: Declarative platform for dynamic overlay composition
Computer Networks: The International Journal of Computer and Telecommunications Networking
Making middleboxes someone else's problem: network processing as a cloud service
Proceedings of the ACM SIGCOMM 2012 conference on Applications, technologies, architectures, and protocols for computer communication
LIFEGUARD: practical repair of persistent route failures
Proceedings of the ACM SIGCOMM 2012 conference on Applications, technologies, architectures, and protocols for computer communication
Making middleboxes someone else's problem: network processing as a cloud service
ACM SIGCOMM Computer Communication Review - Special october issue SIGCOMM '12
LIFEGUARD: practical repair of persistent route failures
ACM SIGCOMM Computer Communication Review - Special october issue SIGCOMM '12
Multimedia Tools and Applications
Dasu: pushing experiments to the internet's edge
nsdi'13 Proceedings of the 10th USENIX conference on Networked Systems Design and Implementation
RiskRoute: a framework for mitigating network outage threats
Proceedings of the ninth ACM conference on Emerging networking experiments and technologies
Hi-index | 0.02 |
Recent work has focused on increasing availability in the face of Internet path failures. To date, proposed solutions have relied on complex routing and path-monitoring schemes, trading scalability for availability among a relatively small set of hosts. This paper proposes a simple, scalable approach to recover from Internet path failures. Our contributions are threefold. First, we conduct a broad measurement study of Internet path failures on a collection of 3,153 Internet destinations consisting of popular Web servers, broad-band hosts, and randomly selected nodes. We monitored these destinations from 67 PlanetLab vantage points over a period of seven days, and found availabilities ranging from 99.6% for servers to 94.4% for broadband hosts. When failures do occur, many appear too close to the destination (e.g., last-hop and end-host failures) to be mitigated through alternative routing techniques of any kind. Second, we show that for the failures that can be addressed through routing, a simple, scalable technique, called one-hop source routing, can achieve close to the maximum benefit available with very low overhead. When a path failure occurs, our scheme attempts to recover from it by routing indirectly through a small set of randomly chosen intermediaries. Third, we implemented and deployed a prototype one-hop source routing infrastructure on PlanetLab. Over a three day period, we repeatedly fetched documents from 982 popular Internet Web servers and used one-hop source routing to attempt to route around the failures we observed. Our results show that our prototype successfully recovered from 56% of network failures. However, we also found a large number of server failures that cannot be addressed through alternative routing. Our research demonstrates that one-hop source routing is easy to implement, adds negligible overhead, and achieves close to the maximum benefit available to indirect routing schemes, without the need for path monitoring, history, or a-priori knowledge of any kind.