Congestion avoidance and control
SIGCOMM '88 Symposium proceedings on Communications architectures and protocols
Gigabit networking
Random early detection gateways for congestion avoidance
IEEE/ACM Transactions on Networking (TON)
TCP Vegas: new techniques for congestion detection and avoidance
SIGCOMM '94 Proceedings of the conference on Communications architectures, protocols and applications
A comparison of mechanisms for improving TCP performance over wireless links
Conference proceedings on Applications, technologies, architectures, and protocols for computer communications
The effects of asymmetry on TCP performance
MobiCom '97 Proceedings of the 3rd annual ACM/IEEE international conference on Mobile computing and networking
On estimating end-to-end network path properties
Proceedings of the conference on Applications, technologies, architectures, and protocols for computer communication
Soft timers: efficient microsecond software timer support for network processing
ACM Transactions on Computer Systems (TOCS)
On the constancy of internet path properties
IMW '01 Proceedings of the 1st ACM SIGCOMM Workshop on Internet Measurement
An image transport protocol for the Internet
ICNP '00 Proceedings of the 2000 International Conference on Network Protocols
SOSP '03 Proceedings of the nineteenth ACM symposium on Operating systems principles
The Panasas ActiveScale Storage Cluster: Delivering Scalable High Bandwidth Storage
Proceedings of the 2004 ACM/IEEE conference on Supercomputing
Designing DCCP: congestion control without reliability
Proceedings of the 2006 conference on Applications, technologies, architectures, and protocols for computer communications
MapReduce: simplified data processing on large clusters
OSDI'04 Proceedings of the 6th conference on Symposium on Opearting Systems Design & Implementation - Volume 6
Structured streams: a new transport abstraction
Proceedings of the 2007 conference on Applications, technologies, architectures, and protocols for computer communications
Scalable performance of the Panasas parallel file system
FAST'08 Proceedings of the 6th USENIX Conference on File and Storage Technologies
Measurement and analysis of TCP throughput collapse in cluster-based storage systems
FAST'08 Proceedings of the 6th USENIX Conference on File and Storage Technologies
Understanding TCP incast throughput collapse in datacenter networks
Proceedings of the 1st ACM workshop on Research on enterprise networking
NETWORKING'07 Proceedings of the 6th international IFIP-TC6 conference on Ad Hoc and sensor networks, wireless networks, next generation internet
Understanding TCP incast throughput collapse in datacenter networks
Proceedings of the 1st ACM workshop on Research on enterprise networking
The nature of data center traffic: measurements & analysis
Proceedings of the 9th ACM SIGCOMM conference on Internet measurement conference
The impact of virtualization on network performance of amazon EC2 data center
INFOCOM'10 Proceedings of the 29th conference on Information communications
Proceedings of the ACM SIGCOMM 2010 conference
Proceedings of the ACM SIGCOMM 2010 conference
XCo: explicit coordination to prevent network fabric congestion in cloud computing cluster platforms
Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
Data center networking with multipath TCP
Hotnets-IX Proceedings of the 9th ACM SIGCOMM Workshop on Hot Topics in Networks
ICTCP: Incast Congestion Control for TCP in data center networks
Proceedings of the 6th International COnference
Exploring potential vulnerabilities in data center network
Proceedings of the ACM CoNEXT Student Workshop
On TCP-based SIP server overload control
Principles, Systems and Applications of IP Telecommunications
Profiling network performance for multi-tier data center applications
Proceedings of the 8th USENIX conference on Networked systems design and implementation
Efficiently measuring bandwidth at all time scales
Proceedings of the 8th USENIX conference on Networked systems design and implementation
Analysis of DCTCP: stability, convergence, and fairness
Proceedings of the ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
Fine-grained latency and loss measurements in the presence of reordering
Proceedings of the ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
A cost-intelligent application-specific data layout scheme for parallel file systems
Proceedings of the 20th international symposium on High performance distributed computing
Curbing energy cravings in networks: a cross-sectional view across the micro-macro boundary
NOCS '11 Proceedings of the Fifth ACM/IEEE International Symposium on Networks-on-Chip
Analysis of DCTCP: stability, convergence, and fairness
ACM SIGMETRICS Performance Evaluation Review - Performance evaluation review
Fine-grained latency and loss measurements in the presence of reordering
ACM SIGMETRICS Performance Evaluation Review - Performance evaluation review
Better never than late: meeting deadlines in datacenter networks
Proceedings of the ACM SIGCOMM 2011 conference
Managing data transfers in computer clusters with orchestra
Proceedings of the ACM SIGCOMM 2011 conference
Opportunistic flooding to improve TCP transmit performance in virtualized clouds
Proceedings of the 2nd ACM Symposium on Cloud Computing
Advanced FCoE: extension of fibre channel over ethernet
Proceedings of the 3rd Workshop on Data Center - Converged and Virtual Ethernet Switching
Cross-layer flow and congestion control for datacenter networks
Proceedings of the 3rd Workshop on Data Center - Converged and Virtual Ethernet Switching
Purlieus: locality-aware resource allocation for MapReduce in a cloud
Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
Pangolin: speeding up concurrent messaging for cloud-based social gaming
Proceedings of the Seventh COnference on emerging Networking EXperiments and Technologies
Practical TDMA for datacenter ethernet
Proceedings of the 7th ACM european conference on Computer Systems
Cutting MapReduce cost with spot market
HotCloud'11 Proceedings of the 3rd USENIX conference on Hot topics in cloud computing
Camdoop: exploiting in-network aggregation for big data applications
NSDI'12 Proceedings of the 9th USENIX conference on Networked Systems Design and Implementation
OSA: an optical switching architecture for data center networks with unprecedented flexibility
NSDI'12 Proceedings of the 9th USENIX conference on Networked Systems Design and Implementation
Less is more: trading a little bandwidth for ultra-low latency in the data center
NSDI'12 Proceedings of the 9th USENIX conference on Networked Systems Design and Implementation
The TCP outcast problem: exposing unfairness in data center networks
NSDI'12 Proceedings of the 9th USENIX conference on Networked Systems Design and Implementation
Preventing TCP incast throughput collapse at the initiation, continuation, and termination
Proceedings of the 2012 IEEE 20th International Workshop on Quality of Service
Explicit coordination to prevent congestion in data center networks
Cluster Computing
Deadline-aware datacenter tcp (D2TCP)
Proceedings of the ACM SIGCOMM 2012 conference on Applications, technologies, architectures, and protocols for computer communication
Finishing flows quickly with preemptive scheduling
Proceedings of the ACM SIGCOMM 2012 conference on Applications, technologies, architectures, and protocols for computer communication
DeTail: reducing the flow completion time tail in datacenter networks
Proceedings of the ACM SIGCOMM 2012 conference on Applications, technologies, architectures, and protocols for computer communication
Using CPU as a traffic co-processing unit in commodity switches
Proceedings of the first workshop on Hot topics in software defined networks
A switch-based approach to throughput collapse and starvation in data centers
Computer Networks: The International Journal of Computer and Telecommunications Networking
Deadline-aware datacenter tcp (D2TCP)
ACM SIGCOMM Computer Communication Review - Special october issue SIGCOMM '12
Finishing flows quickly with preemptive scheduling
ACM SIGCOMM Computer Communication Review - Special october issue SIGCOMM '12
DeTail: reducing the flow completion time tail in datacenter networks
ACM SIGCOMM Computer Communication Review - Special october issue SIGCOMM '12
Rethinking end-to-end congestion control in software-defined networks
Proceedings of the 11th ACM Workshop on Hot Topics in Networks
Deconstructing datacenter packet transport
Proceedings of the 11th ACM Workshop on Hot Topics in Networks
Tuning ECN for data center networks
Proceedings of the 8th international conference on Emerging networking experiments and technologies
Reproducible network experiments using container-based emulation
Proceedings of the 8th international conference on Emerging networking experiments and technologies
Zone-based data striping for cloud storage
IBM Journal of Research and Development
Bobtail: avoiding long tails in the cloud
nsdi'13 Proceedings of the 10th USENIX conference on Networked Systems Design and Implementation
Proceedings of the ACM SIGCOMM 2013 conference on SIGCOMM
pFabric: minimal near-optimal datacenter transport
Proceedings of the ACM SIGCOMM 2013 conference on SIGCOMM
Resource/accuracy tradeoffs in software-defined measurement
Proceedings of the second ACM SIGCOMM workshop on Hot topics in software defined networking
Protocol Responsibility Offloading to Improve TCP Throughput in Virtualized Environments
ACM Transactions on Computer Systems (TOCS)
Proceedings of the ACM SIGCOMM 2013 conference on SIGCOMM
ICTCP: incast congestion control for TCP in data-center networks
IEEE/ACM Transactions on Networking (TON)
Data-Intensive Cloud Computing: Requirements, Expectations, Challenges, and Solutions
Journal of Grid Computing
Efficient new delayed ACK for TCP: old problem, new insight
Proceedings of the 16th ACM international conference on Modeling, analysis & simulation of wireless and mobile systems
Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles
ACM SIGOPS 24th Symposium on Operating Systems Principles
Timecard: controlling user-perceived delays in server-based mobile applications
Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles
Explicit multipath congestion control for data center networks
Proceedings of the ninth ACM conference on Emerging networking experiments and technologies
Trevi: watering down storage hotspots with cool fountain codes
Proceedings of the Twelfth ACM Workshop on Hot Topics in Networks
High-fidelity per-flow delay measurements with reference latency interpolation
IEEE/ACM Transactions on Networking (TON)
Review: A survey on architectures and energy efficiency in Data Center Networks
Computer Communications
IDTCP: An effective approach to mitigating the TCP incast problem in data center networks
Information Systems Frontiers
Catch the whole lot in an action: rapid precise packet loss notification in data centers
NSDI'14 Proceedings of the 11th USENIX Conference on Networked Systems Design and Implementation
Hi-index | 0.00 |
This paper presents a practical solution to a problem facing high-fan-in, high-bandwidth synchronized TCP workloads in datacenter Ethernets---the TCP incast problem. In these networks, receivers can experience a drastic reduction in application throughput when simultaneously requesting data from many servers using TCP. Inbound data overfills small switch buffers, leading to TCP timeouts lasting hundreds of milliseconds. For many datacenter workloads that have a barrier synchronization requirement (e.g., filesystem reads and parallel data-intensive queries), throughput is reduced by up to 90%. For latency-sensitive applications, TCP timeouts in the datacenter impose delays of hundreds of milliseconds in networks with round-trip-times in microseconds. Our practical solution uses high-resolution timers to enable microsecond-granularity TCP timeouts. We demonstrate that this technique is effective in avoiding TCP incast collapse in simulation and in real-world experiments. We show that eliminating the minimum retransmission timeout bound is safe for all environments, including the wide-area.