A distributed memory LAPSE: parallel simulation of message-passing programs
PADS '94 Proceedings of the eighth workshop on Parallel and distributed simulation
TCP Vegas: new techniques for congestion detection and avoidance
SIGCOMM '94 Proceedings of the conference on Communications architectures, protocols and applications
Analysis of bounded time warp and comparison with YAWNS
ACM Transactions on Modeling and Computer Simulation (TOMACS)
Parallelized Direct Execution Simulation of Message-Passing Parallel Programs
IEEE Transactions on Parallel and Distributed Systems
A workstation-based parallel direct-execution simulator
Proceedings of the eleventh workshop on Parallel and distributed simulation
Proceedings of the 2000 ACM/IEEE conference on Supercomputing
Using loss pairs to discover network properties
IMW '01 Proceedings of the 1st ACM SIGCOMM Workshop on Internet Measurement
High Performance Wide Area Data Transfers over High Performance Networks
IPDPS '02 Proceedings of the 16th International Parallel and Distributed Processing Symposium
Parallelized Network Simulators for Message-Passing Parallel Programs
MASCOTS '95 Proceedings of the 3rd International Workshop on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems
Discriminating Congestion Losses from Wireless Losses using Inter-Arrival Times at the Receiver
ASSET '99 Proceedings of the 1999 IEEE Symposium on Application - Specific Systems and Software Engineering and Technology
Reliable Blast UDP: Predictable High Performance Bulk Data Transfer
CLUSTER '02 Proceedings of the IEEE International Conference on Cluster Computing
An Evaluation of Object-Based Data Transfers on High Performance Networks
HPDC '02 Proceedings of the 11th IEEE International Symposium on High Performance Distributed Computing
MSS '01 Proceedings of the Eighteenth IEEE Symposium on Mass Storage Systems and Technologies
Distinguishing Congestion Losses from Wireless Transmission Losses: A Negative Result
IC3N '98 Proceedings of the International Conference on Computer Communications and Networks
End-to-end differentiation of congestion and wireless losses
IEEE/ACM Transactions on Networking (TON)
Scalable TCP: improving performance in highspeed wide area networks
ACM SIGCOMM Computer Communication Review
Web100: extended TCP instrumentation for research, education and diagnosis
ACM SIGCOMM Computer Communication Review
Experiences in Design and Implementation of a High Performance Transport Protocol
Proceedings of the 2004 ACM/IEEE conference on Supercomputing
The Globus eXtensible Input/Output System (XIO): A Protocol Independent IO System for the Grid
IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Workshop 4 - Volume 05
"De-randomizing" congestion losses to improve TCP performance over wired-wireless networks
IEEE/ACM Transactions on Networking (TON)
Supporting Configurable Congestion Control in Data Transport Services
SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
Classifiers for the causes of data loss using packet-loss signatures
CCGRID '04 Proceedings of the 2004 IEEE International Symposium on Cluster Computing and the Grid
UDT: UDP-based data transfer for high-speed wide area networks
Computer Networks: The International Journal of Computer and Telecommunications Networking
Globus GridFTP: what's new in 2007
Proceedings of the first international conference on Networks for grid applications
TCP westwoodVT: a novel technique for discriminating the cause of packet loss in wireless networks
NETWORKING'07 Proceedings of the 6th international IFIP-TC6 conference on Ad Hoc and sensor networks, wireless networks, next generation internet
FAST TCP: from theory to experiments
IEEE Network: The Magazine of Global Internetworking
Hi-index | 0.00 |
This paper describes a lightweight, high-performance communication protocol for the high-bandwidth, high-delay networks typical of computational Grids. One unique feature of this protocol is that it incorporates an extremely accurate classification mechanism that is efficient enough to diagnose the cause of data loss in real time, providing to the controller the opportunity to respond to different causes of data loss in different ways. The simplest adaptive response, and the one discussed in this paper, is to trigger aggressive congestion control measures only when the data loss is diagnosed as network related. However, even this very simple adaptation can have a tremendous impact on performance in a Grid setting where the resources allocated to a long-running, data-intensive application can fluctuate significantly during the course of its execution. In fact, we provide results showing that the utilization of the information provided by the classifier increased performance by over two orders of magnitude depending on the dominant cause of data loss. In this paper, we discuss the Bayesian statistical framework upon which the classifier is based and the classification metrics that make this approach highly successful. We discuss the integration of the classifier into the congestion control structures of an existing high-performance communication protocol, and provide empirical results showing that it correctly diagnosed the cause of data loss in over 98% of the experimental trials.