Limits to low-latency communication on high-speed networks

Authors:
Chandramohan A. Thekkath;Henry M. Levy
Affiliations:
Univ. of Washington, Seattle;Univ. of Washington, Seattle
Venue:
ACM Transactions on Computer Systems (TOCS)
Year:
1993

Citing 19
Cited 50

Firefly: A Multiprocessor Workstation

IEEE Transactions on Computers - Special issue on architectural support for programming languages and operating systems
The VMP network adapter board (NAB): high-performance network communication for multiprocessors

SIGCOMM '88 Symposium proceedings on Communications architectures and protocols
The performance of the Amoeba distributed operating system

Software—Practice & Experience
Translation lookaside buffer consistency: a software approach

ASPLOS III Proceedings of the third international conference on Architectural support for programming languages and operating systems
The Amber system: parallel programming on a network of multiprocessors

SOSP '89 Proceedings of the twelfth ACM symposium on Operating systems principles
Threads and input/output in the synthesis kernal

SOSP '89 Proceedings of the twelfth ACM symposium on Operating systems principles
Memory coherence in shared virtual memory systems

ACM Transactions on Computer Systems (TOCS)
Lightweight remote procedure call

ACM Transactions on Computer Systems (TOCS)
Performance of the Firefly RPC

ACM Transactions on Computer Systems (TOCS)
Architectural considerations for a new generation of protocols

SIGCOMM '90 Proceedings of the ACM symposium on Communications architectures & protocols
The effect of context switches on cache performance

ASPLOS IV Proceedings of the fourth international conference on Architectural support for programming languages and operating systems
A portable interface for on-the-fly instruction space modification

ASPLOS IV Proceedings of the fourth international conference on Architectural support for programming languages and operating systems
The interaction of architecture and operating system design

ASPLOS IV Proceedings of the fourth international conference on Architectural support for programming languages and operating systems
A host-network interface architecture for ATM

SIGCOMM '91 Proceedings of the conference on Communications architecture & protocols
A high-performance host interface for ATM networks

SIGCOMM '91 Proceedings of the conference on Communications architecture & protocols
Alpha architecture reference manual

Alpha architecture reference manual
Implementing remote procedure calls

ACM Transactions on Computer Systems (TOCS)
Communicating sequential processes

Communications of the ACM
Ethernet: distributed packet switching for local computer networks

Communications of the ACM

Processor allocation policies for message-passing parallel computers

SIGMETRICS '94 Proceedings of the 1994 ACM SIGMETRICS conference on Measurement and modeling of computer systems
USC: a universal stub compiler

SIGCOMM '94 Proceedings of the conference on Communications architectures, protocols and applications
Quickly generating billion-record synthetic databases

SIGMOD '94 Proceedings of the 1994 ACM SIGMOD international conference on Management of data
Techniques for reducing consistency-related communication in distributed shared-memory systems

ACM Transactions on Computer Systems (TOCS)
CRL: high-performance all-software distributed shared memory

SOSP '95 Proceedings of the fifteenth ACM symposium on Operating systems principles
Exokernel: an operating system architecture for application-level resource management

SOSP '95 Proceedings of the fifteenth ACM symposium on Operating systems principles
Extensibility safety and performance in the SPIN operating system

SOSP '95 Proceedings of the fifteenth ACM symposium on Operating systems principles
Hypervisor-based fault tolerance

ACM Transactions on Computer Systems (TOCS) - Special issue on operating system principles
Petal: distributed virtual disks

Proceedings of the seventh international conference on Architectural support for programming languages and operating systems
C: a language for high-level, efficient, and machine-independent dynamic code generation

POPL '96 Proceedings of the 23rd ACM SIGPLAN-SIGACT symposium on Principles of programming languages
Profiling and reducing processing overheads in TCP/IP

IEEE/ACM Transactions on Networking (TON)
ASHs: Application-specific handlers for high-performance messaging

Conference proceedings on Applications, technologies, architectures, and protocols for computer communications
DPF: fast, flexible message demultiplexing using dynamic code generation

Conference proceedings on Applications, technologies, architectures, and protocols for computer communications
Analysis of techniques to improve protocol processing latency

Conference proceedings on Applications, technologies, architectures, and protocols for computer communications
tcc: a system for fast, flexible, and high-level dynamic code generation

Proceedings of the ACM SIGPLAN 1997 conference on Programming language design and implementation
Scaling up partial evaluation for optimizing the Sun commercial RPC protocol

PEPM '97 Proceedings of the 1997 ACM SIGPLAN symposium on Partial evaluation and semantics-based program manipulation
ASHs: application-specific handlers for high-performance messaging

IEEE/ACM Transactions on Networking (TON)
Predicting the performance of distributed virtual shared-memory applications

IBM Systems Journal
Compact and efficient presentation conversion code

IEEE/ACM Transactions on Networking (TON)
Performance-Based Path Determination for Interprocessor Communication in Distributed Computing Systems

IEEE Transactions on Parallel and Distributed Systems
C and tcc: a language and compiler for dynamic code generation

ACM Transactions on Programming Languages and Systems (TOPLAS)
An efficient processor partitioning and thread mapping strategy for mesh-connected multiprocessor systems

SAC '97 Proceedings of the 1997 ACM symposium on Applied computing
Dynamic computation migration in DSM systems

Supercomputing '96 Proceedings of the 1996 ACM/IEEE conference on Supercomputing
Simulation of the 3 dimensional cascade flow with numerical wind tunnel (NWT)

Supercomputing '96 Proceedings of the 1996 ACM/IEEE conference on Supercomputing
Optimistic active messages: structuring systems for high-performance communication

EW 6 Proceedings of the 6th workshop on ACM SIGOPS European workshop: Matching operating systems to application needs
Towards safe and efficient customization in distributed systems

EW 6 Proceedings of the 6th workshop on ACM SIGOPS European workshop: Matching operating systems to application needs
Using active messages to support shared objects

EW 6 Proceedings of the 6th workshop on ACM SIGOPS European workshop: Matching operating systems to application needs
Efficient Java RMI for parallel programming

ACM Transactions on Programming Languages and Systems (TOPLAS)
Ibis: an efficient Java-based grid programming environment

JGI '02 Proceedings of the 2002 joint ACM-ISCOPE conference on Java Grande
Supporting parallel applications on clusters of workstations: The Virtual Communication Machine-based architecture

Cluster Computing
Distributed network computing over local ATM networks

Proceedings of the 1994 ACM/IEEE conference on Supercomputing
Performance evaluation of three distributed computing environments for scientific applications

Proceedings of the 1994 ACM/IEEE conference on Supercomputing
Models for Asynchronous Message Handling

IEEE Parallel & Distributed Technology: Systems & Technology
Client-Server Computing on Shrimp

IEEE Micro
Software Support for Virtual Memory-Mapped Communication

IPPS '96 Proceedings of the 10th International Parallel Processing Symposium
Experience with Parallel Computing on the AN2 Network

IPPS '96 Proceedings of the 10th International Parallel Processing Symposium
Think: A Software Framework for Component-based Operating System Kernels

ATEC '02 Proceedings of the General Track of the annual conference on USENIX Annual Technical Conference
Exploiting multiple heterogeneous networks to reduce communication costs in parallel programs

HCW '97 Proceedings of the 6th Heterogeneous Computing Workshop (HCW '97)
A Hybrid Analysis of an Optimization Approach for Cluster Applications

The Journal of Supercomputing
Making the Most Out of Direct-Access Network Attached Storage

FAST '03 Proceedings of the 2nd USENIX Conference on File and Storage Technologies
Design and Evaluation of Dynamic Key Message Algorithms for Cluster Computing

HPCASIA '05 Proceedings of the Eighth International Conference on High-Performance Computing in Asia-Pacific Region
A portable kernel abstraction for low-overhead ephemeral mapping management

ATEC '05 Proceedings of the annual conference on USENIX Annual Technical Conference
Devil: an IDL for hardware programming

OSDI'00 Proceedings of the 4th conference on Symposium on Operating System Design & Implementation - Volume 4
Latency analysis of TCP on an ATM network

WTEC'94 Proceedings of the USENIX Winter 1994 Technical Conference on USENIX Winter 1994 Technical Conference
Distributed filaments: efficient fine-grain parallelism on a cluster of workstations

OSDI '94 Proceedings of the 1st USENIX conference on Operating Systems Design and Implementation
High-performance distributed objects over system area networks

WINSYM'99 Proceedings of the 3rd conference on USENIX Windows NT Symposium - Volume 3
Enabling semantic communications for virtual machines via iConnect

VTDC '07 Proceedings of the 2nd international workshop on Virtualization technology in distributed computing
High-speed networks: definition and fundamental attributes

Computer Communications
Making the most out of direct-access network attached storage

FAST'03 Proceedings of the 2nd USENIX conference on File and storage technologies
Network interface design for low latency request-response protocols

USENIX ATC'13 Proceedings of the 2013 USENIX conference on Annual Technical Conference

Quantified Score

Hi-index	0.00

Visualization

Abstract

The throughput of local area networks is rapidly increasing. For example, the bandwidth of new ATM networks and FDDI token rings is an order of magnitude greater than that of Ethernets. Other network technologies promise a bandwidth increase of yet another order of magnitude in several years. However, in distributed systems, lowered latency rather than increased throughput is often of primary concern. This paper examines the system-level effects of newer high-speed network technologies on low-latency, cross-machine communications.To evaluate a number of influences, both hardware and software, we designed and implemented a new remote procedure call system targeted at providing low latency. We then ported this system to several hardware platforms (DECstation and SPARCstation) with several different networks and controllers (ATM, FDDI, and Ethernet). Comparing these systems allows us to explore the performance impact of alternative designs in the communication system with respect to achieving low latency, e.g., the network, the network controller, the hose architecture and cache system, and the kernel and user-level runtime software.Our RPC system, which achieves substantially reduced call times (170 &mgr;seconds on an ATM network using DECstation 5000/200 hosts), allows us to isolate those components of next-generation networks and controllers that still stand in the way of low-latency communication. We demonstrate that new-generation processor technology and software design can reduce small-packet RPC times to near network-imposed limits, making network and controller design more crucial than ever to achieving truly low-latency communication.