Active messages: a mechanism for integrated communication and computation
ISCA '92 Proceedings of the 19th annual international symposium on Computer architecture
The Stanford FLASH multiprocessor
ISCA '94 Proceedings of the 21st annual international symposium on Computer architecture
Tempest and typhoon: user-level shared memory
ISCA '94 Proceedings of the 21st annual international symposium on Computer architecture
Meiko CS-2 interconnect Elan-Elite design
Parallel Computing - Special double issue: SUPRENUM and GENESIS
U-Net: a user-level network interface for parallel and distributed computing
SOSP '95 Proceedings of the fifteenth ACM symposium on Operating systems principles
High performance messaging on workstations: Illinois fast messages (FM) for Myrinet
Supercomputing '95 Proceedings of the 1995 ACM/IEEE conference on Supercomputing
Virtual-Memory-Mapped Network Interfaces
IEEE Micro
Shrimp Project Update: Myrinet Communication
IEEE Micro
Native Data Representation: An Efficient Wire Format for High-Performance Distributed Computing
IEEE Transactions on Parallel and Distributed Systems
Microprocessors & Microsystems
Network interfaces for programmable NICs and multicore platforms
Computer Networks: The International Journal of Computer and Telecommunications Networking
Hi-index | 0.00 |
With user-level network interfaces applications can access the network directly without operating system intervention on every send and receive. Messages are transferred to and from user-space by the network interface while observing the traditional protection boundaries between processes. First generation user-level network interfaces limit this message transfer to a per-process region of permanently-pinned physical memory to allow safe DMA. This approach is inflexible and does not scale to a large number of processes. A new memory management extension to the U-Net user-level network architecture allows messages to be transferred directly to and from any part of an application's address space. This is achieved by integrating a translation look-aside buffer into the network interface and coordinating its operation with the operating system's virtual memory subsystem. Two implementations demonstrate that existing commodity hardware and commercial operating systems can efficiently support the architecture.