An integration of network communication with workstation architecture
ACM SIGCOMM Computer Communication Review
The design of the Caltech Mosaic C multicomputer
Proceedings of the 1993 symposium on Research on integrated systems
A family of routing and communication chips based on the Mosaic
Proceedings of the 1993 symposium on Research on integrated systems
ROMM routing on mesh and torus networks
Proceedings of the seventh annual ACM symposium on Parallel algorithms and architectures
The interaction of parallel and sequential workloads on a network of workstations
Proceedings of the 1995 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
Serverless network file systems
SOSP '95 Proceedings of the fifteenth ACM symposium on Operating systems principles
CRL: high-performance all-software distributed shared memory
SOSP '95 Proceedings of the fifteenth ACM symposium on Operating systems principles
High performance messaging on workstations: Illinois fast messages (FM) for Myrinet
Supercomputing '95 Proceedings of the 1995 ACM/IEEE conference on Supercomputing
Serverless network file systems
ACM Transactions on Computer Systems (TOCS) - Special issue on operating system principles
1995 observations on supercomputing alternatives: did the MPP bandwagon lead to a cul-de-sac?
Communications of the ACM
Decoupled hardware support for distributed shared memory
ISCA '96 Proceedings of the 23rd annual international symposium on Computer architecture
Integrating performance monitoring and communication in parallel computers
Proceedings of the 1996 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Proceedings of the seventh international conference on Architectural support for programming languages and operating systems
Synchronization hardware for networks of workstations: performance vs. cost
ICS '96 Proceedings of the 10th international conference on Supercomputing
Parallel simulation of a high-speed wormhole routing network
PADS '96 Proceedings of the tenth workshop on Parallel and distributed simulation
The impact of a zero-scan Internet checksumming mechanism
ACM SIGCOMM Computer Communication Review
Multicasting protocols for high-speed, wormhole-routing local area networks
Conference proceedings on Applications, technologies, architectures, and protocols for computer communications
IEEE Transactions on Parallel and Distributed Systems
High-performance sorting on networks of workstations
SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
Proceedings of the ninth annual ACM symposium on Parallel algorithms and architectures
Design and evaluation of a DRAM-based shared memory ATM switch
SIGMETRICS '97 Proceedings of the 1997 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
File server scaling with network-attached secure disks
SIGMETRICS '97 Proceedings of the 1997 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Flick: a flexible, optimizing IDL compiler
Proceedings of the ACM SIGPLAN 1997 conference on Programming language design and implementation
pSNOW: a tool to evaluate architectural issues for NOW environments
ICS '97 Proceedings of the 11th international conference on Supercomputing
Relaxed consistency and coherence granularity in DSM systems: a performance evaluation
PPOPP '97 Proceedings of the sixth ACM SIGPLAN symposium on Principles and practice of parallel programming
PPOPP '97 Proceedings of the sixth ACM SIGPLAN symposium on Principles and practice of parallel programming
Effects of communication latency, overhead, and bandwidth in a cluster architecture
Proceedings of the 24th annual international symposium on Computer architecture
VM-based shared memory on low-latency, remote-memory-access networks
Proceedings of the 24th annual international symposium on Computer architecture
Efficient synchronization: let them eat QOLB
Proceedings of the 24th annual international symposium on Computer architecture
Exploiting local data in parallel array I/O on a practical network of workstations
Proceedings of the fifth workshop on I/O in parallel and distributed systems
Cluster-based scalable network services
Proceedings of the sixteenth ACM symposium on Operating systems principles
Performance evaluation of the Orca shared-object system
ACM Transactions on Computer Systems (TOCS)
FPGA '98 Proceedings of the 1998 ACM/SIGDA sixth international symposium on Field programmable gate arrays
IEEE Transactions on Computers
Per-Node Multithreading and Remote Latency
IEEE Transactions on Computers
Deadlock-free routing in arbitrary networks via the flattest common supersequence method
Proceedings of the tenth annual ACM symposium on Parallel algorithms and architectures
Modeling communication pipeline latency
SIGMETRICS '98/PERFORMANCE '98 Proceedings of the 1998 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
Implementing cooperative prefetching and caching in a globally-managed memory system
SIGMETRICS '98/PERFORMANCE '98 Proceedings of the 1998 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
ICS '98 Proceedings of the 12th international conference on Supercomputing
Monitoring shared virtual memory performance on a Myrinet-based PC cluster
ICS '98 Proceedings of the 12th international conference on Supercomputing
Evaluation of hardware write propagation support for next-generation shared virtual memory clusters
ICS '98 Proceedings of the 12th international conference on Supercomputing
Performance measurements for multithreaded programs
SIGMETRICS '98/PERFORMANCE '98 Proceedings of the 1998 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
Scheduling with implicit information in distributed systems
SIGMETRICS '98/PERFORMANCE '98 Proceedings of the 1998 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
Implementation of reductions in support of PDES on a network of workstations
PADS '98 Proceedings of the twelfth workshop on Parallel and distributed simulation
Switcherland: a QoS communication architecture for workstation clusters
Proceedings of the 25th annual international symposium on Computer architecture
Design choices in the SHRIMP system: an empirical study
Proceedings of the 25th annual international symposium on Computer architecture
A High Performance Message-Passing System for Network of Workstations
The Journal of Supercomputing - Special issue: high performance distributed computing
ACM SIGOPS Operating Systems Review
Performance monitoring in a Myrinet-connected SHRIMP cluster
SPDT '98 Proceedings of the SIGMETRICS symposium on Parallel and distributed tools
Searching for the sorting record: experiences in tuning NOW-Sort
SPDT '98 Proceedings of the SIGMETRICS symposium on Parallel and distributed tools
A Competitive Analysis of Load Balancing Strategiesfor Parallel Ray Tracing
The Journal of Supercomputing
A Performance Evaluation of the Convex SPP-1000 Scalable Shared Memory Parallel Computer
Supercomputing '95 Proceedings of the 1995 ACM/IEEE conference on Supercomputing
Techniques for energy minimization of communication pipelines
Proceedings of the 1998 IEEE/ACM international conference on Computer-aided design
Hardware Support for Flexible Distributed Shared Memory
IEEE Transactions on Computers
A cost-effective, high-bandwidth storage architecture
Proceedings of the eighth international conference on Architectural support for programming languages and operating systems
UTLB: a mechanism for address translation on network interfaces
Proceedings of the eighth international conference on Architectural support for programming languages and operating systems
A task- and data-parallel programming language based on shared objects
ACM Transactions on Programming Languages and Systems (TOPLAS)
MultiView and Millipage — fine-grain sharing in page-based DSMs
OSDI '99 Proceedings of the third symposium on Operating systems design and implementation
ISCA '99 Proceedings of the 26th annual international symposium on Computer architecture
Design challenges of virtual networks: fast, general-purpose communication
Proceedings of the seventh ACM SIGPLAN symposium on Principles and practice of parallel programming
MagPIe: MPI's collective communication operations for clustered wide area systems
Proceedings of the seventh ACM SIGPLAN symposium on Principles and practice of parallel programming
An efficient implementation of Java's remote method invocation
Proceedings of the seventh ACM SIGPLAN symposium on Principles and practice of parallel programming
LOTEC: a simple DSM consistency protocol for nested object transactions
Proceedings of the eighteenth annual ACM symposium on Principles of distributed computing
Exploiting temporal uncertainty in parallel and distributed simulations
PADS '99 Proceedings of the thirteenth workshop on Parallel and distributed simulation
NFS sensitivity to high performance networks
SIGMETRICS '99 Proceedings of the 1999 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Potentials and limitations of fault-based Markov prefetching for virtual memory pages
SIGMETRICS '99 Proceedings of the 1999 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Cluster I/O with River: making the fast case common
Proceedings of the sixth workshop on I/O in parallel and distributed systems
Multiple Multicast with Minimized Node Contention on Wormhole k-ary n-cube Networks
IEEE Transactions on Parallel and Distributed Systems
Wire-area parallel computing in Java
JAVA '99 Proceedings of the ACM 1999 conference on Java Grande
JAVA '99 Proceedings of the ACM 1999 conference on Java Grande
Responsiveness without interrupts
ICS '99 Proceedings of the 13th international conference on Supercomputing
Application scaling under shared virtual memory on a cluster of SMPs
ICS '99 Proceedings of the 13th international conference on Supercomputing
Fast cluster failover using virtual memory-mapped communication
ICS '99 Proceedings of the 13th international conference on Supercomputing
A closer look at coscheduling approaches for a network of workstations
Proceedings of the eleventh annual ACM symposium on Parallel algorithms and architectures
Experience with an adaptive globally-synchronizing clock algorithm
Proceedings of the eleventh annual ACM symposium on Parallel algorithms and architectures
Load balancing for multi-projector rendering systems
HWWS '99 Proceedings of the ACM SIGGRAPH/EUROGRAPHICS workshop on Graphics hardware
Transposition table driven work scheduling in distributed search
AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
Adaptive-Trail Routing and Performance Evaluation in Irregular Networks Using Cut-Through Switches
IEEE Transactions on Parallel and Distributed Systems
Hierarchical Simulation Approach to Accurate Fault Modeling for System Dependability Evaluation
IEEE Transactions on Software Engineering
An efficient communication architecture for commodity supercomputers
SC '99 Proceedings of the 1999 ACM/IEEE conference on Supercomputing
BIP-SMP: high performance message passing over a cluster of commodity SMPs
SC '99 Proceedings of the 1999 ACM/IEEE conference on Supercomputing
Architectural requirements and scalability of the NAS parallel benchmarks
SC '99 Proceedings of the 1999 ACM/IEEE conference on Supercomputing
A personal supercomputer for climate research
SC '99 Proceedings of the 1999 ACM/IEEE conference on Supercomputing
Software-Based Rerouting for Fault-Tolerant Pipelined Communication
IEEE Transactions on Parallel and Distributed Systems
Efficient Execution of Time Warp Programs on Heterogeneous, NOW Platforms
IEEE Transactions on Parallel and Distributed Systems
Performance evaluation of a new routing strategy for irregular networks with source routing
Proceedings of the 14th international conference on Supercomputing
Characterizing processor architectures for programmable network interfaces
Proceedings of the 14th international conference on Supercomputing
Pre-sampling as an approach for exploiting temporal uncertainty
PADS '00 Proceedings of the fourteenth workshop on Parallel and distributed simulation
Efficient replicated method invocation in Java
Proceedings of the ACM 2000 conference on Java Grande
A Distributed Shared-Memory System on a Workstation Cluster Using Fast Serial Links
International Journal of Parallel Programming - Special issue on international symposium on high performance computing 1997, part I
Hybrid sort-first and sort-last parallel rendering with a cluster of PCs
HWWS '00 Proceedings of the ACM SIGGRAPH/EUROGRAPHICS workshop on Graphics hardware
Design of a performance technology infrastructure to support the construction of responsive software
Proceedings of the 2nd international workshop on Software and performance
High-Performance Routing in Networks of Workstations with Irregular Topology
IEEE Transactions on Parallel and Distributed Systems
On the Use of Virtual Channels in Networks of Workstations with Irregular Topology
IEEE Transactions on Parallel and Distributed Systems
Evaluating design alternatives for reliable communication on high-speed networks
ACM SIGPLAN Notices
Performance Metrics for Embedded Parallel Pipelines
IEEE Transactions on Parallel and Distributed Systems
Routing in the bidirectional shufflenet
IEEE/ACM Transactions on Networking (TON)
ACM Computing Surveys (CSUR)
Accelerating shared virtual memory via general-purpose network interface support
ACM Transactions on Computer Systems (TOCS)
Proceedings of the 2000 ACM/IEEE conference on Supercomputing
Scalable fault-tolerant distributed shared memory
Proceedings of the 2000 ACM/IEEE conference on Supercomputing
Distributed rendering for scalable displays
Proceedings of the 2000 ACM/IEEE conference on Supercomputing
Dynamic Access Ordering for Streamed Computations
IEEE Transactions on Computers
Computing in the RAIN: A Reliable Array of Independent Nodes
IEEE Transactions on Parallel and Distributed Systems
A Protocol for Deadlock-Free Dynamic Reconfiguration in High-Speed Local Area Networks
IEEE Transactions on Parallel and Distributed Systems
Architectural Support for Efficient Multicasting in Irregular Networks
IEEE Transactions on Parallel and Distributed Systems
Object-based collective communication in Java
Proceedings of the 2001 joint ACM-ISCOPE conference on Java Grande
Runtime optimizations for a Java DSM implementation
Proceedings of the 2001 joint ACM-ISCOPE conference on Java Grande
ICS '01 Proceedings of the 15th international conference on Supercomputing
Evaluating design alternatives for reliable communication on high-speed networks
ASPLOS IX Proceedings of the ninth international conference on Architectural support for programming languages and operating systems
QoS provisioning in clusters: an investigation of Router and NIC design
ISCA '01 Proceedings of the 28th annual international symposium on Computer architecture
Source-level global optimizations for fine-grain distributed shared memory systems
PPoPP '01 Proceedings of the eighth ACM SIGPLAN symposium on Principles and practices of parallel programming
LogGPS: a parallel computational model for synchronization analysis
PPoPP '01 Proceedings of the eighth ACM SIGPLAN symposium on Principles and practices of parallel programming
Implicit coscheduling: coordinated scheduling with implicit information in distributed systems
ACM Transactions on Computer Systems (TOCS)
WireGL: a scalable graphics system for clusters
Proceedings of the 28th annual conference on Computer graphics and interactive techniques
Efficient Multicast on Irregular Switch-Based Cut-Through Networks with Up-Down Routing
IEEE Transactions on Parallel and Distributed Systems
Parallel rendering with k-way replication
PVG '01 Proceedings of the IEEE 2001 symposium on parallel and large-data visualization and graphics
Parallel implementation of self-organizing maps
Self-Organizing neural networks
A new fast message passing communication system for multiprocessor workstation clusters
Progress in computer research
A General Theory for Deadlock-Free Adaptive Routing Using a Mixed Set of Resources
IEEE Transactions on Parallel and Distributed Systems
Efficient Java RMI for parallel programming
ACM Transactions on Programming Languages and Systems (TOPLAS)
Four-Ary Tree-Based Barrier Synchronization for 2D Meshes without Nonmember Involvement
IEEE Transactions on Computers - Special issue on the parallel architecture and compilation techniques conference
Deadlock-Free Oblivious Wormhole Routing with Cyclic Dependencies
IEEE Transactions on Computers
A Cost-Effective Approach to Deadlock Handling in Wormhole Networks
IEEE Transactions on Parallel and Distributed Systems
An implementation and analysis of the virtual interface architecture
SC '98 Proceedings of the 1998 ACM/IEEE conference on Supercomputing
MPI-StarT: delivering network performance to numerical applications
SC '98 Proceedings of the 1998 ACM/IEEE conference on Supercomputing
User-space communication: a quantitative study
SC '98 Proceedings of the 1998 ACM/IEEE conference on Supercomputing
StarT-Voyager: a flexible platform for exploring scalable SMP issues
SC '98 Proceedings of the 1998 ACM/IEEE conference on Supercomputing
Highly efficient gang scheduling implementation
SC '98 Proceedings of the 1998 ACM/IEEE conference on Supercomputing
The effects of communication parameters on end performance of shared virtual memory clusters
SC '97 Proceedings of the 1997 ACM/IEEE conference on Supercomputing
FM-QoS: real-time communication using self-synchronizing schedules
SC '97 Proceedings of the 1997 ACM/IEEE conference on Supercomputing
Multi-protocol active messages on a cluster of SMP's
SC '97 Proceedings of the 1997 ACM/IEEE conference on Supercomputing
Modeling and analysis of dynamic coscheduling in parallel and distributed environments
SIGMETRICS '02 Proceedings of the 2002 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Dynamic memory management for programmable devices
Proceedings of the 3rd international symposium on Memory management
Parallelization and performance of 3D ultrasound imaging beamforming algorithms on modern clusters
ICS '02 Proceedings of the 16th international conference on Supercomputing
Early cancellation: an active NIC optimization for time-warp
Proceedings of the sixteenth workshop on Parallel and distributed simulation
HIPIQS: A High-Performance Switch Architecture Using Input Queuing
IEEE Transactions on Parallel and Distributed Systems
Fair and Efficient Packet Scheduling Using Elastic Round Robin
IEEE Transactions on Parallel and Distributed Systems
A Performance Analysis of Transposition-Table-Driven Work Scheduling in Distributed Search
IEEE Transactions on Parallel and Distributed Systems
A new fast message passing communication system for multiprocessor workstation clusters
Progress in computer research
Lazy Garbage Collection of Recovery State for Fault-Tolerant Distributed Shared Memory
IEEE Transactions on Parallel and Distributed Systems
Boosting the Performance of Myrinet Networks
IEEE Transactions on Parallel and Distributed Systems
Approach for software development of parallel real-time VE systems on heterogenous clusters
EGPGV '02 Proceedings of the Fourth Eurographics Workshop on Parallel Graphics and Visualization
Design and implementation of a large-scale hybrid distributed graphics system
EGPGV '02 Proceedings of the Fourth Eurographics Workshop on Parallel Graphics and Visualization
A new routing mechanism for networks with irregular topology
Proceedings of the 2001 ACM/IEEE conference on Supercomputing
Scalable parallel application launch on Cplant™
Proceedings of the 2001 ACM/IEEE conference on Supercomputing
Next-generation visual supercomputing using PC clusters with volume graphics hardware devices
Proceedings of the 2001 ACM/IEEE conference on Supercomputing
Cost effectiveness of an adaptable computing cluster
Proceedings of the 2001 ACM/IEEE conference on Supercomputing
EMP: zero-copy OS-bypass NIC-driven gigabit ethernet message passing
Proceedings of the 2001 ACM/IEEE conference on Supercomputing
ACM Transactions on Computer Systems (TOCS)
Prediction and adaptation in Active Harmony
Cluster Computing
Storing spatial data on a network of workstations
Cluster Computing
The Network RamDisk: Using remote memory on heterogeneous NOWs
Cluster Computing
High Performance Network of PC Cluster Maestro
Cluster Computing
Software Architecture for Processing Clusters Based on I2O
Cluster Computing
A Software Suite for High-Performance Communications on Clusters of SMPs
Cluster Computing
Dual-tree-based multicasting on wormhole-routed irregular switch-based networks
Journal of Systems Architecture: the EUROMICRO Journal
Studies on striping and buffer caching issues for the software RAID file system
Journal of Systems Architecture: the EUROMICRO Journal
The Journal of Supercomputing
Information Retrieval on an SCI-Based PC Cluster
The Journal of Supercomputing
Evolving RPC for active storage
Proceedings of the 10th international conference on Architectural support for programming languages and operating systems
Models for Asynchronous Message Handling
IEEE Parallel & Distributed Technology: Systems & Technology
Fast Messages: Efficient, Portable Communication for Workstation Clusters and MPPs
IEEE Parallel & Distributed Technology: Systems & Technology
COMPaS: A PC-Based SMP Cluster
IEEE Concurrency
IEEE Concurrency
A Case for NOW (Networks of Workstations)
IEEE Micro
Assessing Fast Network Interfaces
IEEE Micro
Client-Server Computing on Shrimp
IEEE Micro
A Delay Model for Router Microarchitectures
IEEE Micro
Lazy Garbage Collection of Recovery State for Fault-Tolerant Distributed Shared Memory
IEEE Transactions on Parallel and Distributed Systems
Boosting the Performance of Myrinet Networks
IEEE Transactions on Parallel and Distributed Systems
MediaWorm: A QoS Capable Router Architecture for Clusters
IEEE Transactions on Parallel and Distributed Systems
Portable and scalable algorithm for irregular all-to-all communication
Journal of Parallel and Distributed Computing
A Pipeline-Based Approach for Scheduling Video Processing Algorithms on NOW
IEEE Transactions on Parallel and Distributed Systems
Parallel simulation of chip-multiprocessor architectures
ACM Transactions on Modeling and Computer Simulation (TOMACS)
Performance Evaluation of Real-Time Communication Services on High-Speed LANs under Topology Changes
HiPC '01 Proceedings of the 8th International Conference on High Performance Computing
A Fast Tree-Based Barrier Synchroization on Switch-Based Irregular Networks
HiPC '00 Proceedings of the 7th International Conference on High Performance Computing
Protocols and Software for Exploiting Myrinet Clusters
ICCS '01 Proceedings of the International Conference on Computational Sciences-Part I
Optimal Multicast with Packetization and Network Interface Support
ICPP '97 Proceedings of the international Conference on Parallel Processing
Design of Scalable and Multicast Capable Cut-Through Switches for High-Speed LANs
ICPP '97 Proceedings of the international Conference on Parallel Processing
Software-Based Deadlock Recovery Technique for True Fully Adaptive Routing in Wormhole Networks
ICPP '97 Proceedings of the international Conference on Parallel Processing
An Architecture for Using Multiple Communication Devices in a MPI Library
HPCN Europe 2000 Proceedings of the 8th International Conference on High-Performance Computing and Networking
Evaluation of Alternative Arbitration Policies for Myrinet Switches
IPDPS '02 Proceedings of the 16th International Parallel and Distributed Processing Symposium
Using Programmable NICs for Time-Warp Optimization
IPDPS '02 Proceedings of the 16th International Parallel and Distributed Processing Symposium
Analyzing the Influence of Virtual Lanes on the Performance of InfiniBand Networks
IPDPS '02 Proceedings of the 16th International Parallel and Distributed Processing Symposium
Survivable Computer Networks in the Presence of Partitioning
IPDPS '02 Proceedings of the 16th International Parallel and Distributed Processing Symposium
A Parallel Ultra-High Resolution MPEG-2 Video Decoder for PC Cluster Based Tiled Display Systems
IPDPS '02 Proceedings of the 16th International Parallel and Distributed Processing Symposium
Experience with Parallel Computing on the AN2 Network
IPPS '96 Proceedings of the 10th International Parallel Processing Symposium
Design and Implementation of Virtual Memory-Mapped Communication on Myrinet
IPPS '97 Proceedings of the 11th International Symposium on Parallel Processing
Reducing Waiting Costs in User-Level Communication
IPPS '97 Proceedings of the 11th International Symposium on Parallel Processing
Portals 3.0: Protocol Building Blocks for Low Overhead Communication
IPDPS '02 Proceedings of the 16th International Parallel and Distributed Processing Symposium
Layered Shortest Path (LASH) Routing in Irregular System Area Networks
IPDPS '02 Proceedings of the 16th International Parallel and Distributed Processing Symposium
Platform-Independent Runtime Optimizations Using OpenThreads
IPPS '97 Proceedings of the 11th International Symposium on Parallel Processing
Exploiting Transparent Remote Memory Access for Non-Contiguous- and One-Sided-Communication
IPDPS '02 Proceedings of the 16th International Parallel and Distributed Processing Symposium
Can User-Level Protocols Take Advantage of Multi-CPU NICs?
IPDPS '02 Proceedings of the 16th International Parallel and Distributed Processing Symposium
Tuning Buffer Size in the Multimedia Router (MMR)
IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
User-Level Communication in a System with Gang Scheduling
IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
Message Passing Vs. Shared Address Space on a Clusters of SMPs
IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
Mixing High Performance and Portability for the Design of Active Network Framework with Java
IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
A New Approach to Provide Real-Time Services on High-Speed Local Area Networks
IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
Performance Improvement for Applications on Parallel Computers
IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
An Adaptive Communication System for Heterogeneous Network Computing
IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
A First Implementation of In-Transit Buffers on Myrinet GM Software
IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
Implementation of Finite Lattices in VLSI for Fault-State Encoding in High-Speed Networks
IPDPS '00 Proceedings of the 15 IPDPS 2000 Workshops on Parallel and Distributed Processing
ClusterNet: An Object-Oriented Cluster Network
IPDPS '00 Proceedings of the 15 IPDPS 2000 Workshops on Parallel and Distributed Processing
Parallel Information Retrieval on an SCI-Based PC-NOW
IPDPS '00 Proceedings of the 15 IPDPS 2000 Workshops on Parallel and Distributed Processing
Priority Based Messaging for Software Distributed Shared Memory
IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
Dynamically Scaling Computer Networks
IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
On the Interconnection Topology for Storage Area Networks
IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
Automatic Scheduler for Real-Time Vision Applications
IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
Performance Benefits of NIC-Based Barrier on Myrinet/GM
IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
IPDPS '00 Proceedings of the 15 IPDPS 2000 Workshops on Parallel and Distributed Processing
The MultiCluster Model to the Integrated Use of Multiple Workstation Clusters
IPDPS '00 Proceedings of the 15 IPDPS 2000 Workshops on Parallel and Distributed Processing
IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
Performance Evaluation of the Quadrics Interconnection Network
IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
An Approach to Asynchronous Object-Oriented Parallel and Distributed Computing on Wide-Area Systems
IPDPS '00 Proceedings of the 15 IPDPS 2000 Workshops on Parallel and Distributed Processing
A PC-NOW Based Parallel Extension for a Sequential DBMS
IPDPS '00 Proceedings of the 15 IPDPS 2000 Workshops on Parallel and Distributed Processing
A Simple Incremental Network Topology for Wormhole Switch-Based Networks
IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
VIBe: A Micro-benchmark Suite for Evaluating Virtual Interface Architecture (VIA) Implementations
IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
Investigating Switch Scheduling Algorithms to Support QoS in the Multimedia Router
IPDPS '02 Proceedings of the 16th International Parallel and Distributed Processing Symposium
Optimization of Parallel Algorithms on Cluster of SMP's
PARA '02 Proceedings of the 6th International Conference on Applied Parallel Computing Advanced Scientific Computing
A Cluster-Based Solution for a High Performance Air Quality Simulation
PARA '02 Proceedings of the 6th International Conference on Applied Parallel Computing Advanced Scientific Computing
SCI-Based LINUX PC-Clusters as a Platform for Electromagnetic Field Calculations
PaCT '01 Proceedings of the 6th International Conference on Parallel Computing Technologies
Parallel Unstructured AMR and Gigabit Networking for Beowulf-Class Clusters
PPAM '01 Proceedings of the th International Conference on Parallel Processing and Applied Mathematics-Revised Papers
A Parallel System Architecture Based on Dynamically Configurable Shared Memory Clusters
PPAM '01 Proceedings of the th International Conference on Parallel Processing and Applied Mathematics-Revised Papers
Bottleneck Analysis of a Gigabit Network Interface Card: Formal Verification Approach
Proceedings of the 9th International SPIN Workshop on Model Checking of Software
An Efficient and Scalable Coscheduling Technique for Large Symmetric Multiprocessor Clusters
JSSPP '01 Revised Papers from the 7th International Workshop on Job Scheduling Strategies for Parallel Processing
Performance Sensitivity of Routing Algorithms to Failures in Networks of Worksations
ISHPC '00 Proceedings of the Third International Symposium on High Performance Computing
ISHPC '00 Proceedings of the Third International Symposium on High Performance Computing
On the Influence of the Selection Function on the Performance of Networks of Workstations
ISHPC '00 Proceedings of the Third International Symposium on High Performance Computing
A Flexible Routing Scheme for Networks of Workstations
ISHPC '00 Proceedings of the Third International Symposium on High Performance Computing
Improving InfiniBand Routing through Multiple Virtual Networks
ISHPC '02 Proceedings of the 4th International Symposium on High Performance Computing
Multicasting on Switch-Based Irregular Networks Using Multi-drop Path-Based Multidestination Worms
PCRCW '97 Proceedings of the Second International Workshop on Parallel Computer Routing and Communication
PCI-DDC Application Programming Interface: Performance in User-Level Messaging (Research Note)
Euro-Par '00 Proceedings from the 6th International Euro-Par Conference on Parallel Processing
Improving the Up*/Down* Routing Scheme for Networks of Workstations
Euro-Par '00 Proceedings from the 6th International Euro-Par Conference on Parallel Processing
Request Sequencing: Optimizing Communication for the Grid
Euro-Par '00 Proceedings from the 6th International Euro-Par Conference on Parallel Processing
Building Distributed Applications Using Multiple, Heterogeneous Environments
Euro-Par '00 Proceedings from the 6th International Euro-Par Conference on Parallel Processing
On Deadlock Frequency during Dynamic Reconfiguration in NOWs
Euro-Par '01 Proceedings of the 7th International Euro-Par Conference Manchester on Parallel Processing
VIA Communication Performance on a Gigabit Ethernet Cluster
Euro-Par '01 Proceedings of the 7th International Euro-Par Conference Manchester on Parallel Processing
Euro-Par '01 Proceedings of the 7th International Euro-Par Conference Manchester on Parallel Processing
Prioritizing Network Event Handling in Clusters of Workstations
Euro-Par '01 Proceedings of the 7th International Euro-Par Conference Manchester on Parallel Processing
PAPI Message Passing Library: Comparison of Performance in User and Kernel Level Messaging
Euro-Par '01 Proceedings of the 7th International Euro-Par Conference Manchester on Parallel Processing
Characterizing the Scalability of Decision-Support Workloads on Clusters and SMP Systems
Euro-Par '02 Proceedings of the 8th International Euro-Par Conference on Parallel Processing
Stepwise Optimizations of UDP/IP on a Gigabit Network (Research Note)
Euro-Par '02 Proceedings of the 8th International Euro-Par Conference on Parallel Processing
Multi-protocol Communications and High Speed Networks
Euro-Par '99 Proceedings of the 5th International Euro-Par Conference on Parallel Processing
High-Speed LANs: New Environments for Parallel and Distributed Applications
Euro-Par '99 Proceedings of the 5th International Euro-Par Conference on Parallel Processing
Euro-Par '00 Proceedings from the 6th International Euro-Par Conference on Parallel Processing
Deadlock Avoidance for Wormhole Based Switches
Euro-Par '00 Proceedings from the 6th International Euro-Par Conference on Parallel Processing
Dynamic Reconfiguration and Virtual Machine Management in the Harness Metacomputing System
ISCOPE '98 Proceedings of the Second International Symposium on Computing in Object-Oriented Parallel Environments
CableS: Thread Control and Memory System Extensions for Shared Virtual Memory Clusters
WOMPAT '01 Proceedings of the International Workshop on OpenMP Applications and Tools: OpenMP Shared Memory Parallel Programming
VECPAR '00 Selected Papers and Invited Talks from the 4th International Conference on Vector and Parallel Processing
Parallel Cluster Computing with IEEE1394-1995
ParNum '99 Proceedings of the 4th International ACPC Conference Including Special Tracks on Parallel Numerics and Parallel Computing in Image Processing, Video Processing, and Multimedia: Parallel Computation
Multilayer Online-Monitoring for Hybrid DSM Systems on Top of PC Clusters with a SMiLE
TOOLS '00 Proceedings of the 11th International Conference on Computer Performance Evaluation: Modelling Techniques and Tools
Comparing Reference Counting and Global Mark-and-Sweep on Parallel Computers
LCR '98 Selected Papers from the 4th International Workshop on Languages, Compilers, and Run-Time Systems for Scalable Computers
Thread Migration and Load-Balancing in Heterogeneous Environments
LCR '00 Selected Papers from the 5th International Workshop on Languages, Compilers, and Run-Time Systems for Scalable Computers
System Area Network Extensions to the Parallel Virtual Machine
Proceedings of the 8th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
An MPI Implementation on the Top of the Virtual Interface Architecture
Proceedings of the 6th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
Building MPI for Multi-Programming Systems Using Implicit Information
Proceedings of the 6th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
Ready-Mode Receive: An Optimized Receive Function for MPI
Proceedings of the 9th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
Design and Implementation of MPI on Portals 3.0
Proceedings of the 9th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
Design of DMPI on DAWNING-3000
Proceedings of the 9th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
IWCC '01 Proceedings of the NATO Advanced Research Workshop on Advanced Environments, Tools, and Applications for Cluster Computing-Revised Papers
A Customizable Simulator for Workstation Networks
IPPS '97 Proceedings of the 11th International Symposium on Parallel Processing
Ultra-high performance communication with MPI and the Sun fire™ link interconnect
Proceedings of the 2002 ACM/IEEE conference on Supercomputing
Separated high-bandwidth and low-latency communication in the cluster interconnect Clint
Proceedings of the 2002 ACM/IEEE conference on Supercomputing
STORM: lightning-fast resource management
Proceedings of the 2002 ACM/IEEE conference on Supercomputing
Message passing and shared address space parallelism on an SMP cluster
Parallel Computing
Programming environments for high-performance grid computing: the Albatross project
Future Generation Computer Systems - Grid computing: Towards a new computing infrastructure
High-performance thread migration on clusters of SMPs
Cluster computing
PadicoTM: an open integration framework for communication middleware and runtimes
Future Generation Computer Systems - Selected papers from CCGRID 2002
Exploiting task-level concurrency in a programmable network interface
Proceedings of the ninth ACM SIGPLAN symposium on Principles and practice of parallel programming
Placement of I/O servers to improve parallel I/O performance on switch-based clusters
ICS '03 Proceedings of the 17th annual international conference on Supercomputing
miNI: reducing network interface memory requirements with dynamic handle lookup
ICS '03 Proceedings of the 17th annual international conference on Supercomputing
Communication performance issues for two cluster computers
ACSC '03 Proceedings of the 26th Australasian computer science conference - Volume 16
A Codesign Environment Supporting Hardware/Software Modeling at Different Levels of Detail
CODES '97 Proceedings of the 5th International Workshop on Hardware/Software Co-Design
Sepia: Scalable 3D Compositing Using PCI Pamette
FCCM '99 Proceedings of the Seventh Annual IEEE Symposium on Field-Programmable Custom Computing Machines
Integrating polling, interrupts, and thread management
FRONTIERS '96 Proceedings of the 6th Symposium on the Frontiers of Massively Parallel Computation
An Efficient Group Communication Architecture over ATM Networks
HCW '98 Proceedings of the Seventh Heterogeneous Computing Workshop
Heterogeneous Distributed Virtual Machines in the Harness Metacomputing Framework
HCW '99 Proceedings of the Eighth Heterogeneous Computing Workshop
HCW '99 Proceedings of the Eighth Heterogeneous Computing Workshop
Firmware-Level Latency Analysis on a Gigabit Network
The Journal of Supercomputing
Dynamic Data Replication: An Approach to Providing Fault-Tolerant Shared Memory Clusters
HPCA '03 Proceedings of the 9th International Symposium on High-Performance Computer Architecture
Dynamic Voltage Scaling with Links for Power Optimization of Interconnection Networks
HPCA '03 Proceedings of the 9th International Symposium on High-Performance Computer Architecture
Evaluating the Impact of Communication Architecture on the Performability of Cluster-Based Services
HPCA '03 Proceedings of the 9th International Symposium on High-Performance Computer Architecture
Automatic exploitation of dual level parallelism on a network of multiprocessors
HPDC '96 Proceedings of the 5th IEEE International Symposium on High Performance Distributed Computing
A Scalable Architecture for Clustered Network Attached Storage
MSS '03 Proceedings of the 20 th IEEE/11 th NASA Goddard Conference on Mass Storage Systems and Technologies (MSS'03)
Issues in using heterogeneous HPC systems for embedded real time signal processing applications
RTCSA '95 Proceedings of the 2nd International Workshop on Real-Time Computing Systems and Applications
Performance of Congestion Control Mechanisms in Wormhole Routing Networks
INFOCOM '97 Proceedings of the INFOCOM '97. Sixteenth Annual Joint Conference of the IEEE Computer and Communications Societies. Driving the Information Revolution
Dynamic load balancing for switch-based networks
Journal of Parallel and Distributed Computing
Journal of Parallel and Distributed Computing
Fast Dynamic Reconfiguration in Irregular Networks
ICPP '00 Proceedings of the Proceedings of the 2000 International Conference on Parallel Processing
Improving the Performance of Regular Networks with Source Routing
ICPP '00 Proceedings of the Proceedings of the 2000 International Conference on Parallel Processing
A Network Co-processor-Based Approach to Scalable Media Streaming in Servers
ICPP '00 Proceedings of the Proceedings of the 2000 International Conference on Parallel Processing
On the Design of Communication-Aware Task Scheduling Strategies for Heterogeneous Systems
ICPP '00 Proceedings of the Proceedings of the 2000 International Conference on Parallel Processing
A queueing model for wormhole routing with timeout
ICCCN '95 Proceedings of the 4th International Conference on Computer Communications and Networks
Optimizing Parallel Applications for Wide-Area Clusters
IPPS '98 Proceedings of the 12th. International Parallel Processing Symposium on International Parallel Processing Symposium
Optimal Contention-Free Unicast-Based Multicasting in Switch-Based Networks of Workstations
IPPS '98 Proceedings of the 12th. International Parallel Processing Symposium on International Parallel Processing Symposium
Tolerant Switched Local Area Networks
IPPS '98 Proceedings of the 12th. International Parallel Processing Symposium on International Parallel Processing Symposium
HIPIQS: A High-Performance Switch Architecture using Input Queuing
IPPS '98 Proceedings of the 12th. International Parallel Processing Symposium on International Parallel Processing Symposium
Efficient Fine-Grain Thread Migration with Active Threads
IPPS '98 Proceedings of the 12th. International Parallel Processing Symposium on International Parallel Processing Symposium
Tree-Based Multicasting in Wormhole-Routed Irregular Topologies
IPPS '98 Proceedings of the 12th. International Parallel Processing Symposium on International Parallel Processing Symposium
On Network CoProcessors for Scalable, Predictable Media Services
IEEE Transactions on Parallel and Distributed Systems
IEEE Transactions on Parallel and Distributed Systems
Deadlock-Free Dynamic Reconfiguration Schemes for Increased Network Dependability
IEEE Transactions on Parallel and Distributed Systems
Performance Analysis of a Myrinet-Based Cluster
Cluster Computing
Applying In-Transit Buffers to Boost the Performance of Networks with Source Routing
IEEE Transactions on Computers
Engineering a user-level TCP for the CLAN network
NICELI '03 Proceedings of the ACM SIGCOMM workshop on Network-I/O convergence: experience, lessons, implications
An improvement on binary-swap compositing for sort-last parallel rendering
Proceedings of the 2003 ACM symposium on Applied computing
Efficient implementation of reduce-scatter in MPI
Journal of Systems Architecture: the EUROMICRO Journal - Special issue: Parallel, distributed and network-based processing
Journal of Parallel and Distributed Computing
Anchored opportunity queueing: a low-latency scheduler for fair arbitration among virtual channels
Journal of Parallel and Distributed Computing
Stateful distributed interposition
ACM Transactions on Computer Systems (TOCS)
Supporting adaptive routing in IBA switches
Journal of Systems Architecture: the EUROMICRO Journal - Special issue: Evolutions in parallel distributed and network-based processing
Content-aware cooperative caching for cluster-based web servers
Journal of Systems and Software
Parallel Computing - Special issue: Parallel and distributed scientific and engineering computing
Collective communication patterns on the quadrics network
Performance analysis and grid computing
Efficient Multiple Multicast on Heterogeneous Network of Workstations
The Journal of Supercomputing
On the development of a communication-aware task mapping technique
Journal of Systems Architecture: the EUROMICRO Journal
An analysis of the impact of MPI overlap and independent progress
Proceedings of the 18th annual international conference on Supercomputing
The Journal of Supercomputing
An Effective Methodology to Improve the Performance of the Up*/Down* Routing Algorithm
IEEE Transactions on Parallel and Distributed Systems
Cluster communication protocols for parallel-programming systems
ACM Transactions on Computer Systems (TOCS)
An Analysis of the Cost Effectiveness of an Adaptable Computing Cluster
Cluster Computing
Fast Paths in Concurrent Programs
Proceedings of the 13th International Conference on Parallel Architectures and Compilation Techniques
Key Messaging on SOME-Bus clusters
Parallel Computing
Coscheduling in Clusters: Is It a Viable Alternative?
Proceedings of the 2004 ACM/IEEE conference on Supercomputing
Assessing Fault Sensitivity in MPI Applications
Proceedings of the 2004 ACM/IEEE conference on Supercomputing
The Panasas ActiveScale Storage Cluster: Delivering Scalable High Bandwidth Storage
Proceedings of the 2004 ACM/IEEE conference on Supercomputing
Realistic Modeling and Svnthesis of Resources for Computational Grids
Proceedings of the 2004 ACM/IEEE conference on Supercomputing
Proceedings of the 2003 ACM/IEEE conference on Supercomputing
Enabling the Efficient Use of SMP Clusters: The GAMESS/DDI Model
Proceedings of the 2003 ACM/IEEE conference on Supercomputing
Optimizing 10-Gigabit Ethernet for Networks of Workstations, Clusters, and Grids: A Case Study
Proceedings of the 2003 ACM/IEEE conference on Supercomputing
Performance Comparison of MPI Implementations over InfiniBand, Myrinet and Quadrics
Proceedings of the 2003 ACM/IEEE conference on Supercomputing
Scalable NIC-based Reduction on Large-scale Clusters
Proceedings of the 2003 ACM/IEEE conference on Supercomputing
An Efficient Parallel Algorithm to Solve Block-Toeplitz Systems
The Journal of Supercomputing
IEEE Transactions on Computers
PRESS: A Clustered Server Based on User-Level Communication
IEEE Transactions on Parallel and Distributed Systems
Power Saving in Regular Interconnection Networks Built with High-Degree Switches
IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Papers - Volume 01
Fault-Tolerance, Malleability and Migration for Divide-and-Conquer Applications on the Grid
IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Papers - Volume 01
A Memory-Effective Routing Strategy for Regular Interconnection Networks
IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Papers - Volume 01
PROC: Process ReOrdering-Based Coscheduling on Workstation Clusters
IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Papers - Volume 01
Design and Implementation of Open MPI over Quadrics/Elan4
IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Papers - Volume 01
Using Message-Driven Objects to Mask Latency in Grid Computing Applications
IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Papers - Volume 01
Enhancing NIC Performance for MPI using Processing-in-Memory
IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Workshop 9 - Volume 10
Message Passing for Linux Clusters with Gigabit Ethernet Mesh Connections
IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Workshop 9 - Volume 10
Modeling Particle Systems Animations for Heterogeneous Clusters
IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Workshop 13 - Volume 14
Hyperplane Grouping and Pipelined Schedules: How to Execute Tiled Loops Fast on Clusters of SMPs
The Journal of Supercomputing
International Journal of High Performance Computing Applications
Deadlock-free multicasting in irregular networks using prefix routing
The Journal of Supercomputing
Comparing Ethernet and Myrinet for MPI communication
LCR '04 Proceedings of the 7th workshop on Workshop on languages, compilers, and run-time support for scalable systems
Performance Evaluation of Deterministic Routings, Multicasts, and Topologies on RHiNET-2 Cluster
IEEE Transactions on Parallel and Distributed Systems
A data distributed parallel algorithm for nonrigid image registration
Parallel Computing
Performance analysis of a QoS capable cluster interconnect
Performance Evaluation - Performance modelling and evaluation of high-performance parallel and distributed systems
From Toys to Teraflops: Bridging the Beowulf Gap
International Journal of High Performance Computing Applications
Design and Evaluation of an HPVM-Based Windows NT Supercomputer
International Journal of High Performance Computing Applications
International Journal of High Performance Computing Applications
High performance support of parallel virtual file system (PVFS2) over Quadrics
Proceedings of the 19th annual international conference on Supercomputing
FAST '03 Proceedings of the 2nd USENIX Conference on File and Storage Technologies
Network Interface Data Caching
IEEE Transactions on Computers
Feedback-Based Synchronization in System Area Networks for Cluster Computing
IEEE Transactions on Parallel and Distributed Systems
Traffic Scheduling Solutions with QoS Support for an Input-Buffered MultiMedia Router
IEEE Transactions on Parallel and Distributed Systems
Application Resource Requirement Estimation in a Parallel-Pipeline Model of Execution
IEEE Transactions on Parallel and Distributed Systems
Optimizing All-to-All Collective Communication by Exploiting Concurrency in Modern Networks
SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
Transformations to Parallel Codes for Communication-Computation Overlap
SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
Layered Routing in Irregular Networks
IEEE Transactions on Parallel and Distributed Systems
Enforcing in-order packet delivery in system area networks with adaptive routing
Journal of Parallel and Distributed Computing - Special issue: Design and performance of networks for super-, cluster-, and grid-computing: Part I
Research note: Anatomy of UDP and M-VIA for cluster communication
Journal of Parallel and Distributed Computing - Special issue: Design and performance of networks for super-, cluster-, and grid-computing: Part I
Fast Routing Computation on InfiniBand Networks
IEEE Transactions on Parallel and Distributed Systems
High Performance Sockets over Kernel Level Virtual Interface Architecture
HPCASIA '05 Proceedings of the Eighth International Conference on High-Performance Computing in Asia-Pacific Region
Efficient broadcast in heterogeneous networks of workstations using two sub-networks
International Journal of Parallel Programming
Fast and transparent recovery for continuous availability of cluster-based servers
Proceedings of the eleventh ACM SIGPLAN symposium on Principles and practice of parallel programming
Lazy direct-to-cache transfer during receive operations in a message passing environment
Proceedings of the 3rd conference on Computing frontiers
A Routing Methodology for Achieving Fault Tolerance in Direct Networks
IEEE Transactions on Computers
ICWall: a calibrated stereo tiled display from commodity components
Proceedings of the 2006 ACM international conference on Virtual reality continuum and its applications
Exploiting NIC architectural support for enhancing IP-based protocols on high-performance networks
Journal of Parallel and Distributed Computing - Special issue: Design and performance of networks for super-, cluster-, and grid-computing: Part II
Reliability challenges in large systems
Future Generation Computer Systems
Optimizing I/O server placement for parallel I/O on switch-based irregular networks
The Journal of Supercomputing
Implementation and performance study of a hardware-VIA-based network adapter on gigabit ethernet
Journal of Systems Architecture: the EUROMICRO Journal
CEFT: A cost-effective, fault-tolerant parallel virtual file system
Journal of Parallel and Distributed Computing
MMR: A MultiMedia Router architecture to support hybrid workloads
Journal of Parallel and Distributed Computing
MEDEA '05 Proceedings of the 2005 workshop on MEmory performance: DEaling with Applications , systems and architecture
High-performance adaptive routing for networks with arbitrary topology
Journal of Systems Architecture: the EUROMICRO Journal
FIR: an efficient routing strategy for tori and meshes
Journal of Parallel and Distributed Computing - 19th International parallel and distributed processing symposium
Throughput fairness in k-ary n-cube networks
ACSC '06 Proceedings of the 29th Australasian Computer Science Conference - Volume 48
Improving the flexibility of active grids through web services
ACSW Frontiers '06 Proceedings of the 2006 Australasian workshops on Grid computing and e-research - Volume 54
Software-Based Adaptive and Concurrent Self-Testing in Programmable Network Interfaces
ICPADS '06 Proceedings of the 12th International Conference on Parallel and Distributed Systems - Volume 1
Destination-Based HoL Blocking Elimination
ICPADS '06 Proceedings of the 12th International Conference on Parallel and Distributed Systems - Volume 1
Memory and Network Bandwidth Aware Scheduling of Multiprogrammed Workloads on Clusters of SMPs
ICPADS '06 Proceedings of the 12th International Conference on Parallel and Distributed Systems - Volume 1
FRoots: A Fault Tolerant and Topology-Flexible Routing Technique
IEEE Transactions on Parallel and Distributed Systems
RAMS: a RDMA-enabled I/O cache architecture for clustered network servers
SNAPI '04 Proceedings of the international workshop on Storage network architecture and parallel I/Os
Efficient remote block-level I/O over an RDMA-capable NIC
Proceedings of the 20th annual international conference on Supercomputing
Proceedings of the 20th annual international conference on Supercomputing
STORM: Scalable Resource Management for Large-Scale Parallel Computers
IEEE Transactions on Computers
Locality and parallelism optimization for dynamic programming algorithm in bioinformatics
Proceedings of the 2006 ACM/IEEE conference on Supercomputing
Deadlock-free connection-based adaptive routing with dynamic virtual circuits
Journal of Parallel and Distributed Computing
International Journal of High Performance Computing Applications
A runtime resolution scheme for priority boost conflict in implicit coscheduling
The Journal of Supercomputing
10Gb/s Ethernet performance and retrospective
ACM SIGCOMM Computer Communication Review
A comprehensive performance and energy consumption analysis of scheduling alternatives in clusters
The Journal of Supercomputing
Handling Topology Changes in InfiniBand
IEEE Transactions on Parallel and Distributed Systems
Throughput Region of Finite-Buffered Networks
IEEE Transactions on Parallel and Distributed Systems
IEEE Transactions on Parallel and Distributed Systems
Exploring the Design Space of Self-Regulating Power-Aware On/Off Interconnection Networks
IEEE Transactions on Parallel and Distributed Systems
The Journal of Supercomputing
WINSYM'98 Proceedings of the 2nd conference on USENIX Windows NT Symposium - Volume 2
Coordinated thread scheduling for workstation clusters under windows NT
NT'97 Proceedings of the USENIX Windows NT Workshop on The USENIX Windows NT Workshop 1997
Extending a traditional OS using object-oriented techniques
COOTS'96 Proceedings of the 2nd conference on USENIX Conference on Object-Oriented Technologies (COOTS) - Volume 2
Cheating the I/O bottleneck: network storage with Trapeze/Myrinet
ATEC '98 Proceedings of the annual conference on USENIX Annual Technical Conference
Solaris MC: a multi computer OS
ATEC '96 Proceedings of the 1996 annual conference on USENIX Annual Technical Conference
FLIPC: a low latency messaging system for distributed real time environments
ATEC '96 Proceedings of the 1996 annual conference on USENIX Annual Technical Conference
WINSYM'99 Proceedings of the 3rd conference on USENIX Windows NT Symposium - Volume 3
Trapeze/IP: TCP/IP at near-gigabit speeds
ATEC '99 Proceedings of the annual conference on USENIX Annual Technical Conference
Smartsockets: solving the connectivity problems in grid computing
Proceedings of the 16th international symposium on High performance distributed computing
Optimization and bottleneck analysis of network block I/O in commodity storage systems
Proceedings of the 21st annual international conference on Supercomputing
Performance evaluation of offloading software modules to cluster network
PDCN'07 Proceedings of the 25th conference on Proceedings of the 25th IASTED International Multi-Conference: parallel and distributed computing and networks
Performance evaluation on low-latency Communication mechanism of DIMMnet-2
PDCN'07 Proceedings of the 25th conference on Proceedings of the 25th IASTED International Multi-Conference: parallel and distributed computing and networks
An optimal scheduling algorithm for an agent-based multicast strategy on irregular networks
The Journal of Supercomputing
A Survey and Taxonomy of GALS Design Styles
IEEE Design & Test
A Comprehensive Framework for Enhancing Security in InfiniBand Architecture
IEEE Transactions on Parallel and Distributed Systems
Martini: A Network Interface Controller Chip for High Performance Computing with Distributed PCs
IEEE Transactions on Parallel and Distributed Systems
Software-Based Failure Detection and Recovery in Programmable Network Interfaces
IEEE Transactions on Parallel and Distributed Systems
A parallel implementation of 2-D/3-D image registration for computer-assisted surgery
International Journal of Bioinformatics Research and Applications
Implications of application usage characteristics for collective communication offload
International Journal of High Performance Computing and Networking
NIC-based reduction algorithms for large-scale clusters
International Journal of High Performance Computing and Networking
International Journal of High Performance Computing and Networking
HPM: a hierarchical model for parallel computations
International Journal of High Performance Computing and Networking
Application-bypass reduction for large-scale clusters
International Journal of High Performance Computing and Networking
Efficient parallel out-of-core matrix transposition
International Journal of High Performance Computing and Networking
Proceedings of the 2007 ACM/IEEE conference on Supercomputing
High-performance ethernet-based communications for future multi-core processors
Proceedings of the 2007 ACM/IEEE conference on Supercomputing
RISC: A resilient interconnection network for scalable cluster storage systems
Journal of Systems Architecture: the EUROMICRO Journal
Fast performance prediction of master-slave programs by partial task execution
SEPADS'05 Proceedings of the 4th WSEAS International Conference on Software Engineering, Parallel & Distributed Systems
Designing efficient irregular networks for heterogeneous systems-on-chip
Journal of Systems Architecture: the EUROMICRO Journal
Proceedings of the 22nd annual international conference on Supercomputing
Coscheduled distributed-Web servers on system area network
Journal of Parallel and Distributed Computing
A scalable, commodity data center network architecture
Proceedings of the ACM SIGCOMM 2008 conference on Data communication
Deadlock-Free Dynamic Network Reconfiguration Based on Close Up*/Down* Graphs
Euro-Par '08 Proceedings of the 14th international Euro-Par conference on Parallel Processing
Software techniques to improve virtualized I/O performance on multi-core systems
Proceedings of the 4th ACM/IEEE Symposium on Architectures for Networking and Communications Systems
An efficient design for fast memory registration in RDMA
Journal of Network and Computer Applications
Performance Modeling and Analysis of a Massively Parallel Direct - Part 2
International Journal of High Performance Computing Applications
Trace-driven co-simulation of high-performance computing systems using OMNeT++
Proceedings of the 2nd International Conference on Simulation Tools and Techniques
Effective admission and congestion control for interconnection networks of cluster computing systems
International Journal of High Performance Computing and Networking
Towards 100 gbit/s ethernet: multicore-based parallel communication protocol design
Proceedings of the 23rd international conference on Supercomputing
High performance wide-area overlay using deadlock-free routing
Proceedings of the 18th ACM international symposium on High performance distributed computing
Implementing a Change Assimilation Mechanism for Source Routing Interconnects
Euro-Par '09 Proceedings of the 15th International Euro-Par Conference on Parallel Processing
RecTOR: A New and Efficient Method for Dynamic Network Reconfiguration
Euro-Par '09 Proceedings of the 15th International Euro-Par Conference on Parallel Processing
Full-system simulation of distributed memory multicomputers
Cluster Computing
Microprocessors & Microsystems
Experience with Top Gun Wingman: a proxy-based graphical web browser for the 3Com PalmPilot
Middleware '98 Proceedings of the IFIP International Conference on Distributed Systems Platforms and Open Distributed Processing
A new ultra-low latency message transfer mechanism
CSN '07 Proceedings of the Sixth IASTED International Conference on Communication Systems and Networks
Application-aware prioritization mechanisms for on-chip networks
Proceedings of the 42nd Annual IEEE/ACM International Symposium on Microarchitecture
Separated high-bandwidth and low-latency communication in the cluster interconnect clint
Separated high-bandwidth and low-latency communication in the cluster interconnect clint
Reliability challenges in large systems
Future Generation Computer Systems
The ParaStation project: Using workstations as building blocks for parallel computing
Information Sciences: an International Journal
A general methodology for direction-based irregular routing algorithms
Journal of Parallel and Distributed Computing
An improved model for predicting HPL performance
GPC'07 Proceedings of the 2nd international conference on Advances in grid and pervasive computing
A scalable methodology for computing fault-free paths in InfiniBand torus networks
ISHPC'05/ALPS'06 Proceedings of the 6th international symposium on high-performance computing and 1st international conference on Advanced low power systems
Implementation and evaluation of the mechanisms for low latency communication on DIMMnet-2
ISHPC'05/ALPS'06 Proceedings of the 6th international symposium on high-performance computing and 1st international conference on Advanced low power systems
Sockets direct protocol for hybrid network stacks: a case study with iWARP over 10G Ethernet
HiPC'08 Proceedings of the 15th international conference on High performance computing
Aérgia: exploiting packet latency slack in on-chip networks
Proceedings of the 37th annual international symposium on Computer architecture
Efficient On-Demand Connection Management Mechanisms with PGAS Models over InfiniBand
CCGRID '10 Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing
Motivating future interconnects: a differential measurement analysis of PCI latency
Proceedings of the 5th ACM/IEEE Symposium on Architectures for Networking and Communications Systems
MyriXen: message passing in Xen virtual machines over Myrinet and Ethernet
Euro-Par'09 Proceedings of the 2009 international conference on Parallel processing
An introduction to Balder: an OpenMP run-time library for clusters of SMPs
IWOMP'05/IWOMP'06 Proceedings of the 2005 and 2006 international conference on OpenMP shared memory parallel programming
Routing to support communication in dependable networks
EUROMICRO-PDP'02 Proceedings of the 10th Euromicro conference on Parallel, distributed and network-based processing
EUROMICRO-PDP'02 Proceedings of the 10th Euromicro conference on Parallel, distributed and network-based processing
Efficient implementation of reduce-scatter in MPI
EUROMICRO-PDP'02 Proceedings of the 10th Euromicro conference on Parallel, distributed and network-based processing
Increasing the adaptivity of routing algorithms for k-ary n-cubes
EUROMICRO-PDP'02 Proceedings of the 10th Euromicro conference on Parallel, distributed and network-based processing
Removing the latency overhead of the ITB mechanism in COWs with source routing
EUROMICRO-PDP'02 Proceedings of the 10th Euromicro conference on Parallel, distributed and network-based processing
Seekable sockets: a mechanism to reduce copy overheads in TCP-based messaging
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
Benefits of high speed interconnects to cluster file systems: a case study with lustre
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
Designing next generation data-centers with advanced communication protocols and systems services
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
Segment-based routing: an efficient fault-tolerant routing algorithm for meshes and Tori
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
Dynamic SMP clusters with communication on the fly
ISPDC'03 Proceedings of the Second international conference on Parallel and distributed computing
Designing Energy Efficient Communication Runtime Systems for Data Centric Programming Models
GREENCOM-CPSCOM '10 Proceedings of the 2010 IEEE/ACM Int'l Conference on Green Computing and Communications & Int'l Conference on Cyber, Physical and Social Computing
Efficient network management applied to source routed networks
Parallel Computing
FAST'03 Proceedings of the 2nd USENIX conference on File and storage technologies
Scalable memory registration for high performance networks using helper threads
Proceedings of the 8th ACM International Conference on Computing Frontiers
Asynchronous PGAS runtime for Myrinet networks
Proceedings of the Fourth Conference on Partitioned Global Address Space Programming Model
Minimizing data size for efficient data reuse in grid-enabled medical applications
ISBMDA'06 Proceedings of the 7th international conference on Biological and Medical Data Analysis
Tree-turn routing: an efficient deadlock-free routing algorithm for irregular networks
The Journal of Supercomputing
Ethernet as a lossless deadlock free system area network
ISPA'05 Proceedings of the Third international conference on Parallel and Distributed Processing and Applications
Users matter: a multi-agent systems model of high performance computing cluster users
MABS'04 Proceedings of the 2004 international conference on Multi-Agent and Multi-Agent-Based Simulation
Designing a common communication subsystem
PVM/MPI'05 Proceedings of the 12th European PVM/MPI users' group conference on Recent Advances in Parallel Virtual Machine and Message Passing Interface
Assessing MPI performance on QsNetIIt
PVM/MPI'05 Proceedings of the 12th European PVM/MPI users' group conference on Recent Advances in Parallel Virtual Machine and Message Passing Interface
A commodity cluster using IEEE 1394 network for parallel applications
PDCAT'04 Proceedings of the 5th international conference on Parallel and Distributed Computing: applications and Technologies
Parallelizing power systems simulation for multi-core clusters: design for an SME
HPCS'09 Proceedings of the 23rd international conference on High Performance Computing Systems and Applications
PerWiz: a what-if prediction tool for tuning message passing programs
VECPAR'04 Proceedings of the 6th international conference on High Performance Computing for Computational Science
An integrated architecture for qos-enable router and grid-oriented supercomputer
ICCNMC'05 Proceedings of the Third international conference on Networking and Mobile Computing
Performance evaluation of MM5 on clusters with modern interconnects: scalability and impact
Euro-Par'05 Proceedings of the 11th international Euro-Par conference on Parallel Processing
On the correct sizing on meshes through an effective congestion management strategy
Euro-Par'05 Proceedings of the 11th international Euro-Par conference on Parallel Processing
Paradis-Net: a network interface for parallel and distributed
ICN'05 Proceedings of the 4th international conference on Networking - Volume Part II
An optimal scheduling algorithm for an agent-based multicast strategy on irregular networks
GPC'06 Proceedings of the First international conference on Advances in Grid and Pervasive Computing
Architecture and performance of dynamic offloader for cluster network
ISPA'06 Proceedings of the 4th international conference on Parallel and Distributed Processing and Applications
A GPGPU approach for accelerating 2-d/3-d rigid registration of medical images
ISPA'06 Proceedings of the 4th international conference on Parallel and Distributed Processing and Applications
Indirect cube: A power-efficient topology for compute clusters
Optical Switching and Networking
On paged distributed virtual memory algorithms in a broadcasting environment
Computer Communications
Invited Performance of the communication layers of TCP/IP with the Myrinet gigabit LAN
Computer Communications
Partitioning and scheduling loops on NOWs
Computer Communications
Supporting TCP connections in wormhole routing and ATM networks
Computer Communications
Understanding and improving the cost of scaling distributed event processing
Proceedings of the 6th ACM International Conference on Distributed Event-Based Systems
uvNIC: rapid prototyping network interface controller device drivers
Proceedings of the ACM SIGCOMM 2012 conference on Applications, technologies, architectures, and protocols for computer communication
Analyzing performance and power efficiency of network processing over 10 GbE
Journal of Parallel and Distributed Computing
uvNIC: rapid prototyping network interface controller device drivers
ACM SIGCOMM Computer Communication Review - Special october issue SIGCOMM '12
Parallel particle rendering: a performance comparison between Chromium and Aura
EG PGV'06 Proceedings of the 6th Eurographics conference on Parallel Graphics and Visualization
Chronos: predictable low latency for data center applications
Proceedings of the Third ACM Symposium on Cloud Computing
ISPA'07 Proceedings of the 5th international conference on Parallel and Distributed Processing and Applications
Performance evaluation of distributed computing over heterogeneous networks
HPCC'07 Proceedings of the Third international conference on High Performance Computing and Communications
KNEM: A generic and scalable kernel-assisted intra-node MPI communication framework
Journal of Parallel and Distributed Computing
Designing energy efficient communication runtime systems: a view from PGAS models
The Journal of Supercomputing
Circuit switching under the radar with REACToR
NSDI'14 Proceedings of the 11th USENIX Conference on Networked Systems Design and Implementation
Hi-index | 0.05 |
Myrinet is a new type of local-area network (LAN) based on the technology used for packet communication and switching within "massively-parallel processors" (MPPs). Think of Myrinet as an MPP message-passing network that can span campus dimensions, rather than as a wide-area telecommunications network that is operating in close quarters. The technical steps toward making Myrinet a reality included the development of (1) robust, 25m communication channels with flow control, packet framing, and error control; (2) self-initializing, low-latency, cut-through switches; (3) host interfaces that can map the network, select routes, and translate from network addresses to routes, as well as handle packet traffic; and (4) streamlined host software that allows direct communication between user processes and the network.