Concrete mathematics: a foundation for computer science
Concrete mathematics: a foundation for computer science
The interaction of architecture and operating system design
ASPLOS IV Proceedings of the fourth international conference on Architectural support for programming languages and operating systems
Modeling and validation of the real-time Mach scheduler
SIGMETRICS '93 Proceedings of the 1993 ACM SIGMETRICS conference on Measurement and modeling of computer systems
Some measurements of timeline gaps in VAX/VMS
ACM SIGOPS Operating Systems Review
Architecture and Implementation of Vulcan
Proceedings of the 8th International Symposium on Parallel Processing
Reducing the Variance of Point-to-Point Transfers for Parallel Real-Time Programs
IEEE Parallel & Distributed Technology: Systems & Technology
Modeling Communication Overhead: MPI and MPL Performance on the IBM SP2
IEEE Parallel & Distributed Technology: Systems & Technology
A high performance get-put interface for ATM communications
CASCON '97 Proceedings of the 1997 conference of the Centre for Advanced Studies on Collaborative research
Proceedings of the 2003 ACM/IEEE conference on Supercomputing
System noise, OS clock ticks, and fine-grained parallel applications
Proceedings of the 19th annual international conference on Supercomputing
Right-weight kernels: an off-the-shelf alternative to custom light-weight kernels
ACM SIGOPS Operating Systems Review
Proceedings of the 2007 ACM/IEEE conference on Supercomputing
CCGRID '10 Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing
Network offloaded hierarchical collectives using ConnectX-2's CORE-Direct capabilities
EuroMPI'10 Proceedings of the 17th European MPI users' group meeting conference on Recent advances in the message passing interface
Performance analysis of parallel programs via message-passing graph traversal
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
Thread Tranquilizer: Dynamically reducing performance variation
ACM Transactions on Architecture and Code Optimization (TACO) - HIPEAC Papers
Hi-index | 0.00 |
Commodity workstations have adapted to standard UNIX like environments to allow scientists to efficiently develop and port applications across systems. UNIX based environments, such as IBM's AIX, furnishes such an operating environment while providing efficient uni-processor utilization for user code execution. When these machines are interconnected with a low latency (user space) communication mechanism, large variances in point to point communication times for identical parallel programs are typically found. It is our contention that a large part of this variance is introduced by operating system support functionality that can delay point to point user space communications. We are able to experimentally measure this effect by monitoring the change in time of circulating a token through parallel processors connected in a virtual ring configuration. This paper proposes some solutions and then experimentally validates their ability to reduce point to point message passing variance for the IBM 9076 (SP1) machines.