Proceedings of the 2003 ACM/IEEE conference on Supercomputing
System noise, OS clock ticks, and fine-grained parallel applications
Proceedings of the 19th annual international conference on Supercomputing
Operating system issues for petascale systems
ACM SIGOPS Operating Systems Review
Designing a highly-scalable operating system: the Blue Gene/L story
Proceedings of the 2006 ACM/IEEE conference on Supercomputing
ZOID: I/O-forwarding infrastructure for petascale architectures
Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming
Proceedings of the 2007 ACM/IEEE conference on Supercomputing
Evaluating the effect of replacing CNK with linux on the compute-nodes of blue gene/l
Proceedings of the 22nd annual international conference on Supercomputing
Overview of the IBM Blue Gene/P project
IBM Journal of Research and Development
Characterizing application sensitivity to OS interference using kernel-level noise injection
Proceedings of the 2008 ACM/IEEE conference on Supercomputing
Identifying sources of Operating System Jitter through fine-grained kernel instrumentation
CLUSTER '07 Proceedings of the 2007 IEEE International Conference on Cluster Computing
Blue Gene/L programming and operating environment
IBM Journal of Research and Development
Characterizing the Performance of "Big Memory on Blue Gene Linux
ICPPW '09 Proceedings of the 2009 International Conference on Parallel Processing Workshops
Experiences with a Lightweight Supercomputer Kernel: Lessons Learned from Blue Gene's CNK
Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis
Characterizing the Influence of System Noise on Large-Scale Applications by Simulation
Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis
jitSim: a simulator for predicting scalability of parallel applications in presence of OS jitter
EuroPar'10 Proceedings of the 16th international Euro-Par conference on Parallel processing: Part I
Designing OS for HPC Applications: Scheduling
CLUSTER '10 Proceedings of the 2010 IEEE International Conference on Cluster Computing
The Impact of System Design Parameters on Application Noise Sensitivity
CLUSTER '10 Proceedings of the 2010 IEEE International Conference on Cluster Computing
Performance and Scalability Evaluation of 'Big Memory' on Blue Gene Linux
International Journal of High Performance Computing Applications
The impact of noise on the scaling of collectives: a theoretical approach
HiPC'05 Proceedings of the 12th international conference on High Performance Computing
Hi-index | 0.00 |
Despite the fact that Linux is a popular operating system for high-performance computing, it does not ensure maximum performance for compute-intensive workloads. In our previous work we presented "Big Memory"---an alternative, transparent memory space that successfully removes the memory performance bottleneck on Blue Gene/P Linux. The initial Big Memory worked only as a per node resource. In this work we extend it to a per core resource and describe the details of the implementation. We evaluate our new implementation by running various benchmarks and the Nek5000 application. Compared with IBM's Compute Node Kernel that is noise-free and lightweight, the Nek5000 application runs with only 1.2% performance loss on Linux with Big Memory at 32K cores. Our benchmark results show no significant performance degradation from OS noise caused by Linux at a scale of up to 32K cores, although irregular OS events are still present.