A case for NUMA-aware contention management on multicore systems

Authors:
Sergey Blagodurov;Sergey Zhuravlev;Alexandra Fedorova;Ali Kamali
Affiliations:
Simon Fraser University, Vancouver, BC, Canada;Simon Fraser University, Vancouver, BC, Canada;Simon Fraser University, Vancouver, BC, Canada;Simon Fraser University, Vancouver, BC, Canada
Venue:
Proceedings of the 19th international conference on Parallel architectures and compilation techniques
Year:
2010

Citing 3
Cited 12

Using OS Observations to Improve Performance in Multicore Systems

IEEE Micro
vGreen: a system for energy efficient computing in virtualized environments

Proceedings of the 14th ACM/IEEE international symposium on Low power electronics and design
Addressing shared resource contention in multicore processors via scheduling

Proceedings of the fifteenth edition of ASPLOS on Architectural support for programming languages and operating systems

Memory system performance in a NUMA multicore multiprocessor

Proceedings of the 4th Annual International Conference on Systems and Storage
OLTP on hardware islands

Proceedings of the VLDB Endowment
A meta-scheduler for the par-monad: composable scheduling for the heterogeneous cloud

Proceedings of the 17th ACM SIGPLAN international conference on Functional programming
Reducing last level cache pollution in NUMA multicore systems for improving cache performance

ICCSA'12 Proceedings of the 12th international conference on Computational Science and Its Applications - Volume Part III
Critical path-based thread placement for NUMA systems

ACM SIGMETRICS Performance Evaluation Review
Multi-tenancy on GPGPU-based servers

Proceedings of the 7th international workshop on Virtualization technologies in distributed computing
Redesigning MPI shared memory communication for large multi-core architecture

Computer Science - Research and Development
Characterization and modeling of PIDX parallel I/O for performance optimization

SC '13 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Energy-aware thread co-location in heterogeneous multicore processors

Proceedings of the Eleventh ACM International Conference on Embedded Software
Direct distributed memory access for CMPs

Journal of Parallel and Distributed Computing
Exploiting multi-core nodes in peer-to-peer grids

Journal of Parallel and Distributed Computing
An efficient and comprehensive scheduler on Asymmetric Multicore Architecture systems

Journal of Systems Architecture: the EUROMICRO Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

On multicore systems contention for shared resources occurs when memory-intensive threads are co-scheduled on cores that share parts of the memory hierarchy, such as last-level caches and memory controllers. Previous work investigated how contention could be addressed via scheduling. A contention-aware scheduler separates competing threads onto separate memory hierarchy domains to eliminate resource sharing and, as a consequence, mitigate contention. However, all previous work on contention-aware scheduling assumed that the underlying system is UMA (uniform memory access latencies, single memory controller). Modern multicore systems, however, are NUMA, which means that they feature non-uniform memory access latencies and multiple memory controllers. We discovered that contention management is a lot more difficult on NUMA systems, because the scheduler must not only consider the placement of threads, but also the placement of their memory. This is mostly required to eliminate contention for memory controllers contrary to the popular belief that remote access latency is the dominant concern. In this work we quantify the effects on performance imposed by resource contention and remote access latency. This analysis inspires the design of a contention-aware scheduling algorithm for NUMA systems. This algorithm significantly outperforms a NUMA-unaware algorithm proposed before as well as the default Linux scheduler. We also investigate memory migration strategies, which are the necessary part of the NUMA contention-aware scheduling algorithm. Finally, we propose and evaluate a new contention management algorithm that is priority-aware.