The NAS parallel benchmarks—summary and preliminary results
Proceedings of the 1991 ACM/IEEE conference on Supercomputing
Linux Journal
Understanding operating systems (3rd ed.)
Understanding operating systems (3rd ed.)
Symbiotic jobscheduling for a simultaneous multithreaded processor
ASPLOS IX Proceedings of the ninth international conference on Architectural support for programming languages and operating systems
OpenMP: An Industry-Standard API for Shared-Memory Programming
IEEE Computational Science & Engineering
Using Processor-Cache Affinity Information in Shared-Memory Multiprocessor Scheduling
IEEE Transactions on Parallel and Distributed Systems
Operating Systems Design and Implementation (3rd Edition)
Operating Systems Design and Implementation (3rd Edition)
Operating Systems, Third Edition
Operating Systems, Third Edition
Operating System Concepts
lmbench: portable tools for performance analysis
ATEC '96 Proceedings of the 1996 annual conference on USENIX Annual Technical Conference
Thread clustering: sharing-aware scheduling on SMP-CMP-SMT multiprocessors
Proceedings of the 2nd ACM SIGOPS/EuroSys European Conference on Computer Systems 2007
Thousand core chips: a technology perspective
Proceedings of the 44th annual Design Automation Conference
Efficient operating system scheduling for performance-asymmetric multi-core architectures
Proceedings of the 2007 ACM/IEEE conference on Supercomputing
Towards achieving fairness in the Linux scheduler
ACM SIGOPS Operating Systems Review - Research and developments in the Linux kernel
A dynamic scheduler for balancing HPC applications
Proceedings of the 2008 ACM/IEEE conference on Supercomputing
Professional Linux Kernel Architecture
Professional Linux Kernel Architecture
Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
Windows Internals: Including Windows Server 2008 and Windows Vista, Fifth Edition
Windows Internals: Including Windows Server 2008 and Windows Vista, Fifth Edition
Pthreads Performance Characteristics on Shared Cache CMP, Private Cache CMP and SMP
ICCEA '10 Proceedings of the 2010 Second International Conference on Computer Engineering and Applications - Volume 01
Operating Systems: Internals and Design Principles
Operating Systems: Internals and Design Principles
The Journal of Supercomputing
Juggle: proactive load balancing on multicore computers
Proceedings of the 20th international symposium on High performance distributed computing
Periodic hierarchical load balancing for large supercomputers
International Journal of High Performance Computing Applications
A comparative analysis of performance improvement schemes for cache memories
Computers and Electrical Engineering
An efficient scheduler of RTOS for multi/many-core system
Computers and Electrical Engineering
Hi-index | 0.00 |
The introduction of multicore microprocessors has enabled smaller organizations to invest in high performance shared memory parallel systems. These systems ship with standard operating systems using preset thresholds for task imbalance assessment to activate load balancing. Unfortunately, this will unnecessarily trigger task migrations when the number of tasks is a few multiples of the number of processing cores. We illustrate this unnecessary task migration behavior through simulation and introduce a dynamic threshold for task imbalance assessment that is dependent on the number of tasks and the number of processing cores. This is as a replacement for the static threshold that is used by standard operating systems. With the dynamic threshold method, we are able to illustrate a performance gain of up to 17% on a synthetic benchmark and up to 25% gain using the Integer Sort Benchmark from the National Aeronautics and Space Administration (NASA) Advanced Supercomputing Parallel Benchmark Suite.