A comprehensive scheduler for asymmetric multicore systems

Authors:
Juan Carlos Saez;Manuel Prieto;Alexandra Fedorova;Sergey Blagodurov
Affiliations:
Complutense University of Madrid, Madrid, Spain;Complutense University of Madrid, Madrid, Spain;Simon Fraser University, Vancouver, BC, Canada;Simon Fraser University, Vancouver, BC, Canada
Venue:
Proceedings of the 5th European conference on Computer systems
Year:
2010

Citing 12
Cited 31

Single-ISA Heterogeneous Multi-Core Architectures: The Potential for Processor Power Reduction

Proceedings of the 36th annual IEEE/ACM International Symposium on Microarchitecture
Single-ISA Heterogeneous Multi-Core Architectures for Multithreaded Workload Performance

Proceedings of the 31st annual international symposium on Computer architecture
Mitigating Amdahl's Law through EPI Throttling

Proceedings of the 32nd annual international symposium on Computer Architecture
The Impact of Performance Asymmetry in Emerging Multicore Architectures

Proceedings of the 32nd annual international symposium on Computer Architecture
Dynamic thread assignment on heterogeneous multiprocessor architectures

Proceedings of the 3rd conference on Computing frontiers
Efficient operating system scheduling for performance-asymmetric multi-core architectures

Proceedings of the 2007 ACM/IEEE conference on Supercomputing
Using Asymmetric Single-ISA CMPs to Save Energy on Operating Systems

IEEE Micro
Amdahl's Law in the Multicore Era

Computer
HASS: a scheduler for heterogeneous multicore systems

ACM SIGOPS Operating Systems Review
Maximizing power efficiency with asymmetric multicore systems

Communications of the ACM - Finding the Fun in Computer Science Education
Towards better performance per watt in virtual environments on asymmetric single-ISA multi-core systems

ACM SIGOPS Operating Systems Review
Bias scheduling in heterogeneous multi-core architectures

Proceedings of the 5th European conference on Computer systems

AASH: an asymmetry-aware scheduler for hypervisors

Proceedings of the 6th ACM SIGPLAN/SIGOPS international conference on Virtual execution environments
Operating system support for mitigating software scalability bottlenecks on asymmetric multicore processors

Proceedings of the 7th ACM international conference on Computing frontiers
AKULA: a toolset for experimenting and developing thread placement algorithms on multicore systems

Proceedings of the 19th international conference on Parallel architectures and compilation techniques
Hera-JVM: a runtime system for heterogeneous multi-core architectures

Proceedings of the ACM international conference on Object oriented programming systems languages and applications
Leveraging workload diversity through OS scheduling to maximize performance on single-ISA heterogeneous multicore systems

Journal of Parallel and Distributed Computing
Analyzing performance asymmetric multicore processors for latency sensitive datacenter applications

HotPower'10 Proceedings of the 2010 international conference on Power aware computing and systems
Attaining system performance points: revisiting the end-to-end argument in system design for heterogeneous many-core systems

ACM SIGOPS Operating Systems Review
Bridging functional heterogeneity in multicore architectures

ACM SIGOPS Operating Systems Review
Memory management in NUMA multicore systems: trapped between cache contention and interconnect overhead

Proceedings of the international symposium on Memory management
Virtualizing performance asymmetric multi-core systems

Proceedings of the 38th annual international symposium on Computer architecture
Parallel pattern detection for architectural improvements

HotPar'11 Proceedings of the 3rd USENIX conference on Hot topic in parallelism
Efficiently exploiting memory level parallelism on asymmetric coupled cores in the dark silicon era

ACM Transactions on Architecture and Code Optimization (TACO) - HIPEAC Papers
Chameleon: operating system support for dynamic processors

ASPLOS XVII Proceedings of the seventeenth international conference on Architectural Support for Programming Languages and Operating Systems
Bottleneck identification and scheduling in multithreaded applications

ASPLOS XVII Proceedings of the seventeenth international conference on Architectural Support for Programming Languages and Operating Systems
Thread scheduling for heterogeneous multicore processors using phase identification

ACM SIGMETRICS Performance Evaluation Review
Leveraging Core Specialization via OS Scheduling to Improve Performance on Asymmetric Multicore Systems

ACM Transactions on Computer Systems (TOCS)
CRQ-based fair scheduling on composable multicore architectures

Proceedings of the 26th ACM international conference on Supercomputing
Improving performance per watt of asymmetric multi-core processors via online program phase classification and adaptive core morphing

ACM Transactions on Design Automation of Electronic Systems (TODAES) - Special section on adaptive power management for energy and temperature-aware computing systems
More for your money: exploiting performance heterogeneity in public clouds

Proceedings of the Third ACM Symposium on Cloud Computing
Efficient task scheduling for hard real-time tasks in asymmetric multicore processors

ICA3PP'12 Proceedings of the 12th international conference on Algorithms and Architectures for Parallel Processing - Volume Part II
Kinship: efficient resource management for performance and functionally asymmetric platforms

Proceedings of the ACM International Conference on Computing Frontiers
Utility-based acceleration of multithreaded applications on asymmetric CMPs

Proceedings of the 40th Annual International Symposium on Computer Architecture
Hierarchical power management for asymmetric multi-core in dark silicon era

Proceedings of the 50th Annual Design Automation Conference
Energy-efficient virtual machine scheduling in performance-asymmetric multi-core architectures

Proceedings of the 8th International Conference on Network and Service Management
Low-latency adaptive mode transitions and hierarchical power management in asymmetric clustered cores

ACM Transactions on Architecture and Code Optimization (TACO)
An opportunistic prediction-based thread scheduling to maximize throughput/watt in AMPs

PACT '13 Proceedings of the 22nd international conference on Parallel architectures and compilation techniques
A unified view of non-monotonic core selection and application steering in heterogeneous chip multiprocessors

PACT '13 Proceedings of the 22nd international conference on Parallel architectures and compilation techniques
Power-performance modeling on asymmetric multi-cores

Proceedings of the 2013 International Conference on Compilers, Architectures and Synthesis for Embedded Systems
Virtual asymmetric multiprocessor for interactive performance of consolidated desktops

Proceedings of the 10th ACM SIGPLAN/SIGOPS international conference on Virtual execution environments
Adaptive workload-aware task scheduling for single-ISA asymmetric multicore architectures

ACM Transactions on Architecture and Code Optimization (TACO)
An efficient and comprehensive scheduler on Asymmetric Multicore Architecture systems

Journal of Systems Architecture: the EUROMICRO Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

Symmetric-ISA (instruction set architecture) asymmetric-performance multicore processors were shown to deliver higher performance per watt and area for applications with diverse architectural requirements, and so it is likely that future multicore processors will combine a few fast cores characterized by complex pipelines, high clock frequency, high area requirements and power consumption, and many slow ones, characterized by simple pipelines, low clock frequency, low area requirements and power consumption. Asymmetric multicore processors (AMP) derive their efficiency from core specialization. Efficiency specialization ensures that fast cores are used for "CPU-intensive" applications, which efficiently utilize these cores' "expensive" features, while slow cores would be used for "memory-intensive" applications, which utilize fast cores inefficiently. TLP (thread-level parallelism) specialization ensures that fast cores are used to accelerate sequential phases of parallel applications, while leaving slow cores for energy-efficient execution of parallel phases. Specialization is effected by an asymmetry-aware thread scheduler, which maps threads to cores in consideration of the properties of both. Previous asymmetry-aware schedulers employed one type of specialization (either efficiency or TLP), but not both. As a result, they were effective only for limited workload scenarios. We propose, implement, and evaluate CAMP, a Comprehensive AMP scheduler, which delivers both efficiency and TLP specialization. Furthermore, we propose a new light-weight technique for discovering which threads utilize fast cores most efficiently. Our evaluation in the OpenSolaris operating system demonstrates that CAMP accomplishes an efficient use of an AMP system for a variety of workloads, while existing asymmetry-aware schedulers were effective only in limited scenarios.