Processor scheduling in shared memory multiprocessors

Authors:
John Zahorjan;Cathy McCann
Affiliations:
Department of Computer Science and Engineering, University of Washington, Seattle, WA;Department of Computer Science and Engineering, University of Washington, Seattle, WA
Venue:
SIGMETRICS '90 Proceedings of the 1990 ACM SIGMETRICS conference on Measurement and modeling of computer systems
Year:
1990

Citing 11
Cited 44

Allocating Independent Subtasks on Parallel Processors

IEEE Transactions on Software Engineering
The duality of memory and communication in the implementation of a multiprocessor operating system

SOSP '87 Proceedings of the eleventh ACM Symposium on Operating systems principles
Guided self-scheduling: A practical scheduling scheme for parallel supercomputers

IEEE Transactions on Computers
Firefly: A Multiprocessor Workstation

IEEE Transactions on Computers - Special issue on architectural support for programming languages and operating systems
Scheduling in multiprogrammed parallel systems

SIGMETRICS '88 Proceedings of the 1988 ACM SIGMETRICS conference on Measurement and modeling of computer systems
Speedup Versus Efficiency in Parallel Systems

IEEE Transactions on Computers
Process control and scheduling issues for multiprogrammed shared-memory multiprocessors

SOSP '89 Proceedings of the twelfth ACM symposium on Operating systems principles
Characterizations of parallelism in applications and their use in scheduling

SIGMETRICS '89 Proceedings of the 1989 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
The performance of multiprogrammed multiprocessor scheduling algorithms

SIGMETRICS '90 Proceedings of the 1990 ACM SIGMETRICS conference on Measurement and modeling of computer systems
Mean-Value Analysis of Closed Multichain Queuing Networks

Journal of the ACM (JACM)
The impact of distributions and disciplines on multiple processor systems

Communications of the ACM

The impact of operating system scheduling policies and synchronization methods of performance of parallel applications

SIGMETRICS '91 Proceedings of the 1991 ACM SIGMETRICS conference on Measurement and modeling of computer systems
Processor-pool-based scheduling for large-scale NUMA multiprocessors

SIGMETRICS '91 Proceedings of the 1991 ACM SIGMETRICS conference on Measurement and modeling of computer systems
Parallel programs and background load: efficiency studies with the PAR-Bench system

ICS '91 Proceedings of the 5th international conference on Supercomputing
Scheduler activations: effective kernel support for the user-level management of parallelism

SOSP '91 Proceedings of the thirteenth ACM symposium on Operating systems principles
Scheduling in parallel systems with a hierarchical organization of tasks

ICS '92 Proceedings of the 6th international conference on Supercomputing
Scheduler activations: effective kernel support for the user-level management of parallelism

ACM Transactions on Computer Systems (TOCS)
Scheduling a mixed interactive and batch workload on a parallel, shared memory supercomputer

Proceedings of the 1992 ACM/IEEE conference on Supercomputing
A dynamic processor allocation policy for multiprogrammed shared-memory multiprocessors

ACM Transactions on Computer Systems (TOCS)
Using scheduler information to achieve optimal barrier synchronization performance

PPOPP '93 Proceedings of the fourth ACM SIGPLAN symposium on Principles and practice of parallel programming
A machine independent interface for lightweight threads

ACM SIGOPS Operating Systems Review
The influence of random delays on parallel execution times

SIGMETRICS '93 Proceedings of the 1993 ACM SIGMETRICS conference on Measurement and modeling of computer systems
Processor scheduling on multiprogrammed, distributed memory parallel computers

SIGMETRICS '93 Proceedings of the 1993 ACM SIGMETRICS conference on Measurement and modeling of computer systems
A performance evaluation of several priority policies for parallel processing systems

Journal of the ACM (JACM)
Analysis of the impact of memory in distributed parallel processing systems

SIGMETRICS '94 Proceedings of the 1994 ACM SIGMETRICS conference on Measurement and modeling of computer systems
Processor allocation policies for message-passing parallel computers

SIGMETRICS '94 Proceedings of the 1994 ACM SIGMETRICS conference on Measurement and modeling of computer systems
Use of application characteristics and limited preemption for run-to-completion parallel processor scheduling policies

SIGMETRICS '94 Proceedings of the 1994 ACM SIGMETRICS conference on Measurement and modeling of computer systems
A Hierarchical Task Queue Organization for Shared-Memory Multiprocessor Systems

IEEE Transactions on Parallel and Distributed Systems
A Measurement-Based Model to Predict the Performance Impact of System Modifications: A Case Study

IEEE Transactions on Parallel and Distributed Systems
Distributed Hardwired Barrier Synchronization for Scalable Multiprocessor Clusters

IEEE Transactions on Parallel and Distributed Systems
High performance synchronization algorithms for multiprogrammed multiprocessors

PPOPP '95 Proceedings of the fifth ACM SIGPLAN symposium on Principles and practice of parallel programming
Scheduling memory constrained jobs on distributed memory parallel computers

Proceedings of the 1995 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
On multiprocessor system scheduling

Proceedings of the eighth annual ACM symposium on Parallel algorithms and architectures
An analysis of gang scheduling for multiprogrammed parallel computing environments

Proceedings of the eighth annual ACM symposium on Parallel algorithms and architectures
Scheduler-conscious synchronization

ACM Transactions on Computer Systems (TOCS)
Non-clairvoyant multiprocessor scheduling of jobs with changing execution characteristics (extended abstract)

STOC '97 Proceedings of the twenty-ninth annual ACM symposium on Theory of computing
Processor Saving Scheduling Policies for Multiprocessor Systems

IEEE Transactions on Computers
Compile/run-time support for threaded MPI execution on multiprogrammed shared memory machines

Proceedings of the seventh ACM SIGPLAN symposium on Principles and practice of parallel programming
A closer look at coscheduling approaches for a network of workstations

Proceedings of the eleventh annual ACM symposium on Parallel algorithms and architectures
Preemptive scheduling of parallel jobs on multiprocessors

Proceedings of the seventh annual ACM-SIAM symposium on Discrete algorithms
Performance of Hierarchical Processor Scheduling in Shared-Memory Multiprocessor Systems

IEEE Transactions on Computers
Adaptive two-level thread management for fast MPI execution on shared memory machines

SC '99 Proceedings of the 1999 ACM/IEEE conference on Supercomputing
An efficient and effective performance evaluation method for multiprogrammed multiprocessor systems

SAC '97 Proceedings of the 1997 ACM symposium on Applied computing
Memory Conscious Scheduling for Cluster-based NUMA Multiprocessors

The Journal of Supercomputing
Program transformation and runtime support for threaded MPI execution on shared-memory machines

ACM Transactions on Programming Languages and Systems (TOPLAS)
Analysis of Processor Allocation in Multiprogrammed, Distributed-Memory Parallel Processing Systems

IEEE Transactions on Parallel and Distributed Systems
Parallel Models and Job Characterization for System Scheduling

ICCS '01 Proceedings of the International Conference on Computational Science-Part II
Parallel Job Scheduling: A Performance Perspective

Performance Evaluation: Origins and Directions
Petri net model of a dynamically partitioned multiprocessor system

PNPM '95 Proceedings of the Sixth International Workshop on Petri Nets and Performance Models
Non-clair voy ant multiprocessor scheduling of jobs with changing execution characteristics

Journal of Scheduling - Special issue: On-line scheduling
Selective preemption strategies for parallel job scheduling

International Journal of High Performance Computing and Networking
Using application information to drive adaptive grid middleware scheduling decisions

Proceedings of the 2nd workshop on Middleware-application interaction: affiliated with the DisCoTec federated conferences 2008
Elyze: enabling safe parallelism in event-driven servers

Proceedings of the 8th ACM SIGPLAN-SIGSOFT workshop on Program analysis for software tools and engineering
Provably efficient two-level adaptive scheduling

JSSPP'06 Proceedings of the 12th international conference on Job scheduling strategies for parallel processing
Safe at any speed: fast, safe parallelism in servers

HotDep'06 Proceedings of the Second conference on Hot topics in system dependability

Quantified Score

Hi-index	0.01

Visualization

Abstract

Existing work indicates that the commonly used “single queue of runnable tasks” approach to scheduling shared memory multiprocessors can perform very poorly in a multiprogrammed parallel processing environment. A more promising approach is the class of “two-level schedulers” in which the operating system deals solely with allocating processors to jobs while the individual jobs themselves perform task dispatching on those processors.In this paper we compare two basic varieties of two-level schedulers. Those of the first type, static, make a single decision per job regarding the number of processors to allocate to it. Once the job has received its allocation, it is guaranteed to have exactly that number of processors available to it whenever it is active. The other class of two-level scheduler, dynamic, allows each job to acquire and release processors during its execution. By responding to the varying parallelism of the jobs, the dynamic scheduler promises higher processor utilizations at the cost of potentially greater scheduling overhead and more complicated application level task control policies.Our results, obtained via simulation, highlight the tradeoffs between the static and dynamic approaches. We investigate how the choice of policy is affected by the cost of switching a processor from one job to another. We show that for a wide range of plausible overhead values, dynamic scheduling is superior to static scheduling. Within the class of static schedulers, we show that, in most cases, a simple “run to completion” scheme is preferable to a round-robin approach. Finally, we investigate different techniques for tuning the allocation decisions required by the dynamic policies and quantify their effects on performance.We believe our results are directly applicable to many existing shared memory parallel computers, which for the most part currently employ a simple “single queue of tasks” extension of basic sequential machine schedulers. We plan to validate our results in future work through implementation and experimentation on such a system.