Scheduling best-effort and real-time pipelined applications on time-shared clusters

Authors:
Yanyong Zhang;Anand Sivasubramaniam
Affiliations:
Department of Computer Science & Engineering, The Pennsylvania State University, University Park, PA;Department of Computer Science & Engineering, ,The Pennsylvania State University, University Park, PA
Venue:
Proceedings of the thirteenth annual ACM symposium on Parallel algorithms and architectures
Year:
2001

Citing 19
Cited 10

HARTOS: a distributed real-time operating system

ACM SIGOPS Operating Systems Review
Comparison of rate-based service disciplines

SIGCOMM '91 Proceedings of the conference on Communications architecture & protocols
U-Net: a user-level network interface for parallel and distributed computing

SOSP '95 Proceedings of the fifteenth ACM symposium on Operating systems principles
High performance messaging on workstations: Illinois fast messages (FM) for Myrinet

Supercomputing '95 Proceedings of the 1995 ACM/IEEE conference on Supercomputing
Effective distributed scheduling of parallel workloads

Proceedings of the 1996 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
A hierarchial CPU scheduler for multimedia operating systems

OSDI '96 Proceedings of the second USENIX symposium on Operating systems design and implementation
The design, implementation and evaluation of SMART: a scheduler for multimedia applications

Proceedings of the sixteenth ACM symposium on Operating systems principles
Scheduling with implicit information in distributed systems

SIGMETRICS '98/PERFORMANCE '98 Proceedings of the 1998 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
Locality-aware request distribution in cluster-based network servers

Proceedings of the eighth international conference on Architectural support for programming languages and operating systems
A closer look at coscheduling approaches for a network of workstations

Proceedings of the eleventh annual ACM symposium on Parallel algorithms and architectures
Borrowed-virtual-time (BVT) scheduling: supporting latency-sensitive threads in a general-purpose scheduler

Proceedings of the seventeenth ACM symposium on Operating systems principles
A simulation-based study of scheduling mechanisms for a dynamic cluster environment

Proceedings of the 14th international conference on Supercomputing
Cluster reserves: a mechanism for resource management in cluster-based network servers

Proceedings of the 2000 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Efficient Scheduling Algorithms for Real-Time Multiprocessor Systems

IEEE Transactions on Parallel and Distributed Systems
Automatic Scheduler for Real-Time Vision Applications

IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
Improving Parallel Job Scheduling by Combining Gang Scheduling and Backfilling Techniques

IPDPS '00 Proceedings of the 14th International Symposium on Parallel and Distributed Processing
Stride Scheduling: Deterministic Proportional- Share Resource Management

Stride Scheduling: Deterministic Proportional- Share Resource Management
Demand-based coscheduling of parallel jobs on multiprogrammed multiprocessors

Demand-based coscheduling of parallel jobs on multiprogrammed multiprocessors
An automatic scheduler for real-time vision applications

An automatic scheduler for real-time vision applications

Task scheduling performance in distributed systems with time varying workload

Neural, Parallel & Scientific Computations
A Pipeline-Based Approach for Scheduling Video Processing Algorithms on NOW

IEEE Transactions on Parallel and Distributed Systems
Performance Analysis of Parallel Job Scheduling in Distributed Systems

ANSS '03 Proceedings of the 36th annual symposium on Simulation
A dynamic and reliability-driven scheduling algorithm for parallel real-time jobs executing on heterogeneous clusters

Journal of Parallel and Distributed Computing
Process prioritization using output production: Scheduling for multimedia

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Engineering grid applications and middleware for high performance

WOSP '07 Proceedings of the 6th international workshop on Software and performance
Improving security for periodic tasks in embedded systems through scheduling

ACM Transactions on Embedded Computing Systems (TECS)
Achieving efficiency, quality of service and robustness in multi-organizational Grids

Journal of Systems and Software
A new technique of switch & feedback job scheduling mechanism in a distributed system

SpringSim '09 Proceedings of the 2009 Spring Simulation Multiconference
Business-driven short-term management of a hybrid IT infrastructure

Journal of Parallel and Distributed Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Two important emerging trends are influencing the design, implementation and deployment of high performance parallel systems. The first is on the architectural end, where both economic and technological factors are compelling the use of off-the-shelf computing elements (workstations/PCs and networks) to put together high performance systems called clusters. The second is from the user community that is finding an increasing number of applications to benefit from such high performance systems. Apart from the scientific applications that have traditionally needed supercomputing power, a large number of graphics, visualization, database, web service and e-commerce applications have started using clusters because of their high processing and storage requirements. These applications have diverse characteristics and can place different Quality-of-Service (QoS) requirements on the underlying system (low response time, high throughput, high I/O demands, guaranteed response/throughput etc.). Further, clusters running such applications need to cater to potentially a large number of users (or other applications) in a time-shared manner. The underlying system needs to accommodate the requirements of each application, while ensuring that they do not interfere with each other.This paper focuses on the CPU resources of a cluster and investigates scheduling mechanisms to meet the responsiveness, throughput and guaranteed service requirements of different applications. Specifically, we propose and evaluate three different scheduling mechanisms. These mechanisms have been drawn from traditional solutions on parallel systems (gang scheduling and dynamic coscheduling), and have been extended to accommodate the new criteria under consideration. These mechanisms have been investigated using detailed simulation and workload models to show their pros and cons for different performance metrics.