Linger Longer: fine-grain cycle stealing for networks of workstations

Authors:
Kyung Dong Ryu;Jeffrey K. Hollingsworth
Affiliations:
University of Maryland, College Park, MD;University of Maryland, College Park, MD
Venue:
SC '98 Proceedings of the 1998 ACM/IEEE conference on Supercomputing
Year:
1998

Citing 16
Cited 13

Attacking the process migration bottleneck

SOSP '87 Proceedings of the eleventh ACM Symposium on Operating systems principles
The effect of context switches on cache performance

ASPLOS IV Proceedings of the fourth international conference on Architectural support for programming languages and operating systems
The available capacity of a privately owned workstation environment

Performance Evaluation
LOCUS operating system, a transparent system

Computer Communications
Utopia: a load sharing facility for large, heterogeneous distributed computer systems

Software—Practice & Experience
ATOM: a system for building customized program analysis tools

PLDI '94 Proceedings of the ACM SIGPLAN 1994 conference on Programming language design and implementation
The interaction of parallel and sequential workloads on a network of workstations

Proceedings of the 1995 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
The SPLASH-2 programs: characterization and methodological considerations

ISCA '95 Proceedings of the 22nd annual international symposium on Computer architecture
Exploiting process lifetime distributions for dynamic load balancing

Proceedings of the 1996 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Effective distributed scheduling of parallel workloads

Proceedings of the 1996 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
The utility of exploiting idle workstations for parallel computation

SIGMETRICS '97 Proceedings of the 1997 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Load-balancing heuristics and process behavior

SIGMETRICS '86/PERFORMANCE '86 Proceedings of the 1986 ACM SIGMETRICS joint international conference on Computer performance modelling, measurement and evaluation
Preemptable remote execution facilities for the V-system

Proceedings of the tenth ACM symposium on Operating systems principles
Probability and Statistics with Reliability, Queuing and Computer Science Applications

Probability and Statistics with Reliability, Queuing and Computer Science Applications
Process migration in DEMOS/MP

SOSP '83 Proceedings of the ninth ACM symposium on Operating systems principles
The relative importance of concurrent writers and weak consistency models

ICDCS '96 Proceedings of the 16th International Conference on Distributed Computing Systems (ICDCS '96)

Mechanisms and policies for supporting fine-grained cycle stealing

ICS '99 Proceedings of the 13th international conference on Supercomputing
Instant-Access Cycle-Stealing for Parallel Applications Requiring Interactive Response

Euro-Par '02 Proceedings of the 8th International Euro-Par Conference on Parallel Processing
Realistic CPU Workloads through Host Load Trace Playback

LCR '00 Selected Papers from the 5th International Workshop on Languages, Compilers, and Run-Time Systems for Scalable Computers
Resource Policing to Support Fine-Grain Cycle Stealing in Networks of Workstations

IEEE Transactions on Parallel and Distributed Systems
Dynamic and adaptive updates of non-quiescent subsystems in commodity operating system kernels

Proceedings of the 2nd ACM SIGOPS/EuroSys European Conference on Computer Systems 2007
The Performance of a Parallel TSP Program and Byte Sequential Benchmarks Executing on a Shared Cluster

International Journal of High Performance Computing Applications
A study of the concurrent execution of parallel and sequential applications on a non-dedicated cluster

Parallel Computing
Load balancing based concurrent execution of NAS parallel benchmarks with BYTE sequential benchmarks on a non-dedicated cluster

International Journal of High Performance Computing and Networking
Volunteer computing on clusters

JSSPP'06 Proceedings of the 12th international conference on Job scheduling strategies for parallel processing
On-the-fly kernel updates for high-performance computing clusters

IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
Improving cluster utilization through intelligent processor sharing

IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
Scheduling of a parallel computation-bound application and sequential applications executing concurrently on a cluster: a case study

ISPA'04 Proceedings of the Second international conference on Parallel and Distributed Processing and Applications
GoldRush: resource efficient in situ scientific data analytics using fine-grained interference aware execution

SC '13 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis

Quantified Score

Hi-index	0.00

Visualization

Abstract

Studies have shown that a significant fraction of the time, workstations are idle. In this paper we present a new scheduling policy called Linger-Longer that exploits the fine-grained availability of workstations to run sequential and parallel jobs. We present a two-level workload characterization study and use it to simulate a cluster of workstations running our new policy. We compare two variations of our policy to two previous policies: Immediate-Eviction and Pause-and-Migrate. Our study shows that the Linger-Longer policy can improve the throughput of foreign jobs on cluster by 60% with only a 0.5% slowdown of foreground jobs. For parallel computing, we showed that the Linger-Longer policy outperforms reconfiguration strategies when the processor utilization by the local process is 20% or less in both synthetic bulk synchronous and real data-parallel applications.