Exploiting Fine-Grained Idle Periods in Networks of Workstations

Authors:
Kyung Dong Ryu;Jeffrey K. Hollingsworth
Affiliations:
Univ. of Maryland, College Park;Univ. of Maryland, College Park
Venue:
IEEE Transactions on Parallel and Distributed Systems
Year:
2000

Citing 24
Cited 8

A Butler process for resource sharing on Spice machines

ACM Transactions on Information Systems (TOIS)
Attacking the process migration bottleneck

SOSP '87 Proceedings of the eleventh ACM Symposium on Operating systems principles
The effect of context switches on cache performance

ASPLOS IV Proceedings of the fourth international conference on Architectural support for programming languages and operating systems
The available capacity of a privately owned workstation environment

Performance Evaluation
LOCUS operating system, a transparent system

Computer Communications
Distributed computing feasibility in a non-dedicated homogeneous distributed system

Proceedings of the 1993 ACM/IEEE conference on Supercomputing
Utopia: a load sharing facility for large, heterogeneous distributed computer systems

Software—Practice & Experience
PVM: Parallel virtual machine: a users' guide and tutorial for networked parallel computing

PVM: Parallel virtual machine: a users' guide and tutorial for networked parallel computing
The interaction of parallel and sequential workloads on a network of workstations

Proceedings of the 1995 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
The SPLASH-2 programs: characterization and methodological considerations

ISCA '95 Proceedings of the 22nd annual international symposium on Computer architecture
Exploiting process lifetime distributions for dynamic load balancing

Proceedings of the 1996 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Effective distributed scheduling of parallel workloads

Proceedings of the 1996 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
The utility of exploiting idle workstations for parallel computation

SIGMETRICS '97 Proceedings of the 1997 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
The Hector Distributed Run-Time Environment

IEEE Transactions on Parallel and Distributed Systems
Availability and utility of idle memory in workstation clusters

SIGMETRICS '99 Proceedings of the 1999 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Mechanisms and policies for supporting fine-grained cycle stealing

ICS '99 Proceedings of the 13th international conference on Supercomputing
Load-balancing heuristics and process behavior

SIGMETRICS '86/PERFORMANCE '86 Proceedings of the 1986 ACM SIGMETRICS joint international conference on Computer performance modelling, measurement and evaluation
Preemptable remote execution facilities for the V-system

Proceedings of the tenth ACM symposium on Operating systems principles
Probability and Statistics with Reliability, Queuing and Computer Science Applications

Probability and Statistics with Reliability, Queuing and Computer Science Applications
A Case for NOW (Networks of Workstations)

IEEE Micro
On Optimal Strategies for Cycle-Stealing in Networks of Workstations

IEEE Transactions on Computers
CoCheck: Checkpointing and Process Migration for MPI

IPPS '96 Proceedings of the 10th International Parallel Processing Symposium
Process migration in DEMOS/MP

SOSP '83 Proceedings of the ninth ACM symposium on Operating systems principles
The relative importance of concurrent writers and weak consistency models

ICDCS '96 Proceedings of the 16th International Conference on Distributed Computing Systems (ICDCS '96)

Efficient network and I/O throttling for fine-grain cycle stealing

Proceedings of the 2001 ACM/IEEE conference on Supercomputing
Resource Policing to Support Fine-Grain Cycle Stealing in Networks of Workstations

IEEE Transactions on Parallel and Distributed Systems
G2-P2P: a fully decentralised fault-tolerant cycle-stealing framework

ACSW Frontiers '05 Proceedings of the 2005 Australasian workshop on Grid computing and e-research - Volume 44
Dyn-MPI: Supporting MPI on medium-scale, non-dedicated clusters

Journal of Parallel and Distributed Computing
Comparison of message-passing and shared memory implementations of the GMRES method on MIMD computers

Scientific Programming
Adaptive hierarchical scheduling policy for enterprise grid computing systems

Journal of Network and Computer Applications
Improving cluster utilization through intelligent processor sharing

IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
Improving energy efficiency for mobile platforms by exploiting low-power sleep states

Proceedings of the 9th conference on Computing Frontiers

Quantified Score

Hi-index	0.01

Visualization

Abstract

Studies have shown that for a significant fraction of the time, workstations are idle. In this paper, we present a new scheduling policy called Linger-Longer that exploits the fine-grained availability of workstations to run sequential and parallel jobs. We present a two-level workload characterization study and use it to simulate a cluster of workstations running our new policy. We compare two variations of our policy to two previous policies: Immediate-Eviction and Pause-and-Migrate. Our study shows that the Linger-Longer policy can improve the throughput of foreign jobs on a cluster by 60 percent with only a 0.5 percent slowdown of local jobs. For parallel computing, we show that the Linger-Longer policy outperforms reconfiguration strategies when the processor utilization by the local process is 20 percent or less in both synthetic bulk synchronous and real data-parallel applications.