A Butler process for resource sharing on Spice machines
ACM Transactions on Information Systems (TOIS)
Attacking the process migration bottleneck
SOSP '87 Proceedings of the eleventh ACM Symposium on Operating systems principles
The effect of context switches on cache performance
ASPLOS IV Proceedings of the fourth international conference on Architectural support for programming languages and operating systems
The available capacity of a privately owned workstation environment
Performance Evaluation
LOCUS operating system, a transparent system
Computer Communications
Distributed computing feasibility in a non-dedicated homogeneous distributed system
Proceedings of the 1993 ACM/IEEE conference on Supercomputing
Utopia: a load sharing facility for large, heterogeneous distributed computer systems
Software—Practice & Experience
PVM: Parallel virtual machine: a users' guide and tutorial for networked parallel computing
PVM: Parallel virtual machine: a users' guide and tutorial for networked parallel computing
The interaction of parallel and sequential workloads on a network of workstations
Proceedings of the 1995 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
The SPLASH-2 programs: characterization and methodological considerations
ISCA '95 Proceedings of the 22nd annual international symposium on Computer architecture
Exploiting process lifetime distributions for dynamic load balancing
Proceedings of the 1996 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Effective distributed scheduling of parallel workloads
Proceedings of the 1996 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
The utility of exploiting idle workstations for parallel computation
SIGMETRICS '97 Proceedings of the 1997 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
The Hector Distributed Run-Time Environment
IEEE Transactions on Parallel and Distributed Systems
Availability and utility of idle memory in workstation clusters
SIGMETRICS '99 Proceedings of the 1999 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Mechanisms and policies for supporting fine-grained cycle stealing
ICS '99 Proceedings of the 13th international conference on Supercomputing
Load-balancing heuristics and process behavior
SIGMETRICS '86/PERFORMANCE '86 Proceedings of the 1986 ACM SIGMETRICS joint international conference on Computer performance modelling, measurement and evaluation
Preemptable remote execution facilities for the V-system
Proceedings of the tenth ACM symposium on Operating systems principles
Probability and Statistics with Reliability, Queuing and Computer Science Applications
Probability and Statistics with Reliability, Queuing and Computer Science Applications
A Case for NOW (Networks of Workstations)
IEEE Micro
On Optimal Strategies for Cycle-Stealing in Networks of Workstations
IEEE Transactions on Computers
CoCheck: Checkpointing and Process Migration for MPI
IPPS '96 Proceedings of the 10th International Parallel Processing Symposium
SOSP '83 Proceedings of the ninth ACM symposium on Operating systems principles
The relative importance of concurrent writers and weak consistency models
ICDCS '96 Proceedings of the 16th International Conference on Distributed Computing Systems (ICDCS '96)
Efficient network and I/O throttling for fine-grain cycle stealing
Proceedings of the 2001 ACM/IEEE conference on Supercomputing
Resource Policing to Support Fine-Grain Cycle Stealing in Networks of Workstations
IEEE Transactions on Parallel and Distributed Systems
G2-P2P: a fully decentralised fault-tolerant cycle-stealing framework
ACSW Frontiers '05 Proceedings of the 2005 Australasian workshop on Grid computing and e-research - Volume 44
Dyn-MPI: Supporting MPI on medium-scale, non-dedicated clusters
Journal of Parallel and Distributed Computing
Adaptive hierarchical scheduling policy for enterprise grid computing systems
Journal of Network and Computer Applications
Improving cluster utilization through intelligent processor sharing
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
Improving energy efficiency for mobile platforms by exploiting low-power sleep states
Proceedings of the 9th conference on Computing Frontiers
Hi-index | 0.01 |
Studies have shown that for a significant fraction of the time, workstations are idle. In this paper, we present a new scheduling policy called Linger-Longer that exploits the fine-grained availability of workstations to run sequential and parallel jobs. We present a two-level workload characterization study and use it to simulate a cluster of workstations running our new policy. We compare two variations of our policy to two previous policies: Immediate-Eviction and Pause-and-Migrate. Our study shows that the Linger-Longer policy can improve the throughput of foreign jobs on a cluster by 60 percent with only a 0.5 percent slowdown of local jobs. For parallel computing, we show that the Linger-Longer policy outperforms reconfiguration strategies when the processor utilization by the local process is 20 percent or less in both synthetic bulk synchronous and real data-parallel applications.