Combining batch execution and leasing using virtual machines

Authors:
Borja Sotomayor;Kate Keahey;Ian Foster
Affiliations:
University of Chicago, Chicago, IL, USA;Argonne National Laboratory, Argonne, IL, USA;Argonne National Laboratory, Argonne, IL, USA
Venue:
HPDC '08 Proceedings of the 17th international symposium on High performance distributed computing
Year:
2008

Citing 24
Cited 49

Utilization, Predictability, Workloads, and User Runtime Estimates in Scheduling the IBM SP2 with Backfilling

IEEE Transactions on Parallel and Distributed Systems
Condor-G: A Computation Management Agent for Multi-Institutional Grids

Cluster Computing
The ANL/IBM SP Scheduling System

IPPS '95 Proceedings of the Workshop on Job Scheduling Strategies for Parallel Processing
Parallel Job Scheduling: Issues and Approaches

IPPS '95 Proceedings of the Workshop on Job Scheduling Strategies for Parallel Processing
Metrics and Benchmarking for Parallel Job Scheduling

IPPS/SPDP '98 Proceedings of the Workshop on Job Scheduling Strategies for Parallel Processing
Resource Co-Allocation in Computational Grids

HPDC '99 Proceedings of the 8th IEEE International Symposium on High Performance Distributed Computing
Scheduling with Advanced Reservations

IPDPS '00 Proceedings of the 14th International Symposium on Parallel and Distributed Processing
Handbook of Scheduling: Algorithms, Models, and Performance Analysis

Handbook of Scheduling: Algorithms, Models, and Performance Analysis
VMPlants: Providing and Managing Virtual Machine Execution Environments for Grid Computing

Proceedings of the 2004 ACM/IEEE conference on Supercomputing
From virtualized resources to virtual computing grids: the In-VIGO system

Future Generation Computer Systems - Special section: Complex problem-solving environments for grid computing
Maestro-VC: A Paravirtualized Execution Environment for Secure On-Demand Cluster Computing

CCGRID '06 Proceedings of the Sixth IEEE International Symposium on Cluster Computing and the Grid
Virtual Clusters for Grid Communities

CCGRID '06 Proceedings of the Sixth IEEE International Symposium on Cluster Computing and the Grid
Virtual workspaces: Achieving quality of service and quality of life in the Grid

Scientific Programming - Dynamic Grids and Worldwide Computing
Virtual Clusters on the Fly - Fast, Scalable, and Flexible Installation

CCGRID '07 Proceedings of the Seventh IEEE International Symposium on Cluster Computing and the Grid
Sharing networked resources with brokered leases

ATEC '06 Proceedings of the annual conference on USENIX '06 Annual Technical Conference
Grid'5000: A Large Scale And Highly Reconfigurable Experimental Grid Testbed

International Journal of High Performance Computing Applications
Xen and the Art of Cluster Scheduling

VTDC '06 Proceedings of the 2nd International Workshop on Virtualization Technology in Distributed Computing
Overhead Matters: A Model for Virtual Resource Management

VTDC '06 Proceedings of the 2nd International Workshop on Virtualization Technology in Distributed Computing
Falkon: a Fast and Light-weight tasK executiON framework

Proceedings of the 2007 ACM/IEEE conference on Supercomputing
Model-based resource selection for efficient virtual cluster deployment

VTDC '07 Proceedings of the 2nd international workshop on Virtualization technology in distributed computing
Autonomic Live Adaptation of Virtual Computational Environments in a Multi-Domain Infrastructure

ICAC '06 Proceedings of the 2006 IEEE International Conference on Autonomic Computing
Advance reservation policies for workflows

JSSPP'06 Proceedings of the 12th international conference on Job scheduling strategies for parallel processing
Impact of reservations on production job scheduling

JSSPP'07 Proceedings of the 13th international conference on Job scheduling strategies for parallel processing
Division of labor: tools for growing and scaling grids

ICSOC'06 Proceedings of the 4th international conference on Service-Oriented Computing

Feedback-controlled resource sharing for predictable eScience

Proceedings of the 2008 ACM/IEEE conference on Supercomputing
Evaluating the cost-benefit of using cloud computing to extend the capacity of clusters

Proceedings of the 18th ACM international symposium on High performance distributed computing
Experiencing with the Cloud over gLite

CLOUD '09 Proceedings of the 2009 ICSE Workshop on Software Engineering Challenges of Cloud Computing
Self-Tuning Virtual Machines for Predictable eScience

CCGRID '09 Proceedings of the 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid
Dynamic Provisioning of Virtual Organization Clusters

CCGRID '09 Proceedings of the 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid
VGrADS: enabling e-Science workflows on grids and clouds with fault tolerance

Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis
CARE Resource Broker: A framework for scheduling and supporting virtual resource management

Future Generation Computer Systems
Self-provisioned hybrid clouds

Proceedings of the 7th international conference on Autonomic computing
Virtual Organization Clusters: Self-provisioned clouds on the grid

Future Generation Computer Systems
A cost-benefit analysis of using cloud computing to extend the capacity of clusters

Cluster Computing
Cluster-wide context switch of virtualized jobs

Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
Elastic Cloud Caches for Accelerating Service-Oriented Computations

Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis
SLA enabled CARE resource broker

Future Generation Computer Systems
Multiplexing low and high QoS workloads in virtual environments

JSSPP'10 Proceedings of the 15th international conference on Job scheduling strategies for parallel processing
Hybrid Computing-Where HPC meets grid and Cloud Computing

Future Generation Computer Systems
Negotiation for resource allocation in IaaS cloud

COMPUTE '11 Proceedings of the Fourth Annual ACM Bangalore Conference
Performance and energy modeling for live migration of virtual machines

Proceedings of the 20th international symposium on High performance distributed computing
Design of a new cloud computing simulation platform

ICCSA'11 Proceedings of the 2011 international conference on Computational science and its applications - Volume Part III
SLA-based resource provisioning for heterogeneous workloads in a virtualized cloud datacenter

ICA3PP'11 Proceedings of the 11th international conference on Algorithms and architectures for parallel processing - Volume Part I
Performance analysis of preemption-aware scheduling in multi-cluster grid environments

ICA3PP'11 Proceedings of the 11th international conference on Algorithms and architectures for parallel processing - Volume Part I
Optimizing Resource Consumptions in Clouds

GRID '11 Proceedings of the 2011 IEEE/ACM 12th International Conference on Grid Computing
Evaluation of gang scheduling performance and cost in a cloud computing system

The Journal of Supercomputing
QoS and preemption aware scheduling in federated and virtualized Grid computing environments

Journal of Parallel and Distributed Computing
Scheduling and planning job execution of loosely coupled applications

The Journal of Supercomputing
Adapting market-oriented scheduling policies for cloud computing

ICA3PP'10 Proceedings of the 10th international conference on Algorithms and Architectures for Parallel Processing - Volume Part I
Service deactivation aware placement and defragmentation in enterprise clouds

Proceedings of the 7th International Conference on Network and Services Management
Energy-efficient and multifaceted resource management for profit-driven virtualized data centers

Future Generation Computer Systems
A reconfiguration language for virtualized grid infrastructures

DAIS'10 Proceedings of the 10th IFIP WG 6.1 international conference on Distributed Applications and Interoperable Systems
Online optimization for scheduling preemptable tasks on IaaS cloud systems

Journal of Parallel and Distributed Computing
Allocation of slotted deadline sensitive leases in infrastructure cloud

ICDCIT'12 Proceedings of the 8th international conference on Distributed Computing and Internet Technology
Enhancing an autonomic cloud architecture with mobile agents

Euro-Par'11 Proceedings of the 2011 international conference on Parallel Processing
Cloud federation in a layered service model

Journal of Computer and System Sciences
Virtualized HPC: a contradiction in terms?

Software—Practice & Experience
Policy based resource allocation in IaaS cloud

Future Generation Computer Systems
Toward scalable Web systems on multicore clusters: making use of virtual machines

The Journal of Supercomputing
Towards realistic benchmarks for virtual infrastructure resource allocators

Proceedings of the Asia-Pacific Workshop on Systems
A general-purpose virtualization service for HPC on cloud computing: an application to GPUs

PPAM'11 Proceedings of the 9th international conference on Parallel Processing and Applied Mathematics - Volume Part I
Cost-Aware and SLO-Fulfilling Software as a Service

Journal of Grid Computing
Towards realistic benchmarks for virtual infrastructure resource allocators

APSys'12 Proceedings of the Third ACM SIGOPS Asia-Pacific conference on Systems
Caching VM instances for fast VM provisioning: a comparative evaluation

Euro-Par'12 Proceedings of the 18th international conference on Parallel Processing
Cranduler: a dynamic and reusable scheduler for cloud infrastructure service

ICPCA/SWS'12 Proceedings of the 2012 international conference on Pervasive Computing and the Networked World
A genetic algorithm for power-aware virtual machine allocation in private cloud

ICT-EurAsia'13 Proceedings of the 2013 international conference on Information and Communication Technology
Resource provisioning based on lease preemption in InterGrid

ACSC '11 Proceedings of the Thirty-Fourth Australasian Computer Science Conference - Volume 113
Performance and energy modeling for live migration of virtual machines

Cluster Computing
Developing an optimized application hosting framework in Clouds

Journal of Computer and System Sciences
SLA achievement by negotiation in a cloud

Proceedings of the 6th ACM India Computing Convention
Economy Based Resource Allocation in IaaS Cloud

International Journal of Cloud Applications and Computing
Developing resource consolidation frameworks for moldable virtual machines in clouds

Future Generation Computer Systems
Adapting grid computing environments dependable with virtual machines: design, implementation, and evaluations

The Journal of Supercomputing

Quantified Score

Hi-index	0.00

Visualization

Abstract

As cluster computers are used for a wider range of applications, we encounter the need to deliver resources at particular times, to meet particular deadlines, and/or at the same time as other resources are provided elsewhere. To address such requirements, we describe a scheduling approach in which users request resource leases, where leases can request either as-soon-as-possible ("best-effort") or reservation start times. We present the design of a lease management architecture, Haizea, that implements leases as virtual machines (VMs), leveraging their ability to suspend, migrate, and resume computations and to provide leased resources with customized application environments. We discuss methods to minimize the overhead introduced by having to deploy VM images before the start of a lease. We also present the results of simulation studies that compare alternative approaches. Using workloads with various mixes of best-effort and advance reservation requests, we compare the performance of our VM-based approach with that of non-VM-based schedulers. We find that a VM-based approach can provide better performance (measured in terms of both total execution time and average delay incurred by best-effort requests) than a scheduler that does not support task pre-emption, and only slightly worse performance than a scheduler that does support task pre-emption. We also compare the impact of different VM image popularity distributions and VM image caching strategies on performance. These results emphasize the importance of VM image caching for the workloads studied and quantify the sensitivity of scheduling performance to VM image popularity distribution.