User-Centric Performance Analysis of Market-Based Cluster Batch Schedulers
CCGRID '02 Proceedings of the 2nd IEEE/ACM International Symposium on Cluster Computing and the Grid
On Recent Advances in Time/Utility Function Real-Time Scheduling and Resource Management
ISORC '05 Proceedings of the Eighth IEEE International Symposium on Object-Oriented Real-Time Distributed Computing
Resource overbooking and application profiling in shared hosting platforms
OSDI '02 Proceedings of the 5th symposium on Operating systems design and implementationCopyright restrictions prevent ACM from being able to make the PDFs for this conference available for downloading
VSched: Mixing Batch And Interactive Virtual Machines Using Periodic Real-time Scheduling
SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
Virtual Clusters for Grid Communities
CCGRID '06 Proceedings of the Sixth IEEE International Symposium on Cluster Computing and the Grid
Dynamic placement for clustered web applications
Proceedings of the 15th international conference on World Wide Web
Resource management for clusters of virtual machines
CCGRID '05 Proceedings of the Fifth IEEE International Symposium on Cluster Computing and the Grid - Volume 01
Dynamic estimation of CPU demand of web traffic
valuetools '06 Proceedings of the 1st international conference on Performance evaluation methodolgies and tools
Utility Accrual Real-Time Scheduling under Variable Cost Functions
IEEE Transactions on Computers
On the Use of Fuzzy Modeling in Virtualized Data Center Management
ICAC '07 Proceedings of the Fourth International Conference on Autonomic Computing
Appliance-Based Autonomic Provisioning Framework for Virtualized Outsourcing Data Center
ICAC '07 Proceedings of the Fourth International Conference on Autonomic Computing
Precise and realistic utility functions for user-centric performance analysis of schedulers
Proceedings of the 16th international symposium on High performance distributed computing
Adaptive control of virtualized resources in utility computing environments
Proceedings of the 2nd ACM SIGOPS/EuroSys European Conference on Computer Systems 2007
Adaptive data-aware utility-based scheduling in resource-constrained systems
Adaptive data-aware utility-based scheduling in resource-constrained systems
Dynamic application placement under service and memory constraints
WEA'05 Proceedings of the 4th international conference on Experimental and Efficient Algorithms
Performance management for cluster-based web services
IEEE Journal on Selected Areas in Communications
Resource pool management: Reactive versus proactive or let's be friends
Computer Networks: The International Journal of Computer and Telecommunications Networking
Consumer sharing policy constrained resource allocation method for grid system
ISCIT'09 Proceedings of the 9th international conference on Communications and information technologies
SLA-based resource provisioning for heterogeneous workloads in a virtualized cloud datacenter
ICA3PP'11 Proceedings of the 11th international conference on Algorithms and architectures for parallel processing - Volume Part I
Hi-index | 0.00 |
We present a technique that enables existing middleware to fairly manage mixed workloads: batch jobs and transactional applications. The technique leverages a generic application placement controller, which dynamically allocates compute resources to application instances. The controller works towards a fairness goal while also trying to maximize individual workload performance. We use relative performance functions to drive the application placement controller. Such functions are derived from workload-specific performance models---in the case of transactional workloads, we use queuing theory to build the performance model. For batch workloads, we evaluate a candidate placement by calculating long-term estimates of the completion times that are achievable with that placement according to a scheduling policy. In this paper, we propose a lowest relative performance first scheduling policy as a way to also achieve fair resource allocation among batch jobs. Our technique permits collocation of the workload types on the same physical hardware, and leverages control mechanisms such as suspension and migration to perform online system reconfiguration. In our experiments we demonstrate that our technique maximizes mixed workload performance while providing service differentiation based on high-level performance goals.