Proactive management of service instance pools for meeting service level agreements

Authors:
Kavitha Ranganathan;Asit Dan
Affiliations:
IBM T J Watson Research Center, Hawthorne, NY;IBM T J Watson Research Center, Hawthorne, NY
Venue:
ICSOC'05 Proceedings of the Third international conference on Service-Oriented Computing
Year:
2005

Citing 6
Cited 2

The Legion vision of a worldwide virtual computer

Communications of the ACM
Matchmaking: Distributed Resource Management for High Throughput Computing

HPDC '98 Proceedings of the 7th IEEE International Symposium on High Performance Distributed Computing
Design and Evaluation of a Resource Selection Framework for Grid Applications

HPDC '02 Proceedings of the 11th IEEE International Symposium on High Performance Distributed Computing
A Service Level Agreement Language for Dynamic Electronic Services

WECWIS '02 Proceedings of the Fourth IEEE International Workshop on Advanced Issues of E-Commerce and Web-Based Information Systems (WECWIS'02)
Connecting client objectives with resource capabilities: an essential component for grid service managent infrastructures

Proceedings of the 2nd international conference on Service oriented computing
The Cactus Worm: Experiments with Dynamic Resource Discovery and Allocation in a Grid Environment

International Journal of High Performance Computing Applications

Resource management with stateful support for analytic applications

IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
A case study on optimizing web service monitoring configurations

ICSOC'10 Proceedings of the 2010 international conference on Service-oriented computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Existing Grid schedulers focus on allocating resources to jobs as per the resource requirements expressed by end-users. This demands detailed knowledge of application behavior for different resource configurations on the part of end-users. Additionally, this model incurs significant delay in terms of the provisioning overhead for each request. In contrast, for interactive workloads, services are commonly pre-configured by an application server according to long-term steady-state requirements. In this paper, we propose a framework for bridging the gap between these two extremes. We target application services beyond simple interactive workloads, such as a parallel numeric application. In our approach, end users are shielded from lower-level resource configuration details and deal only with service metrics like average response time, expressed as SLAs. These SLAs are then translated into concrete resource allocation decisions. Since demand for a service fluctuates over time, static pre-configurations may not maximize utility of the common pool of resources. Our approach involves dynamic re-provisioning to achieve maximum utility, while accounting for overheads incurred during re-provisioning. We find that it is not always beneficial to re-provision resources according to perceived benefits and propose a model for calculating the optimal amount of re-provisioning for a particular scenario.