Quantitative system performance: computer system analysis using queueing network models
Quantitative system performance: computer system analysis using queueing network models
Falkon: a Fast and Light-weight tasK executiON framework
Proceedings of the 2007 ACM/IEEE conference on Supercomputing
Definition and Evaluation of Penalty Functions in SLA Management Framework
ICNS '08 Proceedings of the Fourth International Conference on Networking and Services
Elastic management of cluster-based services in the cloud
ACDC '09 Proceedings of the 1st workshop on Automated control for datacenters and clouds
The Eucalyptus Open-Source Cloud-Computing System
CCGRID '09 Proceedings of the 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid
Virtual Infrastructure Management in Private and Hybrid Clouds
IEEE Internet Computing
Introducing Windows Azure
Communications of the ACM
Automated control for elastic storage
Proceedings of the 7th international conference on Autonomic computing
Runtime measurements in the cloud: observing, analyzing, and reducing variance
Proceedings of the VLDB Endowment
Predicting completion times of batch query workloads using interaction-aware models and simulation
Proceedings of the 14th International Conference on Extending Database Technology
Flexible use of cloud resources through profit maximization and price discrimination
ICDE '11 Proceedings of the 2011 IEEE 27th International Conference on Data Engineering
PARA'10 Proceedings of the 10th international conference on Applied Parallel and Scientific Computing - Volume Part I
Executing Data-Intensive Workloads in a Cloud
CCGRID '12 Proceedings of the 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012)
Estimating resource costs of data-intensive workloads in public clouds
Proceedings of the 10th International Workshop on Middleware for Grids, Clouds and e-Science
Towards building performance models for data-intensive workloads in public clouds
Proceedings of the 4th ACM/SPEC International Conference on Performance Engineering
Survey Cloud monitoring: A survey
Computer Networks: The International Journal of Computer and Telecommunications Networking
Hi-index | 0.00 |
Data analytics applications are well-suited for a cloud environment. In this paper we examine the problem of provisioning resources in a public cloud to execute data analytic workloads. The goal of our provisioning method is to determine the most cost-effective configuration for a given data analytic workload. Provisioning a workload in a public cloud environment faces several challenges: it is difficult to develop accurate performance prediction models using standard methods; the space of possible configurations is very large so exact solutions cannot be efficiently determined, and the mix and intensity of query classes in a workload vary dynamically over time. We provide a formulation of the provisioning problem and then define a framework to solve the problem. Our framework contains a cost model to predict the cost of executing a workload on a configuration and a method of selecting configurations. The cost model balances resource costs and penalties from SLAs. The specific resource demands and frequencies are accounted for by queueing network models of the Virtual Machines (VMs), which are used to predict performance. We evaluate our approach experimentally using sample data analytic workloads on Amazon EC2.