Runtime measurements in the cloud: observing, analyzing, and reducing variance

Authors:
Jörg Schad;Jens Dittrich;Jorge-Arnulfo Quiané-Ruiz
Affiliations:
Saarland University;Saarland University;Saarland University
Venue:
Proceedings of the VLDB Endowment
Year:
2010

Citing 10
Cited 69

MapReduce: simplified data processing on large clusters

OSDI'04 Proceedings of the 6th conference on Symposium on Opearting Systems Design & Implementation - Volume 6
Criteria to Compare Cloud Computing with Current Database Technology

IWSM/Metrikon/Mensura '08 Proceedings of the International Conferences on Software Process and Product Measurement
Can cloud computing reach the top500?

Proceedings of the combined workshops on UnConventional high performance computing workshop plus memory access workshop
A comparison of approaches to large-scale data analysis

Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
Performance model driven QoS guarantees and optimization in clouds

CLOUD '09 Proceedings of the 2009 ICSE Workshop on Software Engineering Challenges of Cloud Computing
What's inside the Cloud? An architectural map of the Cloud landscape

CLOUD '09 Proceedings of the 2009 ICSE Workshop on Software Engineering Challenges of Cloud Computing
How is the weather tomorrow?: towards a benchmark for the cloud

Proceedings of the Second International Workshop on Testing Database Systems
Hey, you, get off of my cloud: exploring information leakage in third-party compute clouds

Proceedings of the 16th ACM conference on Computer and communications security
An evaluation of alternative architectures for transaction processing in the cloud

Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
EC2 performance analysis for resource provisioning of service-oriented applications

ICSOC/ServiceWave'09 Proceedings of the 2009 international conference on Service-oriented computing

Hadoop++: making a yellow elephant run like a cheetah (without it even noticing)

Proceedings of the VLDB Endowment
Efficient data transmission between multimedia web services via aspect-oriented programming

MMSys '11 Proceedings of the second annual ACM conference on Multimedia systems
RAFT at work: speeding-up mapreduce applications under task and node failures

Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
Tradeoffs Between Profit and Customer Satisfaction for Service Provisioning in the Cloud

Proceedings of the 20th international symposium on High performance distributed computing
Towards predictable datacenter networks

Proceedings of the ACM SIGCOMM 2011 conference
Making time-stepped applications tick in the cloud

Proceedings of the 2nd ACM Symposium on Cloud Computing
Trojan data layouts: right shoes for a running elephant

Proceedings of the 2nd ACM Symposium on Cloud Computing
Cuanta: quantifying effects of shared on-chip resource interference for consolidated virtual machines

Proceedings of the 2nd ACM Symposium on Cloud Computing
The price is right: towards location-independent costs in datacenters

Proceedings of the 10th ACM Workshop on Hot Topics in Networks
Migration of Multi-tier Applications to Infrastructure-as-a-Service Clouds: An Investigation Using Kernel-Based Virtual Machines

GRID '11 Proceedings of the 2011 IEEE/ACM 12th International Conference on Grid Computing
PerfXplain: debugging MapReduce job performance

Proceedings of the VLDB Endowment
How a consumer can measure elasticity for cloud platforms

ICPE '12 Proceedings of the 3rd ACM/SPEC International Conference on Performance Engineering
Editorial

Concurrency and Computation: Practice & Experience
Pricing cloud bandwidth reservations under demand uncertainty

Proceedings of the 12th ACM SIGMETRICS/PERFORMANCE joint international conference on Measurement and Modeling of Computer Systems
Halt or continue: estimating progress of queries in the cloud

DASFAA'12 Proceedings of the 17th international conference on Database Systems for Advanced Applications - Volume Part II
Understanding the effects and implications of compute node related failures in hadoop

Proceedings of the 21st international symposium on High-Performance Parallel and Distributed Computing
Self-managing SLA compliance in cloud architectures: a market-based approach

Proceedings of the 3rd international ACM SIGSOFT symposium on Architecting Critical Systems
RDF data management in the Amazon cloud

Proceedings of the 2012 Joint EDBT/ICDT Workshops
Cloud SLAs: present and future

ACM SIGOPS Operating Systems Review
Walk the line: consistent network updates with bandwidth guarantees

Proceedings of the first workshop on Hot topics in software defined networks
The seven deadly sins of cloud computing research

HotCloud'12 Proceedings of the 4th USENIX conference on Hot Topics in Cloud Ccomputing
Towards fair sharing of block storage in a multi-tenant cloud

HotCloud'12 Proceedings of the 4th USENIX conference on Hot Topics in Cloud Ccomputing
Only aggressive elephants are fast elephants

Proceedings of the VLDB Endowment
An architecture framework for application-managed scaling of cloud-hosted relational databases

Proceedings of the WICSA/ECSA 2012 Companion Volume
Optimization of analytic data flows for next generation business intelligence applications

TPCTC'11 Proceedings of the Third TPC Technology conference on Topics in Performance Evaluation, Measurement and Characterization
When free is not really free: what does it cost to run a database workload in the cloud?

TPCTC'11 Proceedings of the Third TPC Technology conference on Topics in Performance Evaluation, Measurement and Characterization
Detecting co-residency with active traffic analysis techniques

Proceedings of the 2012 ACM Workshop on Cloud computing security workshop
Resource-freeing attacks: improve your cloud performance (at your neighbor's expense)

Proceedings of the 2012 ACM conference on Computer and communications security
Towards non-intrusive elastic query processing in the cloud

Proceedings of the fourth international workshop on Cloud data management
Benchmarking OLTP/web databases in the cloud: the OLTP-bench framework

Proceedings of the fourth international workshop on Cloud data management
Bridging the tenant-provider gap in cloud services

Proceedings of the Third ACM Symposium on Cloud Computing
More for your money: exploiting performance heterogeneity in public clouds

Proceedings of the Third ACM Symposium on Cloud Computing
Estimating resource costs of data-intensive workloads in public clouds

Proceedings of the 10th International Workshop on Middleware for Grids, Clouds and e-Science
Data-Intensive Workload Consolidation for the Hadoop Distributed File System

GRID '12 Proceedings of the 2012 ACM/IEEE 13th International Conference on Grid Computing
On a Catalogue of Metrics for Evaluating Commercial Cloud Services

GRID '12 Proceedings of the 2012 ACM/IEEE 13th International Conference on Grid Computing
Towards Elastic Multi-Tenant Database Replication with Quality of Service

UCC '12 Proceedings of the 2012 IEEE/ACM Fifth International Conference on Utility and Cloud Computing
A Representation Model for Virtual Machine Allocation

UCC '12 Proceedings of the 2012 IEEE/ACM Fifth International Conference on Utility and Cloud Computing
An experimental study of open-source cloud platforms for dust storm forecasting

Proceedings of the 20th International Conference on Advances in Geographic Information Systems
Trade-Off analysis of elasticity approaches for cloud-based business applications

WISE'12 Proceedings of the 13th international conference on Web Information Systems Engineering
A framework for ranking of cloud computing services

Future Generation Computer Systems
MAScloud: a framework based on multi-agent systems for optimizing cost in cloud computing

ICCCI'12 Proceedings of the 4th international conference on Computational Collective Intelligence: technologies and applications - Volume Part I
ClouDiA: a deployment advisor for public clouds

Proceedings of the VLDB Endowment
Web data indexing in the cloud: efficiency and cost reductions

Proceedings of the 16th International Conference on Extending Database Technology
Embedded systems for global e-Social Science: Moving computation rather than data

Future Generation Computer Systems
Editorial: Model-driven provisioning of application services in hybrid computing environments

Future Generation Computer Systems
Performance implications of multi-tier application deployments on Infrastructure-as-a-Service clouds: Towards performance modeling

Future Generation Computer Systems
Position paper: cloud-based performance testing: issues and challenges

Proceedings of the 2013 international workshop on Hot topics in cloud services
Towards building performance models for data-intensive workloads in public clouds

Proceedings of the 4th ACM/SPEC International Conference on Performance Engineering
A study of pricing for cloud resources

ACM SIGMETRICS Performance Evaluation Review
Modelling exogenous variability in cloud deployments

ACM SIGMETRICS Performance Evaluation Review
Provisioning data analytic workloads in a cloud

Future Generation Computer Systems
Chatty tenants and the cloud network sharing problem

nsdi'13 Proceedings of the 10th USENIX conference on Networked Systems Design and Implementation
Bobtail: avoiding long tails in the cloud

nsdi'13 Proceedings of the 10th USENIX conference on Networked Systems Design and Implementation
A vision for personalized service level agreements in the cloud

Proceedings of the Second Workshop on Data Analytics in the Cloud
SPGM: an efficient algorithm for mapping MapReduce-like data-intensive applications in data centre network

International Journal of Web and Grid Services
Survey Cloud monitoring: A survey

Computer Networks: The International Journal of Computer and Telecommunications Networking
DynamicCloudSim: simulating heterogeneity in computational clouds

Proceedings of the 2nd ACM SIGMOD Workshop on Scalable Workflow Execution Engines and Technologies
Guide-copy: fast and silent migration of virtual machine for datacenters

SC '13 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Choreo: network-aware task placement for cloud applications

Proceedings of the 2013 conference on Internet measurement conference
Adaptive Online Compression in Clouds--Making Informed Decisions in Virtual Machine Environments

Journal of Grid Computing
Consumer-centric SLA manager for cloud-hosted databases

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Small is better: avoiding latency traps in virtualized data centers

Proceedings of the 4th annual Symposium on Cloud Computing
A framework for analyzing monetary cost of database systems in the cloud

WAIM'13 Proceedings of the 14th international conference on Web-Age Information Management
Developing self managing software systems using agile modeling

ACM SIGSOFT Software Engineering Notes
Toward software-defined SLAs

Communications of the ACM
Toward Software-defined SLAs

Queue - Distributed Computing
A Framework and Middleware for Application-Level Cloud Bursting on Top of Infrastructure-as-a-Service Clouds

UCC '13 Proceedings of the 2013 IEEE/ACM 6th International Conference on Utility and Cloud Computing
Report from the second workshop on scalable workflow enactment engines and technology (SWEET'13)

ACM SIGMOD Record
A Speculative Parallel DFA Membership Test for Multicore, SIMD and Cloud Computing Environments

International Journal of Parallel Programming

Quantified Score

Hi-index	0.03

Visualization

Abstract

One of the main reasons why cloud computing has gained so much popularity is due to its ease of use and its ability to scale computing resources on demand. As a result, users can now rent computing nodes on large commercial clusters through several vendors, such as Amazon and rackspace. However, despite the attention paid by Cloud providers, performance unpredictability is a major issue in Cloud computing for (1) database researchers performing wall clock experiments, and (2) database applications providing service-level agreements. In this paper, we carry out a study of the performance variance of the most widely used Cloud infrastructure (Amazon EC2) from different perspectives. We use established microbenchmarks to measure performance variance in CPU, I/O, and network. And, we use a multi-node MapReduce application to quantify the impact on real dataintensive applications. We collected data for an entire month and compare it with the results obtained on a local cluster. Our results show that EC2 performance varies a lot and often falls into two bands having a large performance gap in-between --- which is somewhat surprising. We observe in our experiments that these two bands correspond to the different virtual system types provided by Amazon. Moreover, we analyze results considering different availability zones, points in time, and locations. This analysis indicates that, among others, the choice of availability zone also influences the performance variability. A major conclusion of our work is that the variance on EC2 is currently so high that wall clock experiments may only be performed with considerable care. To this end, we provide some hints to users.