Distributed and Parallel Databases
Managing server energy and operational costs in hosting centers
SIGMETRICS '05 Proceedings of the 2005 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
WS-replication: a framework for highly available web services
Proceedings of the 15th international conference on World Wide Web
Exploiting nonstationarity for performance prediction
Proceedings of the 2nd ACM SIGOPS/EuroSys European Conference on Computer Systems 2007
ICWS '08 Proceedings of the 2008 IEEE International Conference on Web Services
Automated performance assessment for service-oriented middleware: a case study on BPEL engines
Proceedings of the 19th international conference on World wide web
Efficient resource provisioning in compute clouds via VM multiplexing
Proceedings of the 7th international conference on Autonomic computing
Autonomic mix-aware provisioning for non-stationary data center workloads
Proceedings of the 7th international conference on Autonomic computing
Web Services: Concepts, Architectures and Applications
Web Services: Concepts, Architectures and Applications
Optimized Management of Power and Performance for Virtualized Heterogeneous Server Clusters
CCGRID '11 Proceedings of the 2011 11th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing
A workload characterization study of the 1998 World Cup Web site
IEEE Network: The Magazine of Global Internetworking
On load balancing: a mix-aware algorithm for heterogeneous systems
Proceedings of the 4th ACM/SPEC International Conference on Performance Engineering
Hi-index | 0.00 |
Service-oriented systems, consisting of atomic services and their compositions hosted in service composition execution engines (CEEs), are commonly deployed to deliver web applications. As the workloads of applications fluctuate over time, it is economical to autonomously and dynamically adjust system capacity, i.e., the number of replicas for atomic services and CEEs. In this paper, we propose a novel replica provisioning policy, Resos, which adjusts the number of CEE and service replicas periodically based on the predicted workloads such that all replicas are well utilized at the target values. In particular, Resos models the workload balance and dependency between CEE and service replicas by estimating the probability that threads of CEE replicas are not blocked by I/O. Moreover, we derive the analytical bounds of CEE effective utilization and illustrate the cause of low nominal utilization at CEE replicas. We evaluate Resos on a simulated service-oriented system, which hosts CEE and service replicas on multi-threaded servers. The evaluated workload is derived from utilization traces collected from production systems. Through simulation, we demonstrate that Resos effectively reduces the number of required replicas while maintaining target utilization and lowering the response times of requests.