Network flows: theory, algorithms, and applications
Network flows: theory, algorithms, and applications
Cluster-based scalable network services
Proceedings of the sixteenth ACM symposium on Operating systems principles
The Coign automatic distributed partitioning system
OSDI '99 Proceedings of the third symposium on Operating systems design and implementation
Integrated resource management for cluster-based internet services
OSDI '02 Proceedings of the 5th symposium on Operating systems design and implementationCopyright restrictions prevent ACM from being able to make the PDFs for this conference available for downloading
Resource overbooking and application profiling in shared hosting platforms
OSDI '02 Proceedings of the 5th symposium on Operating systems design and implementationCopyright restrictions prevent ACM from being able to make the PDFs for this conference available for downloading
Dynamic placement for clustered web applications
Proceedings of the 15th international conference on World Wide Web
SCC '06 Proceedings of the IEEE International Conference on Services Computing
Dynamic estimation of CPU demand of web traffic
valuetools '06 Proceedings of the 1st international conference on Performance evaluation methodolgies and tools
Performance modeling and system management for multi-component online services
NSDI'05 Proceedings of the 2nd conference on Symposium on Networked Systems Design & Implementation - Volume 2
Dynamic application placement under service and memory constraints
WEA'05 Proceedings of the 4th international conference on Experimental and Efficient Algorithms
Automatic virtual machine configuration for database workloads
Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Dynamic quorums for DHT-based enterprise infrastructures
Journal of Parallel and Distributed Computing
Doloto: code splitting for network-bound web 2.0 applications
Proceedings of the 16th ACM SIGSOFT International Symposium on Foundations of software engineering
Active Energy-Aware Management of Business-Process Based Applications
ServiceWave '08 Proceedings of the 1st European Conference on Towards a Service-Based Internet
Efficient application placement in a dynamic hosting platform
Proceedings of the 18th international conference on World wide web
Harnessing the power of DHTs to build dynamic quorums in large-scale enterprise infrastructures
LADIS '08 Proceedings of the 2nd Workshop on Large-Scale Distributed Systems and Middleware
ICAC '09 Proceedings of the 6th international conference on Autonomic computing
Performance model driven QoS guarantees and optimization in clouds
CLOUD '09 Proceedings of the 2009 ICSE Workshop on Software Engineering Challenges of Cloud Computing
Run-time resource management in SOA virtualized environments
Proceedings of the 1st international workshop on Quality of service-oriented software systems
DSF: a common platform for distributed systems research and development
Proceedings of the 10th ACM/IFIP/USENIX International Conference on Middleware
Automatic virtual machine configuration for database workloads
ACM Transactions on Database Systems (TODS)
Agile resource management in a virtualized data center
Proceedings of the first joint WOSP/SIPEW international conference on Performance engineering
Joint admission control and resource allocation in virtualized servers
Journal of Parallel and Distributed Computing
DSF: a common platform for distributed systems research and development
Middleware'09 Proceedings of the ACM/IFIP/USENIX 10th international conference on Middleware
USENIX'09 Proceedings of the 2009 conference on USENIX Annual technical conference
An offer generation approach to SLA negotiation support in service oriented computing
Service Oriented Computing and Applications
Self-adaptive resource management for large-scale shared clusters
Journal of Computer Science and Technology
Multi-Objective Virtual Machine Placement in Virtualized Data Center Environments
GREENCOM-CPSCOM '10 Proceedings of the 2010 IEEE/ACM Int'l Conference on Green Computing and Communications & Int'l Conference on Cyber, Physical and Social Computing
A capacity planning process for performance assurance of component-based distributed systems
Proceedings of the 2nd ACM/SPEC International Conference on Performance engineering
Inter-domain stealthy port scan detection through complex event processing
EWDC '11 Proceedings of the 13th European Workshop on Dependable Computing
Incremental placement of interactive perception applications
Proceedings of the 20th international symposium on High performance distributed computing
SAFECOMP'11 Proceedings of the 30th international conference on Computer safety, reliability, and security
CloudOpt: multi-goal optimization of application deployments across a cloud
Proceedings of the 7th International Conference on Network and Services Management
Gossip-based resource allocation for green computing in large clouds
Proceedings of the 7th International Conference on Network and Services Management
Distributed workload and response time management for web applications
Proceedings of the 7th International Conference on Network and Services Management
Dynamic service placement in shared service hosting infrastructures
NETWORKING'10 Proceedings of the 9th IFIP TC 6 international conference on Networking
Resource-aware adaptive scheduling for mapreduce clusters
Middleware'11 Proceedings of the 12th ACM/IFIP/USENIX international conference on Middleware
Dual time-scale distributed capacity allocation and load redirect algorithms for cloud systems
Journal of Parallel and Distributed Computing
Towards an understanding of oversubscription in cloud
Hot-ICE'12 Proceedings of the 2nd USENIX conference on Hot Topics in Management of Internet, Cloud, and Enterprise Networks and Services
Virtual machine placement for predictable and time-constrained peak loads
GECON'11 Proceedings of the 8th international conference on Economics of Grids, Clouds, Systems, and Services
SLA-based Optimization of Power and Migration Cost in Cloud Computing
CCGRID '12 Proceedings of the 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012)
Resource-aware adaptive scheduling for MapReduce clusters
Proceedings of the 12th International Middleware Conference
Towards transparent and distributed workload management for large scale web servers
Future Generation Computer Systems
Automatic virtual machine clustering based on bhattacharyya distance for multi-cloud systems
Proceedings of the 2013 international workshop on Multi-cloud applications and federated clouds
Availability analysis for deployment of in-cloud applications
Proceedings of the 4th international ACM Sigsoft symposium on Architecting critical systems
Autonomic resource provisioning in cloud systems with availability goals
Proceedings of the 2013 ACM Cloud and Autonomic Computing Conference
Network-aware impact determination algorithms for service workflow deployment in hybrid clouds
Proceedings of the 8th International Conference on Network and Service Management
QoS-aware placement of stream processing service
The Journal of Supercomputing
Developing an optimized application hosting framework in Clouds
Journal of Computer and System Sciences
International Journal of Web Services Research
Network-aware virtual machine consolidation for large data centers
NDM '13 Proceedings of the Third International Workshop on Network-Aware Data Management
Hi-index | 0.00 |
Given a set of machines and a set of Web applications with dynamically changing demands, an online application placement controller decides how many instances to run for each application and where to put them, while observing all kinds of resource constraints. This NP hard problem has real usage in commercial middleware products. Existing approximation algorithms for this problem can scale to at most a few hundred machines, and may produce placement solutions that are far from optimal when system resources are tight. In this paper, we propose a new algorithm that can produce within 30seconds high-quality solutions for hard placement problems with thousands of machines and thousands of applications. This scalability is crucial for dynamic resource provisioning in large-scale enterprise data centers. Our algorithm allows multiple applications to share a single machine, and strivesto maximize the total satisfied application demand, to minimize the number of application starts and stops, and to balance the load across machines. Compared with existing state-of-the-art algorithms, for systems with 100 machines or less, our algorithm is up to 134 times faster, reduces application starts and stops by up to 97%, and produces placement solutions that satisfy up to 25% more application demands. Our algorithm has been implemented and adopted in a leading commercial middleware product for managing the performance of Web applications.