A scalable HTTP server: the NCSA prototype
Selected papers of the first conference on World-Wide Web
Dynamic Load Balancing on Web-Server Systems
IEEE Internet Computing
WebOS: Operating System Services for Wide Area Applications
HPDC '98 Proceedings of the 7th IEEE International Symposium on High Performance Distributed Computing
FLEX: Load Balancing and Management Strategy for Scalable Web Hosting Service
ISCC '00 Proceedings of the Fifth IEEE Symposium on Computers and Communications (ISCC 2000)
Using smart clients to build scalable services
ATEC '97 Proceedings of the annual conference on USENIX Annual Technical Conference
Dynamically scaling applications in the cloud
ACM SIGCOMM Computer Communication Review
Globe'11 Proceedings of the 4th international conference on Data management in grid and peer-to-peer systems
GRID '11 Proceedings of the 2011 IEEE/ACM 12th International Conference on Grid Computing
A study on scalability of services and privacy issues in cloud computing
ICDCIT'12 Proceedings of the 8th international conference on Distributed Computing and Internet Technology
Towards Runtime Reconfiguration of Application Control Policies in the Cloud
Journal of Network and Systems Management
Future Generation Computer Systems
Hi-index | 0.00 |
Web applications' traffic demand fluctuates widely and unpredictably. The common practice of provisioning a fixed capacity would either result in unsatisfied customers (underprovision) or waste valuable capital investment (overprovision). By leveraging an infrastructure cloud's on-demand, pay-per-use capabilities, we finally can match the capacity with the demand in real time. This paper investigates how we can build a large-scale web server farm in the cloud. Our performance study shows that using existing cloud components and optimization techniques, we cannot achieve high scalability. Instead, we propose a client-side load balancing architecture, which can scale and handle failure on a milli-second time scale. We experimentally show that our architecture achieves high throughput in a cloud environment while meeting QoS requirements.