Online response time optimization of Apache web server

  • Authors:
  • Xue Liu;Lui Sha;Yixin Diao;Steven Froehlich;Joseph L. Hellerstein;Sujay Parekh

  • Affiliations:
  • Department of Computer Science, Univ. of Illinois at Urbana-Champaign, Urbana, IL;Department of Computer Science, Univ. of Illinois at Urbana-Champaign, Urbana, IL;IBM T. J. Watson Research Center, Hawthorne, NY;IBM T. J. Watson Research Center, Hawthorne, NY;IBM T. J. Watson Research Center, Hawthorne, NY;IBM T. J. Watson Research Center, Hawthorne, NY

  • Venue:
  • IWQoS'03 Proceedings of the 11th international conference on Quality of service
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

Properly optimizing the setting of configuration parameters can greatly improve performance, especially in the presence of changing workloads. This paper explores approaches to online optimization of the Apache web server, focusing on the MaxClients parameter (which controls the maximum number of workers). Using both empirical and analytic techniques, we show that MaxClients has a concave upward effect on response time and hence hill climbing techniques can be used to find the optimal value of MaxClients. We investigate two optimizers that employ hill climbing--one based on Newton's Method and the second based on fuzzy control. A third technique is a heuristic that exploits relationships between bottleneck utilizations and response time minimization. In all cases, online optimization reduces response times by a factor of 10 or more compared to using a static, default value. The trade-offs between the online schemes are as follows. Newton's method is well known but does not produce consistent results for highly variable data such as response times. Fuzzy control is more robust, but converges slowly. The heuristic works well in our prototype system, but it may be difficult to generalize because it requires knowledge of bottleneck resources and an ability to measure their utilizations.