Using Reinforcement Learning for Multi-policy Optimization in Decentralized Autonomic Systems --- An Experimental Evaluation

  • Authors:
  • Ivana Dusparic;Vinny Cahill

  • Affiliations:
  • Lero --- The Irish Software Engineering Research Centre Distributed Systems Group School of Computer Science and Statistics, Trinity College Dublin, Ireland;Lero --- The Irish Software Engineering Research Centre Distributed Systems Group School of Computer Science and Statistics, Trinity College Dublin, Ireland

  • Venue:
  • ATC '09 Proceedings of the 6th International Conference on Autonomic and Trusted Computing
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Large-scale autonomic systems are required to self-optimize with respect to high-level policies, that can differ in terms of their priority, as well as their spatial and temporal scope. Decentralized multi-agent systems represent one approach to implementing the required self-optimization capabilities. However, the presence of multiple heterogeneous policies leads to heterogeneity of the agents that implement them. In this paper we evaluate the use of Reinforcement Learning techniques to support the self-optimization of heterogeneous agents towards multiple policies in decentralized systems. We evaluate these techniques in an Urban Traffic Control simulation and compare two approaches to supporting multiple policies. Our results suggest that approaches based on W-learning, which learn separately for each policy and then select between nominated actions based on current action importance, perform better than combining policies into a single learning process over a single state space. The results also indicate that explicitly supporting multiple policies simultaneously can improve waiting times over policies dedicated to optimizing for a single vehicle type.