Reinforcement learning as a means of dynamic aggregate QoS provisioning

Authors:
Nail Akar;Cem Sahin
Affiliations:
Electrical and Electronics Engineering Dept., Bilkent University, Bilkent, Ankara, Turkey;Electrical and Electronics Engineering Dept., Bilkent University, Bilkent, Ankara, Turkey
Venue:
Art-QoS'03 Proceedings of the 2003 international conference on Architectures for quality of service in the internet
Year:
2003

Citing 5
Cited 1

Supporting real-time applications in an Integrated Services Packet Network: architecture and mechanism

SIGCOMM '92 Conference proceedings on Communications architectures & protocols
Average reward reinforcement learning: foundations, algorithms, and empirical results

Machine Learning - Special issue on reinforcement learning
MPLS: technology and applications

MPLS: technology and applications
Introduction to Reinforcement Learning

Introduction to Reinforcement Learning
Neuro-Dynamic Programming

Neuro-Dynamic Programming

Reinforcement Learning: A Tutorial Survey and Recent Advances

INFORMS Journal on Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Dynamic capacity management (or dynamic provisioning) is the process of dynamically changing the capacity allocation (reservation) of a virtual path (or a pseudo-wire) established between two network end points. This process is based on certain criteria including instantaneous traffic load for the pseudo-wire, network utilization, hour of day, or day of week. Frequent adjustment of the capacity yields a scalability issue in the form of a significant amount of message distribution and processing (i.e., signaling) in the network elements involved in the capacity update process. We therefore use the term "signaling rate" for the number of capacity updates per unit time. On the other hand, if the capacity is adjusted once and for the highest loaded traffic conditions, a significant amount of bandwidth may be wasted depending on the actual traffic load. There is then a need for dynamic capacity management that takes into account the tradeoff between signaling scalability and bandwidth efficiency. In this paper, we introduce a Markov decision framework for an optimal capacity management scheme. Moreover, for problems with large sizes and for which the desired signaling rate is imposed as a constraint, we provide suboptimal schemes using reinforcement learning. Our numerical results demonstrate that the reinforcement learning schemes that we propose provide significantly better bandwidth efficiencies than the static allocation policy without violating the signaling rate requirements of the underlying network.