Decentralized Q-learning for aggregated interference control in completely and partially observable cognitive radio networks

Authors:
Ana Galindo-Serrano;L. Giupponi
Affiliations:
Centre Tecnològic de Telecomunicacions de Catalunya, Parc Mediterrani de la Tecnologia, Barcelona, Spain;Centre Tecnològic de Telecomunicacions de Catalunya, Parc Mediterrani de la Tecnologia, Barcelona, Spain
Venue:
CCNC'10 Proceedings of the 7th IEEE conference on Consumer communications and networking conference
Year:
2010

Citing 3
Cited 3

Technical Note: \cal Q-Learning

Machine Learning
Performance of power detector sensors of DTV signals in IEEE 802.22 WRANs

TAPAS '06 Proceedings of the first international workshop on Technology and policy for accessing spectrum
Cognitive radio: brain-empowered wireless communications

IEEE Journal on Selected Areas in Communications

Docitive networks: an emerging paradigm for dynamic spectrum management

IEEE Wireless Communications
Review: Reinforcement learning for context awareness and intelligence in wireless networks: Review, new features and open issues

Journal of Network and Computer Applications
A reinforcement learning based solution for cognitive network cooperation between co-located, heterogeneous wireless sensor networks

Ad Hoc Networks

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper deals with the problem of aggregated interference generated by multiple cognitive radios (CR) at the receivers of primary (licensed) users. In particular, we consider a secondary CR system based on the mEE 802.22 standard for wireless regional area networks (WRAN), and we model it as a multi-agent system where the multiple agents are the different secondary base stations in charge of controlling the secondary cells. We propose a form of real-time multi-agent reinforcement learning, known as decentralized Q-leaming, to manage the aggregated interference generated by multiple WRAN cells. We consider both situations of complete and partial information about the environment. By directly interacting with the surrounding environment in a distributed fashion, the multi-agent system is able to learn, in the first case, an optimal policy to solve the problem and, in the second case, a reasonably good suboptimal policy. Simulation results reveal that the proposed approach is able to fulfill the primary users interference constraints, without introducing signaling overhead in the system.