Technical Note: \cal Q-Learning
Machine Learning
Performance of power detector sensors of DTV signals in IEEE 802.22 WRANs
TAPAS '06 Proceedings of the first international workshop on Technology and policy for accessing spectrum
Cognitive radio: brain-empowered wireless communications
IEEE Journal on Selected Areas in Communications
Docitive networks: an emerging paradigm for dynamic spectrum management
IEEE Wireless Communications
Journal of Network and Computer Applications
Hi-index | 0.00 |
This paper deals with the problem of aggregated interference generated by multiple cognitive radios (CR) at the receivers of primary (licensed) users. In particular, we consider a secondary CR system based on the mEE 802.22 standard for wireless regional area networks (WRAN), and we model it as a multi-agent system where the multiple agents are the different secondary base stations in charge of controlling the secondary cells. We propose a form of real-time multi-agent reinforcement learning, known as decentralized Q-leaming, to manage the aggregated interference generated by multiple WRAN cells. We consider both situations of complete and partial information about the environment. By directly interacting with the surrounding environment in a distributed fashion, the multi-agent system is able to learn, in the first case, an optimal policy to solve the problem and, in the second case, a reasonably good suboptimal policy. Simulation results reveal that the proposed approach is able to fulfill the primary users interference constraints, without introducing signaling overhead in the system.