Using feedback in collaborative reinforcement learning to adaptively optimize MANET routing

Authors:
J. Dowling;E. Curran;R. Cunningham;V. Cahill
Affiliations:
Dept. of Comput. Sci., Trinity Coll., Dublin, Ireland;-;-;-
Venue:
IEEE Transactions on Systems, Man, and Cybernetics, Part A: Systems and Humans
Year:
2005

Citing 0
Cited 19

Building autonomic systems using collaborative reinforcement learning

The Knowledge Engineering Review
Properties and mechanisms of self-organizing MANET and P2P systems

ACM Transactions on Autonomous and Adaptive Systems (TAAS)
Exploiting emergent behavior for inter-vehicle communication

HotAC II Hot Topics in Autonomic Computing on Hot Topics in Autonomic Computing
Adaptive audio streaming in mobile ad hoc networks using neural networks

Ad Hoc Networks
Multi-Agent Reinforcement Learning for Intrusion Detection: A Case Study and Evaluation

MATES '08 Proceedings of the 6th German conference on Multiagent System Technologies
Achieving Self-management in a Distributed System of Autonomic BUT Social Entities

MACE '08 Proceedings of the 3rd IEEE international workshop on Modelling Autonomic Communications Environments
Multi-Agent Reinforcement Learning for Intrusion Detection: A case study and evaluation

Proceedings of the 2008 conference on ECAI 2008: 18th European Conference on Artificial Intelligence
A trustworthiness-based QoS routing protocol for wireless ad hoc networks

IEEE Transactions on Wireless Communications
Solving multiconstraint assignment problems using learning automata

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Online learning in autonomic multi-hop wireless networks for transmitting mission-critical applications

IEEE Journal on Selected Areas in Communications
A dynamic route change mechanism for mobile ad hoc networks

International Journal of Communication Networks and Distributed Systems
Review: Reinforcement learning for context awareness and intelligence in wireless networks: Review, new features and open issues

Journal of Network and Computer Applications
A framework and model for soft routing: the Markovian termite and other curious creatures

ANTS'06 Proceedings of the 5th international conference on Ant Colony Optimization and Swarm Intelligence
MoCoA: customisable middleware for context-aware mobile applications

ODBASE'06/OTM'06 Proceedings of the 2006 Confederated international conference on On the Move to Meaningful Internet Systems: CoopIS, DOA, GADA, and ODBASE - Volume Part II
Learning-Based spectrum selection in cognitive radio ad hoc networks

WWIC'10 Proceedings of the 8th international conference on Wired/Wireless Internet Communications
Unstable path routing in urban-scale WSN

ACM SIGBED Review - Special Issue on the 3rd International Workshop on Networks of Cooperating Objects (CONET 2012)
Security aspects in the cognition cycle of distributed cognitive radio networks: a survey from a multi-agent perspective

International Journal of Ad Hoc and Ubiquitous Computing
A reinforcement learning-based routing for delay tolerant networks

Engineering Applications of Artificial Intelligence
Ant colony system with characterization-based heuristics for a bottled-products distribution logistics system

Journal of Computational and Applied Mathematics

Quantified Score

Hi-index	0.00

Visualization

Abstract

Designers face many system optimization problems when building distributed systems. Traditionally, designers have relied on optimization techniques that require either prior knowledge or centrally managed runtime knowledge of the system's environment, but such techniques are not viable in dynamic networks where topology, resource, and node availability are subject to frequent and unpredictable change. To address this problem, we propose collaborative reinforcement learning (CRL) as a technique that enables groups of reinforcement learning agents to solve system optimization problems online in dynamic, decentralized networks. We evaluate an implementation of CRL in a routing protocol for mobile ad hoc networks, called SAMPLE. Simulation results show how feedback in the selection of links by routing agents enables SAMPLE to adapt and optimize its routing behavior to varying network conditions and properties, resulting in optimization of network throughput. In the experiments, SAMPLE displays emergent properties such as traffic flows that exploit stable routes and reroute around areas of wireless interference or congestion. SAMPLE is an example of a complex adaptive distributed system.