Technical Note: \cal Q-Learning
Machine Learning
Learning to solve multiple goals
Learning to solve multiple goals
The dynamics of reinforcement learning in cooperative multiagent systems
AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
The ant colony optimization meta-heuristic
New ideas in optimization
Gradient descent for general reinforcement learning
Proceedings of the 1998 conference on Advances in neural information processing systems II
Swarm intelligence
Particle swarm optimization method in multiobjective problems
Proceedings of the 2002 ACM symposium on Applied computing
A particle swarm model for swarm-based networked sensor systems
Proceedings of the 2002 ACM symposium on Applied computing
Introduction to Reinforcement Learning
Introduction to Reinforcement Learning
Optimal Selection of Uncertain Actions by Maximizing Expected Utility
Autonomous Robots
Robot learning driven by emotions
Adaptive Behavior
Multi-criteria Reinforcement Learning
ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
ICML '99 Proceedings of the Sixteenth International Conference on Machine Learning
A New Distributed Reinforcement Learning Algorithm for Multiple Objective Optimization Problems
IBERAMIA-SBIA '00 Proceedings of the International Joint Conference, 7th Ibero-American Conference on AI: Advances in Artificial Intelligence
Learning to Cooperate via Policy Search
UAI '00 Proceedings of the 16th Conference on Uncertainty in Artificial Intelligence
Backpropagation Networks for Grapheme-Phoneme Conversion: a Non-Technical Introduction
Artificial Neural Networks: An Introduction to ANN Theory and Practice
Coordinated Reinforcement Learning
ICML '02 Proceedings of the Nineteenth International Conference on Machine Learning
Artificial Intelligence: A Modern Approach
Artificial Intelligence: A Modern Approach
Anthill: A Framework for the Development of Agent-Based Peer-to-Peer Systems
ICDCS '02 Proceedings of the 22 nd International Conference on Distributed Computing Systems (ICDCS'02)
Optimizing information exchange in cooperative multi-agent systems
AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
Achieving Artificial Intelligence through Building Robots
Achieving Artificial Intelligence through Building Robots
Introduction to Evolutionary Computing
Introduction to Evolutionary Computing
An Artificial Intelligence Perspective on Autonomic Computing Policies
POLICY '04 Proceedings of the Fifth IEEE International Workshop on Policies for Distributed Systems and Networks
A Multi-Agent Systems Approach to Autonomic Computing
AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 1
Multiobjective Evolutionary Algorithms and Applications (Advanced Information and Knowledge Processing)
Reinforcement Learning for Autonomic Network Repair
ICAC '04 Proceedings of the First International Conference on Autonomic Computing
Utility-Function-Driven Resource Allocation in Autonomic Systems
ICAC '05 Proceedings of the Second International Conference on Automatic Computing
Dynamic preferences in multi-criteria reinforcement learning
ICML '05 Proceedings of the 22nd international conference on Machine learning
Multiobjective Evolutionary Algorithms: Analyzing the State-of-the-Art
Evolutionary Computation
Self-organization in multi-agent systems
The Knowledge Engineering Review
Building autonomic systems using collaborative reinforcement learning
The Knowledge Engineering Review
Collaborative Multiagent Reinforcement Learning by Payoff Propagation
The Journal of Machine Learning Research
Evolutionary swarm traffic: if ant roads had traffic lights
CEC '02 Proceedings of the Evolutionary Computation on 2002. CEC '02. Proceedings of the 2002 Congress - Volume 02
Reinforcement Learning in Autonomic Computing: A Manifesto and Case Studies
IEEE Internet Computing
Grid Differentiated Services: A Reinforcement Learning Approach
CCGRID '08 Proceedings of the 2008 Eighth IEEE International Symposium on Cluster Computing and the Grid
Learning all optimal policies with multiple criteria
Proceedings of the 25th international conference on Machine learning
Parallel Reinforcement Learning for Weighted Multi-criteria Model with Adaptive Margin
Neural Information Processing
Swarm Intelligence: Introduction and Applications
Swarm Intelligence: Introduction and Applications
A Collaborative Reinforcement Learning Approach to Urban Traffic Control Optimization
WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 02
A Concise Introduction to Multiagent Systems and Distributed Artificial Intelligence
A Concise Introduction to Multiagent Systems and Distributed Artificial Intelligence
Learning of coordination: exploiting sparse interactions in multiagent systems
Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
Ants and reinforcement learning: a case study in routing in dynamic networks
IJCAI'97 Proceedings of the Fifteenth international joint conference on Artifical intelligence - Volume 2
Decentralized control of cooperative systems: categorization and complexity analysis
Journal of Artificial Intelligence Research
AntNet: distributed stigmergetic control for communications networks
Journal of Artificial Intelligence Research
Multiple-goal reinforcement learning with modular Sarsa(O)
IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Planning and acting in partially observable stochastic domains
Artificial Intelligence
ABLE: a toolkit for building multiagent autonomic systems
IBM Systems Journal
Distributed W-Learning: Multi-Policy Optimization in Self-Organizing Systems
SASO '09 Proceedings of the 2009 Third IEEE International Conference on Self-Adaptive and Self-Organizing Systems
Evolutionary computing and autonomic computing: shared problems, shared solutions?
Self-star Properties in Complex Information Systems
Neural Networks for Real-Time Traffic Signal Control
IEEE Transactions on Intelligent Transportation Systems
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
A survey of multi-objective sequential decision-making
Journal of Artificial Intelligence Research
Hi-index | 0.00 |
Self-organizing systems are often implemented as collections of collaborating agents. Such agents may need to optimize their own performance according to multiple policies as well as contribute to the optimization of overall system performance towards a potentially different set of policies. These policies can be heterogeneous, i.e., be implemented on different sets of agents, be active at different times and have different levels of priority, leading to the heterogeneity of the agents of which the system is composed. Numerous biologically-inspired techniques as well as techniques from artificial intelligence have been used to implement such self-organizing systems. In this paper we review the most commonly used techniques for multi-policy optimization in such systems, specifically, those based on ant colony optimization, evolutionary algorithms, particle swarm optimization and reinforcement learning (RL). We analyze the characteristics and existing applications of the reviewed algorithms, assessing their suitability for particular types of optimization problems, based on the environment and policy characteristics. We focus on RL, as it is considered particularly suitable for large-scale self-organizing systems due to its ability to take into account the long-term consequences of the actions executed. Therefore, RL enables the system to learn not only the immediate payoffs of its actions, but also the best actions for the long-term performance of the system. Existing RL implementations mostly focus on optimization towards a single system policy, while most multi-policy RL-based optimization techniques have so far been implemented only on a single agent. We argue that, in order to be more widely utilized as a technique for self-optimization, RL needs to address both multiple policies and multiple agents simultaneously, and analyze the challenges associated with extending existing or developing new RL optimization techniques.