Introduction to algorithms
Introduction to Reinforcement Learning
Introduction to Reinforcement Learning
Risk-Sensitive Reinforcement Learning
Machine Learning
Learning to Predict by the Methods of Temporal Differences
Machine Learning
Dynamic Programming
Reconfiguration planning for modular self-reconfigurable robots
Reconfiguration planning for modular self-reconfigurable robots
R-max - a general polynomial time algorithm for near-optimal reinforcement learning
The Journal of Machine Learning Research
Least-squares policy iteration
The Journal of Machine Learning Research
Exploration and apprenticeship learning in reinforcement learning
ICML '05 Proceedings of the 22nd international conference on Machine learning
Extending Scope of Robust Optimization: Comprehensive Robust Counterparts of Uncertain Problems
Mathematical Programming: Series A and B
Robust Control of Markov Decision Processes with Uncertain Transition Matrices
Operations Research
An analysis of reinforcement learning with function approximation
Proceedings of the 25th international conference on Machine learning
Risk-sensitive reinforcement learning applied to control under constraints
Journal of Artificial Intelligence Research
Consensus-based decentralized auctions for robusttask allocation
IEEE Transactions on Robotics
Optimized stochastic policies for task allocationin swarms of robots
IEEE Transactions on Robotics
Nonmyopic adaptive informative path planning for multiple robots
IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Combining manual feedback with subsequent MDP reward signals for reinforcement learning
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Theory and Applications of Robust Optimization
SIAM Review
IEEE Transactions on Neural Networks
Hi-index | 0.00 |
Planning for multi-agent systems such as task assignment for teams of limited-fuel unmanned aerial vehicles (UAVs) is challenging due to uncertainties in the assumed models and the very large size of the planning space. Researchers have developed fast cooperative planners based on simple models (e.g.,聽linear and deterministic dynamics), yet inaccuracies in assumed models will impact the resulting performance. Learning techniques are capable of adapting the model and providing better policies asymptotically compared to cooperative planners, yet they often violate the safety conditions of the system due to their exploratory nature. Moreover they frequently require an impractically large number of interactions to perform well. This paper introduces the intelligent Cooperative Control Architecture (iCCA) as a framework for combining cooperative planners and reinforcement learning techniques. iCCA improves the policy of the cooperative planner, while reduces the risk and sample complexity of the learner. Empirical results in gridworld and task assignment for fuel-limited UAV domains with problem sizes up to 9 billion state-action pairs verify the advantage of iCCA over pure learning and planning strategies.