Intelligent Cooperative Control Architecture: A Framework for Performance Improvement Using Safe Learning

Authors:
Alborz Geramifard;Joshua Redding;Jonathan P. How
Affiliations:
Laboratory for Information and Decision Systems, Massachusetts Institute of Technology, Cambridge, USA;VTOL Embedded Systems, Lockheed Martin Procerus Technologies, Vineyard, USA 84058;Laboratory for Information and Decision Systems, Massachusetts Institute of Technology, Cambridge, USA
Venue:
Journal of Intelligent and Robotic Systems
Year:
2013

Citing 20
Cited 0

Introduction to algorithms

Introduction to algorithms
Introduction to Reinforcement Learning

Introduction to Reinforcement Learning
Risk-Sensitive Reinforcement Learning

Machine Learning
Learning to Predict by the Methods of Temporal Differences

Machine Learning
Dynamic Programming

Dynamic Programming
Reconfiguration planning for modular self-reconfigurable robots

Reconfiguration planning for modular self-reconfigurable robots
R-max - a general polynomial time algorithm for near-optimal reinforcement learning

The Journal of Machine Learning Research
Least-squares policy iteration

The Journal of Machine Learning Research
Exploration and apprenticeship learning in reinforcement learning

ICML '05 Proceedings of the 22nd international conference on Machine learning
Extending Scope of Robust Optimization: Comprehensive Robust Counterparts of Uncertain Problems

Mathematical Programming: Series A and B
Robust Control of Markov Decision Processes with Uncertain Transition Matrices

Operations Research
An analysis of reinforcement learning with function approximation

Proceedings of the 25th international conference on Machine learning
Risk-sensitive reinforcement learning applied to control under constraints

Journal of Artificial Intelligence Research
Worst-Case Conditional Value-at-Risk with Application to Robust Portfolio Management

Operations Research
Consensus-based decentralized auctions for robusttask allocation

IEEE Transactions on Robotics
Optimized stochastic policies for task allocationin swarms of robots

IEEE Transactions on Robotics
Nonmyopic adaptive informative path planning for multiple robots

IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Combining manual feedback with subsequent MDP reward signals for reinforcement learning

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Theory and Applications of Robust Optimization

SIAM Review
Robust Reinforcement Learning Control Using Integral Quadratic Constraints for Recurrent Neural Networks

IEEE Transactions on Neural Networks

Quantified Score

Hi-index	0.00

Visualization

Abstract

Planning for multi-agent systems such as task assignment for teams of limited-fuel unmanned aerial vehicles (UAVs) is challenging due to uncertainties in the assumed models and the very large size of the planning space. Researchers have developed fast cooperative planners based on simple models (e.g.,聽linear and deterministic dynamics), yet inaccuracies in assumed models will impact the resulting performance. Learning techniques are capable of adapting the model and providing better policies asymptotically compared to cooperative planners, yet they often violate the safety conditions of the system due to their exploratory nature. Moreover they frequently require an impractically large number of interactions to perform well. This paper introduces the intelligent Cooperative Control Architecture (iCCA) as a framework for combining cooperative planners and reinforcement learning techniques. iCCA improves the policy of the cooperative planner, while reduces the risk and sample complexity of the learner. Empirical results in gridworld and task assignment for fuel-limited UAV domains with problem sizes up to 9 billion state-action pairs verify the advantage of iCCA over pure learning and planning strategies.