Distributed Learning for Planning Under Uncertainty Problems with Heterogeneous Teams

  • Authors:
  • N. Kemal Ure;Girish Chowdhary;Yu Fan Chen;Jonathan P. How;John Vian

  • Affiliations:
  • Massachusetts Institute of Technology, Cambridge, USA;Oklahoma State University, Stillwater, USA 74074;Massachusetts Institute of Technology, Cambridge, USA;Massachusetts Institute of Technology, Cambridge, USA;Boeing Research and Technology, Seattle, USA

  • Venue:
  • Journal of Intelligent and Robotic Systems
  • Year:
  • 2014

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper considers the problem of multiagent sequential decision making under uncertainty and incomplete knowledge of the state transition model. A distributed learning framework, where each agent learns an individual model and shares the results with the team, is proposed. The challenges associated with this approach include choosing the model representation for each agent and how to effectively share these representations under limited communication. A decentralized extension of the model learning scheme based on the Incremental Feature Dependency Discovery (Dec-iFDD) is presented to address the distributed learning problem. The representation selection problem is solved by leveraging iFDD's property of adjusting the model complexity based on the observed data. The model sharing problem is addressed by having each agent rank the features of their representation based on the model reduction error and broadcast the most relevant features to their teammates. The algorithm is tested on the multi-agent block building and the persistent search and track missions. The results show that the proposed distributed learning scheme is particularly useful in heterogeneous learning setting, where each agent learns significantly different models. We show through large-scale planning under uncertainty simulations and flight experiments with state-dependent actuator and fuel-burn- rate uncertainty that our planning approach can outperform planners that do not account for heterogeneity between agents.