An Ensemble Architecture for Learning Complex Problem-Solving Techniques from Demonstration

  • Authors:
  • Xiaoqin Shelley Zhang;Bhavesh Shrestha;Sungwook Yoon;Subbarao Kambhampati;Phillip DiBona;Jinhong K. Guo;Daniel McFarlane;Martin O. Hofmann;Kenneth Whitebread;Darren Scott Appling;Elizabeth T. Whitaker;Ethan B. Trewhitt;Li Ding;James R. Michaelis;Deborah L. McGuinness;James A. Hendler;Janardhan Rao Doppa;Charles Parker;Thomas G. Dietterich;Prasad Tadepalli;Weng-Keen Wong;Derek Green;Anton Rebguns;Diana Spears;Ugur Kuter;Geoff Levine;Gerald DeJong;Reid L. MacTavish;Santiago Ontañón;Jainarayan Radhakrishnan;Ashwin Ram;Hala Mostafa;Huzaifa Zafar;Chongjie Zhang;Daniel Corkill;Victor Lesser;Zhexuan Song

  • Affiliations:
  • University of Massachusetts at Dartmouth;University of Massachusetts at Dartmouth;Arizona State University;Arizona State University;Lockheed Martin Advanced Technology Laboratories;Lockheed Martin Advanced Technology Laboratories;Lockheed Martin Advanced Technology Laboratories;Lockheed Martin Advanced Technology Laboratories;Lockheed Martin Advanced Technology Laboratories;Georgia Tech Research Institute;Georgia Tech Research Institute;Georgia Tech Research Institute;Rensselaer Polytechnic Institute;Rensselaer Polytechnic Institute;Rensselaer Polytechnic Institute;Rensselaer Polytechnic Institute;Oregon State University;Oregon State University;Oregon State University;Oregon State University;Oregon State University;University of Wyoming;University of Wyoming;University of Wyoming;University of Maryland;University of Illinois at Urbana;University of Illinois at Urbana;Georgia Institute of Technology;Georgia Institute of Technology;Georgia Institute of Technology;Georgia Institute of Technology;University of Massachusetts, Amherst;University of Massachusetts, Amherst;University of Massachusetts, Amherst;University of Massachusetts, Amherst;University of Massachusetts, Amherst;Fujitsu Laboratories of America

  • Venue:
  • ACM Transactions on Intelligent Systems and Technology (TIST)
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present a novel ensemble architecture for learning problem-solving techniques from a very small number of expert solutions and demonstrate its effectiveness in a complex real-world domain. The key feature of our “Generalized Integrated Learning Architecture” (GILA) is a set of heterogeneous independent learning and reasoning (ILR) components, coordinated by a central meta-reasoning executive (MRE). The ILRs are weakly coupled in the sense that all coordination during learning and performance happens through the MRE. Each ILR learns independently from a small number of expert demonstrations of a complex task. During performance, each ILR proposes partial solutions to subproblems posed by the MRE, which are then selected from and pieced together by the MRE to produce a complete solution. The heterogeneity of the learner-reasoners allows both learning and problem solving to be more effective because their abilities and biases are complementary and synergistic. We describe the application of this novel learning and problem solving architecture to the domain of airspace management, where multiple requests for the use of airspaces need to be deconflicted, reconciled, and managed automatically. Formal evaluations show that our system performs as well as or better than humans after learning from the same training data. Furthermore, GILA outperforms any individual ILR run in isolation, thus demonstrating the power of the ensemble architecture for learning and problem solving.