An Ensemble Architecture for Learning Complex Problem-Solving Techniques from Demonstration

Authors:
Xiaoqin Shelley Zhang;Bhavesh Shrestha;Sungwook Yoon;Subbarao Kambhampati;Phillip DiBona;Jinhong K. Guo;Daniel McFarlane;Martin O. Hofmann;Kenneth Whitebread;Darren Scott Appling;Elizabeth T. Whitaker;Ethan B. Trewhitt;Li Ding;James R. Michaelis;Deborah L. McGuinness;James A. Hendler;Janardhan Rao Doppa;Charles Parker;Thomas G. Dietterich;Prasad Tadepalli;Weng-Keen Wong;Derek Green;Anton Rebguns;Diana Spears;Ugur Kuter;Geoff Levine;Gerald DeJong;Reid L. MacTavish;Santiago Ontañón;Jainarayan Radhakrishnan;Ashwin Ram;Hala Mostafa;Huzaifa Zafar;Chongjie Zhang;Daniel Corkill;Victor Lesser;Zhexuan Song
Affiliations:
University of Massachusetts at Dartmouth;University of Massachusetts at Dartmouth;Arizona State University;Arizona State University;Lockheed Martin Advanced Technology Laboratories;Lockheed Martin Advanced Technology Laboratories;Lockheed Martin Advanced Technology Laboratories;Lockheed Martin Advanced Technology Laboratories;Lockheed Martin Advanced Technology Laboratories;Georgia Tech Research Institute;Georgia Tech Research Institute;Georgia Tech Research Institute;Rensselaer Polytechnic Institute;Rensselaer Polytechnic Institute;Rensselaer Polytechnic Institute;Rensselaer Polytechnic Institute;Oregon State University;Oregon State University;Oregon State University;Oregon State University;Oregon State University;University of Wyoming;University of Wyoming;University of Wyoming;University of Maryland;University of Illinois at Urbana;University of Illinois at Urbana;Georgia Institute of Technology;Georgia Institute of Technology;Georgia Institute of Technology;Georgia Institute of Technology;University of Massachusetts, Amherst;University of Massachusetts, Amherst;University of Massachusetts, Amherst;University of Massachusetts, Amherst;University of Massachusetts, Amherst;Fujitsu Laboratories of America
Venue:
ACM Transactions on Intelligent Systems and Technology (TIST)
Year:
2012

Citing 30
Cited 1

Taxonomic syntax for first order inference

Journal of the ACM (JACM)
Case-based reasoning: foundational issues, methodological variations, and system approaches

AI Communications
Bagging predictors

Machine Learning
A Review and Empirical Evaluation of Feature Weighting Methods for aClass of Lazy Learning Algorithms

Artificial Intelligence Review - Special issue on lazy learning
Combining labeled and unlabeled data with co-training

COLT' 98 Proceedings of the eleventh annual conference on Computational learning theory
Learning action strategies for planning domains

Artificial Intelligence
Model checking

Model checking
An Experimental Comparison of Three Methods for Constructing Ensembles of Decision Trees: Bagging, Boosting, and Randomization

Machine Learning
The Hearsay-II Speech-Understanding System: Integrating Knowledge to Resolve Uncertainty

ACM Computing Surveys (CSUR)
Multiagent Systems: A Modern Approach to Distributed Artificial Intelligence

Multiagent Systems: A Modern Approach to Distributed Artificial Intelligence
Multiagent Systems: A Survey from a Machine Learning Perspective

Autonomous Robots
Learning Decision Lists

Machine Learning
Learning Declarative Control Rules for Constraint-BAsed Planning

ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Ensemble Methods in Machine Learning

MCS '00 Proceedings of the First International Workshop on Multiple Classifier Systems
Programming by demonstration: a machine learning approach

Programming by demonstration: a machine learning approach
Learning as search optimization: approximate large margin methods for structured prediction

ICML '05 Proceedings of the 22nd international conference on Machine learning
Learning goal hierarchies from structured observations and expert annotations

Machine Learning
Learning and joint deliberation through argumentation in multiagent systems

Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Analyzing Co-training Style Algorithms

ECML '07 Proceedings of the 18th European conference on Machine Learning
Search-based structured prediction

Machine Learning
MABLE: a framework for learning from natural instruction

Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Gradient boosting for sequence alignment

AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
HTN-MAKER: learning HTNs with minimal additional knowledge engineering required

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 2
POIROT: integrated learning of web service procedures

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 3
Responsibility and blame: a structural-model approach

Journal of Artificial Intelligence Research
Unifying SAT-based and graph-based planning

IJCAI'99 Proceedings of the 16th international joint conference on Artifical intelligence - Volume 1
Strategies for learning search control rules: an explanation-based approach

IJCAI'87 Proceedings of the 10th international joint conference on Artificial intelligence - Volume 1
Discriminative learning of beam-search heuristics for planning

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Goal-driven learning in the GILA integrated intelligence architecture

IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Inductive policy selection for first-order MDPs

UAI'02 Proceedings of the Eighteenth conference on Uncertainty in artificial intelligence

LEARNING AND VERIFYING SAFETY CONSTRAINTS FOR PLANNERS IN A KNOWLEDGE-IMPOVERISHED SYSTEM

Computational Intelligence

Quantified Score

Hi-index	0.00

Visualization

Abstract

We present a novel ensemble architecture for learning problem-solving techniques from a very small number of expert solutions and demonstrate its effectiveness in a complex real-world domain. The key feature of our “Generalized Integrated Learning Architecture” (GILA) is a set of heterogeneous independent learning and reasoning (ILR) components, coordinated by a central meta-reasoning executive (MRE). The ILRs are weakly coupled in the sense that all coordination during learning and performance happens through the MRE. Each ILR learns independently from a small number of expert demonstrations of a complex task. During performance, each ILR proposes partial solutions to subproblems posed by the MRE, which are then selected from and pieced together by the MRE to produce a complete solution. The heterogeneity of the learner-reasoners allows both learning and problem solving to be more effective because their abilities and biases are complementary and synergistic. We describe the application of this novel learning and problem solving architecture to the domain of airspace management, where multiple requests for the use of airspaces need to be deconflicted, reconciled, and managed automatically. Formal evaluations show that our system performs as well as or better than humans after learning from the same training data. Furthermore, GILA outperforms any individual ILR run in isolation, thus demonstrating the power of the ensemble architecture for learning and problem solving.