Problem Decomposition for Behavioural Cloning

Authors:
Dorian Suc;Ivan Bratko
Affiliations:
-;-
Venue:
ECML '00 Proceedings of the 11th European Conference on Machine Learning
Year:
2000

Citing 8
Cited 1

Dynamic programming: deterministic and stochastic models

Dynamic programming: deterministic and stochastic models
Integrated architecture for learning, planning, and reacting based on approximating dynamic programming

Proceedings of the seventh international conference (1990) on Machine learning
Instance-Based Learning Algorithms

Machine Learning
Learning to fly

ML92 Proceedings of the ninth international workshop on Machine learning
Locally Weighted Learning for Control

Artificial Intelligence Review - Special issue on lazy learning
A Framework for Behavioural Cloning

Machine Intelligence 15, Intelligent Agents [St. Catherine's College, Oxford, July 1995]
Skill reconstruction as induction of LQ controllers with subgoals

IJCAI'97 Proceedings of the Fifteenth international joint conference on Artifical intelligence - Volume 2
Reinforcement learning: a survey

Journal of Artificial Intelligence Research

Using Machine Learning to Understand Operator's Skill

IEA/AIE '02 Proceedings of the 15th international conference on Industrial and engineering applications of artificial intelligence and expert systems: developments in applied artificial intelligence

Quantified Score

Hi-index	0.00

Visualization

Abstract

In behavioural cloning of the human operator's skill, a controller is usually induced directly as a classifier from system's states into actions. Experience shows that this often results in brittle controllers. In this paper we explore a decomposition of the cloning problem into two learning problems: the learning of operator's control trajectories and the learning of the system's dynamics separately. We analyse advantages of such indirect controllers. We give characterization of the learner's error that is plausible explanation of why this decomposition approach has empirically proved to be usually superior to direct cloning.