Option Discovery in Reinforcement Learning using Frequent Common Subsequences of Actions

Authors:
Sertan Girgin;Faruk Polat
Affiliations:
Middle East Technical University, Turkey;Middle East Technical University, Turkey
Venue:
CIMCA '05 Proceedings of the International Conference on Computational Intelligence for Modelling, Control and Automation and International Conference on Intelligent Agents, Web Technologies and Internet Commerce Vol-1 (CIMCA-IAWTIC'06) - Volume 01
Year:
2005

Citing 0
Cited 1

A layered approach to learning coordination knowledge in multiagent environments

Applied Intelligence

Quantified Score

Hi-index	0.00

Visualization

Abstract

Temporally abstract actions, or options, facilitate learning in large and complex domains by exploiting sub-tasks and hierarchical structure of the problem formed by these sub-tasks. In this paper, we study automatic generation of options using common sub-sequences derived from the state transition histories collected as learning progresses. The standard Q-learning algorithm is extended to use generated options transparently, and effectiveness of the method is demostrated in Dietterich's Taxi domain.