Positive Impact of State Similarity on Reinforcement Learning Performance

Authors:
S. Girgin;F. Polat;R. Alhajj
Affiliations:
Middle East Tech. Univ., Ankara;-;-
Venue:
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Year:
2007

Citing 0
Cited 2

Effective monitoring by efficient fingerprint matching using a forest of NAQ-trees

Journal of Intelligent Information Systems
Concept learning games

Information Systems Frontiers

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we propose a novel approach to identify states with similar subpolicies and show how they can be integrated into the reinforcement learning framework to improve learning performance. The method utilizes a specialized tree structure to identify common action sequences of states, which are derived from possible optimal policies, and defines a similarity function between two states based on the number of such sequences. Using this similarity function, updates on the action-value function of a state are reflected onto all similar states. This allows experience that is acquired during learning to be applied to a broader context. The effectiveness of the method is demonstrated empirically.