Reducing the memory footprint of temporal difference learning over finitely many states by using case-based generalization

Authors:
Matt Dilts;Héctor Muñoz-Avila
Affiliations:
Department of Computer Science and Engineering, Lehigh University, Bethlehem, PA;Department of Computer Science and Engineering, Lehigh University, Bethlehem, PA
Venue:
ICCBR'10 Proceedings of the 18th international conference on Case-Based Reasoning Research and Development
Year:
2010

Citing 16
Cited 1

Temporal difference learning and TD-Gammon

Communications of the ACM
Continuous case-based reasoning

Artificial Intelligence
Experiments with reinforcement learning in problems with continuous state and action spaces

Adaptive Behavior
Introduction to Reinforcement Learning

Introduction to Reinforcement Learning
Learning to Predict by the Methods of Temporal Differences

Machine Learning
INRECA: A Seamlessly Integrated System Based on Inductive Inference and Case-Based Reasoning

ICCBR '95 Proceedings of the First International Conference on Case-Based Reasoning Research and Development
VQQL. Applying Vector Quantization to Reinforcement Learning

RoboCup-99: Robot Soccer World Cup III
Using Reinforcement Learning for Similarity Assessment in Case-Based Systems

IEEE Intelligent Systems
An Analysis of Case-Based Value Function Approximation by Approximating State Transition Graphs

ICCBR '07 Proceedings of the 7th international conference on Case-Based Reasoning: Case-Based Reasoning Research and Development
Recognizing the Enemy: Combining Reinforcement Learning with Strategy Selection Using Case-Based Reasoning

ECCBR '08 Proceedings of the 9th European conference on Advances in Case-Based Reasoning
Improving Reinforcement Learning by Using Case Based Heuristics

ICCBR '09 Proceedings of the 8th International Conference on Case-Based Reasoning: Case-Based Reasoning Research and Development
Maintenance by a Committee of Experts: The MACE Approach to Case-Base Maintenance

ICCBR '09 Proceedings of the 8th International Conference on Case-Based Reasoning: Case-Based Reasoning Research and Development
RETALIATE: learning winning policies in first-person shooter games

IAAI'07 Proceedings of the 19th national conference on Innovative applications of artificial intelligence - Volume 2
Transfer learning in real-time strategy games using hybrid CBR/RL

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
The virtue of reward: performance, reinforcement and discovery in case-based reasoning

ICCBR'05 Proceedings of the 6th international conference on Case-Based Reasoning Research and Development
CBR for state value function approximation in reinforcement learning

ICCBR'05 Proceedings of the 6th international conference on Case-Based Reasoning Research and Development

Integrated learning for goal-driven autonomy

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper we present an approach for reducing the memory footprint requirement of temporal difference methods in which the set of states is finite. We use case-based generalization to group the states visited during the reinforcement learning process. We follow a lazy learning approach; cases are grouped in the order in which they are visited. Any new state visited is assigned to an existing entry in the Q-table provided that a similar state has been visited before. Otherwise a new entry is added to the Q-table. We performed experiments on a turn-based game where actions have non-deterministic effects and might have long term repercussions on the outcome of the game. The main conclusion from our experiments is that by using case-based generalization, the size of the Q-table can be substantially reduced while maintaining the quality of the RL estimates.