A reinforcement learning algorithm using temporal difference error in ant model

  • Authors:
  • SeungGwan Lee;TaeChoong Chung

  • Affiliations:
  • School of Computer Science and Information Engineering, Catholic University, Gyeonggi-Do, Korea;School of Electronics and Information, KyungHee University, Gyeonggi-Do, Korea

  • Venue:
  • IWANN'05 Proceedings of the 8th international conference on Artificial Neural Networks: computational Intelligence and Bioinspired Systems
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

When agent chooses some action and does state transition in present state in reinforcement learning, it is important subject to decide how will reward for conduct that agent chooses. In this paper, we suggest multi colony interaction ant reinforcement learning model using TD-error to original Ant-Q learning. This method is a hybrid of multi colony interaction by elite strategy and reinforcement learning applying TD-error to Ant-Q. We could know through an experiment that proposed reinforcement learning method converges faster to optimal solution than original ACS and Ant-Q.