Reinforcement learning: a survey
Journal of Artificial Intelligence Research
Hi-index | 0.00 |
Our proposed cognitive distance learning agent generates sequence of actions from a start state to goal state in problem state space. This agent learns cognitive distance (path cost) of arbitrary combination of two states. The action generation at each state is selection of next state that has minimum cognitive distance to the goal.In this paper, we investigate a learning process of the agent by a computer simulation in a tile world state space. An average search cost is more reduced more the prior learning term is long and our problem solver is familiar to the environment. After enough learning process, an average search cost of proposed method is reduced to 1/20 from that of conventional search method.