ICANNGA'11 Proceedings of the 10th international conference on Adaptive and natural computing algorithms - Volume Part II
Hi-index | 0.00 |
A novel learning policy in multi-agent reinforcement learning is presented, trying to find another tradeoff of exploration and exploitation efficiently, It use the output of the classical quantum computer as an input for chaotic dynamics amplifier, The novel amplifier consider the chaotic effect, it can amplify the initial value in polynomial time. It considers the action selection problem and argues that the problem, in principle, can be solved in polynomial time if it combines the quantum computer with the chaotic dynamics amplifier based on the logistic map.