An Extremely Simple Reinforcement Learning Rule for Neural Networks
ISNN '07 Proceedings of the 4th international symposium on Neural Networks: Advances in Neural Networks
Design and defect tolerance beyond CMOS
CODES+ISSS '08 Proceedings of the 6th IEEE/ACM/IFIP international conference on Hardware/Software codesign and system synthesis
Immediate Reward Reinforcement Learning for Clustering and Topology Preserving Mappings
Similarity-Based Clustering
Q-Learning Based on Dynamical Structure Neural Network for Robot Navigation in Unknown Environment
ISNN 2009 Proceedings of the 6th International Symposium on Neural Networks: Advances in Neural Networks - Part III
Stochastic weights reinforcement learning for exploratory data analysis
ICANN'07 Proceedings of the 17th international conference on Artificial neural networks
Scaling-efficient in-situ training of CMOL CrossNet classifiers
Neural Networks
Hi-index | 0.00 |
In this letter, we have found a more general formulation of the REward Increment = Nonnegative Factor times Offset Reinforcement times Characteristic Eligibility (REINFORCE) learning principle first suggested by Williams. The new formulation has enabled us to apply the principle to global reinforcement learning in networks with various sources of randomness, and to suggest several simple local rules for such networks. Numerical simulations have shown that for simple classification and reinforcement learning tasks, at least one family of the new learning rules gives results comparable to those provided by the famous Rules Ar-i and Ar-p for the Boltzmann machines