Global Reinforcement Learning in Neural Networks

Authors:
X. Ma;K. K. Likharev
Affiliations:
Stony Brook Univ., NY;-
Venue:
IEEE Transactions on Neural Networks
Year:
2007

Citing 0
Cited 7

2008 Special Issue: Two forms of immediate reward reinforcement learning for exploratory data analysis

Neural Networks
An Extremely Simple Reinforcement Learning Rule for Neural Networks

ISNN '07 Proceedings of the 4th international symposium on Neural Networks: Advances in Neural Networks
Design and defect tolerance beyond CMOS

CODES+ISSS '08 Proceedings of the 6th IEEE/ACM/IFIP international conference on Hardware/Software codesign and system synthesis
Immediate Reward Reinforcement Learning for Clustering and Topology Preserving Mappings

Similarity-Based Clustering
Q-Learning Based on Dynamical Structure Neural Network for Robot Navigation in Unknown Environment

ISNN 2009 Proceedings of the 6th International Symposium on Neural Networks: Advances in Neural Networks - Part III
Stochastic weights reinforcement learning for exploratory data analysis

ICANN'07 Proceedings of the 17th international conference on Artificial neural networks
Scaling-efficient in-situ training of CMOL CrossNet classifiers

Neural Networks

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this letter, we have found a more general formulation of the REward Increment = Nonnegative Factor times Offset Reinforcement times Characteristic Eligibility (REINFORCE) learning principle first suggested by Williams. The new formulation has enabled us to apply the principle to global reinforcement learning in networks with various sources of randomness, and to suggest several simple local rules for such networks. Numerical simulations have shown that for simple classification and reinforcement learning tasks, at least one family of the new learning rules gives results comparable to those provided by the famous Rules Ar-i and Ar-p for the Boltzmann machines