Journal of Network and Computer Applications
Hi-index | 0.00 |
In this paper, non deterministic Direct Reinforcement Learning (RL) for controlling the transmission times and power of a Wireless Sensor Network (WSN) node is presented. RL allows for truly autonomous optimal behaviour of agents by requiring no models or supervision to learn. Optimal actions are learnt by repeated interactions with the environment. Performance results are presented for Monte Carlo, TD0 and TDλ. The resultant optimal learned policies are shown to out perform static power control in a stochastic environment.