Hardware architecture of reinforcement learning scheme for dynamic power management in embedded systems

  • Authors:
  • Viswanathan Lakshmi Prabha;Elwin Chandra Monie

  • Affiliations:
  • Department of Electronics and Communication Engineering, Government College of Technology, Coimbatore, Tamil Nadu, India;Thanthai Periyar Government Institute of Technology, Vellore, Tamil Nadu, India

  • Venue:
  • EURASIP Journal on Embedded Systems
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Dynamic power management (DPM) is a technique to reduce power consumption of electronic systems by selectively shutting down idle components. In this paper, a novel and nontrivial enhancement of conventional reinforcement learning (RL) is adopted to choose the optimal policy out of the existing DPM policies. A hardware architecture evolved from the VHDL model of Temporal Difference RL algorithm is proposed in this paper, which can suggest the winner policy to be adopted for any given workload to achieve power savings. The effectiveness of this approach is also demonstrated by an event-driven simulator, which is designed using JAVA for power-manageable embedded devices. The results show that RL applied to DPM can lead up to 28% power savings.