Online learning of timeout policies for dynamic power management

Authors:
Umair Ali Khan;Bernhard Rinner
Affiliations:
Alpen-Adria Universität Klagenfurt, Austria;Alpen-Adria Universität Klagenfurt, Austria
Venue:
ACM Transactions on Embedded Computing Systems (TECS)
Year:
2014

Citing 36
Cited 0

Integrated architecture for learning, planning, and reacting based on approximating dynamic programming

Proceedings of the seventh international conference (1990) on Machine learning
Technical Note: \cal Q-Learning

Machine Learning
Predictive system shutdown and other architectural techniques for energy efficient programmable computation

IEEE Transactions on Very Large Scale Integration (VLSI) Systems
Dynamic power management based on continuous-time Markov decision processes

Proceedings of the 36th annual ACM/IEEE Design Automation Conference
A predictive system shutdown method for energy saving of event-driven computation

ACM Transactions on Design Automation of Electronic Systems (TODAES)
Dynamic power management using adaptive learning tree

ICCAD '99 Proceedings of the 1999 IEEE/ACM international conference on Computer-aided design
A survey of design techniques for system-level dynamic power management

IEEE Transactions on Very Large Scale Integration (VLSI) Systems - Special section on low-power electronics and design
Dynamic power management for portable systems

MobiCom '00 Proceedings of the 6th annual international conference on Mobile computing and networking
Comparing System-Level Power Management Policies

IEEE Design & Test
Dynamic Power Management for Nonstationary Service Requests

IEEE Transactions on Computers
Online strategies for dynamic power management in systems with multiple power-saving states

ACM Transactions on Embedded Computing Systems (TECS)
Hierarchical Adaptive Dynamic Power Management

IEEE Transactions on Computers
Dynamic preferences in multi-criteria reinforcement learning

ICML '05 Proceedings of the 22nd international conference on Machine learning
PowerNap: An Efficient Power Management Scheme for Mobile Devices

IEEE Transactions on Mobile Computing
Dynamic power management using machine learning

Proceedings of the 2006 IEEE/ACM international conference on Computer-aided design
Hardware architecture of reinforcement learning scheme for dynamic power management in embedded systems

EURASIP Journal on Embedded Systems
Design and management of voltage-frequency island partitioned networks-on-chip

IEEE Transactions on Very Large Scale Integration (VLSI) Systems
Adaptive power management using reinforcement learning

Proceedings of the 2009 International Conference on Computer-Aided Design
Autonomous Audio-Supported Learning of Visual Classifiers for Traffic Monitoring

IEEE Intelligent Systems
Multiple Instance Learning with Multiple Objective Genetic Programming for Web Mining

Applied Soft Computing
Supervised learning based power management for multicore processors

IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems
Dynamic power management in environmentally powered systems

Proceedings of the 2010 Asia and South Pacific Design Automation Conference
Power Management in Real Time Embedded Systems through Online and Adaptive Interplay of DPM and DVFS Policies

EUC '10 Proceedings of the 2010 IEEE/IFIP International Conference on Embedded and Ubiquitous Computing
Towards a Science of Cyber-Physical Systems Design

ICCPS '11 Proceedings of the 2011 IEEE/ACM Second International Conference on Cyber-Physical Systems
Cyberphysical Systems: Workload Modeling and Design Optimization

IEEE Design & Test
Deriving a near-optimal power management policy using model-free reinforcement learning and Bayesian classification

Proceedings of the 48th Design Automation Conference
An adaptive hybrid dynamic power management algorithm for mobile devices

Computer Networks: The International Journal of Computer and Telecommunications Networking
QoS provisioning in wireless video sensor networks: a dynamic power management framework

IEEE Wireless Communications
A predictive dynamic power management technique for embedded mobile devices

IEEE Transactions on Consumer Electronics
Policy optimization for dynamic power management

IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems
Event-driven power management

IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems
Non-Stationary Traffic Analysis and Its Implications on Multicore Platform Design

IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems
An Optimal Control Approach to Power Management for Multi-Voltage and Frequency Islands Multiprocessor Platforms under Highly Variable Workloads

NOCS '12 Proceedings of the 2012 IEEE/ACM Sixth International Symposium on Networks-on-Chip
Robust Traffic State Estimation on Smart Cameras

AVSS '12 Proceedings of the 2012 IEEE Ninth International Conference on Advanced Video and Signal-Based Surveillance
Robust optimization of a Chip Multiprocessor's performance under power and thermal constraints

ICCD '12 Proceedings of the 2012 IEEE 30th International Conference on Computer Design (ICCD 2012)
A Reinforcement Learning Framework for Dynamic Power Management of a Portable, Multi-camera Traffic Monitoring System

GREENCOM '12 Proceedings of the 2012 IEEE International Conference on Green Computing and Communications

Quantified Score

Hi-index	0.00

Visualization

Abstract

Dynamic power management (DPM) refers to strategies which selectively change the operational states of a device during runtime to reduce the power consumption based on the past usage pattern, the current workload, and the given performance constraint. The power management problem becomes more challenging when the workload exhibits nonstationary behavior which may degrade the performance of any single or static DPM policy. This article presents a reinforcement learning (RL)-based DPM technique for optimal selection of timeout values in the different device states. Each timeout period determines how long the device will remain in a particular state before the transition decision is taken. The timeout selection is based on workload estimates derived from a Multilayer Artificial Neural Network (ML-ANN) and an objective function given by weighted performance and power parameters. Our DPM approach is further able to adapt the power-performance weights online to meet user-specified power and performance constraints, respectively. We have completely implemented our DPM algorithm on our embedded traffic surveillance platform and performed long-term experiments using real traffic data to demonstrate the effectiveness of the DPM. Our results show that the proposed learning algorithm not only adequately explores the power-performance trade-off with nonstationary workload but can also successfully perform online adjustment of the trade-off parameter in order to meet the user-specified constraint.