Compiler optimization on VLIW instruction scheduling for low power

Authors:
Chingren Lee;Jenq Kuen Lee;Tingting Hwang;Shi-Chun Tsai
Affiliations:
National Tsing-Hua University, Taiwan;National Tsing-Hua University, Taiwan;National Tsing-Hua University, Taiwan;National Chi-Nan University, Taiwan
Venue:
ACM Transactions on Design Automation of Electronic Systems (TODAES)
Year:
2003

Citing 19
Cited 18

Fibonacci heaps and their uses in improved network optimization algorithms

Journal of the ACM (JACM)
Global instruction scheduling for superscalar machines

PLDI '91 Proceedings of the ACM SIGPLAN 1991 conference on Programming language design and implementation
Technology decomposition and mapping targeting low power dissipation

DAC '93 Proceedings of the 30th international Design Automation Conference
Re-encoding sequential circuits to reduce power dissipation

ICCAD '94 Proceedings of the 1994 IEEE/ACM international conference on Computer-aided design
Precomputation-based sequential logic optimization for low power

IEEE Transactions on Very Large Scale Integration (VLSI) Systems - Special issue on low-power design
Power analysis of embedded software: a first step towards software power minimization

IEEE Transactions on Very Large Scale Integration (VLSI) Systems - Special issue on low-power design
Register allocation and binding for low power

DAC '95 Proceedings of the 32nd annual ACM/IEEE Design Automation Conference
Instruction level power analysis and optimization of software

Journal of VLSI Signal Processing Systems - Special issue on technologies for wireless computing
Algorithms for address assignment in DSP code generation

Proceedings of the 1996 IEEE/ACM international conference on Computer-aided design
Power analysis and minimization techniques for embedded DSP software

IEEE Transactions on Very Large Scale Integration (VLSI) Systems
Computer architecture (2nd ed.): a quantitative approach

Computer architecture (2nd ed.): a quantitative approach
The design and use of simplepower: a cycle-accurate energy estimation tool

Proceedings of the 37th Annual Design Automation Conference
Architectural and compiler techniques for energy reduction in high-performance microprocessors

IEEE Transactions on Very Large Scale Integration (VLSI) Systems - Special section on low-power electronics and design
Local Microcode Compaction Techniques

ACM Computing Surveys (CSUR)
Compiler optimization on instruction scheduling for low power

ISSS '00 Proceedings of the 13th international symposium on System synthesis
Parallel processing: a smart compiler and a dumb machine

SIGPLAN '84 Proceedings of the 1984 SIGPLAN symposium on Compiler construction
SYCLOP: Synthesis of CMOS Logic for Low Power Applications

ICCD '92 Proceedings of the 1991 IEEE International Conference on Computer Design on VLSI in Computer & Processors
Very Long Instruction Word architectures and the ELI-512

ISCA '83 Proceedings of the 10th annual international symposium on Computer architecture
Power optimization of variable-voltage core-based systems

IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems

Transition aware scheduling: increasing continuous idle-periods in resource units

Proceedings of the 2nd conference on Computing frontiers
A sink-n-hoist framework for leakage power reduction

Proceedings of the 5th ACM international conference on Embedded software
Instruction scheduling of VLIW architectures for balanced power consumption

Proceedings of the 2005 Asia and South Pacific Design Automation Conference
Compilers for leakage power reduction

ACM Transactions on Design Automation of Electronic Systems (TODAES)
Loop scheduling with timing and switching-activity minimization for VLIW DSP

ACM Transactions on Design Automation of Electronic Systems (TODAES)
System-level scheduling on instruction cell based reconfigurable systems

Proceedings of the conference on Design, automation and test in Europe: Proceedings
Compilation for compact power-gating controls

ACM Transactions on Design Automation of Electronic Systems (TODAES)
INTACTE: an interconnect area, delay, and energy estimation tool for microarchitectural explorations

CASES '07 Proceedings of the 2007 international conference on Compilers, architecture, and synthesis for embedded systems
Energy-aware scheduling and simulation methodologies for parallel security processors with multiple voltage domains

The Journal of Supercomputing
Algorithms and analysis of scheduling for loops with minimum switching

International Journal of Computational Science and Engineering
Effective Code Generation for Distributed and Ping-Pong Register Files: A Case Study on PAC VLIW DSP Cores

Journal of Signal Processing Systems
Enhancing Microkernel Performance on VLIW DSP Processors via Multiset Context Switch

Journal of Signal Processing Systems
Opposite-phase register switching for peak current minimization

ACM Transactions on Design Automation of Electronic Systems (TODAES)
Energy-Aware Loop Scheduling and Assignment for Multi-Core, Multi-Functional-Unit Architecture

Journal of Signal Processing Systems
Compiler-assisted leakage-aware loop scheduling for embedded VLIW DSP processors

Journal of Systems and Software
Power aware SID-based simulator for embedded multicore DSP subsystems

CODES/ISSS '10 Proceedings of the eighth IEEE/ACM/IFIP international conference on Hardware/software codesign and system synthesis
Power-Aware scheduling for parallel security processors with analytical models

LCPC'04 Proceedings of the 17th international conference on Languages and Compilers for High Performance Computing
Power devil: tool for power gating strategy selection

Proceedings of the 10th Workshop on Optimizations for DSP and Embedded Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this article, we investigate compiler transformation techniques regarding the problem of scheduling VLIW instructions aimed at reducing power consumption of VLIW architectures in the instruction bus. The problem can be categorized into two types: horizontal scheduling and vertical scheduling. For the case of horizontal scheduling, we propose a bipartite-matching scheme for instruction scheduling. We prove that our greedy bipartite-matching scheme always gives the optimal switching activities of the instruction bus for given VLIW instruction scheduling policies. For the case of vertical scheduling, we prove that the problem is NP-hard, and we further propose a heuristic algorithm to solve the problem. Our experiment is performed on Alpha-based VLIW architectures and an ATOM simulator, and the compiler incorporated in our proposed schemes is implemented based on SUIF and MachSUIF. Experimental results of horizontal scheduling optimization show an average 13.30% reduction with four-way issue architecture and an average 20.15% reduction with eight-way issue architecture for transitional activities of the instruction bus as compared with conventional list scheduling for an extensive set of benchmarks. The additional reduction for transitional activities of the instruction bus from horizontal to vertical scheduling with window size four is around 4.57 to 10.42%, and the average is 7.66%. Similarly, the additional reduction with window size eight is from 6.99 to 15.25%, and the average is 10.55%.