Optimizing Address Code Generation for Array-Intensive DSP Applications

Authors:
Guilin Chen;Mahmut Kandemir
Affiliations:
The Pennsylvania State University, University Park;The Pennsylvania State University, University Park
Venue:
Proceedings of the international symposium on Code generation and optimization
Year:
2005

Citing 39
Cited 5

Improving register allocation for subscripted variables

PLDI '90 Proceedings of the ACM SIGPLAN 1990 conference on Programming language design and implementation
Optimizing stack frame accesses for processors with restricted addressing modes

Software—Practice & Experience
Register allocation via graph coloring

Register allocation via graph coloring
SUIF: an infrastructure for research on parallelizing and optimizing compilers

ACM SIGPLAN Notices
Unifying data and control transformations for distributed shared-memory machines

PLDI '95 Proceedings of the ACM SIGPLAN 1995 conference on Programming language design and implementation
Tile size selection using cache organization and data layout

PLDI '95 Proceedings of the ACM SIGPLAN 1995 conference on Programming language design and implementation
Compiler cache optimizations for banded matrix problems

ICS '95 Proceedings of the 9th international conference on Supercomputing
Storage assignment to decrease code size

ACM Transactions on Programming Languages and Systems (TOPLAS)
Address calculation for retargetable compilation and exploration of instruction-set architectures

DAC '96 Proceedings of the 33rd annual Design Automation Conference
Algorithms for address assignment in DSP code generation

Proceedings of the 1996 IEEE/ACM international conference on Computer-aided design
Data-centric multi-level blocking

Proceedings of the ACM SIGPLAN 1997 conference on Programming language design and implementation
Analysis and evaluation of address arithmetic capabilities in custom DSP architectures

DAC '97 Proceedings of the 34th annual Design Automation Conference
DSP address optimization using a minimum cost circulation technique

ICCAD '97 Proceedings of the 1997 IEEE/ACM international conference on Computer-aided design
Procedure based program compression

MICRO 30 Proceedings of the 30th annual ACM/IEEE international symposium on Microarchitecture
Memory size reduction through storage order optimization for embedded parallel multimedia applications

Parallel Computing - Special issue on applications: parallel processing and multimedia
Advanced compiler design and implementation

Advanced compiler design and implementation
Improving locality using loop and data transformations in an integrated framework

MICRO 31 Proceedings of the 31st annual ACM/IEEE international symposium on Microarchitecture
A uniform optimization technique for offset assignment problems

Proceedings of the 11th international symposium on System synthesis
Addressing optimization for loop execution targeting DSP with auto-increment/decrement architecture

Proceedings of the 11th international symposium on System synthesis
Storage assignment optimizations to generate compact and efficient code on embedded DSPs

Proceedings of the ACM SIGPLAN 1999 conference on Programming language design and implementation
Minimizing cost of local variables access for DSP-processors

Proceedings of the ACM SIGPLAN 1999 workshop on Languages, compilers, and tools for embedded systems
Address code generation for digital signal processors

Proceedings of the 38th annual Design Automation Conference
Profile-guided code compression

PLDI '02 Proceedings of the ACM SIGPLAN 2002 Conference on Programming language design and implementation
Compiling with code-size constraints

Proceedings of the joint conference on Languages, compilers and tools for embedded systems: software and compilers for embedded systems
A code decompression architecture for VLIW processors

Proceedings of the 34th annual ACM/IEEE international symposium on Microarchitecture
High Performance Compilers for Parallel Computing

High Performance Compilers for Parallel Computing
Array recovery and high-level transformations for DSP applications

ACM Transactions on Embedded Computing Systems (TECS)
Fast and Effective Procedure Inlining

SAS '97 Proceedings of the 4th International Symposium on Static Analysis
Storage assignment optimizations through variable coalescence for embedded processors

Proceedings of the 2003 ACM SIGPLAN conference on Language, compiler, and tool for embedded systems
Address code generation for DSP instruction-set architectures

ACM Transactions on Design Automation of Electronic Systems (TODAES)
Array Index Allocation under Register Constraints in DSP Programs

VLSID '99 Proceedings of the 12th International Conference on VLSI Design - 'VLSI for the Information Appliance'
Design and Evaluation of a Selective Compressed Memory System

ICCD '99 Proceedings of the 1999 IEEE International Conference on Computer Design
Code Size Efficiency in Global Scheduling for ILP Processors

INTERACT '02 Proceedings of the Sixth Annual Workshop on Interaction between Compilers and Computer Architectures
Instruction Set Design and Optimizations for Address Computation in DSP Architectures

ISSS '96 Proceedings of the 9th international symposium on System synthesis
Code generation and optimization for embedded digital signal processors

Code generation and optimization for embedded digital signal processors
Lattice-based memory allocation

Proceedings of the 2003 international conference on Compilers, architecture and synthesis for embedded systems
Custom Data Layout for Memory Parallelism

Proceedings of the international symposium on Code generation and optimization: feedback-directed and runtime optimization
Address register assignment for reducing code size

CC'03 Proceedings of the 12th international conference on Compiler construction
Offset assignment showdown: evaluation of DSP address code optimization algorithms

CC'03 Proceedings of the 12th international conference on Compiler construction

Link-time compaction and optimization of ARM executables

ACM Transactions on Embedded Computing Systems (TECS)
Accuracy-aware SRAM: a reconfigurable low power SRAM architecture for mobile multimedia applications

Proceedings of the 2009 Asia and South Pacific Design Automation Conference
Evaluation of offset assignment heuristics

HiPEAC'07 Proceedings of the 2nd international conference on High performance embedded architectures and compilers
Reconfigurable SRAM architecture with spatial voltage scaling for low power mobile multimedia applications

IEEE Transactions on Very Large Scale Integration (VLSI) Systems
Evaluating address register assignment and offset assignment algorithms

ACM Transactions on Embedded Computing Systems (TECS)

Quantified Score

Hi-index	0.00

Visualization

Abstract

The application code size is a critical design factor for many embedded systems. Unfortunately, most available compilers optimize primarily for speed of execution rather than code density. As a result, the compiler-generated code can be much larger than necessary. In particular, in the DSP domain, the past research found that optimizing address code generation can be very important since address code can account for over 50% of all program bits. This paper presents a compiler-directed scheme to minimize the number of instructions to be generated to manipulate address registers found in DSP architectures. As opposed to most of the prior techniques that attempt to reduce the number of such instructions through careful address register assignment, this paper proposes modifying loop access patterns in array-intensive signal processing applications. In addition, it demonstrates how the proposed scheme can cooperate with a data layout optimizer for increasing its benefits further. We also discuss how optimizations that target effective address code generation can conflict with data locality-enhancing transformations. We evaluate the proposed approach using twelve array-intensive embedded applications. Our experimental results indicate that the proposed approach not only leads to significant reductions in code size but also outperforms prior efforts on reducing code size of array-intensive DSP applications.