Simple offset assignment in presence of subword data

Authors:
Bengu Li;Rajiv Gupta
Affiliations:
The University of Arizona, Tucson, AZ;The University of Arizona, Tucson, AZ
Venue:
Proceedings of the 2003 international conference on Compilers, architecture and synthesis for embedded systems
Year:
2003

Citing 24
Cited 8

Optimizing stack frame accesses for processors with restricted addressing modes

Software—Practice & Experience
Storage assignment to decrease code size

PLDI '95 Proceedings of the ACM SIGPLAN 1995 conference on Programming language design and implementation
Storage assignment to decrease code size

ACM Transactions on Programming Languages and Systems (TOPLAS)
Algorithms for address assignment in DSP code generation

Proceedings of the 1996 IEEE/ACM international conference on Computer-aided design
Analysis and evaluation of address arithmetic capabilities in custom DSP architectures

DAC '97 Proceedings of the 34th annual Design Automation Conference
MediaBench: a tool for evaluating and synthesizing multimedia and communicatons systems

MICRO 30 Proceedings of the 30th annual ACM/IEEE international symposium on Microarchitecture
A uniform optimization technique for offset assignment problems

Proceedings of the 11th international symposium on System synthesis
Storage assignment optimizations to generate compact and efficient code on embedded DSPs

Proceedings of the ACM SIGPLAN 1999 conference on Programming language design and implementation
Bidwidth analysis with application to silicon compilation

PLDI '00 Proceedings of the ACM SIGPLAN 2000 conference on Programming language design and implementation
Address code generation for digital signal processors

Proceedings of the 38th annual Design Automation Conference
C Compiler Design for an Industrial Network Processor

OM '01 Proceedings of the 2001 ACM SIGPLAN workshop on Optimization of middleware and distributed systems
Analyzing and compressing assembly code

SIGPLAN '84 Proceedings of the 1984 SIGPLAN symposium on Compiler construction
Profile guided selection of ARM and thumb instructions

Proceedings of the joint conference on Languages, compilers and tools for embedded systems: software and compilers for embedded systems
Address assignment combined with scheduling in DSP code generation

Proceedings of the 39th annual Design Automation Conference
Bit section instruction set extension of ARM for embedded applications

CASES '02 Proceedings of the 2002 international conference on Compilers, architecture, and synthesis for embedded systems
NetBench: a benchmarking suite for network processors

Proceedings of the 2001 IEEE/ACM international conference on Computer-aided design
Bitwidth aware global register allocation

POPL '03 Proceedings of the 30th ACM SIGPLAN-SIGACT symposium on Principles of programming languages
A Representation for Bit Section Based Analysis and Optimization

CC '02 Proceedings of the 11th International Conference on Compiler Construction
Storage assignment optimizations through variable coalescence for embedded processors

Proceedings of the 2003 ACM SIGPLAN conference on Language, compiler, and tool for embedded systems
Enhancing the performance of 16-bit code using augmenting instructions

Proceedings of the 2003 ACM SIGPLAN conference on Language, compiler, and tool for embedded systems
Mixed-width instruction sets

Communications of the ACM - Program compaction
CommBench-a telecommunications benchmark for network processors

ISPASS '00 Proceedings of the 2000 IEEE International Symposium on Performance Analysis of Systems and Software
Code compaction of matching single-entry multiple-exit regions

SAS'03 Proceedings of the 10th international conference on Static analysis
Address register assignment for reducing code size

CC'03 Proceedings of the 12th international conference on Compiler construction

Shangri-La: achieving high performance from compiled network applications while enabling ease of programming

Proceedings of the 2005 ACM SIGPLAN conference on Programming language design and implementation
Offset assignment using simultaneous variable coalescing

ACM Transactions on Embedded Computing Systems (TECS)
Usability evaluation of Korean e-government portal

UAHCI'07 Proceedings of the 4th international conference on Universal access in human-computer interaction: applications and services
Bitstream processing for embedded systems using C++ metaprogramming

Proceedings of the Conference on Design, Automation and Test in Europe
Speculative subword register allocation in embedded processors

LCPC'04 Proceedings of the 17th international conference on Languages and Compilers for High Performance Computing
Storage Optimization through Offset Assignment with Variable Coalescing

ACM Transactions on Embedded Computing Systems (TECS)
An ILP solution to address code generation for embedded applications on digital signal processors

ACM Transactions on Design Automation of Electronic Systems (TODAES) - Special section on verification challenges in the concurrent world
Minimizing address arithmetic instructions in embedded applications on DSPs

Computers and Electrical Engineering

Quantified Score

Hi-index	0.00

Visualization

Abstract

Many embedded architectures support indirect addressing mode with autoincrement/autodecrement. By maximizing the use of this mode, generation of explicit instructions for performing address arithmetic can be avoided and thus reductions in code size and improvements in performance are achieved. Bartley [2] and Liao et al. [16] developed a method for finding a storage layout for program variables so that the use of autoincrement/autodecrement could be maximized. They introduced the Simple Offset Assignment (SOA) problem and solved it using a Path Cover (PC) formulation.We observe that many media and network processing applications make extensive use of subword data. Therefore, for such applications, by packing multiple subword variables into a single word, we can generate storage layouts that further reduce the cost of address arithmetic in two ways. First the need for address arithmetic is reduced as variables that are packed together share the same address. Second opportunities for using autoincrement andautodecrement instructions are increased as layouts are now possible which place a variable adjacent to more than two variables. This approach has become feasible because of the recent trend in embedded processor design which allows subword variables that are packed together to be accessed and manipulated without incurring performance penalty. We introduce the SubWord Offset Assignment (SWOA) problem and solve it using a Path Cover with Node Coalescing (PCwNC) formulation. Node coalescing corresponds to packing of multiple subword variables into a single word while path covering corresponds to placement of variables in adjacent memory locations to enable the use of autoincrement/autodecrement. We present three heuristics to solve the PCwNC problem. Experiments show that when the program is optimized for code size, the three proposed algorithms achieve 26%, 26.9% and 32% reduction in the number of static explicit address arithmetic instructions over Liao et al.'s algorithm. The algorithms also achieve 14.5%, 22.1%and 22.7% reduction in stack frame size. If the program is optimized for performance, the algorithms achieve 24.3%, 24.7% and 30.2% reduction in the dynamic instruction count of explicit address arithmetic instructions.