Code optimizations for a VLIW-style network processing unit

Authors:
Jinhwan Kim;Yunheung Paek;Gangryung Uh
Affiliations:
Department of Electrical Engineering, KAIST, Daejon 350-701, South Korea;School of Electrical Engineering, Seoul National University, Seoul 151-744, South Korea and Department of Electrical Engineering, Seoul National University, Seoul 151-744, South Korea;Department of Computer Science, Boise State University, Boise, ID
Venue:
Software—Practice & Experience
Year:
2004

Citing 10
Cited 0

A simple interprocedural register allocation algorithm and its effectiveness for LISP

ACM Transactions on Programming Languages and Systems (TOPLAS)
Unexpected side effects of inline substitution: a case study

ACM Letters on Programming Languages and Systems (LOPLAS)
Network processors: a perspective on market requirements, processor architectures and embedded S/W tools

Proceedings of the conference on Design, automation and test in Europe
C Compiler Design for an Industrial Network Processor

OM '01 Proceedings of the 2001 ACM SIGPLAN workshop on Optimization of middleware and distributed systems
The very portable optimizer for digital signal processors

CASES '01 Proceedings of the 2001 international conference on Compilers, architecture, and synthesis for embedded systems
Code Optimization Techniques for Embedded Processors: Methods, Algorithms, and Tools

Code Optimization Techniques for Embedded Processors: Methods, Algorithms, and Tools
Experience with a retargetable compiler for a commercial network processor

CASES '02 Proceedings of the 2002 international conference on Compilers, architecture, and synthesis for embedded systems
Machine Descriptions to Build Tools for Embedded Systems

LCTES '98 Proceedings of the ACM SIGPLAN Workshop on Languages, Compilers, and Tools for Embedded Systems
Decreasing Process Memory Requirements by Overlapping Program Portions

HICSS '98 Proceedings of the Thirty-First Annual Hawaii International Conference on System Sciences-Volume 7 - Volume 7
Code optimization libraries for retargetable compilation for embedded digital signal processors

Code optimization libraries for retargetable compilation for embedded digital signal processors

Quantified Score

Hi-index	0.00

Visualization

Abstract

The explosive growth in network bandwidth and Internet services such as QoS (quality of service) and SLA (service level agreement) monitoring have created the need for new networking hardware called a Network Processing Unit (NPU). In order to rapidly reconfigure the NPU for frequently varying Internet services and technologies, a high-performance C compiler is urgently needed. Several code generation techniques, which are intended to meet the high code quality demands of other types of application specific instructionset processors (ASIPs) like digital signal processors (DSPs), have already been developed. However, these techniques are insufficient for NPUs due to striking architectural differences such as asymmetric data paths. The main purpose of this paper is to discuss our recent experience with the development of a commercial compiler for a new NPU called the Paion PPII, which is basically a packet engine for NPU to meet the growing need for new high-bandwidth communication equipment targeted for Internet routers and ethernet adapters. For this purpose, we will first show the architectural challenges posed by the target NPU. Then, we will describe several compiler techniques that we found to be effective for the target NPU with various unorthogonal architectural features. The current implementations of the PPII use a VLIW (Very Long Instruction Word) architecture. So, we handled this VLIW-style architecture by employing a simple code compaction scheme which packs multiple parallel instructions into one long instruction word. The experimental results show that our techniques are effective for significantly reducing the dynamic instruction count.