Fast enumeration of maximal valid subgraphs for custom-instruction identification

Authors:
Tao Li;Zhigang Sun;Wu Jigang;Xicheng Lu
Affiliations:
National University of Defense Technology, Changsha, Hunan, China;National University of Defense Technology, Changsha, Hunan, China;Nanyang Technological University, Singapore, Singapore;National University of Defense Technology, Changsha, Hunan, China
Venue:
CASES '09 Proceedings of the 2009 international conference on Compilers, architecture, and synthesis for embedded systems
Year:
2009

Citing 18
Cited 4

MediaBench: a tool for evaluating and synthesizing multimedia and communicatons systems

MICRO 30 Proceedings of the 30th annual ACM/IEEE international symposium on Microarchitecture
Algorithm 457: finding all cliques of an undirected graph

Communications of the ACM
Xtensa: A Configurable and Extensible Processor

IEEE Micro
ConCISe: A Compiler-Driven CPLD-Based Instruction Set Accelerator

FCCM '99 Proceedings of the Seventh Annual IEEE Symposium on Field-Programmable Custom Computing Machines
Processor Acceleration Through Automated Instruction Set Customization

Proceedings of the 36th annual IEEE/ACM International Symposium on Microarchitecture
Scalable custom instructions identification for instruction-set extensible processors

Proceedings of the 2004 international conference on Compilers, architecture, and synthesis for embedded systems
Instruction set extension with shadow registers for configurable processors

Proceedings of the 2005 ACM/SIGDA 13th international symposium on Field-programmable gate arrays
Exploiting pipelining to relax register-file port constraints of instruction-set extensions

Proceedings of the 2005 international conference on Compilers, architectures and synthesis for embedded systems
MiBench: A free, commercially representative embedded benchmark suite

WWC '01 Proceedings of the Workload Characterization, 2001. WWC-4. 2001 IEEE International Workshop
Scalable subgraph mapping for acyclic computation accelerators

CASES '06 Proceedings of the 2006 international conference on Compilers, architecture and synthesis for embedded systems
Customizable Embedded Processors: Design Technologies and Applications

Customizable Embedded Processors: Design Technologies and Applications
Application Specific Datapath Extension with Distributed I/O Functional Units

VLSID '07 Proceedings of the 20th International Conference on VLSI Design held jointly with 6th International Conference: Embedded Systems
Optimizing instruction-set extensible processors under data bandwidth constraints

Proceedings of the conference on Design, automation and test in Europe
Rethinking custom ISE identification: a new processor-agnostic method

CASES '07 Proceedings of the 2007 international conference on Compilers, architecture, and synthesis for embedded systems
Fast custom instruction identification by convex subgraph enumeration

ASAP '08 Proceedings of the 2008 International Conference on Application-Specific Systems, Architectures and Processors
Trimaran: an infrastructure for research in instruction-level parallelism

LCPC'04 Proceedings of the 17th international conference on Languages and Compilers for High Performance Computing
Exact and approximate algorithms for the extension of embedded processor instruction sets

IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems
Fast Identification of Custom Instructions for Extensible Processors

IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems

Instruction selection by graph transformation

CASES '10 Proceedings of the 2010 international conference on Compilers, architectures and synthesis for embedded systems
Practical and effective domain-specific function unit design for CGRA

ICCSA'11 Proceedings of the 2011 international conference on Computational science and Its applications - Volume Part V
Considering the effect of process variations during the ISA extension design flow

Microprocessors & Microsystems
Accelerating an application domain with specialized functional units

ACM Transactions on Architecture and Code Optimization (TACO)

Quantified Score

Hi-index	0.00

Visualization

Abstract

Extensible processors are increasingly becoming popular as they allow for incorporating custom instructions to meet design constraints. However, identifying custom instructions under architectural input/output ports constraint is a time consuming process particularly when large applications are considered. To rapidly identify the most profitable custom instructions with large inputs and outputs, this paper proposes a novel identification algorithm for enumerating maximal convex subgraphs containing no invalid node (i.e., maximal valid subgraphs). The proposed enumerating strategy is based on divide-and-conquer with a top-down manner, rather than the bottom-up manner utilized in the state-of-the-art. The division operation only considers invalid inner nodes of the given DFG, rather than taking all the invalid nodes into account, and thus accelerates enumeration of the maximal valid subgraphs. Experimental results show that, the improvement over the latest work is more than 90% for 60% DFG instances of the acknowledged benchmarks.