RECONNECT: A NoC for polymorphic ASICs using a low overhead single cycle router

  • Authors:
  • Joseph Nimmy;C. Ramesh Reddy;Keshavan Varadarajan;Mythri Alle;Alexander Fell;S. K. Nandy;Ranjani Narayan

  • Affiliations:
  • CAD Lab, SERC, Indian Institute of Science, Bangalore, India;CAD Lab, SERC, Indian Institute of Science, Bangalore, India;CAD Lab, SERC, Indian Institute of Science, Bangalore, India;CAD Lab, SERC, Indian Institute of Science, Bangalore, India;CAD Lab, SERC, Indian Institute of Science, Bangalore, India;CAD Lab, SERC, Indian Institute of Science, Bangalore, India;Morphing Machines, Bangalore, India

  • Venue:
  • ASAP '08 Proceedings of the 2008 International Conference on Application-Specific Systems, Architectures and Processors
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

A polymorphic ASIC is a runtime reconfigurable hardware substrate comprising compute and communication elements. It is a ldquofuture proofrdquo custom hardware solution for multiple applications and their derivatives in a domain. Interoperability between application derivatives at runtime is achieved through hardware reconfiguration. In this paper we present the design of a single cycle Network on Chip (NoC) router that is responsible for effecting runtime reconfiguration of the hardware substrate. The router design is optimized to avoid FIFO buffers at the input port and loop back at output crossbar. It provides virtual channels to emulate a non-blocking network and supports a simple X-Y relative addressing scheme to limit the control overhead to 9 bits per packet. The 8times8 honeycomb NoC (RECONNECT) implemented in 130 nm UMC CMOS standard cell library operates at 500 MHz and has a bisection bandwidth of 28.5 GBps. The network is characterized for random, self-similar and application specific traffic patterns that model the execution of multimedia and DSP kernels with varying network loads and virtual channels. Our implementation with 4 virtual channels has an average network latency of 24 clock cycles and throughput of 62.5% of the network capacity for random traffic. For application specific traffic the latency is 6 clock cycles and throughput is 87% of the network capacity.