McRouter: multicast within a router for high performance network-on-chips

  • Authors:
  • Yuan He;Hiroshi Sasaki;Shinobu Miwa;Hiroshi Nakamura

  • Affiliations:
  • The University of Tokyo, Tokyo, Japan;Kyushu University, Fukuoka, Japan;The University of Tokyo, Tokyo, Japan;The University of Tokyo, Tokyo, Japan

  • Venue:
  • PACT '13 Proceedings of the 22nd international conference on Parallel architectures and compilation techniques
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

The inevitable advent of the multi-core era has driven an increasing demand for low latency on-chip interconnection networks~(or NoCs). Being a critical part of the memory hierarchy for modern chip multi-processors~(CMPs), these networks face stringent design constraints to provide fast communication with tight power budget. Modern NoC's first-order concern is clearly its latency, while we also find that internal bandwidth of its routers is relatively plentiful; thus, we present a low latency router design utilizing a technique we call "multicast within a router" or McRouter, which allows productive utilization of remaining bandwidth inside a NoC router. McRouter allows a single cycle transfer of flits which shortens the communication latency when there is enough remaining bandwidth within the router. The key idea is to transmit a header flit to all possible output ports (multicast) so that it is always transmitted to the correct output port without relying on route computation. In addition, we find it is affordable with marginal power overhead while still being a stand-alone design by maintaining portability and modularity (unlike look-ahead routing based designs). Our evaluation with application traffic shows that McRouter helps achieving system speed-ups of 1.28, 1.17 and 1.05 over the conventional router~(CR), the VSA router~(VSAR) and the prediction router~(PR), respectively.