On an efficient NoC multicasting scheme in support of multiple applications running on irregular sub-networks

Authors:
Xiaohang Wang;Mei Yang;Yingtao Jiang;Peng Liu
Affiliations:
Department of Information Science and Electronic Engineering, Zhejiang University, Hangzhou, Zhejiang 310027, PR China and Department of Electrical and Computer Engineering, University of Nevada, ...;Department of Electrical and Computer Engineering, University of Nevada, Las Vegas 89154, USA;Department of Electrical and Computer Engineering, University of Nevada, Las Vegas 89154, USA;Department of Information Science and Electronic Engineering, Zhejiang University, Hangzhou, Zhejiang 310027, PR China
Venue:
Microprocessors & Microsystems
Year:
2011

Citing 17
Cited 5

SPLASH: Stanford parallel applications for shared-memory

ACM SIGARCH Computer Architecture News
Asynchronous Tree-Based Multicasting in Wormhole-Switched MINs

IEEE Transactions on Parallel and Distributed Systems
Interconnection Networks: An Engineering Approach

Interconnection Networks: An Engineering Approach
The Raw Microprocessor: A Computational Fabric for Software Circuits and General-Purpose Programs

IEEE Micro
Connection-oriented Multicasting in Wormhole-switched Networks on Chip

ISVLSI '06 Proceedings of the IEEE Computer Society Annual Symposium on Emerging VLSI Technologies and Architectures
Thousand core chips: a technology perspective

Proceedings of the 44th annual Design Automation Conference
On-Chip Interconnection Architecture of the Tile Processor

IEEE Micro
A 5-GHz Mesh Interconnect for a Teraflops Processor

IEEE Micro
Virtual Circuit Tree Multicasting: A Case for On-Chip Hardware Multicast Support

ISCA '08 Proceedings of the 35th Annual International Symposium on Computer Architecture
User-aware dynamic task allocation in networks-on-chip

Proceedings of the conference on Design, automation and test in Europe
Application Specific Routing Algorithms for Networks on Chip

IEEE Transactions on Parallel and Distributed Systems
Efficient unicast and multicast support for CMPs

Proceedings of the 41st annual IEEE/ACM International Symposium on Microarchitecture
Recursive partitioning multicast: A bandwidth-efficient routing for Networks-on-Chip

NOCS '09 Proceedings of the 2009 3rd ACM/IEEE International Symposium on Networks-on-Chip
Designing Reliable and Efficient Networks on Chips

Designing Reliable and Efficient Networks on Chips
Efficient multicast support in buffered crossbars using networks on chip

GLOBECOM'09 Proceedings of the 28th IEEE conference on Global telecommunications
Energy- and performance-aware mapping for regular NoC architectures

IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems
Energy- and Performance-Aware Incremental Mapping for Networks on Chip With Multiple Voltage Levels

IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems

Quest for the ultimate network-on-chip: the NaNoC project

Proceedings of the 2012 Interconnection Network Architecture: On-Chip, Multi-Chip Workshop
An efficient, low-cost routing framework for convex mesh partitions to support virtualization

ACM Transactions on Embedded Computing Systems (TECS) - Special Section on Wireless Health Systems, On-Chip and Off-Chip Network Architectures
Virtual networks -- distributed communication resource management

ACM Transactions on Reconfigurable Technology and Systems (TRETS) - Special Section on 19th Reconfigurable Architectures Workshop (RAW 2012)
Efficient multicast schemes for 3-D Networks-on-Chip

Journal of Systems Architecture: the EUROMICRO Journal
On self-tuning networks-on-chip for dynamic network-flow dominance adaptation

ACM Transactions on Embedded Computing Systems (TECS) - Special Section ESFH'12, ESTIMedia'11 and Regular Papers

Quantified Score

Hi-index	0.00

Visualization

Abstract

When a number of applications simultaneously running on a many-core chip multiprocessor (CMP) chip connected through network-on-chip (NoC), significant amount of on-chip traffic is one-to-many (multicast) in nature. As a matter of fact, when multiple applications are mapped onto an NoC architecture with applicable traffic isolation constraints, the corresponding sub-networks of these applications are mapped onto actually tend to be irregular. In the literature, multicasting for irregular topologies is supported through either multiple unicasting or broadcasting, which, unfortunately, results in overly high power consumption and/or long network latency. To address this problem, a simple, yet efficient hardware-based multicasting scheme is proposed in this paper. First, an irregular oriented multicast strategy is proposed. Literally, following this strategy, an irregular oriented multicast routing algorithm can be designed based on any regular mesh based multicast routing algorithm. One such algorithm, namely, Alternative Recursive Partitioning Multicasting (AL+RPM), is proposed based on RPM, which was designed for regular mesh topology originally. The basic idea of AL+RPM is to find the output directions following the basic RPM algorithm and then decide to replicate the packets to the original output directions or the alternative (AL) output directions based on the shape of the sub-network. The experiment results show that the proposed multicast AL+RPM algorithm can consume, on average, 14% and 20% less power than bLBDR (a broadcasting-based routing algorithm) and the multiple unicast scheme, respectively. In addition, AL+RPM has much lower network latency than the above two approaches. To incorporate AL+RPM into a baseline router to support multicasting, the area overhead is fairly modest, less than 5.5%.