Ordering circuit establishment in multiplane NoCs

Authors:
Ahmed Abousamra;Alex K. Jones;Rami Melhem
Affiliations:
University of Pittsburgh;University of Pittsburgh;University of Pittsburgh
Venue:
ACM Transactions on Design Automation of Electronic Systems (TODAES) - Special Section on Networks on Chip: Architecture, Tools, and Methodologies
Year:
2013

Citing 27
Cited 0

The SPLASH-2 programs: characterization and methodological considerations

ISCA '95 Proceedings of the 22nd annual international symposium on Computer architecture
Simics: A Full System Simulation Platform

Computer
Dynamic Voltage Scaling with Links for Power Optimization of Interconnection Networks

HPCA '03 Proceedings of the 9th International Symposium on High-Performance Computer Architecture
A Delay Model and Speculative Architecture for Pipelined Routers

HPCA '01 Proceedings of the 7th International Symposium on High-Performance Computer Architecture
Low-Latency Virtual-Channel Routers for On-Chip Networks

Proceedings of the 31st annual international symposium on Computer architecture
Interconnect-Aware Coherence Protocols for Chip Multiprocessors

Proceedings of the 33rd annual international symposium on Computer Architecture
Express virtual channels: towards the ideal interconnection fabric

Proceedings of the 34th annual international symposium on Computer architecture
Exploring the Design Space of Self-Regulating Power-Aware On/Off Interconnection Networks

IEEE Transactions on Parallel and Distributed Systems
Analysis of dynamic voltage/frequency scaling in chip-multiprocessors

ISLPED '07 Proceedings of the 2007 international symposium on Low power electronics and design
Design of a Dynamic Priority-Based Fast Path Architecture for On-Chip Interconnects

HOTI '07 Proceedings of the 15th Annual IEEE Symposium on High-Performance Interconnects
A 5-GHz Mesh Interconnect for a Teraflops Processor

IEEE Micro
Research Challenges for On-Chip Interconnection Networks

IEEE Micro
Optimizing NUCA Organizations and Wiring Alternatives for Large Caches with CACTI 6.0

Proceedings of the 40th Annual IEEE/ACM International Symposium on Microarchitecture
The PARSEC benchmark suite: characterization and architectural implications

Proceedings of the 17th international conference on Parallel architectures and compilation techniques
Transaction-Aware Network-on-Chip Resource Reservation

IEEE Computer Architecture Letters
Token flow control

Proceedings of the 41st annual IEEE/ACM International Symposium on Microarchitecture
A variable frequency link for a power-aware network-on-chip (NoC)

Integration, the VLSI Journal
Winning with Pinning in NoC

HOTI '09 Proceedings of the 2009 17th IEEE Symposium on High Performance Interconnects
Heterogeneous Interconnects for Energy-Efficient Message Management in CMPs

IEEE Transactions on Computers
Cache Hierarchy and Memory Subsystem of the AMD Opteron Processor

IEEE Micro
The aethereal network on chip after ten years: goals, evolution, lessons, and future

Proceedings of the 47th Design Automation Conference
aelite: a flit-synchronous network on chip with composable and predictable services

Proceedings of the Conference on Design, Automation and Test in Europe
ORION 2.0: a fast and accurate NoC power and area model for early-stage design space exploration

Proceedings of the Conference on Design, Automation and Test in Europe
Pseudo-Circuit: Accelerating Communication for On-Chip Interconnection Networks

MICRO '43 Proceedings of the 2010 43rd Annual IEEE/ACM International Symposium on Microarchitecture
Codesign of NoC and Cache Organization for Reducing Access Latency in Chip Multiprocessors

IEEE Transactions on Parallel and Distributed Systems
A Statically Scheduled Time-Division-Multiplexed Network-on-Chip for Real-Time Systems

NOCS '12 Proceedings of the 2012 IEEE/ACM Sixth International Symposium on Networks-on-Chip
Compiler-Assisted Data Distribution and Network Configuration for Chip Multiprocessors

IEEE Transactions on Parallel and Distributed Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Segregating networks-on-chips (NoCs) into data and control planes yields several opportunities for improving power and performance in chip-multiprocessor systems (CMPs). This article describes a hybrid packet/circuit switched multiplane network optimized to reduce latency in order to improve system performance and/or reduce system energy. Unlike traditional circuit preallocation techniques which require timestamps to reserve circuit resources, this article proposes an order-based preallocation scheme. By enforcing the order in which resources are scheduled and utilized rather than a fixed time, the NoC can take advantage of messages that arrive early while naturally tolerating message delays due to contention. Ordered circuit establishment is presented using two techniques. First, Déjà Vu switching preestablishes circuits for data messages once a cache hit is detected and prior to the requested data becoming available. Second, using Red Carpet Routing, circuits are proactively reserved for a return data message as a request message traverses the NoC. The reduced communication latency over configured circuits enable system performance improvement or saving NoC energy by reducing voltage and frequency without sacrificing performance. In simulations of 16 and 64 core CMPs, Déjà Vu switching enabled average NoC energy savings of 43% and 53% respectively. On the other hand, simulations of communication sensitive benchmarks using Red Carpet Routing show speedup in execution time of up to 16%, with an average of 10% over a purely packet switched NoC and an average of 8% over preconfiguring circuits using Déjà Vu switching.