Design and implementation of high-speed buffered crossbars with efficient load balancing for multi-core SoCs

  • Authors:
  • George Kornaros;Theofanis Orphanoudakis

  • Affiliations:
  • Technical University of Crete, Electronics & Computer Engineering Department, Kounoupidiana, Chania, Crete, Greece and Department of Applied Informatics & Multimedia, Technological Educational Ins ...;University of Peloponnese, Telecommunications Science and Technology Department, Karaiskaki Str., 22100 Tripoli, Greece

  • Venue:
  • Microprocessors & Microsystems
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

A large increase of the number of devices integrated in a single chip in conjunction with the significant demands of modern applications for performance has led the designers to a system development methodology based on integrating multiple pre-verified intellectual property cores. Yet, design productivity requirements push designers to focus on key micro-architectural solutions to manage more efficiently the scaling of multi-core SoCs as well as to increase the degree of design automation, particularly as rapid prototyping using reconfigurable computing is becoming mainstream. In this paper we present a novel interconnect architecture based on optimized components to efficiently manage SoCs that follow either a multi-core based approach or are built to support SIMD-style applications that can exploit the processing power of a pool of hardware resources; first we analyze the design of a crossbar featuring shared-memory combined input-crosspoint buffering as a solution for efficient implementation of on-chip interconnection; second we describe the design of a load-balancer featuring configurable proportional allocation of on-chip resources and in-order delivery as a solution for efficient scheduling and execution of processing tasks. The main focus of the paper is to describe and evaluate the mechanisms designed to distribute and manage data transfers so as to implement an efficient interconnection of the integrated cores and control access to available (either on-chip or off-chip) resources for the implementation of a number of embedded systems and applications. Each of these challenges is handled by the proposed architecture in an efficient way in terms of performance, cost in silicon and flexibility.