Efficient VLSI architectures for matrix factorizations
Efficient VLSI architectures for matrix factorizations
Microcontrollers and microcomputers: principles of software and hardware engineering
Microcontrollers and microcomputers: principles of software and hardware engineering
On Limits of Wireless Communications in a Fading Environment when UsingMultiple Antennas
Wireless Personal Communications: An International Journal
BEE2: A High-End Reconfigurable Computing System
IEEE Design & Test
A practical, hardware friendly MMSE detector for MIMO-OFDM-based systems
EURASIP Journal on Advances in Signal Processing
On the sphere-decoding algorithm I. Expected complexity
IEEE Transactions on Signal Processing - Part I
The software radio architecture
IEEE Communications Magazine
Cognitive radio: brain-empowered wireless communications
IEEE Journal on Selected Areas in Communications
MIMO accelerator: a design flow for a programmable MIMO decoder architecture
Asilomar'09 Proceedings of the 43rd Asilomar conference on Signals, systems and computers
Journal of Signal Processing Systems
Hi-index | 0.00 |
In this paper, we present a multi-input-multi-output (MIMO) decoder accelerator architecture that offers versatility and reprogrammability while maintaining a very high performance-cost metric. The accelerator is meant to address the MIMO decoding bottlenecks associated with the convergence of multiple high-speed wireless standards onto a single device. It is scalable in the number of antennas, bandwidth, modulation format, and most importantly, present and emerging decoder algorithms. It features a Harvard-like architecture with complex vector operands and a deeply pipelined fixed-point complex arithmetic processing unit. When implemented on a Xilinx Virtex-4 LX200FF1513 field-programmable gate array (FPGA), the design occupied 43% of overall FPGA resources. The accelerator shows an advantage of up to three orders of magnitude (1000 times) in power-delay product for typical MIMO decoding operations relative to a general purpose DSP. When compared to dedicated application-specific IC (ASIC) implementations of mmse MIMO decoders, the accelerator showed a degradation of 340%-17%, depending on the actual ASIC being considered. In order to optimize the design for both speed and area, specific challenges had to be overcome. These include: definition of the processing units and their interconnection; proper dynamic scaling of the signal; and memory partitioning and parallelism.