A 90mW/GFlop 3.4GHz Reconfigurable Fused/Continuous Multiply-Accumulator for Floating-Point and Integer Operands in 65nm

  • Authors:
  • Shailendra Jain;Vasantha Erraguntla;Sriram R. Vangal;Yatin Hoskote;Nitin Borkar;Tulasi Mandepudi;Karthik VP

  • Affiliations:
  • -;-;-;-;-;-;-

  • Venue:
  • VLSID '10 Proceedings of the 2010 23rd International Conference on VLSI Design
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper describes energy efficient and reconfigurable fused/continuous Multiply-Accumulator (MAC) architecture for single-precision Floating-point and 16-bit signed integer operands. This eight-stage pipelined and single-cycle throughput MAC design contains a bit level pipelined multiplier, followed by fast sparse-tree adder and single cycle accumulator loop with delayed normalization logic. Operation driven energy control is achieved using dynamic clock and fine grained power gating techniques. Power gating is employed in 98% of design to save 79% of leakage power in idle mode, at 1.2V supply and 110C. The use of fully shared logic in the multiplier, accumulator and normalization blocks for different operations enables a compact design of 0.54mm2 containing 117K transistors in eight-metal 65nm CMOS technology. The 15-FO4 design provides 6.8GFLOPS of performance with total energy efficiency of 90mW/GFLOP at 1.2V and 3.4GHz operation.