A 186-Mvertices/s 161-mW floating-point vertex processor with optimized datapath and vertex caches

  • Authors:
  • Chang-Hyo Yu;Kyusik Chung;Donghyun Kim;Seok-Hoon Kim;Lee-Sup Kim

  • Affiliations:
  • Mobile Next Generation Technology Group of System, LSI Division, Semiconductor Business, Samsung Electronics Co., Ltd., Yongin-city, Gyeonggi-do, Korea;Department of Electrical Engineering and Computer Science, KAIST, Daejeon, Korea;Qualcomm, San Diego, CA;Department of Electrical Engineering and Computer Science, KAIST, Daejeon, Korea;Department of Electrical Engineering and Computer Science, KAIST, Daejeon, Korea

  • Venue:
  • IEEE Transactions on Very Large Scale Integration (VLSI) Systems
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, a power efficient vertex processor for mobile graphics applications is presented. A four-threaded and four-issue expanded VLIW datapath with a quad-float vertex texture fetcher is proposed by exploiting graphics specific characteristics after evaluation of several candidate architectures. Instruction-level power control methods such as operand sharing and writeback re-allocation along with operand isolations and gated clocks result in 40.4% and 82% reduction in energy dissipation and energy delay product compared to the most widely used single threaded SIMD. The proposed processor with the optimized datapath and vertex caches implemented in a 0.18-µm 1P4M CMOS process achieves 186-Mvertices/s geometry performance which is the best result among the processors that are IEEE-754 compliant.