Exploiting ILP in page-based intelligent memory

  • Authors:
  • Mark Oskin;Justin Hensley;Diana Keen;Frederic T. Chong;Matthew Farrens;Aneet Chopra

  • Affiliations:
  • Department of Computer Science, University of California, Davis;Department of Computer Science, University of California, Davis;Department of Computer Science, University of California, Davis;Department of Computer Science, University of California, Davis;Department of Computer Science, University of California, Davis;Department of Computer Science, University of California, Davis

  • Venue:
  • Proceedings of the 32nd annual ACM/IEEE international symposium on Microarchitecture
  • Year:
  • 1999

Quantified Score

Hi-index 0.00

Visualization

Abstract

This study compares the speed, area, and power of different implementations of Active Pages [OCS98], an intelligent memory system which helps bridge the growing gap between processor and memory performance by associating simple functions with each page of data. Previous investigations have shown up to 1000X speedups using a block of reconfigurable logic to implement these functions next to each sub-array on a DRAM chip.In this study, we show that instruction-level parallelism, not hardware specialization, is the key to the previous success with reconfigurable logic. In order to demonstrate this fact, an Active Page implementation based upon a simplified VLIW processor was developed. Unlike conventional VLIW processors, power and area constraints lead to a design which has a small number of pipeline stages. Our results demonstrate that a four-wide VLIW processor attains comparable performance to that of pure FPGA logic but requires significantly less area and power.