Proceedings of the 7th ACM international conference on Computing frontiers
Journal of Parallel and Distributed Computing
Hi-index | 0.00 |
Virtually all of the discussion on "commodity" vs "custom" architectures, especially for highly parallel systems, has focused on the high-glamor, high complexity processor core. This paper takes a different tack - it explores the potential for directly attacking the memory wall by programming the classically "dumb" memory interface. Several related but separable techniques are involved: converting the data that makes up a memory request into the machine state of a "traveling thread," and developing an ISA that can manipulate this state via extremely short instructions that allow complete programs to be stored within the access packet. The results are interesting: a wide spectrum of functions applications exist for which this approach provides significant improvement in bandwidth and latency.