Memory System Support for Image Processing

Authors:
Lixin Zhang;John B. Carter;Wilson C. Hsieh;Sally A. McKee
Affiliations:
-;-;-;-
Venue:
PACT '99 Proceedings of the 1999 International Conference on Parallel Architectures and Compilation Techniques
Year:
1999

Citing 0
Cited 6

Algorithmic foundations for a parallel vector access memory system

Proceedings of the twelfth annual ACM symposium on Parallel algorithms and architectures
Dynamic Access Ordering for Streamed Computations

IEEE Transactions on Computers
The Impulse Memory Controller

IEEE Transactions on Computers
Measuring the Performance of Multimedia Instruction Sets

IEEE Transactions on Computers
Memory System Support for Dynamic Cache Line Assembly

IMS '00 Revised Papers from the Second International Workshop on Intelligent Memory Systems
Three-dimensional memory vectorization for high bandwidth media memory systems

Proceedings of the 35th annual ACM/IEEE international symposium on Microarchitecture

Quantified Score

Hi-index	0.01

Visualization

Abstract

Image processing applications tend to access their data non-sequentially and reuse that data infrequently. As a result, they tend to perform poorly on conventional memory systems due to high cache and TLB miss rates and are particularly sensitive to the growing latency of main memory. In this paper, we analyze the memory performance of three image processing algorithms (volume rendering, image warping, and image filtering) on both a conventional memory system and on the Impulse memory system. The Impulse memory system allows application software to control how, when, and where data are loaded into a conventional processor cache. It does this by letting software configure how the memory controller interprets the physical addresses exported by the processor, which enables an application to dynamically change how data are fetched. Sparse data can be accessed densely, which improves both cache and TLB utilization, and memory latency is hidden by prefetching data within the memory controller. We find that for these image processing codes, using an Impulse memory system yields speedups of 40% to 226% over an otherwise identical machine with a conventional memory system.