A Multidimensional Software Cache for Scratchpad-Based Systems

Authors:
Arnaldo Azevedo;Ben Juurlink
Affiliations:
Delft University of Technology, The Netherlands;Technische Universität Berlin, Germany
Venue:
International Journal of Embedded and Real-Time Communication Systems
Year:
2010

Citing 8
Cited 1

Scratchpad memory: design alternative for cache on-chip memory in embedded systems

Proceedings of the tenth international symposium on Hardware/software codesign
Synergistic Processing in Cell's Multicore Architecture

IEEE Micro
Introduction to the cell multiprocessor

IBM Journal of Research and Development - POWER5 and packaging
Memory Hierarchy Targeting Bi-Predictive Motion Compensation for H.264/AVC Decoder

ISVLSI '07 Proceedings of the IEEE Computer Society Annual Symposium on VLSI
Prefetching irregular references for software cache on cell

Proceedings of the 6th annual IEEE/ACM international symposium on Code generation and optimization
Hybrid access-specific software cache techniques for the cell BE architecture

Proceedings of the 17th international conference on Parallel architectures and compilation techniques
COMIC: a coherent shared memory interface for cell be

Proceedings of the 17th international conference on Parallel architectures and compilation techniques
HD-VideoBench. A Benchmark for Evaluating High Definition Digital Video Applications

IISWC '07 Proceedings of the 2007 IEEE 10th International Symposium on Workload Characterization

An instruction to accelerate software caches

ARCS'11 Proceedings of the 24th international conference on Architecture of computing systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

In many kernels of multimedia applications, the working set is predictable, making it possible to schedule the data transfers before the computation. Many other kernels, however, process data that is known just before it is needed or have working sets that do not fit in the scratchpad memory. Furthermore, multimedia kernels often access two or higher dimensional data structures and conventional software caches have difficulties to exploit the data locality exhibited by these kernels. For such kernels, the authors present a Multidimensional Software Cache MDSC, which stores 1-4 dimensional blocks to mimic in cache the organization of the data structure. Furthermore, it indexes the cache using the matrix indices rather than linear memory addresses. MDSC also makes use of the lower overhead of Direct Memory Access DMA list transfers and allows exploiting known data access patterns to reduce the number of accesses to the cache. The MDSC is evaluated using GLCM, providing an 8% performance improvement compared to the IBM software cache. For MC, several optimizations are presented that reduce the number of accesses to the MDSC.