The effectiveness of caches for vector processors

Authors:
Jeffrey D. Gee;Alan Jay Smith
Affiliations:
Sun Microsystems, Inc, 2550 Garcia Avenue, Mt. View, CA;Computer Science Division, University of California, Berkeley, CA
Venue:
ICS '94 Proceedings of the 8th international conference on Supercomputing
Year:
1994

Citing 24
Cited 0

Supercomputers and their use

Supercomputers and their use
Line (block) size choice for CPU cache memories

IEEE Transactions on Computers
The Titan Graphics Supercomputer Architecture

Computer
Cache performance of vector processors

ISCA '88 Proceedings of the 15th Annual International Symposium on Computer architecture
Performance tradeoffs in cache design

ISCA '88 Proceedings of the 15th Annual International Symposium on Computer architecture
Performance evaluation of static and dynamic memory systems on the Cray-2

ICS '88 Proceedings of the 2nd international conference on Supercomputing
Program locality of vectorized applications running on the IBM 3090 with vector facility

IBM Systems Journal
The birth of the second generation: the Hitachi S-820/80

Proceedings of the 1988 ACM/IEEE conference on Supercomputing
Evaluating Associativity in CPU Caches

IEEE Transactions on Computers
Cache performance of the integer SPEC benchmarks on a RISC

ACM SIGARCH Computer Architecture News
Data cache performance of supercomputer applications

Proceedings of the 1990 ACM/IEEE conference on Supercomputing
Data prefetching in multiprocessor vector cache memories

ISCA '91 Proceedings of the 18th annual international symposium on Computer architecture
A novel cache design for vector processing

ISCA '92 Proceedings of the 19th annual international symposium on Computer architecture
Analysis of cache performance in vector processors and multiprocessors

Analysis of cache performance in vector processors and multiprocessors
Characterizing the Storage Process and Its Effect on the Update of Main Memory by Write Through

Journal of the ACM (JACM)
Generation and analysis of very long address traces

ISCA '90 Proceedings of the 17th annual international symposium on Computer Architecture
Cache evaluation and the impact of workload choice

ISCA '85 Proceedings of the 12th annual international symposium on Computer architecture
Cache Memories

ACM Computing Surveys (CSUR)
Organizing matrices and matrix operations for paged memory systems

Communications of the ACM
A study of instruction cache organizations and replacement policies

ISCA '83 Proceedings of the 10th annual international symposium on Computer architecture
Program optimization for a pipelined machine a case study

SIGMETRICS '84 Proceedings of the 1984 ACM SIGMETRICS conference on Measurement and modeling of computer systems
Cache Performance of the SPEC Benchmark Suite

Cache Performance of the SPEC Benchmark Suite
Vector Processor Caches

Vector Processor Caches
Aspects of cache memory and instruction buffer performance

Aspects of cache memory and instruction buffer performance

Quantified Score

Hi-index	0.00

Visualization

Abstract

Vector processors have typically used vector registers, interleaved memory, and pipelined access to data to provide sufficient memory system performance. Caches have been used mainly for instructions and scalar data, while vectors are usually uncached, presumably partially because of the belief that there is insufficient vector locality in these workloads. In this study we use memory address traces from an Ardent Titan to examine both reference locality and cache performance in a vector processing environment. Many of the Titan traces are from real vectorized applications which reference large amounts of data. We have found that vector references contain somewhat less temporal locality, but large amounts of spatial locality compared to instruction and scalar references. Cache miss ratios are found to be comparable to those measured and published previously for various non-vectorized workloads. We provide analyses of trace behavior with regard to parameters of interest to cache designers. Calculations based on our measured miss ratios indicate that caches will improve average access times, which in turn can be expected to translate into significant improvements in machine performance. Arguments suggesting otherwise are discussed and considered.