Accurate Low-Cost Methods for Performance Evaluation of Cache Memory Systems
IEEE Transactions on Computers
TRAPEDS: producing traces for multicomputers via execution driven simulation
SIGMETRICS '89 Proceedings of the 1989 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Mache: no-loss trace compaction
SIGMETRICS '89 Proceedings of the 1989 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Efficient trace-driven simulation method for cache performance analysis
SIGMETRICS '90 Proceedings of the 1990 ACM SIGMETRICS conference on Measurement and modeling of computer systems
Blocking: exploiting spatial locality for trace compaction
SIGMETRICS '90 Proceedings of the 1990 ACM SIGMETRICS conference on Measurement and modeling of computer systems
Address Tracing for Parallel Machines
Computer - Special issue on experimental research in computer architecture
Analysis of cache replacement-algorithms
Analysis of cache replacement-algorithms
IEEE Transactions on Computers
Locality-Based Online Trace Compression
IEEE Transactions on Computers
Constructing multiprocessor workload characterizations
ACM-SE 33 Proceedings of the 33rd annual on Southeast regional conference
Memory Trace Compression and Replay for SPMD Systems using Extended PRSDs?
ACM SIGMETRICS Performance Evaluation Review - Special issue on the 1st international workshop on performance modeling, benchmarking and simulation of high performance computing systems (PMBS 10)
Hi-index | 0.01 |
The address traces used in computer architecture research are commonly generated using software techniques that introduce time dilations of an order of magnitude or more. Such techniques may also omit classes of memory references that are important for accurate models of computer systems, such as instruction prefetches, operating system references, and interrupt activity.We describe a technique for capturing all classes of references in real time. RATCHET employs trace filtering hardware to reduce the bandwidth and storage requirements that have previously limited the usefulness of hardware-based tracing. In evaluating this technique using the ten SPEC89 benchmark programs running on a Sun-3/60 workstation, we found that a small filter cache achieves compression ratios in the 10-30 range during the startup section of the programs. Traces from the middle sections of the C programs achieved compression ratios of 20-30, while the FORTRAN codes produced ratios of 45-84. Traces from a smaller ionospheric simulator program were compressed by factors of 100.These filtered traces typically represent 36 million contiguous references.