Coherency for multiprocessor virtual address caches

Authors:
James R. Goodman
Affiliations:
Univ. of Wisconsin, Madison
Venue:
ASPLOS II Proceedings of the second international conference on Architectual support for programming languages and operating systems
Year:
1987

Citing 5
Cited 44

An in-cache address translation mechanism

ISCA '86 Proceedings of the 13th annual international symposium on Computer architecture
A class of compatible cache consistency protocols and their support by the IEEE futurebus

ISCA '86 Proceedings of the 13th annual international symposium on Computer architecture
Cache memory optimization to reduce processor/memory traffic

Advances in VLSI and Computer Systems
Cache Memories

ACM Computing Surveys (CSUR)
Using cache memory to reduce processor-memory traffic

ISCA '83 Proceedings of the 10th annual international symposium on Computer architecture

Comments on “ `Coherency for multiprocessor virtual addresses caches' by James R. Goodman"

ACM SIGARCH Computer Architecture News
On the inclusion properties for multi-level cache hierarchies

ISCA '88 Proceedings of the 15th Annual International Symposium on Computer architecture
The Wisconsin multicube: a new large-scale cache-coherent multiprocessor

ISCA '88 Proceedings of the 15th Annual International Symposium on Computer architecture
A Case for Direct-Mapped Caches

Computer
Organization and performance of a two-level virtual-real cache hierarchy

ISCA '89 Proceedings of the 16th annual international symposium on Computer architecture
Translation-Lookaside Buffer Consistency

Computer
The Design of a Microsupercomputer

Computer - Special issue on experimental research in computer architecture
Simplicity Versus Accuracy in a Model of Cache Coherency Overhead

IEEE Transactions on Computers
Page placement algorithms for large real-indexed caches

ACM Transactions on Computer Systems (TOCS)
The effects of virtually addressed caches on virtual memory design and performance

ACM SIGOPS Operating Systems Review
Life span strategy—a compiler-based approach to cache coherence

ICS '92 Proceedings of the 6th international conference on Supercomputing
Architecture support for single address space operating systems

ASPLOS V Proceedings of the fifth international conference on Architectural support for programming languages and operating systems
Lazy caching

ACM Transactions on Programming Languages and Systems (TOPLAS)
The Wisconsin Wind Tunnel: virtual prototyping of parallel computers

SIGMETRICS '93 Proceedings of the 1993 ACM SIGMETRICS conference on Measurement and modeling of computer systems
Improving cache performance with balanced tag and data paths

Proceedings of the seventh international conference on Architectural support for programming languages and operating systems
Tradeoffs between false sharing and aggregation in software distributed shared memory

PPOPP '97 Proceedings of the sixth ACM SIGPLAN symposium on Principles and practice of parallel programming
A Performance Study on Bounteous Transfer in Multiprocessor Sectored Caches

The Journal of Supercomputing - Special issue: high performance computing systems
Options for dynamic address translation in COMAs

Proceedings of the 25th annual international symposium on Computer architecture
On the inclusion properties for multi-level cache hierarchies

25 years of the international symposia on Computer architecture (selected papers)
Functional Implementation Techniques for CPU Cache Memories

IEEE Transactions on Computers - Special issue on cache memory and related problems
The Kyushu University reconfigurable parallel processor: design of memory and intercommunicaiton architectures

ICS '89 Proceedings of the 3rd international conference on Supercomputing
Trace-driven simulations for a two-level cache design in open bus systems

ISCA '90 Proceedings of the 17th annual international symposium on Computer Architecture
The TLB slice—a low-cost high-speed address translation mechanism

ISCA '90 Proceedings of the 17th annual international symposium on Computer Architecture
Uniprocessor Virtual Memory without TLBs

IEEE Transactions on Computers
TLB and snoop energy-reduction using virtual caches in low-power chip-multiprocessors

Proceedings of the 2002 international symposium on Low power electronics and design
Cool-Mem: combining statically speculative memory accessing with selective address translation for energy efficiency

Proceedings of the 10th international conference on Architectural support for programming languages and operating systems
Hardware Approaches to Cache Coherence in Shared-Memory Multiprocessors Part 2

IEEE Micro
Virtual-Address Caches Part 1: Problems and Solutions in Uniprocessors

IEEE Micro
Virtual-Address Caches, Part 2: Multiprocessor Issues

IEEE Micro
Designing High-Performance Processors Using Real Address Prediction

IEEE Transactions on Computers
Minerva: An Adaptive Subblock Coherence Protocol for Improved SMP Performance

ISHPC '02 Proceedings of the 4th International Symposium on High Performance Computing
U-cache: a cost-effective solution to synonym problem

HPCA '95 Proceedings of the 1st IEEE Symposium on High-Performance Computer Architecture
Two techniques for improving performance on bus-based multiprocessors

HPCA '95 Proceedings of the 1st IEEE Symposium on High-Performance Computer Architecture
Analysis of Shared Memory Misses and Reference Patterns

ICCD '00 Proceedings of the 2000 IEEE International Conference on Computer Design: VLSI in Computers & Processors
Coupling compiler-enabled and conventional memory accessing for energy efficiency

ACM Transactions on Computer Systems (TOCS)
Moving Address Translation Closer to Memory in Distributed Shared-Memory Multiprocessors

IEEE Transactions on Parallel and Distributed Systems
Shared memory computing on clusters with symmetric multiprocessors and system area networks

ACM Transactions on Computer Systems (TOCS)
Reducing energy of virtual cache synonym lookup using bloom filters

CASES '06 Proceedings of the 2006 international conference on Compilers, architecture and synthesis for embedded systems
Enigma: architectural and operating system support for reducing the impact of address translation

Proceedings of the 24th ACM International Conference on Supercomputing
Sentry: light-weight auxiliary memory access control

Proceedings of the 37th annual international symposium on Computer architecture
Implementation tradeoffs in the design of flexible transactional memory support

Journal of Parallel and Distributed Computing
DeFT: Design space exploration for on-the-fly detection of coherence misses

ACM Transactions on Architecture and Code Optimization (TACO)
Reducing memory reference energy with opportunistic virtual caching

Proceedings of the 39th Annual International Symposium on Computer Architecture
A new perspective for efficient virtual-cache coherence

Proceedings of the 40th Annual International Symposium on Computer Architecture

Quantified Score

Hi-index	0.01

Visualization

Abstract

A multiprocessor cache memory system is described that supplies data to the processor based on virtual addresses, but maintains consistency in the main memory, both across caches and across virtual address spaces. Pages in the same or different address spaces may be mapped to share a single physical page. The same hardware is used for maintaining consistency both among caches and among virtual addresses. Three different notions of a cache "block" are defined: (1) the unit for transferring data to/from main storage, (2) the unit over which tag information is maintained, and (3) the unit over which consistency is maintained. The relation among these block sizes is explored, and it is shown that they can be optimized independently. It is shown that the use of large address blocks results in low overhead for the virtual address cache.