Virtual memory on data diffusion architectures

Authors:
Jorge Buenabad-Chávez;Henk L. Muller;Paul W. A. Stallard;David H. D. Warren
Affiliations:
Sección de Computación, CINVESTAV, Ap. Postal 14-740, México DF 07360, Mexico;Department of Computer Science, University of Bristol, Bristol BS8 1UB, UK;Department of Computer Science, University of Bristol, Bristol BS8 1UB, UK;Department of Computer Science, University of Bristol, Bristol BS8 1UB, UK
Venue:
Parallel Computing
Year:
2003

Citing 32
Cited 3

A Survey of Microprocessor Architectures for Memory Management

Computer
Machine-independent virtual memory management for paged uniprocessor and multiprocessor architectures

ASPLOS II Proceedings of the second international conference on Architectual support for programming languages and operating systems
801 storage: architecture and programming

ACM Transactions on Computer Systems (TOCS)
Generic virtual memory management for operating system kernels

SOSP '89 Proceedings of the twelfth ACM symposium on Operating systems principles
Low-synchronization translation lookaside buffer consistency in large-scale shared-memory multiprocessors

SOSP '89 Proceedings of the twelfth ACM symposium on Operating systems principles
Translation-Lookaside Buffer Consistency

Computer
The Aurora or-parallel Prolog system

New Generation Computing - Selected papers on parallel logic programming from the International Conference on Fifth Generation Computer Systems, 1988
SPLASH: Stanford parallel applications for shared-memory

ACM SIGARCH Computer Architecture News
DDM: A Cache-Only Memory Architecture

Computer
Architecture support for single address space operating systems

ASPLOS V Proceedings of the fifth international conference on Architectural support for programming languages and operating systems
Wide-address spaces: exploring the design space

ACM SIGOPS Operating Systems Review
Architectural support for translation table management in large address space machines

ISCA '93 Proceedings of the 20th annual international symposium on computer architecture
Scheduling and page migration for multiprocessor compute servers

ASPLOS VI Proceedings of the sixth international conference on Architectural support for programming languages and operating systems
Surpassing the TLB performance of superpages with less operating system support

ASPLOS VI Proceedings of the sixth international conference on Architectural support for programming languages and operating systems
Sharing and protection in a single-address-space operating system

ACM Transactions on Computer Systems (TOCS) - Special issue on computer architecture
Address space sparsity and fine granularity

ACM SIGOPS Operating Systems Review
Load balancing and data locality in adaptive hierarchical N-body methods: Barnes-Hut, fast multipole, and radiosity

Journal of Parallel and Distributed Computing
COMA-F: a non-hierarchical cache only memory architecture

COMA-F: a non-hierarchical cache only memory architecture
A new page table for 64-bit address spaces

SOSP '95 Proceedings of the fifteenth ACM symposium on Operating systems principles
On micro-kernel construction

SOSP '95 Proceedings of the fifteenth ACM symposium on Operating systems principles
Reducing TLB and memory overhead using online superpage promotion

ISCA '95 Proceedings of the 22nd annual international symposium on Computer architecture
Application and architectural bottlenecks in large scale distributed shared memory machines

ISCA '96 Proceedings of the 23rd annual international symposium on Computer architecture
Operating system support for improving data locality on CC-NUMA compute servers

Proceedings of the seventh international conference on Architectural support for programming languages and operating systems
Increasing TLB reach using superpages backed by shadow memory

Proceedings of the 25th annual international symposium on Computer architecture
Options for dynamic address translation in COMAs

Proceedings of the 25th annual international symposium on Computer architecture
Computer organization and design (2nd ed.): the hardware/software interface

Computer organization and design (2nd ed.): the hardware/software interface
Dynamic storage allocation in the Atlas computer, including an automatic use of a backing store

Communications of the ACM
Microprocessor Memory Management Units

IEEE Micro
Parallel Evaluation of a Parallel Architecture by Means of Calibrated Emulation

Proceedings of the 8th International Symposium on Parallel Processing
Bus-based COMA-reducing traffic in shared-bus multiprocessors

HPCA '96 Proceedings of the 2nd IEEE Symposium on High-Performance Computer Architecture
The Data Diffusion Machine with a Scalable Point-to-Point Network

The Data Diffusion Machine with a Scalable Point-to-Point Network
Shared virtual memory on loosely coupled multiprocessors

Shared virtual memory on loosely coupled multiprocessors

The diffusion space of data diffusion architectures

Parallel Computing
Massively parallel implementation of a fast multipole method for distributed memory machines

Journal of Parallel and Distributed Computing
The data diffusion space for parallel computing in clusters

Euro-Par'05 Proceedings of the 11th international Euro-Par conference on Parallel Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Data diffusion architectures (also known as cache only memory architectures) provide, a shared address space on top of distributed memory. Their distinctive feature is that data diffuses, or migrates and replicates, in main memory according to whichever processors are using the data. This requires an associative organisation of main memory, which decouples each address and its data item from any physical location. A data item can thus be placed and replicated where it is needed. Also, the physical address space does not have to be fixed and contiguous. It can be any set of addresses within the address range of the processors, possibly varying over time, provided it is smaller than the size of main memory. This flexibility is similar to that of a virtual address space, and offers new possibilities to organise a virtual memory system.We present an analysis of possible organisations of virtual memory on such architectures, and propose two main alternatives: traditional virtual memory (TVM) is organised around a fixed and contiguous physical address space using a traditional mapping; associative memory virtual memory (AMVM) is organised around a variable and non-contiguous physical address space using a simpler mapping.To evaluate TVM and AMVM, we extended a multiprocessor emulation of a data diffusion architecture to include part of the Mach operating system virtual memory. This extension implements TVM; a slightly modified version implements AMVM. On applications tested, AMVM shows a marginal performance gain over TVM. We argue that AMVM will offer greater advantages with higher degrees of parallelism or larger data sets.