A model for hierarchical memory

Authors:
A. Aggarwal;B. Alpern;A. Chandra;M. Snir
Affiliations:
IBM T. J. Watson Research Center, P. O. Box 218, Yorktown Heights, New York;IBM T. J. Watson Research Center, P. O. Box 218, Yorktown Heights, New York;IBM T. J. Watson Research Center, P. O. Box 218, Yorktown Heights, New York;IBM T. J. Watson Research Center, P. O. Box 218, Yorktown Heights, New York
Venue:
STOC '87 Proceedings of the nineteenth annual ACM symposium on Theory of computing
Year:
1987

Citing 8
Cited 73

Amortized efficiency of list update and paging rules

Communications of the ACM
Tradeoffs for VLSI models with subpolynomial delay

STOC '85 Proceedings of the seventeenth annual ACM symposium on Theory of computing
The I/O complexity of sorting and related problems

14th International Colloquium on Automata, languages and programming
Virtual Memory

ACM Computing Surveys (CSUR)
Cache Memories

ACM Computing Surveys (CSUR)
The working set model for program behavior

Communications of the ACM
The Design and Analysis of Computer Algorithms

The Design and Analysis of Computer Algorithms
I/O complexity: The red-blue pebble game

STOC '81 Proceedings of the thirteenth annual ACM symposium on Theory of computing

Virtual memory algorithms

STOC '88 Proceedings of the twentieth annual ACM symposium on Theory of computing
The APRAM: incorporating asynchrony into the PRAM model

SPAA '89 Proceedings of the first annual ACM symposium on Parallel algorithms and architectures
Optimal disk I/O with parallel block transfer

STOC '90 Proceedings of the twenty-second annual ACM symposium on Theory of computing
Searching on a Tape

IEEE Transactions on Computers
Large-scale sorting in parallel memories (extended abstract)

SPAA '91 Proceedings of the third annual ACM symposium on Parallel algorithms and architectures
On pointers versus addresses

Journal of the ACM (JACM)
Parallel algorithms column 1: models of computation

ACM SIGACT News
Deterministic distribution sort in shared and distributed memory multiprocessors

SPAA '93 Proceedings of the fifth annual ACM symposium on Parallel algorithms and architectures
Machine models and linear time complexity

ACM SIGACT News
Upper bounds to processor-time tradeoffs under bounded-speed message propagation

Proceedings of the seventh annual ACM symposium on Parallel algorithms and architectures
The influence of caches on the performance of heaps

Journal of Experimental Algorithmics (JEA)
&mgr;Database: parallelism in a memory-mapped environment (research summary)

Proceedings of the eighth annual ACM symposium on Parallel algorithms and architectures
Competitive algorithms for multilevel caching and relaxed list update

Proceedings of the ninth annual ACM-SIAM symposium on Discrete algorithms
Optimal parallel sorting in multi-level storage

SODA '94 Proceedings of the fifth annual ACM-SIAM symposium on Discrete algorithms
Cache performance analysis of traversals and random accesses

Proceedings of the tenth annual ACM-SIAM symposium on Discrete algorithms
Empirical investigation of the Markov reference model

Proceedings of the tenth annual ACM-SIAM symposium on Discrete algorithms
Towards a theory of cache-efficient algorithms

SODA '00 Proceedings of the eleventh annual ACM-SIAM symposium on Discrete algorithms
Computational power of pipelined memory hierarchies

Proceedings of the thirteenth annual ACM symposium on Parallel algorithms and architectures
External memory algorithms and data structures: dealing with massive data

ACM Computing Surveys (CSUR)
Cache-oblivious priority queue and graph algorithm applications

STOC '02 Proceedings of the thiry-fourth annual ACM symposium on Theory of computing
A locality-preserving cache-oblivious dynamic dictionary

SODA '02 Proceedings of the thirteenth annual ACM-SIAM symposium on Discrete algorithms
Optimal organizations for pipelined hierarchical memories

Proceedings of the fourteenth annual ACM symposium on Parallel algorithms and architectures
Towards a theory of cache-efficient algorithms

Journal of the ACM (JACM)
The set-associative cache performance of search trees

SODA '03 Proceedings of the fourteenth annual ACM-SIAM symposium on Discrete algorithms
More on Weighted Servers or FIFO is Better than LRU

MFCS '02 Proceedings of the 27th International Symposium on Mathematical Foundations of Computer Science
A Blocked All-Pairs Shortest-Path Algorithm

SWAT '00 Proceedings of the 7th Scandinavian Workshop on Algorithm Theory
A Characterization of Temporal Locality and Its Portability across Memory Hierarchies

ICALP '01 Proceedings of the 28th International Colloquium on Automata, Languages and Programming,
Seamless Integration of Parallelism and Memory Hierarchy

ICALP '02 Proceedings of the 29th International Colloquium on Automata, Languages and Programming
Designing Practical Efficient Algorithms for Symmetric Multiprocessors

ALENEX '99 Selected papers from the International Workshop on Algorithm Engineering and Experimentation
Efficient Sorting Using Registers and Caches

WAE '00 Proceedings of the 4th International Workshop on Algorithm Engineering
Using PRAM Algorithms on a Uniform-Memory-Access Shared-Memory Architecture

WAE '01 Proceedings of the 5th International Workshop on Algorithm Engineering
Fractal Matrix Multiplication: A Case Study on Portability of Cache Performance

WAE '01 Proceedings of the 5th International Workshop on Algorithm Engineering
On the Space and Access Complexity of Computation DAGs

WG '00 Proceedings of the 26th International Workshop on Graph-Theoretic Concepts in Computer Science
External memory algorithms

Handbook of massive data sets
Cache-Oblivious Algorithms

FOCS '99 Proceedings of the 40th Annual Symposium on Foundations of Computer Science
More on weighted servers or FIFO is better than LRU

Theoretical Computer Science
Proximity Mergesort: optimal in-place sorting in the cache-oblivious model

SODA '04 Proceedings of the fifteenth annual ACM-SIAM symposium on Discrete algorithms
A blocked all-pairs shortest-paths algorithm

Journal of Experimental Algorithmics (JEA)
A locality-preserving cache-oblivious dynamic dictionary

Journal of Algorithms
An Address Dependence Model of Computation for Hierarchical Memories with Pipelined Transfer

IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Workshop 8 - Volume 09
Moderately hard, memory-bound functions

ACM Transactions on Internet Technology (TOIT)
Quantifying Locality In The Memory Access Patterns of HPC Applications

SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
A hierarchical model of data locality

Conference record of the 33rd ACM SIGPLAN-SIGACT symposium on Principles of programming languages
Adaptive Strassen and ATLAS's DGEMM: A Fast Square-Matrix Multiply for Modern High-Performance Systems

HPCASIA '05 Proceedings of the Eighth International Conference on High-Performance Computing in Asia-Pacific Region
The memory behavior of cache oblivious stencil computations

The Journal of Supercomputing
Provably good multicore cache performance for divide-and-conquer algorithms

Proceedings of the nineteenth annual ACM-SIAM symposium on Discrete algorithms
Hierarchical memory with block transfer

SFCS '87 Proceedings of the 28th Annual Symposium on Foundations of Computer Science
Combating I-O bottleneck using prefetching: model, algorithms, and ramifications

The Journal of Supercomputing
On the limits of cache-oblivious rational permutations

Theoretical Computer Science
Algorithms and data structures for external memory

Foundations and Trends® in Theoretical Computer Science
A Bridging Model for Multi-core Computing

ESA '08 Proceedings of the 16th annual European symposium on Algorithms
On the space---time of optimal, approximate and streaming algorithms for synopsis construction problems

The VLDB Journal — The International Journal on Very Large Data Bases
A unified model for multicore architectures

IFMT '08 Proceedings of the 1st international forum on Next-generation multicore/manycore technologies
On approximating the ideal random access machine by physical machines

Journal of the ACM (JACM)
psort, Yet Another Fast Stable Sorting Software

SEA '09 Proceedings of the 8th International Symposium on Experimental Algorithms
Cache-optimal algorithms for option pricing

ACM Transactions on Mathematical Software (TOMS)
Evaluating multicore algorithms on the unified memory model

Scientific Programming - Software Development for Multi-core Computing Systems
Algorithms for memory hierarchies: advanced lectures

Algorithms for memory hierarchies: advanced lectures
A competitive online algorithm for the paging problem with "shelf" memory

COCOON'99 Proceedings of the 5th annual international conference on Computing and combinatorics
On the limits of cache-oblivious matrix transposition

TGC'06 Proceedings of the 2nd international conference on Trustworthy global computing
Low depth cache-oblivious algorithms

Proceedings of the twenty-second annual ACM symposium on Parallelism in algorithms and architectures
Searching

Algorithms and theory of computation handbook
A bridging model for multi-core computing

Journal of Computer and System Sciences
Cache-oblivious simulation of parallel programs

IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
Management of Multilevel, Multiclient Cache Hierarchies with Application Hints

ACM Transactions on Computer Systems (TOCS)
Efficient stack distance computation for priority replacement policies

Proceedings of the 8th ACM International Conference on Computing Frontiers
Cache-Oblivious Algorithms

ACM Transactions on Algorithms (TALG)
Algorithmic ramifications of prefetching in memory hierarchy

HiPC'06 Proceedings of the 13th international conference on High Performance Computing
Optimizing explicit data transfers for data parallel applications on the cell architecture

ACM Transactions on Architecture and Code Optimization (TACO) - HIPEAC Papers
A memory access model for highly-threaded many-core architectures

Future Generation Computer Systems
Optimal eviction policies for stochastic address traces

Theoretical Computer Science
Computing with Time-Varying Data: Sequential Complexity and Parallel Speed-Up

Theory of Computing Systems
Processor--Time Tradeoffs under Bounded-Speed Message Propagation: Part I, Upper Bounds

Theory of Computing Systems

Quantified Score

Hi-index	0.01

Visualization

Abstract

In this paper we introduce the Hierarchical Memory Model (HMM) of computation. It is intended to model computers with multiple levels in the memory hierarchy. Access to memory location x is assumed to take time ⌈ log x ⌉. Tight lower and upper bounds are given in this model for the time complexity of searching, sorting, matrix multiplication and FFT. Efficient algorithms in this model utilize locality of reference by bringing data into fast memory and using them several times before returning them to slower memory. It is shown that the circuit simulation problem has inherently poor locality of reference. The results are extended to HMM's where memory access time is given by an arbitrary (nondecreasing) function. Tight upper and lower bounds are obtained for HMM's with polynomial memory access time; the algorithms for searching, FFT and matrix multiplication are shown to be optimal for arbitrary memory access time. On-line memory management algorithms for the HMM model are also considered. An algorithm that uses LRU policy at the successive “levels” of the memory hierarchy is shown to be optimal.