A Performance Study on Bounteous Transfer in Multiprocessor Sectored Caches

Authors:
Kuang-Chih Liu;Chung-Ta King
Affiliations:
Department of Computer Science, National Tsing Hua University, Hsinchu, Taiwan, 30043, R. O. C. kcliu@cs.nthu.edu.tw;Department of Computer Science, National Tsing Hua University, Hsinchu, Taiwan, 30043, R. O. C. king@cs.nthu.edu.tw
Venue:
The Journal of Supercomputing - Special issue: high performance computing systems
Year:
1997

Citing 14
Cited 0

Coherency for multiprocessor virtual address caches

ASPLOS II Proceedings of the second international conference on Architectual support for programming languages and operating systems
Delayed consistency and its effects on the miss rate of parallel programs

Proceedings of the 1991 ACM/IEEE conference on Supercomputing
Adjustable block size coherent caches

ISCA '92 Proceedings of the 19th annual international symposium on Computer architecture
The detection and elimination of useless misses in multiprocessors

ISCA '93 Proceedings of the 20th annual international symposium on computer architecture
Evaluation of release consistent software distributed shared memory on emerging network technology

ISCA '93 Proceedings of the 20th annual international symposium on computer architecture
Cache write policies and performance

ISCA '93 Proceedings of the 20th annual international symposium on computer architecture
Using virtual lines to enhance locality exploitation

ICS '94 Proceedings of the 8th international conference on Supercomputing
Data prefetching for high-performance processors

Data prefetching for high-performance processors
The SPLASH-2 programs: characterization and methodological considerations

ISCA '95 Proceedings of the 22nd annual international symposium on Computer architecture
Advanced Multi-Microprocessor Bus Architectures

Advanced Multi-Microprocessor Bus Architectures
The Augmint multiprocessor simulation toolkit for Intel x86 architectures

ICCD '96 Proceedings of the 1996 International Conference on Computer Design, VLSI in Computers and Processors
On the effectiveness of sectored caches in reducing false sharing misses

ICPADS '97 Proceedings of the 1997 International Conference on Parallel and Distributed Systems
A low-overhead coherence solution for multiprocessors with private cache memories

ISCA '84 Proceedings of the 11th annual international symposium on Computer architecture
Two techniques for improving performance on bus-based multiprocessors

HPCA '95 Proceedings of the 1st IEEE Symposium on High-Performance Computer Architecture

Quantified Score

Hi-index	0.00

Visualization

Abstract

We demonstrate approaches to the static parallelization of loops andrecursions on the example of the polynomial product. Phrased as a loop nest,the polynomial product can be parallelized automatically by applying aspace-time mapping technique based ...