Data buffering: run-time versus compile-time support

Authors:
H. Mulder
Affiliations:
Section Computer Architecture and Digital Systems, Department of Electrical Engineering, Delft University of Technology, PO Box 5031, 2600 AG Delft, The Netherlands
Venue:
ASPLOS III Proceedings of the third international conference on Architectural support for programming languages and operating systems
Year:
1989

Citing 9
Cited 4

Compilers: principles, techniques, and tools

Compilers: principles, techniques, and tools
Global register allocation at link time

SIGPLAN '86 Proceedings of the 1986 SIGPLAN symposium on Compiler construction
And Now a Case for More Complex Instruction Sets

Computer
Lisp on a reduced-instruction-set processor: characterization and optimization

Lisp on a reduced-instruction-set processor: characterization and optimization
Register windows vs. register allocation

PLDI '88 Proceedings of the ACM SIGPLAN 1988 conference on Programming Language design and Implementation
Register allocation for free: The C machine stack cache

ASPLOS I Proceedings of the first international symposium on Architectural support for programming languages and operating systems
RISC I: A Reduced Instruction Set VLSI Computer

ISCA '81 Proceedings of the 8th annual symposium on Computer Architecture
A portable machine-independent global optimizer--design and measurements

A portable machine-independent global optimizer--design and measurements
Reduced instruction set computer architectures for vlsi (microprocessor, risc, multiple-windows - of - registers)

Reduced instruction set computer architectures for vlsi (microprocessor, risc, multiple-windows - of - registers)

Register allocation across procedure and module boundaries

PLDI '90 Proceedings of the ACM SIGPLAN 1990 conference on Programming language design and implementation
Mapping concurrent programs to VLIW processors

PPOPP '91 Proceedings of the third ACM SIGPLAN symposium on Principles and practice of parallel programming
Flexible register management for sequential programs

ISCA '91 Proceedings of the 18th annual international symposium on Computer architecture
Processor Architecture and Data Buffering

IEEE Transactions on Computers

Quantified Score

Hi-index	0.00

Visualization

Abstract

Data-dependency, branch, and memory-access penalties are main constraints on the performance of high-speed microprocessors. The memory-access penalties concern both penalties imposed by external memory (e.g. cache) or by under utilization of the local processor memory (e.g. registers). This paper focuses solely on methods of increasing the utilization of data memory, local to the processor (registers or register-oriented buffers).A utilization increase of local processor memory is possible by means of compile-time software, run-time hardware, or a combination of both. This paper looks at data buffers which perform solely because of the compile-time software (single register sets); those which operate mainly through hardware but with possible software assistance (multiple register sets); and those intended to operate transparently with main memory implying no software assistance whatsoever (stack buffers). This paper shows that hardware buffering schemes cannot replace compile-time effort, but at most can reduce the complexity of this effort. It shows the utility increase of applying register allocation for multiple register sets. The paper also shows a potential utility decrease inherent to stack buffers. The observation that a single register set, allocated by means of interprocedural allocation, performs competitively with both multiple register set and stack buffer emphasizes the significance of the conclusion