A comparison of two pipeline organizations

Authors:
Michael Golden;Trevor Mudge
Affiliations:
Electrical Engineering and Computer Science Department, University of Michigan, 1301 Beal Avenue, Ann Arbor, Michigan;Electrical Engineering and Computer Science Department, University of Michigan, 1301 Beal Avenue, Ann Arbor, Michigan
Venue:
MICRO 27 Proceedings of the 27th annual international symposium on Microarchitecture
Year:
1994

Citing 8
Cited 1

MIPS RISC architectures

MIPS RISC architectures
Alpha architecture reference manual

Alpha architecture reference manual
Performance optimization of pipelined primary cache

ISCA '92 Proceedings of the 19th annual international symposium on Computer architecture
Instruction-level parallel processing: history, overview, and perspective

The Journal of Supercomputing - Special issue on instruction-level parallelism
The superblock: an effective technique for VLIW and superscalar compilation

The Journal of Supercomputing - Special issue on instruction-level parallelism
Designing the TFP Microprocessor

IEEE Micro
PowerPC 601 and Alpha 21064: A Tale of Two RISCs

Computer
A brief survey of papers on scheduling for pipelined processors

ACM SIGPLAN Notices

Streamlining data cache access with fast address calculation

ISCA '95 Proceedings of the 22nd annual international symposium on Computer architecture

Quantified Score

Hi-index	0.00

Visualization

Abstract

We examine two pipeline structures which are employed in commercial microprocessors. The first is the load-use interlock (LUI) pipeline, which employs an interlock to ensure correct operation during load-use hazards. The second is the address-generation interlock (AGI) pipeline. It eliminates the load-use hazard, but has an address-generation hazard which requires an address-generation interlock for correct operation. We compare the performance of these two pipelines on existing binaries and on applications which have been recompiled with a local code scheduler that understands the difference in the pipeline structures. When branch prediction is more than 80% accurate and the data cache access time is greater than two cycles, the AGI pipeline performs significantly better than the LUI pipeline on existing binaries. Recompiling the benchmarks with a new local code scheduler provides little additional performance improvement.