Multithreaded processor architectures

Authors:
Gregory T. Byrd;Mark A. Holliday
Affiliations:
-;-
Venue:
IEEE Spectrum
Year:
1995

Citing 0
Cited 11

Limits on the performance benefits of multithreading and prefetching

Proceedings of the 1996 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Analysis of performance bottlenecks in multithreaded multiprocessor systems

Fundamenta Informaticae - Application of concurrency to system design
Effects of Multithreading on Data and Workload Distribution for Distributed-Memory Multiprocessors

IPPS '96 Proceedings of the 10th International Parallel Processing Symposium
A Queuing Model of a Multi-threaded Architecture: A Case Study

PaCT '999 Proceedings of the 5th International Conference on Parallel Computing Technologies
Rapid and low-cost context-switch through embedded processor customization for real-time and control applications

Proceedings of the 43rd annual Design Automation Conference
Nondeterministic Multithreading

IEEE Transactions on Computers
Cross-layer customization for rapid and low-cost task preemption in multitasked embedded systems

ACM Transactions on Embedded Computing Systems (TECS)
Mostly static program partitioning of binary executables

ACM Transactions on Programming Languages and Systems (TOPLAS)
Thread allocation in CMP-based multithreaded network processors

Parallel Computing
Performance limitations of block-multithreaded distributed-memory systems

Winter Simulation Conference
Analysis of Performance Bottlenecks in Multithreaded Multiprocessor Systems

Fundamenta Informaticae - Application of Concurrency to System Design

Quantified Score

Hi-index	0.09

Visualization

Abstract

The authors describe how independent streams of instructions, interwoven on a single processor, fill its otherwise idle cycles and so boost its performance. They detail how such multithreaded architectures take the tack of hiding latency by supporting multiple concurrent streams of execution. When a long-latency operation occurs in one of the threads, another begins execution. In this way, useful work is performed while the time-consuming operation is completed