Branch Prediction Topologies for SMT Architectures

Authors:
Guilherme Dal Pizzol;Philippe O. A. Navaux
Affiliations:
Federal University of Rio Grande do Sul, Brazil;Federal University of Rio Grande do Sul, Brazil
Venue:
SBAC-PAD '05 Proceedings of the 17th International Symposium on Computer Architecture on High Performance Computing
Year:
2005

Citing 0
Cited 1

Accurate branch prediction for short threads

Proceedings of the 13th international conference on Architectural support for programming languages and operating systems

Quantified Score

Hi-index	0.01

Visualization

Abstract

The exploitation of instruction level parallelism in superscalar architectures is limited by data and control dependencies. Simultaneous Multi-Threaded (SMT) architectures can explore another level of parallelism, called thread-level parallelism, to fetch and execute instructions from different tasks at the same time. While a task is blocked by control or data dependencies, other tasks may continue executing, thus masking latencies caused by mispredicted branches and memory accesses, and increasing the occupation of functional units. However, the design of SMT architectures brings new challenges, such as determining the most efficient way to share resources among different threads. In this paper, we present different branch prediction topologies for SMT architectures. We show that the best results are obtained by matching the number of i-cache modules (fetch width) with the number of branch prediction modules (number of lookups and updates), while increasing the number of modules also helps increasing clock rates. Moreover, contention on branch prediction lookup and updates buses cannot be ignored on such architectures.