Implicit transactional memory in kilo-instruction multiprocessors

Authors:
Marco Galluzzi;Enrique Vallejo;Adrián Cristal;Fernando Vallejo;Ramón Beivide;Per Stenström;James E. Smith;Mateo Valero
Affiliations:
Departament d'Arquitectura de Computadors, Universitat Politècnica de Catalunya;Grupo de Arquitectura de Computadores, Universidad de Cantabria;Barcelona Supercomputing Center;Grupo de Arquitectura de Computadores, Universidad de Cantabria;Grupo de Arquitectura de Computadores, Universidad de Cantabria;Dept. of Computer Science and Engineering, Chalmers University of Technology;Dept. of Electrical and Computer Engineering, University of Wisconsin-Madison;Departament d'Arquitectura de Computadors, Universitat Politècnica de Catalunya and Barcelona Supercomputing Center
Venue:
ACSAC'07 Proceedings of the 12th Asia-Pacific conference on Advances in Computer Systems Architecture
Year:
2007

Citing 22
Cited 2

Transactional memory: architectural support for lock-free data structures

ISCA '93 Proceedings of the 20th annual international symposium on computer architecture
Combined performance gains of simple cache protocol extensions

ISCA '94 Proceedings of the 21st annual international symposium on Computer architecture
Internal organization of the Alpha 21164, a 300-MHz 64-bit quad-issue CMOS RISC microprocessor

Digital Technical Journal - Special 10th anniversary issue
Using speculative retirement and larger instruction windows to narrow the performance gap between memory consistency models

Proceedings of the ninth annual ACM symposium on Parallel algorithms and architectures
Is SC + ILP = RC?

ISCA '99 Proceedings of the 26th annual international symposium on Computer architecture
Delaying physical register allocation through virtual-physical registers

Proceedings of the 32nd annual ACM/IEEE international symposium on Microarchitecture
Memory consistency and event ordering in scalable shared-memory multiprocessors

ISCA '90 Proceedings of the 17th annual international symposium on Computer Architecture
Transactional lock-free execution of lock-based programs

Proceedings of the 10th international conference on Architectural support for programming languages and operating systems
Speculative synchronization: applying thread-level speculation to explicitly parallel applications

Proceedings of the 10th international conference on Architectural support for programming languages and operating systems
Temporally silent stores

Proceedings of the 10th international conference on Architectural support for programming languages and operating systems
Multiprocessors Should Support Simple Memory-Consistency Models

Computer
Simics: A Full System Simulation Platform

Computer
Speculative Lock Reordering: Optimistic Out-of-Order Execution of Critical Sections

IPDPS '03 Proceedings of the 17th International Symposium on Parallel and Distributed Processing
Reducing Design Complexity of the Load/Store Queue

Proceedings of the 36th annual IEEE/ACM International Symposium on Microarchitecture
A first glance at Kilo-instruction based multiprocessors

Proceedings of the 1st conference on Computing frontiers
Transactional Memory Coherence and Consistency

Proceedings of the 31st annual international symposium on Computer architecture
Continual flow pipelines

ASPLOS XI Proceedings of the 11th international conference on Architectural support for programming languages and operating systems
Scalable Load and Store Processing in Latency Tolerant Processors

Proceedings of the 32nd annual international symposium on Computer Architecture
Out-of-Order Commit Processors

HPCA '04 Proceedings of the 10th International Symposium on High Performance Computer Architecture
Kilo-Instruction Processors: Overcoming the Memory Wall

IEEE Micro
Cherry-MP: Correctly Integrating Checkpointed Early Resource Recycling in Chip Multiprocessors

Proceedings of the 38th annual IEEE/ACM International Symposium on Microarchitecture
A case for resource-conscious out-of-order processors

IEEE Computer Architecture Letters

InvisiFence: performance-transparent memory ordering in conventional multiprocessors

Proceedings of the 36th annual international symposium on Computer architecture
Efficient sequential consistency via conflict ordering

ASPLOS XVII Proceedings of the seventeenth international conference on Architectural Support for Programming Languages and Operating Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Although they have been the main server technology for many years, multiprocessors are undergoing a renaissance due to multi-core chips and the attractive scalability properties of combining a number of such multi-core chips into a system. The widespread use of multiprocessor systems will make performance losses due to consistency models and synchronization styles of popular programming models even more evident than they already are. Known architectural approaches to combat these losses are generally too complex, too specialized, or not transparent to software. In this article, we introduce implicit transactional memory as a generalized architectural concept to remove unnecessary performance losses caused by consistency models and synchronization styles. We show how the concept of implicit transactions can be implemented with low complexity by leveraging the multicheckpoint mechanism of the Kilo-Instruction Processor. By relying on a general speculation substrate, this method supports even the strictest consistency model - sequential consistency - potentially as effectively as weaker models and it allows multiple threads to speculatively execute critical sections, beyond barriers and event synchronizations.