McRT-STM: a high performance software transactional memory system for a multi-core runtime

Authors:
Bratin Saha;Ali-Reza Adl-Tabatabai;Richard L. Hudson;Chi Cao Minh;Benjamin Hertzberg
Affiliations:
Intel Corporation;Intel Corporation;Intel Corporation;Stanford University, Palo Alto, California;Stanford University, Palo Alto, California
Venue:
Proceedings of the eleventh ACM SIGPLAN symposium on Principles and practice of parallel programming
Year:
2006

Citing 21
Cited 142

Simple generational garbage collection and fast allocation

Software—Practice & Experience
A comparative performance evaluation of write barrier implementation

OOPSLA '92 conference proceedings on Object-oriented programming systems, languages, and applications
Transactional memory: architectural support for lock-free data structures

ISCA '93 Proceedings of the 20th annual international symposium on computer architecture
Software transactional memory

Proceedings of the fourteenth annual ACM symposium on Principles of distributed computing
Hoard: a scalable memory allocator for multithreaded applications

ASPLOS IX Proceedings of the ninth international conference on Architectural support for programming languages and operating systems
Safe memory reclamation for dynamic lock-free objects using atomic reads and writes

Proceedings of the twenty-first annual symposium on Principles of distributed computing
Transaction Processing: Concepts and Techniques

Transaction Processing: Concepts and Techniques
Multiple Reservations and the Oklahoma Update

IEEE Parallel & Distributed Technology: Systems & Technology
A Practical Multi-word Compare-and-Swap Operation

DISC '02 Proceedings of the 16th International Conference on Distributed Computing
Software transactional memory for dynamic-sized data structures

Proceedings of the twenty-second annual symposium on Principles of distributed computing
Language support for lightweight transactions

OOPSLA '03 Proceedings of the 18th annual ACM SIGPLAN conference on Object-oriented programing, systems, languages, and applications
Programming with transactional coherence and consistency (TCC)

ASPLOS XI Proceedings of the 11th international conference on Architectural support for programming languages and operating systems
The Open Runtime Platform: a flexible high-performance managed runtime environment: Research Articles

Concurrency and Computation: Practice & Experience - 2002 ACM Java Grande—ISCOPE Conference Part I
Composable memory transactions

Proceedings of the tenth ACM SIGPLAN symposium on Principles and practice of parallel programming
Revocable locks for non-blocking programming

Proceedings of the tenth ACM SIGPLAN symposium on Principles and practice of parallel programming
Design tradeoffs in modern software transactional memory systems

LCR '04 Proceedings of the 7th workshop on Workshop on languages, compilers, and run-time support for scalable systems
Virtualizing Transactional Memory

Proceedings of the 32nd annual international symposium on Computer Architecture
Advanced contention management for dynamic software transactional memory

Proceedings of the twenty-fourth annual ACM symposium on Principles of distributed computing
Multi-Core to the Masses

Proceedings of the 14th International Conference on Parallel Architectures and Compilation Techniques
X10: an object-oriented approach to non-uniform cluster computing

OOPSLA '05 Proceedings of the 20th annual ACM SIGPLAN conference on Object-oriented programming, systems, languages, and applications
Compiler and runtime support for efficient software transactional memory

Proceedings of the 2006 ACM SIGPLAN conference on Programming language design and implementation

McRT-Malloc: a scalable transactional memory allocator

Proceedings of the 5th international symposium on Memory management
The Atomos transactional programming language

Proceedings of the 2006 ACM SIGPLAN conference on Programming language design and implementation
Compiler and runtime support for efficient software transactional memory

Proceedings of the 2006 ACM SIGPLAN conference on Programming language design and implementation
Architectural Semantics for Practical Transactional Memory

Proceedings of the 33rd annual international symposium on Computer Architecture
A flexible framework for implementing software transactional memory

Proceedings of the 21st annual ACM SIGPLAN conference on Object-oriented programming systems, languages, and applications
Unlocking Concurrency

Queue - Computer Architecture
Architectural Support for Software Transactional Memory

Proceedings of the 39th Annual IEEE/ACM International Symposium on Microarchitecture
A practical FPGA-based framework for novel CMP research

Proceedings of the 2007 ACM/SIGDA 15th international symposium on Field programmable gate arrays
Open nesting in software transactional memory

Proceedings of the 12th ACM SIGPLAN symposium on Principles and practice of parallel programming
Nonblocking transactions without indirection using alert-on-update

Proceedings of the nineteenth annual ACM symposium on Parallel algorithms and architectures
Time-based transactional memory with scalable time bases

Proceedings of the nineteenth annual ACM symposium on Parallel algorithms and architectures
An effective hybrid transactional memory system with strong isolation guarantees

Proceedings of the 34th annual international symposium on Computer architecture
Performance pathologies in hardware transactional memory

Proceedings of the 34th annual international symposium on Computer architecture
An integrated hardware-software approach to flexible transactional memory

Proceedings of the 34th annual international symposium on Computer architecture
Enforcing isolation and ordering in STM

Proceedings of the 2007 ACM SIGPLAN conference on Programming language design and implementation
Understanding Tradeoffs in Software Transactional Memory

Proceedings of the International Symposium on Code Generation and Optimization
Code Generation and Optimization for Transactional Memory Constructs in an Unmanaged Language

Proceedings of the International Symposium on Code Generation and Optimization
ATLAS: a chip-multiprocessor with transactional memory support

Proceedings of the conference on Design, automation and test in Europe
Enabling scalability and performance in a large scale CMP environment

Proceedings of the 2nd ACM SIGOPS/EuroSys European Conference on Computer Systems 2007
Semantics of transactional memory and automatic mutual exclusion

Proceedings of the 35th annual ACM SIGPLAN-SIGACT symposium on Principles of programming languages
Modeling optimistic concurrency using quantitative dependence analysis

Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming
Transactional boosting: a methodology for highly-concurrent transactional objects

Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming
Concurrent GC leveraging transactional memory

Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming
Toward high performance nonblocking software transactional memory

Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming
Dynamic performance tuning of word-based software transactional memory

Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming
Software transactional memory for large scale clusters

Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming
Practical experiences with Java software transactional memory

Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming
Optimistic parallelism benefits from data partitioning

Proceedings of the 13th international conference on Architectural support for programming languages and operating systems
Streamware: programming general-purpose multicore processors using streams

Proceedings of the 13th international conference on Architectural support for programming languages and operating systems
Concurrency control with data coloring

Proceedings of the 2008 ACM SIGPLAN workshop on Memory systems performance and correctness: held in conjunction with the Thirteenth International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS '08)
The potential for variable-granularity access tracking for optimistic parallelism

Proceedings of the 2008 ACM SIGPLAN workshop on Memory systems performance and correctness: held in conjunction with the Thirteenth International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS '08)
Distributed computing and the multicore revolution

ACM SIGACT News
Transactional memory

Communications of the ACM - Web science
Inferring locks for atomic sections

Proceedings of the 2008 ACM SIGPLAN conference on Programming language design and implementation
Against lock-based semantics for transactional memory

Proceedings of the twentieth annual symposium on Parallelism in algorithms and architectures
Kicking the tires of software transactional memory: why the going gets tough

Proceedings of the twentieth annual symposium on Parallelism in algorithms and architectures
RingSTM: scalable transactions with a single atomic instruction

Proceedings of the twentieth annual symposium on Parallelism in algorithms and architectures
Irrevocable transactions and their applications

Proceedings of the twentieth annual symposium on Parallelism in algorithms and architectures
Dreadlocks: efficient deadlock detection

Proceedings of the twentieth annual symposium on Parallelism in algorithms and architectures
Commit phase in timestamp-based stm

Proceedings of the twentieth annual symposium on Parallelism in algorithms and architectures
Flexible Decoupled Transactional Memory Support

ISCA '08 Proceedings of the 35th Annual International Symposium on Computer Architecture
Software transactional memory: why is it only a research toy?

Communications of the ACM - Remembering Jim Gray
CAR-STM: scheduling-based collision avoidance and resolution for software transactional memory

Proceedings of the twenty-seventh ACM symposium on Principles of distributed computing
Lee-TM: A Non-trivial Benchmark Suite for Transactional Memory

ICA3PP '08 Proceedings of the 8th international conference on Algorithms and Architectures for Parallel Processing
Maintaining Consistent Transactional States without a Global Clock

SIROCCO '08 Proceedings of the 15th international colloquium on Structural Information and Communication Complexity
A Uniform Transactional Execution Environment for Java

ECOOP '08 Proceedings of the 22nd European conference on Object-Oriented Programming
A Model of Dynamic Separation for Transactional Memory

CONCUR '08 Proceedings of the 19th international conference on Concurrency Theory
Pillar: A Parallel Implementation Language

Languages and Compilers for Parallel Computing
Design and implementation of transactional constructs for C/C++

Proceedings of the 23rd ACM SIGPLAN conference on Object-oriented programming systems languages and applications
Software Transactional Memory: Why Is It Only a Research Toy?

Queue - The Concurrency Problem
Feedback-directed barrier optimization in a strongly isolated STM

Proceedings of the 36th annual ACM SIGPLAN-SIGACT symposium on Principles of programming languages
xCalls: safe I/O in memory transactions

Proceedings of the 4th ACM European conference on Computer systems
An analytic framework for performance modeling of software transactional memory

Computer Networks: The International Journal of Computer and Telecommunications Networking
Implementation and Use of Transactional Memory with Dynamic Separation

CC '09 Proceedings of the 18th International Conference on Compiler Construction: Held as Part of the Joint European Conferences on Theory and Practice of Software, ETAPS 2009
Adaptive Read Validation in Time-Based Software Transactional Memory

Euro-Par 2008 Workshops - Parallel Processing
Modeling software transactional memory with AnyLogic

Proceedings of the 2nd International Conference on Simulation Tools and Techniques
QuakeTM: parallelizing a complex sequential application using transactional memory

Proceedings of the 23rd international conference on Supercomputing
Software transactional memory for multicore embedded systems

Proceedings of the 2009 ACM SIGPLAN/SIGBED conference on Languages, compilers, and tools for embedded systems
Stretching transactional memory

Proceedings of the 2009 ACM SIGPLAN conference on Programming language design and implementation
Reducing Memory Ordering Overheads in Software Transactional Memory

Proceedings of the 7th annual IEEE/ACM International Symposium on Code Generation and Optimization
An Analytic Model for Optimistic STM with Lazy Locking

ASMTA '09 Proceedings of the 16th International Conference on Analytical and Stochastic Modeling Techniques and Applications
Software Transactional Memory on Relaxed Memory Models

CAV '09 Proceedings of the 21st International Conference on Computer Aided Verification
NZTM: nonblocking zero-indirection transactional memory

Proceedings of the twenty-first annual symposium on Parallelism in algorithms and architectures
Optimizing transactions for captured memory

Proceedings of the twenty-first annual symposium on Parallelism in algorithms and architectures
A lightweight in-place implementation for software thread-level speculation

Proceedings of the twenty-first annual symposium on Parallelism in algorithms and architectures
Partial memoization of concurrency and communication

Proceedings of the 14th ACM SIGPLAN international conference on Functional programming
Certifying concurrent programs using transactional memory

Journal of Computer Science and Technology
On the energy-efficiency of software transactional memory

Proceedings of the 22nd Annual Symposium on Integrated Circuits and System Design: Chip on the Dunes
NePaLTM: Design and Implementation of Nested Parallelism for Transactional Memory Systems

Genoa Proceedings of the 23rd European Conference on ECOOP 2009 --- Object-Oriented Programming
Reducing Rollbacks of Transactional Memory Using Ordered Shared Locks

Euro-Par '09 Proceedings of the 15th International Euro-Par Conference on Parallel Processing
Structure-driven optimizations for amorphous data-parallel programs

Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
Scheduling support for transactional memory contention management

Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
A practical concurrent binary search tree

Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
TMBean: Optimistic Concurrency in Application Servers Using Transactional Memory

OTM '09 Proceedings of the Confederated International Conferences, CoopIS, DOA, IS, and ODBASE 2009 on On the Move to Meaningful Internet Systems: Part I
On the Impact of Serializing Contention Management on STM Performance

OPODIS '09 Proceedings of the 13th International Conference on Principles of Distributed Systems
Coarse-grained transactions

Proceedings of the 37th annual ACM SIGPLAN-SIGACT symposium on Principles of programming languages
Dynamic filtering: multi-purpose architecture support for language runtime systems

Proceedings of the fifteenth edition of ASPLOS on Architectural support for programming languages and operating systems
From lock to correct and efficient software transactional memory

Proceedings of the 2010 Workshop on Interaction between Compilers and Computer Architecture
Evaluation of AMD's advanced synchronization facility within a complete transactional memory stack

Proceedings of the 5th European conference on Computer systems
Transactional memory support for scalable and transparent parallelization of multiplayer games

Proceedings of the 5th European conference on Computer systems
An efficient software transactional memory using commit-time invalidation

Proceedings of the 8th annual IEEE/ACM international symposium on Code generation and optimization
Language support and compiler optimizations for STM and transactional boosting

ICDCIT'07 Proceedings of the 4th international conference on Distributed computing and internet technology
Exploring data reusing of failed transaction

APPT'07 Proceedings of the 7th international conference on Advanced parallel processing technologies
Supporting speculative parallelization in the presence of dynamic data structures

PLDI '10 Proceedings of the 2010 ACM SIGPLAN conference on Programming language design and implementation
Making nested parallel transactions practical using lightweight hardware support

Proceedings of the 24th ACM International Conference on Supercomputing
Implementing and evaluating nested parallel transactions in software transactional memory

Proceedings of the twenty-second annual ACM symposium on Parallelism in algorithms and architectures
TLRW: return of the read-write lock

Proceedings of the twenty-second annual ACM symposium on Parallelism in algorithms and architectures
Transactional predication: high-performance concurrent sets and maps for STM

Proceedings of the 29th ACM SIGACT-SIGOPS symposium on Principles of distributed computing
Hardware transactional memory: A high performance parallel programming model

Journal of Systems Architecture: the EUROMICRO Journal
A model of dynamic separation for transactional memory

Information and Computation
Transactional memory

Journal of Parallel and Distributed Computing
Implementation tradeoffs in the design of flexible transactional memory support

Journal of Parallel and Distributed Computing
Avoiding deadlock avoidance

Proceedings of the 19th international conference on Parallel architectures and compilation techniques
Concurrency by modularity: design patterns, a case in point

Proceedings of the ACM international conference on Object oriented programming systems languages and applications
LV*: a class of lazy versioning HTMs for low-cost integration of transactional memory systems

Proceedings of the Second International Forum on Next-Generation Multicore/Manycore Technologies
Semantics of transactional memory and automatic mutual exclusion

ACM Transactions on Programming Languages and Systems (TOPLAS)
Lock-free and scalable multi-version software transactional memory

Proceedings of the 16th ACM symposium on Principles and practice of parallel programming
Algorithms for optimally arranging multicore memory structures

EURASIP Journal on Embedded Systems
Single-version STMs can be multi-version permissive

ICDCN'11 Proceedings of the 12th international conference on Distributed computing and networking
Formal reasoning about lazy-STM programs

Journal of Computer Science and Technology
Hardware acceleration of transactional memory on commodity systems

Proceedings of the sixteenth international conference on Architectural support for programming languages and operating systems
Hybrid NOrec: a case study in the effectiveness of best effort hardware transactional memory

Proceedings of the sixteenth international conference on Architectural support for programming languages and operating systems
NV-Heaps: making persistent objects fast and safe with next-generation, non-volatile memories

Proceedings of the sixteenth international conference on Architectural support for programming languages and operating systems
RMS-TM: a comprehensive benchmark suite for transactional memory systems

Proceedings of the 2nd ACM/SPEC International Conference on Performance engineering
Compiler-assisted selection of a software transactional memory system

ARCS'11 Proceedings of the 24th international conference on Architecture of computing systems
Efficient partial roll-backing mechanism for transactional memory systems

Transactions on high-performance embedded architectures and compilers III
Proving isolation properties for software transactional memory

ESOP'11/ETAPS'11 Proceedings of the 20th European conference on Programming languages and systems: part of the joint European conferences on theory and practice of software
Coping with context switches in lock-based software transactional memory

Proceedings of the 4th Annual International Conference on Systems and Storage
A study of transactional memory vs. locks in practice

Proceedings of the twenty-third annual ACM symposium on Parallelism in algorithms and architectures
Composable, nestable, pessimistic atomic statements

Proceedings of the 2011 ACM international conference on Object oriented programming systems languages and applications
LUTS: a lightweight user-level transaction scheduler

ICA3PP'11 Proceedings of the 11th international conference on Algorithms and architectures for parallel processing - Volume Part I
Verification of STM on relaxed memory models

Formal Methods in System Design
Conflict detection and validation strategies for software transactional memory

DISC'06 Proceedings of the 20th international conference on Distributed Computing
Transactional locking II

DISC'06 Proceedings of the 20th international conference on Distributed Computing
A lazy snapshot algorithm with eager validation

DISC'06 Proceedings of the 20th international conference on Distributed Computing
AGC: adaptive global clock in software transactional memory

Proceedings of the 2012 International Workshop on Programming Models and Applications for Multicores and Manycores
Reducing false aborts in STM systems

ICA3PP'10 Proceedings of the 10th international conference on Algorithms and Architectures for Parallel Processing - Volume Part I
Applying transactional memory to concurrency bugs

ASPLOS XVII Proceedings of the seventeenth international conference on Architectural Support for Programming Languages and Operating Systems
STM in the small: trading generality for performance in software transactional memory

Proceedings of the 7th ACM european conference on Computer Systems
TM2C: a software transactional memory for many-cores

Proceedings of the 7th ACM european conference on Computer Systems
Software transactional memory validation – time and space considerations

Transactions on High-Performance Embedded Architectures and Compilers IV
Improving performance by reducing aborts in hardware transactional memory

HiPEAC'10 Proceedings of the 5th international conference on High Performance Embedded Architectures and Compilers
Efficient transaction nesting in hardware transactional memory

ARCS'10 Proceedings of the 23rd international conference on Architecture of Computing Systems
On the impact of serializing contention management on STM performance

Journal of Parallel and Distributed Computing
STM concurrency control for embedded real-time software with tighter time bounds

Proceedings of the 49th Annual Design Automation Conference
STM concurrency control for multicore embedded real-time software: time bounds and tradeoffs

Proceedings of the 27th Annual ACM Symposium on Applied Computing
Compiler support for fine-grain software-only checkpointing

CC'12 Proceedings of the 21st international conference on Compiler Construction
Delegation and nesting in best-effort hardware transactional memory

Proceedings of the twenty-fourth annual ACM symposium on Parallelism in algorithms and architectures
HydraVM: extracting parallelism from legacy sequential code using STM

HotPar'12 Proceedings of the 4th USENIX conference on Hot Topics in Parallelism
Subobject transactional memory

COORDINATION'12 Proceedings of the 14th international conference on Coordination Models and Languages
Capturing transactional memory application's behavior --- the prerequisite for performance analysis

MSEPT'12 Proceedings of the 2012 international conference on Multicore Software Engineering, Performance, and Tools
Evaluation of Blue Gene/Q hardware support for transactional memories

Proceedings of the 21st international conference on Parallel architectures and compilation techniques
Sandboxing transactional memory

Proceedings of the 21st international conference on Parallel architectures and compilation techniques
Towards a software transactional memory for graphics processors

EG PGV'10 Proceedings of the 10th Eurographics conference on Parallel Graphics and Visualization
A transactional runtime system for the Cell/BE architecture

Journal of Parallel and Distributed Computing
What scientific applications can benefit from hardware transactional memory?

SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Optimizing software runtime systems for speculative parallelization

ACM Transactions on Architecture and Code Optimization (TACO) - Special Issue on High-Performance Embedded Architectures and Compilers
Restricted admission control in view-oriented transactional memory

The Journal of Supercomputing
Improving performance of software transactional memory through contention locality

The Journal of Supercomputing
Verifying safety and liveness for the FlexTM hybrid transactional memory

Proceedings of the Conference on Design, Automation and Test in Europe
FBLT: a real-time contention manager with improved schedulability

Proceedings of the Conference on Design, Automation and Test in Europe
Transactionalizing legacy code: an experience report using GCC and Memcached

Proceedings of the 19th international conference on Architectural support for programming languages and operating systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Applications need to become more concurrent to take advantage of the increased computational power provided by chip level multiprocessing. Programmers have traditionally managed this concurrency using locks (mutex based synchronization). Unfortunately, lock based synchronization often leads to deadlocks, makes fine-grained synchronization difficult, hinders composition of atomic primitives, and provides no support for error recovery. Transactions avoid many of these problems, and therefore, promise to ease concurrent programming.We describe a software transactional memory (STM) system that is part of McRT, an experimental Multi-Core RunTime. The McRT-STM implementation uses a number of novel algorithms, and supports advanced features such as nested transactions with partial aborts, conditional signaling within a transaction, and object based conflict detection for C/C++ applications. The McRT-STM exports interfaces that can be used from C/C++ programs directly or as a target for compilers translating higher level linguistic constructs.We present a detailed performance analysis of various STM design tradeoffs such as pessimistic versus optimistic concurrency, undo logging versus write buffering, and cache line based versus object based conflict detection. We also show a MCAS implementation that works on arbitrary values, coexists with the STM, and can be used as a more efficient form of transactional memory. To provide a baseline we compare the performance of the STM with that of fine-grained and coarse-grained locking using a number of concurrent data structures on a 16-processor SMP system. We also show our STM performance on a non-synthetic workload -- the Linux sendmail application.