A transactional memory with automatic performance tuning

Authors:
Qingping Wang;Sameer Kulkarni;John Cavazos;Michael Spear
Affiliations:
Lehigh University;University of Delaware;University of Delaware;Lehigh University
Venue:
ACM Transactions on Architecture and Code Optimization (TACO) - HIPEAC Papers
Year:
2012

Citing 39
Cited 3

Case-based reasoning: foundational issues, methodological variations, and system approaches

AI Communications
Evolving neural networks through augmenting topologies

Evolutionary Computation
A Machine Learning Approach to Automatic Production of Compiler Heuristics

AIMSA '02 Proceedings of the 10th International Conference on Artificial Intelligence: Methodology, Systems, and Applications
A comparison of empirical and model-driven optimization

PLDI '03 Proceedings of the ACM SIGPLAN 2003 conference on Programming language design and implementation
Meta optimization: improving compiler heuristics with machine learning

PLDI '03 Proceedings of the ACM SIGPLAN 2003 conference on Programming language design and implementation
Language support for lightweight transactions

OOPSLA '03 Proceedings of the 18th annual ACM SIGPLAN conference on Object-oriented programing, systems, languages, and applications
A Dynamically Tuned Sorting Library

Proceedings of the international symposium on Code generation and optimization: feedback-directed and runtime optimization
Fast searches for effective optimization phase sequences

Proceedings of the ACM SIGPLAN 2004 conference on Programming language design and implementation
Locality phase prediction

ASPLOS XI Proceedings of the 11th international conference on Architectural support for programming languages and operating systems
Optimizing Sorting with Genetic Algorithms

Proceedings of the international symposium on Code generation and optimization
Predicting Unroll Factors Using Supervised Classification

Proceedings of the international symposium on Code generation and optimization
ACME: adaptive compilation made efficient

LCTES '05 Proceedings of the 2005 ACM SIGPLAN/SIGBED conference on Languages, compilers, and tools for embedded systems
Automatic Tuning of Inlining Heuristics

SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
Selecting Software Phase Markers with Code Structure Analysis

Proceedings of the International Symposium on Code Generation and Optimization
Using Machine Learning to Focus Iterative Optimization

Proceedings of the International Symposium on Code Generation and Optimization
Optimizing memory transactions

Proceedings of the 2006 ACM SIGPLAN conference on Programming language design and implementation
Method-specific dynamic compilation using logistic regression

Proceedings of the 21st annual ACM SIGPLAN conference on Object-oriented programming systems, languages, and applications
Dynamic performance tuning of word-based software transactional memory

Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming
Iterative optimization in the polyhedral model: part ii, multidimensional time

Proceedings of the 2008 ACM SIGPLAN conference on Programming language design and implementation
RingSTM: scalable transactions with a single atomic instruction

Proceedings of the twentieth annual symposium on Parallelism in algorithms and architectures
Irrevocable transactions and their applications

Proceedings of the twentieth annual symposium on Parallelism in algorithms and architectures
Practical weak-atomicity semantics for java stm

Proceedings of the twentieth annual symposium on Parallelism in algorithms and architectures
Lee-TM: A Non-trivial Benchmark Suite for Transactional Memory

ICA3PP '08 Proceedings of the 8th international conference on Algorithms and Architectures for Parallel Processing
Design and implementation of transactional constructs for C/C++

Proceedings of the 23rd ACM SIGPLAN conference on Object-oriented programming systems languages and applications
How much parallelism is there in irregular applications?

Proceedings of the 14th ACM SIGPLAN symposium on Principles and practice of parallel programming
Reducing Memory Ordering Overheads in Software Transactional Memory

Proceedings of the 7th annual IEEE/ACM International Symposium on Code Generation and Optimization
Taking the heat off transactions: Dynamic selection of pessimistic concurrency control

IPDPS '09 Proceedings of the 2009 IEEE International Symposium on Parallel&Distributed Processing
Adaptive Locks: Combining Transactions and Locks for Efficient Concurrency

PACT '09 Proceedings of the 2009 18th International Conference on Parallel Architectures and Compilation Techniques
NOrec: streamlining STM by abolishing ownership records

Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
Signatures in transactional memory systems

Signatures in transactional memory systems
An efficient software transactional memory using commit-time invalidation

Proceedings of the 8th annual IEEE/ACM international symposium on Code generation and optimization
Lightweight, robust adaptivity for software transactional memory

Proceedings of the twenty-second annual ACM symposium on Parallelism in algorithms and architectures
TLRW: return of the read-write lock

Proceedings of the twenty-second annual ACM symposium on Parallelism in algorithms and architectures
Transactional Memory, 2nd Edition

Transactional Memory, 2nd Edition
Eigenbench: A simple exploration tool for orthogonal TM characteristics

IISWC '10 Proceedings of the IEEE International Symposium on Workload Characterization (IISWC'10)
Hybrid NOrec: a case study in the effectiveness of best effort hardware transactional memory

Proceedings of the sixteenth international conference on Architectural support for programming languages and operating systems
Optimizing hybrid transactional memory: the importance of nonspeculative operations

Proceedings of the twenty-third annual ACM symposium on Parallelism in algorithms and architectures
Transactional locking II

DISC'06 Proceedings of the 20th international conference on Distributed Computing
Adaptive software transactional memory

DISC'05 Proceedings of the 19th international conference on Distributed Computing

Visualizing transactional memory

Proceedings of the 21st international conference on Parallel architectures and compilation techniques
Tightfit: adaptive parallelization with foresight

Proceedings of the 2013 9th Joint Meeting on Foundations of Software Engineering
Performance evaluation of View-Oriented Transactional Memory

Parallel Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

A significant obstacle to the acceptance of transactional memory (TM) in real-world parallel programs is the abundance of substantially different TM algorithms. Each TM algorithm appears well-suited to certain workload characteristics, but the best choice of algorithm is sensitive to program inputs, available cores, and program phases. Furthermore, operating system and hardware characteristics can affect which algorithm is best, with tradeoffs changing across iterations of a single ISA. This paper introduces methods for constructing policies to dynamically select the most appropriate TM algorithm based on static and dynamic information. We leverage intraprocedural static analysis to create a static profile of the application. We also introduce a low-overhead framework for dynamic profiling of a running transactional application. Armed with these complementary descriptions of a program's behavior, we present novel expert adaptivity policies as well as machine learning policies that are trained off-line using simple microbenchmarks. In our evaluation, we find that both the expert and learned policies provide better performance than any single TM algorithm across the entire STAMP benchmark suite. In addition, policies that combine expert and learned policies offer the best combination of performance, maintainability, and flexibility.