Reconstructing hardware transactional memory for workload optimized systems

Authors:
Kunal Korgaonkar;Prabhat Jain;Deepak Tomar;Kashyap Garimella;Veezhinathan Kamakoti
Affiliations:
Indian Institute of Technology Madras, Chennai, India;Indian Institute of Technology Madras, Chennai, India;Indian Institute of Technology Madras, Chennai, India;Indian Institute of Technology Madras, Chennai, India;Indian Institute of Technology Madras, Chennai, India
Venue:
APPT'11 Proceedings of the 9th international conference on Advanced parallel processing technologies
Year:
2011

Citing 13
Cited 0

Transactional memory: architectural support for lock-free data structures

ISCA '93 Proceedings of the 20th annual international symposium on computer architecture
Software transactional memory

Proceedings of the fourteenth annual ACM symposium on Principles of distributed computing
Hybrid transactional memory

Proceedings of the eleventh ACM SIGPLAN symposium on Principles and practice of parallel programming
Making the fast case common and the uncommon case simple in unbounded transactional memory

Proceedings of the 34th annual international symposium on Computer architecture
LogTM-SE: Decoupling Hardware Transactional Memory from Caches

HPCA '07 Proceedings of the 2007 IEEE 13th International Symposium on High Performance Computer Architecture
Why the grass may not be greener on the other side: a comparison of locking vs. transactional memory

Proceedings of the 4th workshop on Programming languages and operating systems
TokenTM: Efficient Execution of Large Transactions with Hardware Transactional Memory

ISCA '08 Proceedings of the 35th Annual International Symposium on Computer Architecture
Software Assisted Transact Cache to Support Efficient Unbounded Transactional Memory

HPCC '08 Proceedings of the 2008 10th IEEE International Conference on High Performance Computing and Communications
Software Transactional Memory: Why Is It Only a Research Toy?

Queue - The Concurrency Problem
Structural aspects of the system/360 model 85: II the cache

IBM Systems Journal
Is transactional programming actually easier?

Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
Workload and network-optimized computing systems

IBM Journal of Research and Development
Introduction to the wire-speed processor and architecture

IBM Journal of Research and Development

Quantified Score

Hi-index	0.00

Visualization

Abstract

Workload optimized systems consisting of large number of general and special purpose cores, and with a support for shared memory programming, are slowly becoming prevalent. One of the major impediments for effective parallel programming on these systems is lock-based synchronization. An alternate synchronization solution called Transactional Memory (TM) is currently being explored.We observe that most of the TM design proposals in literature are catered to match the constrains of general purpose computing platforms. Given the fact that workload optimized systems utilize wider hardware design spaces and on-chip parallelism, we argue that Hardware Transactional Memory (HTM) can be a suitable implementation choice for these systems. We re-evaluate the criteria to be satisfied by a HTM and identify possible scope for relaxations in the context of workload optimized systems. Based on the relaxed criteria, we demonstrate the scope for building HTM design variants, such that, each variant caters to a specific workload requirement. We carry out suitable experiments to bring about the trade-off between the design variants. Overall, we show how the knowledge about the workload is extremely useful to make appropriate design choices in the workload optimized HTM.