Performance modeling and optimization of parallel out-of-core tensor contractions

  • Authors:
  • Xiaoyang Gao;Swarup Kumar Sahoo;Chi-Chung Lam;J. Ramanujam;Qingda Lu;Gerald Baumgartner;P. Sadayappan

  • Affiliations:
  • The Ohio State University;The Ohio State University;The Ohio State University;Louisiana State University;The Ohio State University;Louisiana State University;The Ohio State University

  • Venue:
  • Proceedings of the tenth ACM SIGPLAN symposium on Principles and practice of parallel programming
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

The Tensor Contraction Engine (TCE) is a domain-specific compiler for implementing complex tensor contraction expressions arising in quantum chemistry applications modeling electronic structure. This paper develops a performance model for tensor contractions, considering both disk I/O as well as inter-processor communication costs, to facilitate performance-model driven loop optimization for this domain. Experimental results are provided that demonstrate the accuracy and effectiveness of the model.