FFTs in external or hierarchical memory
The Journal of Supercomputing
Computational frameworks for the fast Fourier transform
Computational frameworks for the fast Fourier transform
Hi-index | 0.00 |
In this paper a new approach is presented in order to overlap all communication intensive steps appearing in the four-step FFT algorithm--initial data distribution, matrix transpose, and final data collection--with computation. The presented method is based on a Kronecker product factorization of the four-step FFT algorithm.