David A. Carlson
-
Efficient 2D FFT implementation on mediaprocessors
Parallel Computing
A transpose-free in-place SIMD optimized FFT
ACM Transactions on Architecture and Code Optimization (TACO)