Parallel Implementation of Multidimensional Transforms without Interprocessor Communication
IEEE Transactions on Computers
Computation on Programmable Graphics Hardware
IEEE Computer Graphics and Applications
Radiative Heat Transfer Simulation Using Programmable Graphics Hardware
ICIS-COMSAR '06 Proceedings of the 5th IEEE/ACIS International Conference on Computer and Information Science and 1st IEEE/ACIS International Workshop on Component-Based Software Engineering,Software Architecture and Reuse
Streaming Algorithms for Biological Sequence Alignment on GPUs
IEEE Transactions on Parallel and Distributed Systems
Cache-efficient numerical algorithms using graphics hardware
Parallel Computing
Editorial: Special issue: General-purpose processing using graphics processing units
Journal of Parallel and Distributed Computing
High performance discrete Fourier transforms on graphics processors
Proceedings of the 2008 ACM/IEEE conference on Supercomputing
Exploiting the Power of GPUs for Asymmetric Cryptography
CHES '08 Proceeding sof the 10th international workshop on Cryptographic Hardware and Embedded Systems
Mars: a MapReduce framework on graphics processors
Proceedings of the 17th international conference on Parallel architectures and compilation techniques
Computer
Memory Locality Exploitation Strategies for FFT on the CUDA Architecture
High Performance Computing for Computational Science - VECPAR 2008
Experiences with Mapping Non-linear Memory Access Patterns into GPUs
ICCS '09 Proceedings of the 9th International Conference on Computational Science: Part I
Improving Performance of Matrix Multiplication and FFT on GPU
ICPADS '09 Proceedings of the 2009 15th International Conference on Parallel and Distributed Systems
Large-scale FFT on GPU clusters
Proceedings of the 24th ACM International Conference on Supercomputing
IEEE Wireless Communications
Hi-index | 0.00 |
The next generation Graphics Processing Units (GPUs) are being considered for non-graphics applications. Millimeter wave (60Ghz) wireless networks that are capable of multi-gigabit per second (Gbps) transfer rates require a significant baseband throughput. In this work, we consider the baseband of WirelessHD, a 60GHz communications system, which can provide a data rate of up to 3.8Gbps over a short range wireless link. Thus, we explore the feasibility of achieving gigabit baseband throughput using the GPUs. One of the most computationally intensive functions commonly used in baseband communications, the Fast Fourier Transform (FFT) algorithm, is implemented on an NVIDIA GPU using their general-purpose computing platform called the Compute Unified Device Architecture (CUDA). The paper, first, investigates the implementation of an FFT algorithm using the GPU hardware and exploiting the computational capability available. It then outlines the limitations discovered and the methods used to overcome these challenges. Finally a new algorithm to compute FFT is proposed, which reduces interprocessor communication. It is further optimized by improving memory access, enabling the processing rate to exceed 4Gbps, achieving a processing time of a 512-point FFT in less than 200 ns using a two-GPU solution.