Scan primitives for vector computers
Proceedings of the 1990 ACM/IEEE conference on Supercomputing
Automatic Data Structure Selection and Transformation for Sparse Matrix Computations
IEEE Transactions on Parallel and Distributed Systems
Segmented Operations for Sparse Matrix Computation on Vector Multiprocessors
Segmented Operations for Sparse Matrix Computation on Vector Multiprocessors
Sparse matrix solvers on the GPU: conjugate gradients and multigrid
ACM SIGGRAPH 2003 Papers
Understanding the efficiency of GPU algorithms for matrix-matrix multiplication
Proceedings of the ACM SIGGRAPH/EUROGRAPHICS conference on Graphics hardware
Exploring Graphics Processor Performance for General Purpose Applications
DSD '05 Proceedings of the 8th Euromicro Conference on Digital System Design
A memory model for scientific algorithms on graphics processors
Proceedings of the 2006 ACM/IEEE conference on Supercomputing
Performance-Energy Tradeoffs for Matrix Multiplication on FPGA-Based Mixed-Mode Chip Multiprocessors
ISQED '07 Proceedings of the 8th International Symposium on Quality Electronic Design
Scan primitives for GPU computing
Proceedings of the 22nd ACM SIGGRAPH/EUROGRAPHICS symposium on Graphics hardware
Optimising data movement rates for parallel processing applications on graphics processors
PDCN'07 Proceedings of the 25th conference on Proceedings of the 25th IASTED International Multi-Conference: parallel and distributed computing and networks
Exploring weak scalability for FEM calculations on a GPU-enhanced cluster
Parallel Computing
Studying Thermal Management for Graphics-Processor Architectures
ISPASS '05 Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2005
Efficient gather and scatter operations on graphics processors
Proceedings of the 2007 ACM/IEEE conference on Supercomputing
Fast scan algorithms on graphics processors
Proceedings of the 22nd annual international conference on Supercomputing
On the energy efficiency of graphics processing units for scientific computing
IPDPS '09 Proceedings of the 2009 IEEE International Symposium on Parallel&Distributed Processing
Analysis of Parallel Algorithms for Energy Conservation in Scalable Multicore Architectures
ICPP '09 Proceedings of the 2009 International Conference on Parallel Processing
Towards optimizing energy costs of algorithms for shared memory architectures
Proceedings of the twenty-second annual ACM symposium on Parallelism in algorithms and architectures
Energy-aware high performance computing with graphic processing units
HotPower'08 Proceedings of the 2008 conference on Power aware computing and systems
Hi-index | 0.00 |
GPU has recently gained considerable attention in getting significant performance, for application raging from scientific computing to database sorting and search. General-purpose computing on GPU can easily reduce the execution time but results in an associated increase in the energy consumption. This paper analyzes energy consumption of parallel algorithms executing on GPU and provide a methodology for energy scalability while satisfying performance requirements. Then parallel prefix sum are analyzed to illustrate our method for energy conservation. We experimentally evaluate Sparse Matrix-Vector Multiply using the method for energy scalability and the results show that the number of blocks, memory choice and task scheduling are the important characterizes to trade-offs the performance and the energy consumption on GPU.