A quasi-minimal residual variant of the Bi-CGSTAB algorithm for nonsymmetric systems
SIAM Journal on Scientific Computing
Proceedings of the 27th annual international symposium on Computer architecture
Basic Linear Algebra Subprograms for Fortran Usage
ACM Transactions on Mathematical Software (TOMS)
Stream processor architecture
ICCD '02 Proceedings of the 2002 IEEE International Conference on Computer Design: VLSI in Computers and Processors (ICCD'02)
A programming system for the imagine media processor
A programming system for the imagine media processor
Programmable Stream Processors
Computer
The vlsi implementation and evaluation of area- and energy-efficient streaming media processors
The vlsi implementation and evaluation of area- and energy-efficient streaming media processors
Analysis and Performance Results of a Molecular Modeling Application on Merrimac
Proceedings of the 2004 ACM/IEEE conference on Supercomputing
Merrimac: Supercomputing with Streams
Proceedings of the 2003 ACM/IEEE conference on Supercomputing
Simulation-based evaluation of the Imagine stream processor with scientific programs
International Journal of High Performance Computing and Networking
Hi-index | 0.00 |
Several representative scientific computing applications have been mapped on the stream processor. But most of them are computation-intensive kernels or synthetic benchmarks. In this paper, we implement and optimize a complete data-intensive hydrodynamics application, QNJ-5, on the stream processor which is designed for computation-intensive applications. Different from other stream programs, how to relieve memory access pressure is especially important to this stream program. Simulation results show that StreamQNJ-5 gets an ultimate speedup of 2.97 and 1.11 over original FORTRAN QNJ-5 on a Xeon and Iantium processor, respectively.