Fast parallel algorithms for QR and triangular factorization
SIAM Journal on Scientific and Statistical Computing
SIAM Journal on Scientific and Statistical Computing
Journal of VLSI Signal Processing Systems - Special issue: application specific array processors
A truly two-dimensional systolic array FPGA implementation of QR decomposition
ACM Transactions on Embedded Computing Systems (TECS)
Application specific processors for the autoregressive signal analysis
PPAM'09 Proceedings of the 8th international conference on Parallel processing and applied mathematics: Part I
A unified co-processor architecture for matrix decomposition
Journal of Computer Science and Technology
FPGA implementation of the conjugate gradient method
PPAM'05 Proceedings of the 6th international conference on Parallel Processing and Applied Mathematics
Scalable matrix decompositions with multiple cores on FPGAs
Microprocessors & Microsystems
Hi-index | 0.00 |
A new parallel processor structure for Givens QR-decomposition intended for the FPGA implementation is presented. The structure is derivedusing methodof mapping regular algorithms using affine transformations of the algorithm graph. The methodsupp orts pipelined processor unit design, and provides efficient hardware utilization. An example of the implementation of this structure in the Xilinx Virtex FPGA devices is presented.