An extended set of FORTRAN basic linear algebra subprograms
ACM Transactions on Mathematical Software (TOMS)
A parallel triangular solver for distributed-memory multiprocessor
SIAM Journal on Scientific and Statistical Computing
Algorithms for matrix transposition on Boolean N-cube configured ensemble architecture
SIAM Journal on Matrix Analysis and Applications
A set of level 3 basic linear algebra subprograms
ACM Transactions on Mathematical Software (TOMS)
Computational frameworks for the fast Fourier transform
Computational frameworks for the fast Fourier transform
LAPACK's user's guide
Software portability with imake
Software portability with imake
Benchmark Evaluation of the IBM SP2 for Parallel Signal Processing
IEEE Transactions on Parallel and Distributed Systems
Basic Linear Algebra Subprograms for Fortran Usage
ACM Transactions on Mathematical Software (TOMS)
Modeling Communication Overhead: MPI and MPL Performance on the IBM SP2
IEEE Parallel & Distributed Technology: Systems & Technology
Parallel algorithms for space-time adaptive processing
IPPS '95 Proceedings of the 9th International Symposium on Parallel Processing
ISCOPE '97 Proceedings of the Scientific Computing in Object-Oriented Parallel Environments
Toward a Portable Parallel Library for Space-Time Adaptive Methods
Toward a Portable Parallel Library for Space-Time Adaptive Methods
MPI: A Message-Passing Interface Standard
MPI: A Message-Passing Interface Standard
The Design of a Parallel Dense Linear Algebra Software Library: Reduction to Hessenberg, Trididgonal, and Bidiagonal Form
Fast Collective Communication Libraries, Please
Fast Collective Communication Libraries, Please
Gemini: An Optical Interconnection Network for Parallel Processing
IEEE Transactions on Parallel and Distributed Systems
ALPS: a software framework for parallel space-time adaptive processing
PARA'04 Proceedings of the 7th international conference on Applied Parallel Computing: state of the Art in Scientific Computing
Hi-index | 0.00 |
Space-time adaptive processing (STAP) refers to a class of methods for detecting targets using an array of sensors. Various STAP methods use similar operations on different data or in different orders. We have developed a portable, parallel library of subroutines for prototyping STAP methods. The subroutines work on the IBM SP2 and the Intel Paragon under three different operating systems and three different communication libraries, and can also be configured for other systems. We provide execution-time models for predicting the performance of each subroutine. Using the library routines, we created a parallel version of element-space pre-Doppler processing, three parallel versions of higher-order post-Doppler processing, and two versions of PRI-staggered post-Doppler processing. We implemented a fourth version of higher-order post-Doppler processing, the hybrid method, which uses a combination of fine-grain and coarse-grain parallelism to reduce execution time. The hybrid method can be used to improve performance when a large number of processors is available. Our execution time models generally predict the best method and predict execution times to within 10 percent or better for large test cases.