A nationwide parallel computing environment
Communications of the ACM
ACM Transactions on Mathematical Software (TOMS)
Programming tools and environments
Communications of the ACM
Algorithmic Redistribution Methods for Block-Cyclic Decompositions
IEEE Transactions on Parallel and Distributed Systems
Efficient Algorithms for Block-Cyclic Array Redistribution Between Processor Sets
IEEE Transactions on Parallel and Distributed Systems
Blocked algorithms and software for reduction of a regular matrix pair to generalized Schur form
ACM Transactions on Mathematical Software (TOMS)
Parallel Partial Stabilizing Algorithms for Large Linear Control Systems
The Journal of Supercomputing
Computational Economics - Computational Studies at Stanford
High-cost CFD on a low-cost cluster
Proceedings of the 2000 ACM/IEEE conference on Supercomputing
Matrix Multiplication on Heterogeneous Platforms
IEEE Transactions on Parallel and Distributed Systems
A Proposal for a Heterogeneous Cluster ScaLAPACK (Dense Linear Solvers)
IEEE Transactions on Computers
Making sparse Gaussian elimination scalable by static pivoting
SC '98 Proceedings of the 1998 ACM/IEEE conference on Supercomputing
Automatically tuned linear algebra software
SC '98 Proceedings of the 1998 ACM/IEEE conference on Supercomputing
Application of a high performance parallel eigensolver to electronic structure calculations
SC '98 Proceedings of the 1998 ACM/IEEE conference on Supercomputing
Design, implementation and testing of extended and mixed precision BLAS
ACM Transactions on Mathematical Software (TOMS)
Distribution Assignment Placement: Effective Optimization of Redistribution Costs
IEEE Transactions on Parallel and Distributed Systems
Numerical libraries and the grid: the GrADS experiments with ScaLAPACK
Proceedings of the 2001 ACM/IEEE conference on Supercomputing
Design and implementation of FMPL, a fast message-passing library for remote memory operations
Proceedings of the 2001 ACM/IEEE conference on Supercomputing
Component-based derivation of a parallel stiff ODE solver implemented in a cluster of computers
International Journal of Parallel Programming
Journal of Computational and Applied Mathematics - Special issue: Proceedings of the 9th International Congress on computational and applied mathematics
Parallel algorithms for LQ optimal control of discrete-time periodic linear systems
Journal of Parallel and Distributed Computing
Component-Based Derivation of a Parallel Stiff ODE Solver Implemented in a Cluster of Computers
International Journal of Parallel Programming
Dense linear algebra kernels on heterogeneous platforms: redistribution issues
Parallel Computing - Parallel matrix algorithms and applications
Applying NetSolve's Network-Enabled Server
IEEE Computational Science & Engineering
A comparison of parallel solvers for diagonally dominant and general narrow-banded linear systems
Parallel numerical linear algebra
A Grid Computing Environment for Enabling Large Scale Quantum Mechanical Simulations
GRID '00 Proceedings of the First IEEE/ACM International Workshop on Grid Computing
Performance Prediction and Analysis of Parallel Out-Of-Core Matrix Factorization
HiPC '00 Proceedings of the 7th International Conference on High Performance Computing
Parallel Factorizations with Algorithmic Blocking
ICCS '01 Proceedings of the International Conference on Computational Sciences-Part I
Cluster Configuration Aided by Simulation
ICCS '01 Proceedings of the International Conference on Computational Sciences-Part I
Semi-automatic Generation of Web-Based Computing Environments for Software Libraries
ICCS '02 Proceedings of the International Conference on Computational Science-Part I
Parallel Out-of-Core Matrix Inversion
IPDPS '02 Proceedings of the 16th International Parallel and Distributed Processing Symposium
Mixed Parallel Implementations of Strassen and Winograd Matrix Multiplication Algorithms
IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
Parallel Two-Stage Reduction of a Regular Matrix Pair to Hessenberg-Triangular Form
PARA '00 Proceedings of the 5th International Workshop on Applied Parallel Computing, New Paradigms for HPC in Industry and Academia
Co-array Fortran for Full and Sparse Matrices
PARA '02 Proceedings of the 6th International Conference on Applied Parallel Computing Advanced Scientific Computing
Enhanced Services for Remote Model Reduction of Large-Scale Dense Linear Systems
PARA '02 Proceedings of the 6th International Conference on Applied Parallel Computing Advanced Scientific Computing
PARA '02 Proceedings of the 6th International Conference on Applied Parallel Computing Advanced Scientific Computing
PaCT '999 Proceedings of the 5th International Conference on Parallel Computing Technologies
SSA, SVD, QR-cp, and RBF Model Reduction
ICANN '02 Proceedings of the International Conference on Artificial Neural Networks
Euro-Par '00 Proceedings from the 6th International Euro-Par Conference on Parallel Processing
Euro-Par '00 Proceedings from the 6th International Euro-Par Conference on Parallel Processing
Solving Discrete-Time Periodic Riccati Equations on a Cluster (Research Note)
Euro-Par '00 Proceedings from the 6th International Euro-Par Conference on Parallel Processing
Parallel Implementation of a Block Algorithm for Matrix 1-Norm Estimation
Euro-Par '01 Proceedings of the 7th International Euro-Par Conference Manchester on Parallel Processing
Solving Stable Stein Equations on Distributed Memory Computers
Euro-Par '99 Proceedings of the 5th International Euro-Par Conference on Parallel Processing
High-Speed LANs: New Environments for Parallel and Distributed Applications
Euro-Par '99 Proceedings of the 5th International Euro-Par Conference on Parallel Processing
A New Parallel Approach to the Toeplitz Inverse Eigenproblem Using Newton-like Methods
VECPAR '00 Selected Papers and Invited Talks from the 4th International Conference on Vector and Parallel Processing
A Parallel Algorithm for Solving the Toeplitz Least Squares Problem
VECPAR '00 Selected Papers and Invited Talks from the 4th International Conference on Vector and Parallel Processing
ParNum '99 Proceedings of the 4th International ACPC Conference Including Special Tracks on Parallel Numerics and Parallel Computing in Image Processing, Video Processing, and Multimedia: Parallel Computation
Blocking Techniques in Numerical Software
ParNum '99 Proceedings of the 4th International ACPC Conference Including Special Tracks on Parallel Numerics and Parallel Computing in Image Processing, Video Processing, and Multimedia: Parallel Computation
Knowledge Discovery in Auto-tuning Parallel Numerical Library
Progress in Discovery Science, Final Report of the Japanese Discovery Science Project
Solving the Inverse Toeplitz Eigenproblem Using ScaLAPACK and MPI
Proceedings of the 6th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
Heterogeneous Networks of Workstations and the Parallel Matrix Multiplication
Proceedings of the 8th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
PVM Implementation of Heterogeneous ScaLAPACK Dense Linear Solvers
Proceedings of the 6th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
An Adaptive Working Set Algorithm
Messung, Modellierung und Bewertung von Rechensystemen, 2. GI/NTG-Fachtagung
An Efficient Parallel Algorithm for the Symmetric Tridiagonal Eigenvalue Problem
VECPAR '00 Selected Papers and Invited Talks from the 4th International Conference on Vector and Parallel Processing
Early evaluation of the IBM p690
Proceedings of the 2002 ACM/IEEE conference on Supercomputing
QR factorization for shared memory and message passing
Parallel Computing
SuperLU_DIST: A scalable distributed-memory sparse direct solver for unsymmetric linear systems
ACM Transactions on Mathematical Software (TOMS)
QR factorization with Morton-ordered quadtree matrices for memory re-use and parallelism
Proceedings of the ninth ACM SIGPLAN symposium on Principles and practice of parallel programming
Nonlinear optimization and parallel computing
Parallel Computing - Special issue: Parallel computing in numerical optimization
Parallel Computing - Special issue: Parallel computing in numerical optimization
NetSolve: A Network-Enabled Solver: Examples and Users
HCW '98 Proceedings of the Seventh Heterogeneous Computing Workshop
Algorithm 826: A parallel eigenvalue routine for complex Hessenberg matrices
ACM Transactions on Mathematical Software (TOMS)
Matrix-Matrix Multiplication on Heterogeneous Platforms
ICPP '00 Proceedings of the Proceedings of the 2000 International Conference on Parallel Processing
Algorithm engineering for parallel computation
Experimental algorithmics
A decoupled scheduling approach for Grid application development environments
Journal of Parallel and Distributed Computing - Special issue on computational grids
Mathematical software: past, present, and future
Computational science, mathematics and software
Numerical algorithm delivery mechanisms
Computational science, mathematics and software
SDPARA: semiDefinite programming algorithm paRAllel version
Parallel Computing
On variable blocking factor in a parallel dynamic block: Jacobi SVD algorithm
Parallel Computing - Parallel matrix algorithms and applications (PMAA '02)
State-space truncation methods for parallel model reduction of large-scale systems
Parallel Computing - Special issue: Parallel and distributed scientific and engineering computing
Self-adapting software for numerical linear algebra and LAPACK for clusters
Parallel Computing - Special issue: Parallel and distributed scientific and engineering computing
Grid resource management
Architecture of an automatically tuned linear algebra library
Parallel Computing
Parallel and fully recursive multifrontal sparse Cholesky
Future Generation Computer Systems - Special issue: Selected numerical algorithms
On the performance of parallel factorization of out-of-core matrices
Parallel Computing
Distributed parallel computing using navigational programming
International Journal of Parallel Programming
Parallel algorithms for Markov chain Monte Carlo methods in latent spatial Gaussian models
Statistics and Computing
An Extension of Fortran for High Performance Parallel Computing
Programming and Computing Software
A Framework for Approximating Eigenpairs in Electronic Structure Computations
Computing in Science and Engineering
ProtoMol, an object-oriented framework for prototyping novel algorithms for molecular dynamics
ACM Transactions on Mathematical Software (TOMS)
An Efficient Parallel Algorithm to Solve Block-Toeplitz Systems
The Journal of Supercomputing
Encyclopedia of Computer Science
Numerical Libraries and Tools for Scalable Parallel Cluster Computing
International Journal of High Performance Computing Applications
Static LU Decomposition on Heterogeneous Platforms
International Journal of High Performance Computing Applications
The GrADS Project: Software Support for High-Level Grid Application Development
International Journal of High Performance Computing Applications
Numerical Libraries and the Grid
International Journal of High Performance Computing Applications
An overview of the Advanced CompuTational Software (ACTS) collection
ACM Transactions on Mathematical Software (TOMS) - Special issue on the Advanced CompuTational Software (ACTS) Collection
An overview of the Trilinos project
ACM Transactions on Mathematical Software (TOMS) - Special issue on the Advanced CompuTational Software (ACTS) Collection
SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
Parallel Computing - Heterogeneous computing
A Component Architecture for High-Performance Scientific Computing
International Journal of High Performance Computing Applications
High Performance Remote Memory Access Communication: The Armci Approach
International Journal of High Performance Computing Applications
Memory efficient parallel matrix multiplication operation for irregular problems
Proceedings of the 3rd conference on Computing frontiers
ABCLib_DRSSED: A parallel eigensolver with an auto-tuning facility
Parallel Computing
A parallel hybrid banded system solver: the SPIKE algorithm
Parallel Computing - Parallel matrix algorithms and applications (PMAA'04)
Error bounds from extra-precise iterative refinement
ACM Transactions on Mathematical Software (TOMS)
Self-adapting numerical software (SANS) effort
IBM Journal of Research and Development
New grid scheduling and rescheduling methods in the GrADS project
International Journal of Parallel Programming - Special issue: The next generation software program
Scalable Hybrid Designs for Linear Algebra on Reconfigurable Computing Systems
ICPADS '06 Proceedings of the 12th International Conference on Parallel and Distributed Systems - Volume 1
Making a Supercomputer Do What You Want: High-Level Tools for Parallel Programming
Computing in Science and Engineering
A new singular value decomposition algorithm suited to parallelization and preliminary results
ACST'06 Proceedings of the 2nd IASTED international conference on Advances in computer science and technology
Large-scale electronic structure calculations of high-Z metals on the BlueGene/L platform
Proceedings of the 2006 ACM/IEEE conference on Supercomputing
Proceedings of the 2006 ACM/IEEE conference on Supercomputing
SIPs: Shift-and-invert parallel spectral transformations
ACM Transactions on Mathematical Software (TOMS)
OpenMP issues arising in the development of parallel BLAS and LAPACK libraries
Scientific Programming - OpenMP
Parallel Computing Algorithms and Applications
Computing in Science and Engineering
Scheduling Messages For Data Redistribution: An Experimental Study
International Journal of High Performance Computing Applications
An efficient direct parallel spectral-element solver for separable elliptic problems
Journal of Computational Physics
High Performance Development for High End Computing With Python Language Wrapper (PLW)
International Journal of High Performance Computing Applications
Parallelizing MCMC for Bayesian spatiotemporal geostatistical models
Statistics and Computing
Parallelism of double divide and conquer algorithm for singular value decomposition
PDCN'07 Proceedings of the 25th conference on Proceedings of the 25th IASTED International Multi-Conference: parallel and distributed computing and networks
PyTrilinos: High-performance distributed-memory solvers for Python
ACM Transactions on Mathematical Software (TOMS)
On the design of interfaces to sparse direct solvers
ACM Transactions on Mathematical Software (TOMS)
BlueGene/L applications: Parallelism On a Massive Scale
International Journal of High Performance Computing Applications
Matrix product on heterogeneous master-worker platforms
Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming
Designing polylibraries to speed up linear algebra computations
International Journal of High Performance Computing and Networking
Partial stabilisation of large-scale discrete-time linear control systems
International Journal of Computational Science and Engineering
Parallel block tridiagonalization of real symmetric matrices
Journal of Parallel and Distributed Computing
Parallelization of a method for the solution of the inverse additive singular value problem
MATH'05 Proceedings of the 8th WSEAS International Conference on Applied Mathematics
Analyzing memory access intensity in parallel programs on multicore
Proceedings of the 22nd annual international conference on Supercomputing
Architecture of Qbox: a scalable first-principles molecular dynamics code
IBM Journal of Research and Development
ISTASC'04 Proceedings of the 4th WSEAS International Conference on Systems Theory and Scientific Computation
Combining building blocks for parallel multi-level matrix multiplication
Parallel Computing
Families of algorithms related to the inversion of a Symmetric Positive Definite matrix
ACM Transactions on Mathematical Software (TOMS)
Performance modeling of parallel applications for grid scheduling
Journal of Parallel and Distributed Computing
Parallel computation of the eigenvalues of symmetric Toeplitz matrices through iterative methods
Journal of Parallel and Distributed Computing
A multi-level parallel simulation approach to electron transport in nano-scale transistors
Proceedings of the 2008 ACM/IEEE conference on Supercomputing
Compatibility of Scalapack with the Discrete Wavelet Transform
ICCS '07 Proceedings of the 7th international conference on Computational Science, Part I: ICCS 2007
Parallel Algorithms for Triangular Periodic Sylvester-Type Matrix Equations
Euro-Par '08 Proceedings of the 14th international Euro-Par conference on Parallel Processing
Solving linear-quadratic optimal control problems on parallel computers
Optimization Methods & Software
How to Write Fast Numerical Code: A Small Introduction
Generative and Transformational Techniques in Software Engineering II
High Performance Computing for Computational Science - VECPAR 2008
Parallel Eigensolvers for a Discretized Radiative Transfer Problem
High Performance Computing for Computational Science - VECPAR 2008
QR factorization for the Cell Broadband Engine
Scientific Programming - High Performance Computing with the Cell Broadband Engine
A tearing-based hybrid parallel banded linear system solver
Journal of Computational and Applied Mathematics
Interfaces for parallel numerical linear algebra libraries in high level languages
Advances in Engineering Software
Fast (Parallel) Dense Linear System Solvers in C-XSC Using Error Free Transformations and BLAS
Numerical Validation in Current Hardware Architectures
A Note on Solving Problem 7 of the SIAM 100-Digit Challenge Using C-XSC
Numerical Validation in Current Hardware Architectures
A new algorithm for singular value decomposition and its parallelization
Parallel Computing
Domain decomposition solution of nonlinear two-dimensional parabolic problems by random trees
Journal of Computational Physics
A compositional framework for developing parallel programs on two-dimensional arrays
International Journal of Parallel Programming
Non-splitting Tridiagonalization of Complex Symmetric Matrices
ICCS '09 Proceedings of the 9th International Conference on Computational Science: Part I
A Parallel Nonnegative Tensor Factorization Algorithm for Mining Global Climate Data
ICCS 2009 Proceedings of the 9th International Conference on Computational Science
Parallel solution of large-scale algebraic Bernoulli equations with the matrix sign function method
International Journal of Computational Science and Engineering
Communication-optimal parallel and sequential Cholesky decomposition: extended abstract
Proceedings of the twenty-first annual symposium on Parallelism in algorithms and architectures
A Grid framework to enable parallel and concurrent TMA image analyses
International Journal of Grid and Utility Computing
PyACTS: a python based interface to ACTS tools and parallel scientific applications
International Journal of Parallel Programming
Study of neural net training methods in parallel and distributed architectures
Future Generation Computer Systems
On the Need for a Consortium of Capability Centers
International Journal of High Performance Computing Applications
International Journal of High Performance Computing Applications
Design patterns for multiphysics modeling in Fortran 2003 and C++
ACM Transactions on Mathematical Software (TOMS)
Low cost high performance uncertainty quantification
Proceedings of the 2nd Workshop on High Performance Computational Finance
PyPnetCDF: A high level framework for parallel access to netCDF files
Advances in Engineering Software
Parallel double divide and conquer and its evaluation on a super computer
PDCS '07 Proceedings of the 19th IASTED International Conference on Parallel and Distributed Computing and Systems
Dynamic task scheduling for linear algebra algorithms on distributed-memory multicore systems
Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis
Comparative study of one-sided factorizations with multiple software packages on multi-core hardware
Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis
A study on quaternion blockquasi-tridiagonal systems
Computers & Mathematics with Applications
Time-memory trade-offs using sparse matrix methods for large-scale eigenvalue problems
ICCSA'03 Proceedings of the 2003 international conference on Computational science and its applications: PartI
A vector-parallel FFT with a user-specifiable data distribution scheme
ISPA'03 Proceedings of the 2003 international conference on Parallel and distributed processing and applications
PyACTS: a high-level framework for fast development of high performance applications
VECPAR'06 Proceedings of the 7th international conference on High performance computing for computational science
Evaluation of linear solvers for astrophysics transfer problems
VECPAR'06 Proceedings of the 7th international conference on High performance computing for computational science
Parallelisation of sparse grids for large scale data analysis
ICCS'03 Proceedings of the 2003 international conference on Computational science: PartIII
A parallel Newton-GMRES algorithm for solving large scale nonlinear systems
VECPAR'02 Proceedings of the 5th international conference on High performance computing for computational science
Translation schemes for the HP java parallel programming language
LCPC'01 Proceedings of the 14th international conference on Languages and compilers for parallel computing
International Journal of High Performance Computing Applications
Parallel variants of the multishift QZ algorithm with advanced deflation techniques
PARA'06 Proceedings of the 8th international conference on Applied parallel computing: state of the art in scientific computing
PARA'06 Proceedings of the 8th international conference on Applied parallel computing: state of the art in scientific computing
High-level user interfaces for the DOE ACTS collection
PARA'06 Proceedings of the 8th international conference on Applied parallel computing: state of the art in scientific computing
Three algorithms for Cholesky factorization on distributed memory using packed storage
PARA'06 Proceedings of the 8th international conference on Applied parallel computing: state of the art in scientific computing
New data distribution for solving triangular systems on distributed memory machines
PARA'06 Proceedings of the 8th international conference on Applied parallel computing: state of the art in scientific computing
Distributed SILC: an easy-to-use interface for MPI-based parallel matrix computation libraries
PARA'06 Proceedings of the 8th international conference on Applied parallel computing: state of the art in scientific computing
Parallel implementation of a neural net training application in a heterogeneous grid environment
OTM'07 Proceedings of the 2007 OTM confederated international conference on On the move to meaningful internet systems: CoopIS, DOA, ODBASE, GADA, and IS - Volume Part II
Implementing effective data management policies in distributed and grid computing environments
PPAM'07 Proceedings of the 7th international conference on Parallel processing and applied mathematics
Hybrid parallel programming with MPI and unified parallel C
Proceedings of the 7th ACM international conference on Computing frontiers
HieraAnalyses – a tool for hierarchical analysis of parallel programs
International Journal of High Performance Systems Architecture
Block Householder computation of sparse matrix singular values
SpringSim '10 Proceedings of the 2010 Spring Simulation Multiconference
NPC'10 Proceedings of the 2010 IFIP international conference on Network and parallel computing
Algorithmic issues in grid computing
Algorithms and theory of computation handbook
Scalable Tile Communication-Avoiding QR Factorization on Multicore Cluster Systems
Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis
CFD parallel simulation using Getfem++ and mumps
Euro-Par'10 Proceedings of the 16th international Euro-Par conference on Parallel processing: Part II
Scheduling parallel eigenvalue computations in a quantum chemistry code
Euro-Par'10 Proceedings of the 16th international Euro-Par conference on Parallel processing: Part II
Adams-Bashforth and Adams-Moulton methods for solving differential Riccati equations
Computers & Mathematics with Applications
EUROMICRO-PDP'02 Proceedings of the 10th Euromicro conference on Parallel, distributed and network-based processing
Towards the design of an automatically tuned linear algebra library
EUROMICRO-PDP'02 Proceedings of the 10th Euromicro conference on Parallel, distributed and network-based processing
Modified valence force field approach for phonon dispersion: from zinc-blende bulk to nanowires
Journal of Computational Electronics
More on JACE: new functionalities, new experiments
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
QCG-OMPI: MPI applications on grids
Future Generation Computer Systems
A piecewise-linearized algorithm based on the Krylov subspace for solving stiff ODEs
Journal of Computational and Applied Mathematics
A Novel Parallel QR Algorithm for Hybrid Distributed Memory HPC Systems
SIAM Journal on Scientific Computing
VECPAR'10 Proceedings of the 9th international conference on High performance computing for computational science
A fast semi-implicit method for anisotropic diffusion
Journal of Computational Physics
ACS'06 Proceedings of the 6th WSEAS international conference on Applied computer science
ICCSA'11 Proceedings of the 2011 international conference on Computational science and its applications - Volume Part I
Journal of Computational and Applied Mathematics
Communication-optimal parallel 2.5D matrix multiplication and LU factorization algorithms
Euro-Par'11 Proceedings of the 17th international conference on Parallel processing - Volume Part II
Strategies for Rescheduling Tightly-Coupled Parallel Applications in Multi-Cluster Grids
Journal of Grid Computing
Formal analysis of MPI-based parallel programs
Communications of the ACM
Algorithm 915, SuiteSparseQR: Multifrontal multithreaded rank-revealing sparse QR factorization
ACM Transactions on Mathematical Software (TOMS)
Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
Checkpointing strategies for parallel jobs
Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
Scalable stochastic optimization of complex energy systems
Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
Improving communication performance in dense linear algebra via topology aware collectives
Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
A threaded SPIKE algorithm for solving general banded systems
Parallel Computing
MR3-SMP: A symmetric tridiagonal eigensolver for multi-core architectures
Parallel Computing
Journal of Computational Physics
Communication-optimal Parallel and Sequential Cholesky Decomposition
SIAM Journal on Scientific Computing
HiPC'06 Proceedings of the 13th international conference on High Performance Computing
An efficient parallel solution of complex toeplitz linear systems,
PPAM'05 Proceedings of the 6th international conference on Parallel Processing and Applied Mathematics
A fortran evolution of mpc parallel programming language
PPAM'05 Proceedings of the 6th international conference on Parallel Processing and Applied Mathematics
PPAM'05 Proceedings of the 6th international conference on Parallel Processing and Applied Mathematics
Mobile pipelines: parallelizing left-looking algorithms using navigational programming
HiPC'05 Proceedings of the 12th international conference on High Performance Computing
A parallel solution of hermitian toeplitz linear systems,
ICCS'06 Proceedings of the 6th international conference on Computational Science - Volume Part I
High performance matrix inversion based on LU factorization for multicore architectures
Proceedings of the 2011 ACM international workshop on Many task computing on grids and supercomputers
Parallel solution of large-scale and sparse generalized algebraic riccati equations
Euro-Par'06 Proceedings of the 12th international conference on Parallel Processing
Parallelising matrix operations on clusters for an optimal control-based quantum compiler
Euro-Par'06 Proceedings of the 12th international conference on Parallel Processing
VECPAR'04 Proceedings of the 6th international conference on High Performance Computing for Computational Science
Parallel model reduction of large linear descriptor systems via balanced truncation
VECPAR'04 Proceedings of the 6th international conference on High Performance Computing for Computational Science
Three parallel algorithms for solving nonlinear systems and optimization problems
VECPAR'04 Proceedings of the 6th international conference on High Performance Computing for Computational Science
Numerical integration of the differential riccati equation: a high performance computing approach
VECPAR'04 Proceedings of the 6th international conference on High Performance Computing for Computational Science
An efficient and stable parallel solution for non-symmetric toeplitz linear systems
VECPAR'04 Proceedings of the 6th international conference on High Performance Computing for Computational Science
Using aspects for supporting procedural modules in # programming
Euro-Par'05 Proceedings of the 11th international Euro-Par conference on Parallel Processing
Parallelization of divide-and-conquer eigenvector accumulation
Euro-Par'05 Proceedings of the 11th international Euro-Par conference on Parallel Processing
Broadcast-Based parallel LU factorization
Euro-Par'05 Proceedings of the 11th international Euro-Par conference on Parallel Processing
Rapid development of high-performance out-of-core solvers
PARA'04 Proceedings of the 7th international conference on Applied Parallel Computing: state of the Art in Scientific Computing
PARA'04 Proceedings of the 7th international conference on Applied Parallel Computing: state of the Art in Scientific Computing
Performance evaluation of a parallel algorithm for a radiative transfer problem
PARA'04 Proceedings of the 7th international conference on Applied Parallel Computing: state of the Art in Scientific Computing
PARA'04 Proceedings of the 7th international conference on Applied Parallel Computing: state of the Art in Scientific Computing
Algorithm-based fault tolerance for dense matrix factorizations
Proceedings of the 17th ACM SIGPLAN symposium on Principles and Practice of Parallel Programming
Sparse matrices in Matlab*P: design and implementation
HiPC'04 Proceedings of the 11th international conference on High Performance Computing
Object-oriented, parallel finite element framework with dynamic load balancing
Advances in Engineering Software
The symmetric–toeplitz linear system problem in parallel
ICCS'05 Proceedings of the 5th international conference on Computational Science - Volume Part I
Parallel resolution with newton algorithms of the inverse non-symmetric eigenvalue problem
ICCS'05 Proceedings of the 5th international conference on Computational Science - Volume Part I
Coupled fusion simulation using the common component architecture
ICCS'05 Proceedings of the 5th international conference on Computational Science - Volume Part I
An implementation of the matrix multiplication algorithm SUMMA in mpf
PaCT'05 Proceedings of the 8th international conference on Parallel Computing Technologies
Automatic performance optimization of the discrete fourier transform on distributed memory computers
ISPA'06 Proceedings of the 4th international conference on Parallel and Distributed Processing and Applications
Journal of Computational Physics
On aggressive early deflation in parallel variants of the QR algorithm
PARA'10 Proceedings of the 10th international conference on Applied Parallel and Scientific Computing - Volume Part I
PARA'10 Proceedings of the 10th international conference on Applied Parallel and Scientific Computing - Volume Part I
Parallel solution of narrow banded diagonally dominant linear systems
PARA'10 Proceedings of the 10th international conference on Applied Parallel and Scientific Computing - Volume 2
Low-cost data uncertainty quantification
Concurrency and Computation: Practice & Experience
Computing matrix functions solving coupled differential models
Mathematical and Computer Modelling: An International Journal
Enabling and scaling matrix computations on heterogeneous multi-core and multi-GPU systems
Proceedings of the 26th ACM international conference on Supercomputing
Concurrency and Computation: Practice & Experience
A scalable framework for heterogeneous GPU-based clusters
Proceedings of the twenty-fourth annual ACM symposium on Parallelism in algorithms and architectures
Communication-optimal parallel algorithm for strassen's matrix multiplication
Proceedings of the twenty-fourth annual ACM symposium on Parallelism in algorithms and architectures
A preconditioning technique for Schur complement systems arising in stochastic optimization
Computational Optimization and Applications
Communication-optimal Parallel and Sequential QR and LU Factorizations
SIAM Journal on Scientific Computing
Reliable Eigenvalues of Symmetric Tridiagonals
SIAM Journal on Matrix Analysis and Applications
Speeding up solving of differential matrix Riccati equations using GPGPU computing and MATLAB
Concurrency and Computation: Practice & Experience
Processor allocation for optimistic parallelization of irregular programs
ICCSA'12 Proceedings of the 12th international conference on Computational Science and Its Applications - Volume Part I
Incomplete cyclic reduction of banded and strictly diagonally dominant linear systems
PPAM'11 Proceedings of the 9th international conference on Parallel Processing and Applied Mathematics - Volume Part I
Cache blocking for linear algebra algorithms
PPAM'11 Proceedings of the 9th international conference on Parallel Processing and Applied Mathematics - Volume Part I
PPAM'11 Proceedings of the 9th international conference on Parallel Processing and Applied Mathematics - Volume Part I
Distributed QR factorization based on randomized algorithms
PPAM'11 Proceedings of the 9th international conference on Parallel Processing and Applied Mathematics - Volume Part I
PPAM'11 Proceedings of the 9th international conference on Parallel Processing and Applied Mathematics - Volume Part I
Implementations of main algorithms for generalized eigenproblem on GPU accelerator
ICSI'12 Proceedings of the Third international conference on Advances in Swarm Intelligence - Volume Part II
A framework for the application of metaheuristics to tasks-to-processors assignation problems
The Journal of Supercomputing
High-performance general solver for extremely large-scale semidefinite programming problems
SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
A first step towards automatically building network representations
Euro-Par'07 Proceedings of the 13th international Euro-Par conference on Parallel Processing
Toward scalable matrix multiply on multithreaded architectures
Euro-Par'07 Proceedings of the 13th international Euro-Par conference on Parallel Processing
PaCT'07 Proceedings of the 9th international conference on Parallel Computing Technologies
A high-level Fortran interface to parallel matrix algebra
Proceedings of the Second International Conference on Computational Science, Engineering and Information Technology
Graph expansion and communication costs of fast matrix multiplication
Journal of the ACM (JACM)
Efficient multidimensional data redistribution for resizable parallel computations
ISPA'07 Proceedings of the 5th international conference on Parallel and Distributed Processing and Applications
From serial loops to parallel execution on distributed systems
Euro-Par'12 Proceedings of the 18th international conference on Parallel Processing
A checkpoint-on-failure protocol for algorithm-based recovery in standard MPI
Euro-Par'12 Proceedings of the 18th international conference on Parallel Processing
Fast parallel algorithms for blocked dense matrix multiplication on shared memory architectures
ICA3PP'12 Proceedings of the 12th international conference on Algorithms and Architectures for Parallel Processing - Volume Part I
Journal of Parallel and Distributed Computing
High-performance bidiagonal reduction using tile algorithms on homogeneous multicore architectures
ACM Transactions on Mathematical Software (TOMS)
Efficient generalized Hessenberg form and applications
ACM Transactions on Mathematical Software (TOMS)
Hierarchical QR factorization algorithms for multi-core clusters
Parallel Computing
Implementing OmpSs support for regions of data in architectures with multiple address spaces
Proceedings of the 27th international ACM conference on International conference on supercomputing
Toward a scalable multi-GPU eigensolver via compute-intensive kernels and efficient communication
Proceedings of the 27th international ACM conference on International conference on supercomputing
Parallel reduction to hessenberg form with algorithm-based fault tolerance
SC '13 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Scalable matrix computations on large scale-free graphs using 2D graph partitioning
SC '13 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Multiphysics simulations: Challenges and opportunities
International Journal of High Performance Computing Applications
Hi-index | 0.08 |