A parallel algorithm for sparse symbolic LU factorization without pivoting on out—of—core matrices
ICS '01 Proceedings of the 15th international conference on Supercomputing
Analysis and comparison of two general sparse solvers for distributed memory computers
ACM Transactions on Mathematical Software (TOMS)
Subdivision-based multilevel methods for large scale engineering simulation of thin shells
Proceedings of the seventh ACM symposium on Solid modeling and applications
Smoothing an overlay grid to minimize linear distortion in texture mapping
ACM Transactions on Graphics (TOG)
Parallel Computing - Parallel matrix algorithms and applications
Solving Unsymmetric Sparse Systems of Linear Equations with PARDISO
ICCS '02 Proceedings of the International Conference on Computational Science-Part II
Recent Progress in General Sparse Direct Solvers
ICCS '01 Proceedings of the International Conference on Computational Sciences-Part I
Rescheduling for Locality in Sparse Matrix Computations
ICCS '01 Proceedings of the International Conference on Computational Sciences-Part I
An Experimental Comparison of some Direct Sparse Solver Packages
IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
A Framework for Efficient Sparse LU Factorization in a Cluster Based Platform
IWCC '01 Proceedings of the NATO Advanced Research Workshop on Advanced Environments, Tools, and Applications for Cluster Computing-Revised Papers
Nested-Dissection Orderings for Sparse LU with Partial Pivoting
NAA '00 Revised Papers from the Second International Conference on Numerical Analysis and Its Applications
A new scheduling algorithm for parallel sparse LU factorization with static pivoting
Proceedings of the 2002 ACM/IEEE conference on Supercomputing
SuperLU_DIST: A scalable distributed-memory sparse direct solver for unsymmetric linear systems
ACM Transactions on Mathematical Software (TOMS)
Adapting a parallel sparse direct solver to architectures with clusters of SMPs
Parallel Computing - Special issue: Parallel and distributed scientific and engineering computing
A column pre-ordering strategy for the unsymmetric-pattern multifrontal method
ACM Transactions on Mathematical Software (TOMS)
Solving unsymmetric sparse systems of linear equations with PARDISO
Future Generation Computer Systems - Special issue: Selected numerical algorithms
Using the parallel algebraic recursive multilevel solver in modern physical applications
Future Generation Computer Systems - Special issue: Selected numerical algorithms
Product or sum with transposed matrix: what is best for unsymmetric sparse matrix compression
Advances in Engineering Software
A column approximate minimum degree ordering algorithm
ACM Transactions on Mathematical Software (TOMS)
Algorithm 836: COLAMD, a column approximate minimum degree ordering algorithm
ACM Transactions on Mathematical Software (TOMS)
A remeshing approach to multiresolution modeling
Proceedings of the 2004 Eurographics/ACM SIGGRAPH symposium on Geometry processing
ABF++: fast and robust angle based flattening
ACM Transactions on Graphics (TOG)
Parallel sparse LU factorization on second-class message passing platforms
Proceedings of the 19th annual international conference on Supercomputing
An overview of SuperLU: Algorithms, implementation, and user interface
ACM Transactions on Mathematical Software (TOMS) - Special issue on the Advanced CompuTational Software (ACTS) Collection
Multilevel preconditioned iterative eigensolvers for Maxwell eigenvalue problems
Applied Numerical Mathematics - 6th IMACS International symposium on iterative methods in scientific computing
Harmonic functions for quadrilateral remeshing of arbitrary manifolds
Computer Aided Geometric Design - Special issue: Geometry processing
Is 1.7 x 10^10 Unknowns the Largest Finite Element System that Can Be Solved Today?
SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
Proceedings of the 2006 ACM symposium on Solid and physical modeling
A fast multigrid algorithm for mesh deformation
ACM SIGGRAPH 2006 Papers
Geometric modeling based on triangle meshes
ACM SIGGRAPH 2006 Courses
Using dense storage to solve small sparse linear systems
ACM Transactions on Mathematical Software (TOMS)
An application of multigrid methods for a discrete elastic model for epitaxial systems
Journal of Computational Physics
Parallel sparse LU factorization on different message passing platforms
Journal of Parallel and Distributed Computing
Harmonic skeleton for realistic character animation
SCA '07 Proceedings of the 2007 ACM SIGGRAPH/Eurographics symposium on Computer animation
Computer-Aided Design
Computing Length-Preserved Free Boundary for Quasi-Developable Mesh Segmentation
IEEE Transactions on Visualization and Computer Graphics
Parallel unsymmetric-pattern multifrontal sparse LU with column preordering
ACM Transactions on Mathematical Software (TOMS)
A partitioning algorithm for block-diagonal matrices with overlap
Parallel Computing
ACM Transactions on Mathematical Software (TOMS)
An Implementation and Evaluation of the AMLS Method for Sparse Eigenvalue Problems
ACM Transactions on Mathematical Software (TOMS)
Rigid shape interpolation using normal equations
NPAR '08 Proceedings of the 6th international symposium on Non-photorealistic animation and rendering
Piecewise-polynomial discretization and Krylov-accelerated multigrid for elliptic interface problems
Journal of Computational Physics
An Algebraic Substructuring Method for High-Frequency Response Analysis of Micro-systems
ICCS '07 Proceedings of the 7th international conference on Computational Science, Part I: ICCS 2007
A parallel computational model for sensitivity analysis in optimization for robustness
Optimization Methods & Software
SBA: A software package for generic sparse bundle adjustment
ACM Transactions on Mathematical Software (TOMS)
Parallelization of Advection-Diffusion-Chemistry Modules
Large-Scale Scientific Computing
Technical Section: Discrete Laplace-Beltrami operators for shape analysis and segmentation
Computers and Graphics
Rig retargeting for 3D animation
Proceedings of Graphics Interface 2009
Multilevel preconditioned iterative eigensolvers for Maxwell eigenvalue problems
Applied Numerical Mathematics - 6th IMACS International symposium on iterative methods in scientific computing
Harmonic functions for quadrilateral remeshing of arbitrary manifolds
Computer Aided Geometric Design - Special issue: Geometry processing
A supernodal out-of-core sparse Gaussian-elimination method
PPAM'07 Proceedings of the 7th international conference on Parallel processing and applied mathematics
ACM Transactions on Mathematical Software (TOMS)
Algorithm 907: KLU, A Direct Sparse Solver for Circuit Simulation Problems
ACM Transactions on Mathematical Software (TOMS)
International Journal of Computer Vision
Efficient implementation of stable Richardson Extrapolation algorithms
Computers & Mathematics with Applications
Sparse non-linear least squares optimization for geometric vision
ECCV'10 Proceedings of the 11th European conference on Computer vision: Part II
Solving Very Sparse Rational Systems of Equations
ACM Transactions on Mathematical Software (TOMS)
A Supernodal Approach to Incomplete LU Factorization with Partial Pivoting
ACM Transactions on Mathematical Software (TOMS)
Journal of Computational Physics
FPGA accelerated parallel sparse matrix factorization for circuit simulations
ARC'11 Proceedings of the 7th international conference on Reconfigurable computing: architectures, tools and applications
Distributed Resonant clOCK grid Synthesis (ROCKS)
Proceedings of the 48th Design Automation Conference
Implementation of sparse matrix algorithms in an advection-diffusion-chemistry module
Journal of Computational and Applied Mathematics
A linear-time approach for the transient thermal simulation of liquid-cooled 3d ics
CODES+ISSS '11 Proceedings of the seventh IEEE/ACM/IFIP international conference on Hardware/software codesign and system synthesis
FATODE: a library for forward, adjoint and tangent linear integration of stiff systems
Proceedings of the 19th High Performance Computing Symposia
The university of Florida sparse matrix collection
ACM Transactions on Mathematical Software (TOMS)
A Krylov Method for the Delay Eigenvalue Problem
SIAM Journal on Scientific Computing
Hypergraph-Based Unsymmetric Nested Dissection Ordering for Sparse LU Factorization
SIAM Journal on Scientific Computing
Hypergraph Partitioning-Based Fill-Reducing Ordering for Symmetric Matrices
SIAM Journal on Scientific Computing
A two-dimensional domain decomposition technique for the simulation of quantum-scale devices
Journal of Computational Physics
Developing a dynamic model of cascading failure for high performance computing using trilinos
Proceedings of the first international workshop on High performance computing, networking and analytics for the power grid
Towards high performance discrete-event simulations of smart electric grids
Proceedings of the first international workshop on High performance computing, networking and analytics for the power grid
The Journal of Supercomputing
3D-ICE: fast compact transient thermal modeling for 3D ICs with inter-tier liquid cooling
Proceedings of the International Conference on Computer-Aided Design
Rational krylov for large nonlinear eigenproblems
PARA'04 Proceedings of the 7th international conference on Applied Parallel Computing: state of the Art in Scientific Computing
Parallel treatment of general sparse matrices
LSSC'05 Proceedings of the 5th international conference on Large-Scale Scientific Computing
Efficient linear system solvers for mesh processing
IMA'05 Proceedings of the 11th IMA international conference on Mathematics of Surfaces
A static parallel multifrontal solver for finite element meshes
ISPA'06 Proceedings of the 4th international conference on Parallel and Distributed Processing and Applications
Advances in Engineering Software
Practical mixed-integer optimization for geometry processing
Proceedings of the 7th international conference on Curves and Surfaces
Sparse LU factorization for parallel circuit simulation on GPU
Proceedings of the 49th Annual Design Automation Conference
Model order reduction of coupled circuit-device systems
International Journal of Numerical Modelling: Electronic Networks, Devices and Fields
A parallel two-level preconditioner for cosmic microwave background map-making
SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Multiclass classification of distributed memory parallel computations
Pattern Recognition Letters
Automatically adapting programs for mixed-precision floating-point computation
Proceedings of the 27th international ACM conference on International conference on supercomputing
Applying microprocessor analysis methods to river network modelling
Environmental Modelling & Software
Amesos2 and Belos: Direct and iterative solvers for large sparse linear systems
Scientific Programming
Hi-index | 0.03 |
We investigate several ways to improve the performance of sparse LU factorization with partial pivoting, as used to solve unsymmetric linear systems. We introduce the notion of unsymmetric supernodes to perform most of the numerical computation in dense matrix kernels. We introduce unsymmetric supernode-panel updates and two-dimensional data partitioning to better exploit the memory hierarchy. We use Gilbert and Peierls's depth-first search with Eisenstat and Liu's symmetric structural reductions to speed up symbolic factorization. We have developed a sparse LU code using all these ideas. We present experiments demonstrating that it is significantly faster than earlier partial pivoting codes. We also compare its performance with UMFPACK, which uses a multifrontal approach; our code is very competitive in time and storage requirements, especially for large problems.