Solving polynomial systems for curve, surface and solid modeling
SMA '93 Proceedings on the second ACM symposium on Solid modeling and applications
Matrix Partitioning on a Virtual Shared Memory Parallel Machine
IEEE Transactions on Parallel and Distributed Systems
The computation of elementary unitary matrices
ACM Transactions on Mathematical Software (TOMS)
Solving algebraic systems using matrix computations
ACM SIGSAM Bulletin
Data-centric multi-level blocking
Proceedings of the ACM SIGPLAN 1997 conference on Programming language design and implementation
IES3: a fast integral equation solver for efficient 3-dimensional extraction
ICCAD '97 Proceedings of the 1997 IEEE/ACM international conference on Computer-aided design
Automatic selection of high-order transformations in the IBM XL FORTRAN compilers
IBM Journal of Research and Development - Special issue: performance analysis and its impact on design
Compiler blockability of dense matrix factorizations
ACM Transactions on Mathematical Software (TOMS)
Efficient householder QR factorization for superscalar processors
ACM Transactions on Mathematical Software (TOMS)
MAPC: a library for efficient and exact manipulation of algebraic points and curves
SCG '99 Proceedings of the fifteenth annual symposium on Computational geometry
High-level semantic optimization of numerical codes
ICS '99 Proceedings of the 13th international conference on Supercomputing
Nonlinear array layouts for hierarchical memory systems
ICS '99 Proceedings of the 13th international conference on Supercomputing
Algorithmic Redistribution Methods for Block-Cyclic Decompositions
IEEE Transactions on Parallel and Distributed Systems
Parallel Partial Stabilizing Algorithms for Large Linear Control Systems
The Journal of Supercomputing
OoLALA: an object oriented analysis and design of numerical linear algebra
OOPSLA '00 Proceedings of the 15th ACM SIGPLAN conference on Object-oriented programming, systems, languages, and applications
An observation on bisection software for the symmetric tridiagonal eigenvalue problem
ACM Transactions on Mathematical Software (TOMS)
Band reduction algorithms revisited
ACM Transactions on Mathematical Software (TOMS)
A framework for symmetric band reduction
ACM Transactions on Mathematical Software (TOMS)
Algorithm 807: The SBR Toolbox—software for successive band reduction
ACM Transactions on Mathematical Software (TOMS)
Automatic translation of Fortran to JVM bytecode
Proceedings of the 2001 joint ACM-ISCOPE conference on Java Grande
The quest for petascale computing
Computing in Science and Engineering
Optimizing locality for ODE solvers
ICS '01 Proceedings of the 15th international conference on Supercomputing
A recursive formulation of Cholesky factorization of a matrix in packed storage
ACM Transactions on Mathematical Software (TOMS)
Proximal support vector machine classifiers
Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
A decoupling method for analysis of coupled RLC interconnects
Proceedings of the 12th ACM Great Lakes symposium on VLSI
Symbolic and numeric methods for exploiting structure in constructing resultant matrices
Journal of Symbolic Computation
Making sparse Gaussian elimination scalable by static pivoting
SC '98 Proceedings of the 1998 ACM/IEEE conference on Supercomputing
Automatically tuned linear algebra software
SC '98 Proceedings of the 1998 ACM/IEEE conference on Supercomputing
MultiMATLAB: integrating MATLAB with high-performance parallel computing
SC '97 Proceedings of the 1997 ACM/IEEE conference on Supercomputing
ACM Transactions on Mathematical Software (TOMS)
Renovating the collected algorithms from ACM
ACM Transactions on Mathematical Software (TOMS)
Synthesizing sounds from rigid-body simulations
Proceedings of the 2002 ACM SIGGRAPH/Eurographics symposium on Computer animation
Fitting nature's basic functions part I: polynomials and linear least squares
Computing in Science and Engineering
An updated set of basic linear algebra subprograms (BLAS)
ACM Transactions on Mathematical Software (TOMS)
Design, implementation and testing of extended and mixed precision BLAS
ACM Transactions on Mathematical Software (TOMS)
On computing givens rotations reliably and efficiently
ACM Transactions on Mathematical Software (TOMS)
Array form representation of idiom recognition system for numerical programs
Proceedings of the 2001 conference on APL: an arrays odyssey
Numerical study of quantum resonances in chaotic scattering
Journal of Computational Physics
ACM Transactions on Mathematical Software (TOMS)
ACM Transactions on Mathematical Software (TOMS)
Applications performance under OSF/1 AD and SUNMOS on Intel Paragon XP/S-15
Proceedings of the 1994 ACM/IEEE conference on Supercomputing
Data-Centric Transformations for Locality Enhancement
International Journal of Parallel Programming
Dynamic ordering for a parallel block-Jacobi SVD algorithm
Parallel Computing - Parallel matrix algorithms and applications
An iterated eigenvalue algorithm for approximating roots of univariate polynomials
Journal of Symbolic Computation - Computer algebra: Selected papers from ISSAC 2001
Linear Algebra Libraries for High-Performance Computers: A Personal Perspective
IEEE Parallel & Distributed Technology: Systems & Technology
Applying NetSolve's Network-Enabled Server
IEEE Computational Science & Engineering
Computing in Science and Engineering
The Decompositional Approach to Matrix Computation
Computing in Science and Engineering
Solving Systems of Polynomial Equations
IEEE Computer Graphics and Applications
Very large electronic structure calculations using an out-of-core filter-diagonalization method
Journal of Computational Physics
Applied Numerical Mathematics
A Grid Computing Environment for Enabling Large Scale Quantum Mechanical Simulations
GRID '00 Proceedings of the First IEEE/ACM International Workshop on Grid Computing
Solving Orthogonal Matrix Differential Systems in Mathematica
ICCS '02 Proceedings of the International Conference on Computational Science-Part III
LAWRA Workshop: Linear Algebra with Recursive Algorithms: http: //lawra.uni-c.dk/lawra/
HPCN Europe 2000 Proceedings of the 8th International Conference on High-Performance Computing and Networking
Parallel Triangular Sylvester-Type Matrix Equation Solvers for SMP Systems Using Recursive Blocking
PARA '00 Proceedings of the 5th International Workshop on Applied Parallel Computing, New Paradigms for HPC in Industry and Academia
LAWRA: Linear Algebra with Recursive Algorithms
PARA '00 Proceedings of the 5th International Workshop on Applied Parallel Computing, New Paradigms for HPC in Industry and Academia
High Performance Cholesky Factorization via Blocking and Recursion That Uses Minimal Storage
PARA '00 Proceedings of the 5th International Workshop on Applied Parallel Computing, New Paradigms for HPC in Industry and Academia
A Fast Minimal Storage Symmetric Indefinite Solver
PARA '00 Proceedings of the 5th International Workshop on Applied Parallel Computing, New Paradigms for HPC in Industry and Academia
Parallel Two-Stage Reduction of a Regular Matrix Pair to Hessenberg-Triangular Form
PARA '00 Proceedings of the 5th International Workshop on Applied Parallel Computing, New Paradigms for HPC in Industry and Academia
Parallel Two-Sided Sylvester-Type Matrix Equation Solvers for SMP Systems Using Recursive Blocking
PARA '02 Proceedings of the 6th International Conference on Applied Parallel Computing Advanced Scientific Computing
PARA '02 Proceedings of the 6th International Conference on Applied Parallel Computing Advanced Scientific Computing
PARA '02 Proceedings of the 6th International Conference on Applied Parallel Computing Advanced Scientific Computing
Collective Principal Component Analysis from Distributed, Heterogeneous Data
PKDD '00 Proceedings of the 4th European Conference on Principles of Data Mining and Knowledge Discovery
Bundle Adjustment - A Modern Synthesis
ICCV '99 Proceedings of the International Workshop on Vision Algorithms: Theory and Practice
Abstraction of Expectation Functions Using Gaussian Distributions
VMCAI 2003 Proceedings of the 4th International Conference on Verification, Model Checking, and Abstract Interpretation
Parallel Implementation of a Block Algorithm for Matrix 1-Norm Estimation
Euro-Par '01 Proceedings of the 7th International Euro-Par Conference Manchester on Parallel Processing
An Evaluation of Java for Numerical Computing
ISCOPE '98 Proceedings of the Second International Symposium on Computing in Object-Oriented Parallel Environments
Typhoon Analysis and Data Mining with Kernel Methods
SVM '02 Proceedings of the First International Workshop on Pattern Recognition with Support Vector Machines
3D Clothes Modeling from Photo Cloned Human Body
VW '00 Proceedings of the Second International Conference on Virtual Worlds
Exploiting On-Chip Memory Bandwidth in the VIRAM Compiler
IMS '00 Revised Papers from the Second International Workshop on Intelligent Memory Systems
Advanced environments for parallel and distributed applications: a view of current status
Parallel Computing - Special issue: Advanced environments for parallel and distributed computing
Future Generation Computer Systems - Special issue: Geometric numerical algorithms
Journal of Computational Physics
Fast accurate computation of large-scale IP traffic matrices from link loads
SIGMETRICS '03 Proceedings of the 2003 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
FRONTIERS '96 Proceedings of the 6th Symposium on the Frontiers of Massively Parallel Computation
Algorithm-Based Diskless Checkpointing for Fault-Tolerant Matrix Operations
FTCS '95 Proceedings of the Twenty-Fifth International Symposium on Fault-Tolerant Computing
Linear algebra operators for GPU implementation of numerical algorithms
ACM SIGGRAPH 2003 Papers
Mathematical software: past, present, and future
Computational science, mathematics and software
Numerical algorithm delivery mechanisms
Computational science, mathematics and software
Sourcebook of parallel computing
ACM Transactions on Programming Languages and Systems (TOPLAS)
Tomography-based overlay network monitoring
Proceedings of the 3rd ACM SIGCOMM conference on Internet measurement
Incompressible limits of lattice Boltzmann equations using multiple relaxation times
Journal of Computational Physics
Structure preservation: a challenge in computational control
Future Generation Computer Systems - Selected papers on theoretical and computational aspects of structural dynamical systems in linear algebra and control
On variable blocking factor in a parallel dynamic block: Jacobi SVD algorithm
Parallel Computing - Parallel matrix algorithms and applications (PMAA '02)
Matrix bidiagonalization: implementation and evaluation on the Trident processor
Neural, Parallel & Scientific Computations
Transforming Complex Loop Nests for Locality
The Journal of Supercomputing
Fast solution of large N × N matrix equations in an MIMD-SIMD hybrid system
Parallel Computing - Special issue: Parallel and distributed scientific and engineering computing
Self-adapting software for numerical linear algebra and LAPACK for clusters
Parallel Computing - Special issue: Parallel and distributed scientific and engineering computing
Using randomization to make recursive matrix algorithms practical
Journal of Functional Programming
Performance optimization of RK methods using block-based pipelining
Performance analysis and grid computing
Finite Elements in Analysis and Design
Generating node coordinates for shortest-path computations in transportation networks
Journal of Experimental Algorithmics (JEA)
High-performance linear algebra algorithms using new generalized data structures for matrices
IBM Journal of Research and Development
An algebraic approach to practical and scalable overlay network monitoring
Proceedings of the 2004 conference on Applications, technologies, architectures, and protocols for computer communications
A Framework for Approximating Eigenpairs in Electronic Structure Computations
Computing in Science and Engineering
ProtoMol, an object-oriented framework for prototyping novel algorithms for molecular dynamics
ACM Transactions on Mathematical Software (TOMS)
Journal of Computational and Applied Mathematics
Advances in Engineering Software
Semi-formal design of reliable mesh generation systems
Advances in Engineering Software
Supporting Cluster-Based Network Services on Functionally Symmetric Software Architecture
Proceedings of the 2004 ACM/IEEE conference on Supercomputing
Spherical blend skinning: a real-time deformation of articulated models
Proceedings of the 2005 symposium on Interactive 3D graphics and games
Understanding the efficiency of GPU algorithms for matrix-matrix multiplication
Proceedings of the ACM SIGGRAPH/EUROGRAPHICS conference on Graphics hardware
Subband decomposition approach for the simulation of quantum electron transport in nanostructures
Journal of Computational Physics
Automatic blocking of QR and LU factorizations for locality
MSP '04 Proceedings of the 2004 workshop on Memory system performance
Multicategory Proximal Support Vector Machine Classifiers
Machine Learning
A fully portable high performance minimal storage hybrid format Cholesky algorithm
ACM Transactions on Mathematical Software (TOMS)
The GrADS Project: Software Support for High-Level Grid Application Development
International Journal of High Performance Computing Applications
An overview of the Advanced CompuTational Software (ACTS) collection
ACM Transactions on Mathematical Software (TOMS) - Special issue on the Advanced CompuTational Software (ACTS) Collection
An overview of SuperLU: Algorithms, implementation, and user interface
ACM Transactions on Mathematical Software (TOMS) - Special issue on the Advanced CompuTational Software (ACTS) Collection
An overview of the Trilinos project
ACM Transactions on Mathematical Software (TOMS) - Special issue on the Advanced CompuTational Software (ACTS) Collection
Applied Numerical Mathematics - 6th IMACS International symposium on iterative methods in scientific computing
Efficient, causal camera tracking in unprepared environments
Computer Vision and Image Understanding
Multisurface Proximal Support Vector Machine Classification via Generalized Eigenvalues
IEEE Transactions on Pattern Analysis and Machine Intelligence
Journal of Computational Physics
A comparative investigation on subspace dimension determination
Neural Networks - 2004 Special issue: New developments in self-organizing systems
Jacobian Conditioning Analysis for Model Validation
Neural Computation
Programming and Computing Software
Symbolic-numeric efficient solution of optimal control problems for multibody systems
Journal of Computational and Applied Mathematics - Special issue: International workshop on the technological aspects of mathematics
IEEE Transactions on Pattern Analysis and Machine Intelligence
Computer Memory and Arithmetic: A Look under the Hood
Computing in Science and Engineering
Algorithm 853: An efficient algorithm for solving rank-deficient least squares problems
ACM Transactions on Mathematical Software (TOMS)
Osprey: a practical type system for validating dimensional unit correctness of C programs
Proceedings of the 28th international conference on Software engineering
Large-scale text categorization by batch mode active learning
Proceedings of the 15th international conference on World Wide Web
An approximate arrangement algorithm for semi-algebraic curves
Proceedings of the twenty-second annual symposium on Computational geometry
Pivoting for structured matrices and rational tangential interpolation
Contemporary mathematics
Optimizing locality and scalability of embedded Runge--Kutta solvers using block-based pipelining
Journal of Parallel and Distributed Computing
A parallel hybrid banded system solver: the SPIKE algorithm
Parallel Computing - Parallel matrix algorithms and applications (PMAA'04)
Building the functional performance model of a processor
Proceedings of the 2006 ACM symposium on Applied computing
Improving the performance of reduction to Hessenberg form
ACM Transactions on Mathematical Software (TOMS)
Error bounds from extra-precise iterative refinement
ACM Transactions on Mathematical Software (TOMS)
Algorithm 854: Fortran 77 subroutines for computing the eigenvalues of Hamiltonian matrices II
ACM Transactions on Mathematical Software (TOMS)
Exploiting semidefinite relaxations in constraint programming
Computers and Operations Research
Self-adapting numerical software (SANS) effort
IBM Journal of Research and Development
Journal of Computational and Applied Mathematics
Scalable Hybrid Designs for Linear Algebra on Reconfigurable Computing Systems
ICPADS '06 Proceedings of the 12th International Conference on Parallel and Distributed Systems - Volume 1
Benchmarking of high throughput computing applications on Grids
Parallel Computing
A new singular value decomposition algorithm suited to parallelization and preliminary results
ACST'06 Proceedings of the 2nd IASTED international conference on Advances in computer science and technology
Block algorithms for reordering standard and generalized Schur forms
ACM Transactions on Mathematical Software (TOMS)
The design and implementation of the MRRR algorithm
ACM Transactions on Mathematical Software (TOMS)
Proceedings of the 2006 ACM/IEEE conference on Supercomputing
SMCtools '06 Proceeding from the 2006 workshop on Tools for solving structured Markov chains
Linear algebra operators for GPU implementation of numerical algorithms
SIGGRAPH '05 ACM SIGGRAPH 2005 Courses
Using dense storage to solve small sparse linear systems
ACM Transactions on Mathematical Software (TOMS)
Journal of Computational Physics
Analytically tractable case of fuzzy c-means clustering
Pattern Recognition
Multilevel domain decomposition for electronic structure calculations
Journal of Computational Physics
On parameter and state estimation for linear differential-algebraic equations
Automatica (Journal of IFAC)
Growth factor and expected growth factor of some pivoting strategies
Journal of Computational and Applied Mathematics
The memory behavior of cache oblivious stencil computations
The Journal of Supercomputing
SIPs: Shift-and-invert parallel spectral transformations
ACM Transactions on Mathematical Software (TOMS)
A conversion of an SDP having free variables into the standard form SDP
Computational Optimization and Applications
An evaluation of Java for numerical computing
Scientific Programming
JLAPACK - compiling LAPACK Fortran to Java
Scientific Programming
Recursive approach in sparse matrix LU factorization
Scientific Programming
OpenMP programming for a global inverse model
Scientific Programming - Hidden Markov Models
OpenMP issues arising in the development of parallel BLAS and LAPACK libraries
Scientific Programming - OpenMP
Improving locality for ODE solvers by program transformations
Scientific Programming
Numerical studies of time-independent and time-dependent scattering by several elliptical cylinders
Journal of Computational and Applied Mathematics
An experimental comparison of cache-oblivious and cache-conscious programs
Proceedings of the nineteenth annual ACM symposium on Parallel algorithms and architectures
Supermatrix out-of-order scheduling of matrix operations for SMP and multi-core architectures
Proceedings of the nineteenth annual ACM symposium on Parallel algorithms and architectures
Preventing Over-Fitting during Model Selection via Bayesian Regularisation of the Hyper-Parameters
The Journal of Machine Learning Research
ACM Transactions on Mathematical Software (TOMS)
deal.II—A general-purpose object-oriented finite element library
ACM Transactions on Mathematical Software (TOMS)
Implementation of a primal—dual method for SDP on a shared memory parallel architecture
Computational Optimization and Applications
High Performance Development for High End Computing With Python Language Wrapper (PLW)
International Journal of High Performance Computing Applications
Prony analysis for power system transient harmonics
EURASIP Journal on Applied Signal Processing
Quality-of-service class specific traffic matrices in ip/mpls networks
Proceedings of the 7th ACM SIGCOMM conference on Internet measurement
Algebra-based scalable overlay network monitoring: algorithms, evaluation, and applications
IEEE/ACM Transactions on Networking (TON)
Block variants of Hammarling's method for solving Lyapunov equations
ACM Transactions on Mathematical Software (TOMS)
Parallel unsymmetric-pattern multifrontal sparse LU with column preordering
ACM Transactions on Mathematical Software (TOMS)
On the design of interfaces to sparse direct solvers
ACM Transactions on Mathematical Software (TOMS)
Improving the parallelism of iterative methods by aggressive loop fusion
The Journal of Supercomputing
A new implementation of the CMRH method for solving dense linear systems
Journal of Computational and Applied Mathematics
High performance dense linear algebra on a spatially distributed processor
Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming
Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming
SuperMatrix: a multithreaded runtime scheduling system for algorithms-by-blocks
Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming
Local analysis of the feasible primal-dual interior-point method
Computational Optimization and Applications
Anatomy of high-performance matrix multiplication
ACM Transactions on Mathematical Software (TOMS)
Cache efficient bidiagonalization using BLAS 2.5 operators
ACM Transactions on Mathematical Software (TOMS)
Designing polylibraries to speed up linear algebra computations
International Journal of High Performance Computing and Networking
Partial stabilisation of large-scale discrete-time linear control systems
International Journal of Computational Science and Engineering
Consistent computation of first- and second-order differential quantities for surface meshes
Proceedings of the 2008 ACM symposium on Solid and physical modeling
Parallelization of a method for the solution of the inverse additive singular value problem
MATH'05 Proceedings of the 8th WSEAS International Conference on Applied Mathematics
A parallel algorithm based on a variant of the Kalman filter for solving the RLS problem
ISCGAV'04 Proceedings of the 4th WSEAS International Conference on Signal Processing, Computational Geometry & Artificial Vision
ISTASC'04 Proceedings of the 4th WSEAS International Conference on Systems Theory and Scientific Computation
Co-arrays in the next Fortran Standard
Scientific Programming - Fortran Programming Language and Scientific Programming: 50 Years of Mutual Growth
ACM Transactions on Mathematical Software (TOMS)
Families of algorithms related to the inversion of a Symmetric Positive Definite matrix
ACM Transactions on Mathematical Software (TOMS)
Algorithm 880: A testing infrastructure for symmetric tridiagonal eigensolvers
ACM Transactions on Mathematical Software (TOMS)
The impact of paravirtualized memory hierarchy on linear algebra computational kernels and software
HPDC '08 Proceedings of the 17th international symposium on High performance distributed computing
A PIC-MCC code for simulation of streamer propagation in air
Journal of Computational Physics
Parallel computation of the eigenvalues of symmetric Toeplitz matrices through iterative methods
Journal of Parallel and Distributed Computing
Organizing rushes video by visually similar setting
CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
A 7-step approach to the design and implementation of parallel algorithms
MATH'05 Proceedings of the 7th WSEAS International Conference on Applied Mathematics
Parallelizing CAD: a timely research agenda for EDA
Proceedings of the 45th annual Design Automation Conference
Dense Linear Algebra over Word-Size Prime Fields: the FFLAS and FFPACK Packages
ACM Transactions on Mathematical Software (TOMS)
Algorithm 887: CHOLMOD, Supernodal Sparse Cholesky Factorization and Update/Downdate
ACM Transactions on Mathematical Software (TOMS)
Design and Implementation of the ScaLAPACK LU, QR, and Cholesky Factorization Routines
Scientific Programming
Multiple extremal eigenpairs by the power method
Journal of Computational Physics
Algorithm 888: Spherical Harmonic Transform Algorithms
ACM Transactions on Mathematical Software (TOMS)
A multi-level parallel simulation approach to electron transport in nano-scale transistors
Proceedings of the 2008 ACM/IEEE conference on Supercomputing
Adaptive Loop Tiling for a Multi-cluster CMP
ICA3PP '08 Proceedings of the 8th international conference on Algorithms and Architectures for Parallel Processing
Interval Subroutine Library Mission
Reliable Implementation of Real Number Algorithms: Theory and Practice
ICAISC '08 Proceedings of the 9th international conference on Artificial Intelligence and Soft Computing
Tridiagonalizing Complex Symmetric Matrices in Waveguide Simulations
ICCS '08 Proceedings of the 8th international conference on Computational Science, Part I
Performance Model for Parallel Mathematical Libraries Based on Historical Knowledgebase
Euro-Par '08 Proceedings of the 14th international Euro-Par conference on Parallel Processing
Solving the quadratic trust-region subproblem in a low-memory BFGS framework
Optimization Methods & Software - THE JOINT EUROPT-OMS CONFERENCE ON OPTIMIZATION, 4-7 JULY, 2007, PRAGUE, CZECH REPUBLIC, PART I
A sparse nonsymmetric eigensolver for distributed memory architectures
International Journal of Parallel, Emergent and Distributed Systems
Solving linear-quadratic optimal control problems on parallel computers
Optimization Methods & Software
Probability-one homotopy maps for mixed complementarity problems
Computational Optimization and Applications
CONTEST: A Controllable Test Matrix Toolbox for MATLAB
ACM Transactions on Mathematical Software (TOMS)
Dynamic Supernodes in Sparse Cholesky Update/Downdate and Triangular Solves
ACM Transactions on Mathematical Software (TOMS)
How to Write Fast Numerical Code: A Small Introduction
Generative and Transformational Techniques in Software Engineering II
Journal of Computational and Applied Mathematics
Design for Interoperability in stapl: pMatrices and Linear Algebra Algorithms
Languages and Compilers for Parallel Computing
Journal of Computational Physics
SBA: A software package for generic sparse bundle adjustment
ACM Transactions on Mathematical Software (TOMS)
Adaptive Winograd's matrix multiplications
ACM Transactions on Mathematical Software (TOMS)
Algorithm 894: On a block Schur--Parlett algorithm for ϕ-functions based on the sep-inverse estimate
ACM Transactions on Mathematical Software (TOMS)
High Performance Computing for Computational Science - VECPAR 2008
A Grid-Aware Web Portal with Advanced Service Trading for Linear Algebra Calculations
High Performance Computing for Computational Science - VECPAR 2008
On the Implementation of Boundary Element Engineering Codes on the Cell Broadband Engine
High Performance Computing for Computational Science - VECPAR 2008
Light interaction with human skin: from believable images to predictable models
ACM SIGGRAPH ASIA 2008 courses
Preconditioned Lanczos method for generalized Toeplitz eigenvalue problems
Journal of Computational and Applied Mathematics
Tailored least-squares solvers implementation for high-performance gravity field research
Computers & Geosciences
A tearing-based hybrid parallel banded linear system solver
Journal of Computational and Applied Mathematics
On the time-splitting scheme used in the Princeton Ocean Model
Journal of Computational Physics
A modeling-based classification algorithm validated with simulated data
Proceedings of the 40th Conference on Winter Simulation
Brief paper: Perturbation analysis and condition numbers of symmetric algebraic Riccati equations
Automatica (Journal of IFAC)
Anasazi software for the numerical solution of large-scale eigenvalue problems
ACM Transactions on Mathematical Software (TOMS)
Programming matrix algorithms-by-blocks for thread-level parallelism
ACM Transactions on Mathematical Software (TOMS)
Mapping the LU decomposition on a many-core architecture: challenges and solutions
Proceedings of the 6th ACM conference on Computing frontiers
Detecting Abnormal Trend Evolution over Multiple Data Streams
APWeb/WAIM '09 Proceedings of the Joint International Conferences on Advances in Data and Web Management
Parallelization of Advection-Diffusion-Chemistry Modules
Large-Scale Scientific Computing
A Note on Solving Problem 7 of the SIAM 100-Digit Challenge Using C-XSC
Numerical Validation in Current Hardware Architectures
A new algorithm for singular value decomposition and its parallelization
Parallel Computing
PetaBricks: a language and compiler for algorithmic choice
Proceedings of the 2009 ACM SIGPLAN conference on Programming language design and implementation
Efficient solution of the Schroedinger-Poisson equations in layered semiconductor devices
Journal of Computational Physics
Fast and Stable Polynomial Equation Solving and Its Application to Computer Vision
International Journal of Computer Vision
Non-splitting Tridiagonalization of Complex Symmetric Matrices
ICCS '09 Proceedings of the 9th International Conference on Computational Science: Part I
A Note on Auto-tuning GEMM for GPUs
ICCS '09 Proceedings of the 9th International Conference on Computational Science: Part I
A Parallel Nonnegative Tensor Factorization Algorithm for Mining Global Climate Data
ICCS 2009 Proceedings of the 9th International Conference on Computational Science
Advanced service trading for scientific computing over the grid
The Journal of Supercomputing
pCMALib: a parallel fortran 90 library for the evolution strategy with covariance matrix adaptation
Proceedings of the 11th Annual conference on Genetic and evolutionary computation
Journal of Computational Physics
PSwarm: a hybrid solver for linearly constrained global derivative-free optimization
Optimization Methods & Software - GLOBAL OPTIMIZATION
PyACTS: a python based interface to ACTS tools and parallel scientific applications
International Journal of Parallel Programming
A robust and efficient harmonic balance (HB) using direct solution of HB Jacobian
Proceedings of the 46th Annual Design Automation Conference
On the Need for a Consortium of Capability Centers
International Journal of High Performance Computing Applications
International Journal of High Performance Computing Applications
Applied Numerical Mathematics - 6th IMACS International symposium on iterative methods in scientific computing
ACM Transactions on Mathematical Software (TOMS)
Cache-optimal algorithms for option pricing
ACM Transactions on Mathematical Software (TOMS)
Efficient, causal camera tracking in unprepared environments
Computer Vision and Image Understanding
Dictionary learning for sparse approximations with the majorization method
IEEE Transactions on Signal Processing
Triangular matrix inversion on Graphics Processing Unit
Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis
Implementing sparse matrix-vector multiplication on throughput-oriented processors
Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis
Multidimensional first and second order symmetric Strang splitting for hyperbolic systems
Applied Numerical Mathematics
TOKAM-3D: A 3D fluid code for transport and turbulence in the edge plasma of Tokamaks
Journal of Computational Physics
Computation in multicriteria matroid optimization
Journal of Experimental Algorithmics (JEA)
Applying recursion to serial and parallel QR factorization leads to better performance
IBM Journal of Research and Development
Minimal-storage high-performance Cholesky factorization via blocking and recursion
IBM Journal of Research and Development
A study on quaternion blockquasi-tridiagonal systems
Computers & Mathematics with Applications
Standardized mixed language programming for Fortran and C
ACM SIGPLAN Fortran Forum
Reservoir Size, Spectral Radius and Connectivity in Static Classification Problems
ICANN '09 Proceedings of the 19th International Conference on Artificial Neural Networks: Part I
Scaling LAPACK panel operations using parallel cache assignment
Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
Airborne smoothing and mapping using vision and inertial sensors
ICRA'09 Proceedings of the 2009 IEEE international conference on Robotics and Automation
Using Python for large scale linear algebra applications
Future Generation Computer Systems - Special section: Complex problem-solving environments for grid computing
Symbolic-numeric efficient solution of optimal control problems for multibody systems
Journal of Computational and Applied Mathematics - Special issue: International workshop on the technological aspects of mathematics
WSEAS TRANSACTIONS on COMMUNICATIONS
SE '08 Proceedings of the IASTED International Conference on Software Engineering
EFCOSS: An interactive environment facilitating optimal experimental design
ACM Transactions on Mathematical Software (TOMS)
Rectangular full packed format for cholesky's algorithm: factorization, solution, and inversion
ACM Transactions on Mathematical Software (TOMS)
Heuristic approach for multiple queries of 3D N-finger frictional force closure grasp
IROS'09 Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems
Zirkonium: Non-invasive software for sound spatialisation*
Organised Sound
Algorithms for memory hierarchies: advanced lectures
Algorithms for memory hierarchies: advanced lectures
Discretization correction of general integral PSE Operators for particle methods
Journal of Computational Physics
Time-memory trade-offs using sparse matrix methods for large-scale eigenvalue problems
ICCSA'03 Proceedings of the 2003 international conference on Computational science and its applications: PartI
VECPAR'06 Proceedings of the 7th international conference on High performance computing for computational science
PyACTS: a high-level framework for fast development of high performance applications
VECPAR'06 Proceedings of the 7th international conference on High performance computing for computational science
Evaluation of linear solvers for astrophysics transfer problems
VECPAR'06 Proceedings of the 7th international conference on High performance computing for computational science
Semantic-based service trading: application to linear algebra
VECPAR'06 Proceedings of the 7th international conference on High performance computing for computational science
Self-adapting software for numerical linear algebra library routines on clusters
ICCS'03 Proceedings of the 2003 international conference on Computational science: PartIII
A parallel algorithm for systems of convection-diffusion equations
NMA'06 Proceedings of the 6th international conference on Numerical methods and applications
Improving data locality by chunking
CC'03 Proceedings of the 12th international conference on Compiler construction
Superquadratic convergence of DLASQ for computing matrix singular values
Journal of Computational and Applied Mathematics
Parallel variants of the multishift QZ algorithm with advanced deflation techniques
PARA'06 Proceedings of the 8th international conference on Applied parallel computing: state of the art in scientific computing
PARA'06 Proceedings of the 8th international conference on Applied parallel computing: state of the art in scientific computing
LAPACK-style codes for pivoted Cholesky and QR updating
PARA'06 Proceedings of the 8th international conference on Applied parallel computing: state of the art in scientific computing
Minimal data copy for dense linear algebra factorization
PARA'06 Proceedings of the 8th international conference on Applied parallel computing: state of the art in scientific computing
Rectangular full packed format for LAPACK algorithms timings on several computers
PARA'06 Proceedings of the 8th international conference on Applied parallel computing: state of the art in scientific computing
PPAM'07 Proceedings of the 7th international conference on Parallel processing and applied mathematics
PPAM'07 Proceedings of the 7th international conference on Parallel processing and applied mathematics
Parallel tiled QR factorization for multicore architectures
PPAM'07 Proceedings of the 7th international conference on Parallel processing and applied mathematics
Parallel solution of band linear systems in model reduction
PPAM'07 Proceedings of the 7th international conference on Parallel processing and applied mathematics
Minimum variance associations: discovering relationships in numerical data
PAKDD'08 Proceedings of the 12th Pacific-Asia conference on Advances in knowledge discovery and data mining
Optimization of BLAS on the cell processor
HiPC'08 Proceedings of the 15th international conference on High performance computing
Scheduling two-sided transformations using tile algorithms on multicore architectures
Scientific Programming
Towards dense linear algebra for hybrid GPU accelerated manycore systems
Parallel Computing
Low complexity DFT-domain noise PSD tracking using high-resolution periodograms
EURASIP Journal on Advances in Signal Processing
Compact integration factor methods for complex domains and adaptive mesh refinement
Journal of Computational Physics
Handling task dependencies under strided and aliased references
Proceedings of the 24th ACM International Conference on Supercomputing
Managing the complexity of lookahead for LU factorization with pivoting
Proceedings of the twenty-second annual ACM symposium on Parallelism in algorithms and architectures
Journal of Computational and Applied Mathematics
Blind detection of interleaver parameters for non-binary coded data streams
ICC'09 Proceedings of the 2009 IEEE international conference on Communications
Time-varying root-locus of large-signal LC oscillators
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems
Parallel Colt: A High-Performance Java Library for Scientific Computing and Image Processing
ACM Transactions on Mathematical Software (TOMS)
ACM Transactions on Mathematical Software (TOMS)
Mathematics and Computers in Simulation
Identifying software usage at HPC centers with the automatic library tracking database
Proceedings of the 2010 TeraGrid Conference
Journal of Computational Physics
Light & Skin Interactions: Simulations for Computer Graphics Applications
Light & Skin Interactions: Simulations for Computer Graphics Applications
Journal of Computational Physics
Algebraic and numerical algorithms
Algorithms and theory of computation handbook
On parallelizing the MRRR algorithm for data-parallel coprocessors
PPAM'09 Proceedings of the 8th international conference on Parallel processing and applied mathematics: Part I
A Parallel Implementation of Electron-Phonon Scattering in Nanoelectronic Devices up to 95k Cores
Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis
An Improved Magma Gemm For Fermi Graphics Processing Units
International Journal of High Performance Computing Applications
Joint data QR-detection and Kalman estimation for OFDM time-varying Rayleigh channel complex gains
IEEE Transactions on Communications
Parallel ICA methods for EEG neuroimaging
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
Algorithm engineering: bridging the gap between algorithm theory and practice
Algorithm engineering: bridging the gap between algorithm theory and practice
Architecture and implementation of a distributed reconfigurable metacomputer
ISPDC'03 Proceedings of the Second international conference on Parallel and distributed computing
Partitioned Triangular Tridiagonalization
ACM Transactions on Mathematical Software (TOMS)
Exact solutions to linear systems of equations using output sensitive lifting
ACM Communications in Computer Algebra
Solving cubics by polynomial fitting
Journal of Computational and Applied Mathematics
SIAM Journal on Scientific Computing
A Novel Parallel QR Algorithm for Hybrid Distributed Memory HPC Systems
SIAM Journal on Scientific Computing
On the vectorization of engineering codes using multimedia instructions
VECPAR'10 Proceedings of the 9th international conference on High performance computing for computational science
Numerical library reuse in parallel and distributed platforms
VECPAR'10 Proceedings of the 9th international conference on High performance computing for computational science
On a strategy for spectral clustering with parallel computation
VECPAR'10 Proceedings of the 9th international conference on High performance computing for computational science
Toward high-quality modal contact sound
ACM SIGGRAPH 2011 papers
ACS'06 Proceedings of the 6th WSEAS international conference on Applied computer science
Journal of Computational Physics
Staged static techniques to efficiently implement array copy semantics in a MATLAB JIT compiler
CC'11/ETAPS'11 Proceedings of the 20th international conference on Compiler construction: part of the joint European conferences on theory and practice of software
On the computation of protein backbones by using artificial backbones of hydrogens
Journal of Global Optimization
Complexity Reduction by Using QR-Based Scheme in Computing Capacity for Optimal Transmission
Wireless Personal Communications: An International Journal
ICCSA'11 Proceedings of the 2011 international conference on Computational science and its applications - Volume Part I
Solving eigenvalue problems on curved surfaces using the Closest Point Method
Journal of Computational Physics
Smart cache cleaning: energy efficient vulnerability reduction in embedded processors
CASES '11 Proceedings of the 14th international conference on Compilers, architectures and synthesis for embedded systems
Knowledge-based automatic generation of partitioned matrix expressions
CASC'11 Proceedings of the 13th international conference on Computer algebra in scientific computing
FATODE: a library for forward, adjoint and tangent linear integration of stiff systems
Proceedings of the 19th High Performance Computing Symposia
Algorithm 915, SuiteSparseQR: Multifrontal multithreaded rank-revealing sparse QR factorization
ACM Transactions on Mathematical Software (TOMS)
Partial factorization of a dense symmetric indefinite matrix
ACM Transactions on Mathematical Software (TOMS)
Atomistic nanoelectronic device engineering with sustained performances up to 1.44 PFlop/s
Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
A threaded SPIKE algorithm for solving general banded systems
Parallel Computing
MR3-SMP: A symmetric tridiagonal eigensolver for multi-core architectures
Parallel Computing
Parallel filtering in global gyrokinetic simulations
Journal of Computational Physics
Optimizing Halley's Iteration for Computing the Matrix Polar Decomposition
SIAM Journal on Matrix Analysis and Applications
Quasi-Newton Methods on Grassmannians and Multilinear Approximations of Tensors
SIAM Journal on Scientific Computing
Communication-optimal Parallel and Sequential Cholesky Decomposition
SIAM Journal on Scientific Computing
Superconvergent Functional Estimates from Summation-By-Parts Finite-Difference Discretizations
SIAM Journal on Scientific Computing
Sensitivity Analysis of Limit-Cycle Oscillating Hybrid Systems
SIAM Journal on Scientific Computing
Applying data copy to improve memory performance of general array computations
LCPC'05 Proceedings of the 18th international conference on Languages and Compilers for Parallel Computing
Towards high performance discrete-event simulations of smart electric grids
Proceedings of the first international workshop on High performance computing, networking and analytics for the power grid
An introduction to GPU accelerated surgical simulation
ISBMS'06 Proceedings of the Third international conference on Biomedical Simulation
Aggressive loop fusion for improving locality and parallelism
ISPA'05 Proceedings of the Third international conference on Parallel and Distributed Processing and Applications
An evaluation methodology for computational grids
HPCC'05 Proceedings of the First international conference on High Performance Computing and Communications
Interfacing with the numerical homotopy algorithms in PHCpack
ICMS'06 Proceedings of the Second international conference on Mathematical Software
Eigen-Genomic System Dynamic-Pattern Analysis (ESDA): Modeling mRNA Degradation and Self-Regulation
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
High performance matrix inversion based on LU factorization for multicore architectures
Proceedings of the 2011 ACM international workshop on Many task computing on grids and supercomputers
Parallel solution of large-scale and sparse generalized algebraic riccati equations
Euro-Par'06 Proceedings of the 12th international conference on Parallel Processing
Parallelising matrix operations on clusters for an optimal control-based quantum compiler
Euro-Par'06 Proceedings of the 12th international conference on Parallel Processing
Parallel model reduction of large linear descriptor systems via balanced truncation
VECPAR'04 Proceedings of the 6th international conference on High Performance Computing for Computational Science
Parallel boundary elements: a portable 3-D elastostatic implementation for shared memory systems
VECPAR'04 Proceedings of the 6th international conference on High Performance Computing for Computational Science
Numerical integration of the differential riccati equation: a high performance computing approach
VECPAR'04 Proceedings of the 6th international conference on High Performance Computing for Computational Science
Comparison of different parallel modified gram-schmidt algorithms
Euro-Par'05 Proceedings of the 11th international Euro-Par conference on Parallel Processing
PARA'04 Proceedings of the 7th international conference on Applied Parallel Computing: state of the Art in Scientific Computing
High performance linear algebra algorithms: an introduction
PARA'04 Proceedings of the 7th international conference on Applied Parallel Computing: state of the Art in Scientific Computing
Applying software testing metrics to lapack
PARA'04 Proceedings of the 7th international conference on Applied Parallel Computing: state of the Art in Scientific Computing
A new array format for symmetric and triangular matrices
PARA'04 Proceedings of the 7th international conference on Applied Parallel Computing: state of the Art in Scientific Computing
Semi-automatic generation of grid computing interfaces for numerical software libraries
PARA'04 Proceedings of the 7th international conference on Applied Parallel Computing: state of the Art in Scientific Computing
ALPS: a software framework for parallel space-time adaptive processing
PARA'04 Proceedings of the 7th international conference on Applied Parallel Computing: state of the Art in Scientific Computing
PARA'04 Proceedings of the 7th international conference on Applied Parallel Computing: state of the Art in Scientific Computing
The inner structure of sensitivities in nodal based shape optimisation
Computational Mechanics
Output error estimation for summation-by-parts finite-difference schemes
Journal of Computational Physics
A new time-dependent complexity reduction method for biochemical systems
Transactions on Computational Systems Biology I
Journal of Computational Physics
A compatible Lagrangian hydrodynamic scheme for multicomponent flows with mixing
Journal of Computational Physics
On aggressive early deflation in parallel variants of the QR algorithm
PARA'10 Proceedings of the 10th international conference on Applied Parallel and Scientific Computing - Volume Part I
PARA'10 Proceedings of the 10th international conference on Applied Parallel and Scientific Computing - Volume Part I
The algorithm of multiple relatively robust representations for multi-core processors
PARA'10 Proceedings of the 10th international conference on Applied Parallel and Scientific Computing - Volume Part I
Efficient implementation of interval matrix multiplication
PARA'10 Proceedings of the 10th international conference on Applied Parallel and Scientific Computing - Volume 2
AI'11 Proceedings of the 24th international conference on Advances in Artificial Intelligence
Advances in Data Analysis and Classification
To CG or to HDG: A Comparative Study
Journal of Scientific Computing
Journal of Scientific Computing
Journal of Computational Physics
Semi-automatic sparse preconditioners for high-order finite element methods on non-uniform meshes
Journal of Computational Physics
Two-stage least squares and indirect least squares algorithms for simultaneous equations models
Journal of Computational and Applied Mathematics
Computing eigenvectors of block tridiagonal matrices based on twisted block factorizations
Journal of Computational and Applied Mathematics
High-Performance matrix-vector multiplication on the GPU
Euro-Par'11 Proceedings of the 2011 international conference on Parallel Processing
Estimating conditioning of BVPs for ODEs
Mathematical and Computer Modelling: An International Journal
Algebraic stabilization of explicit numerical integration for extremely stiff reaction networks
Journal of Computational Physics
A hybridization between memetic algorithm and semidefinite relaxation for the max-cut problem
Proceedings of the 14th annual conference on Genetic and evolutionary computation
ACM Transactions on Mathematical Software (TOMS)
Error handling in Fortran 2003
ACM SIGPLAN Fortran Forum
An Algebraic Multigrid Method with Guaranteed Convergence Rate
SIAM Journal on Scientific Computing
A New Truncation Strategy for the Higher-Order Singular Value Decomposition
SIAM Journal on Scientific Computing
Divide and Conquer on Hybrid GPU-Accelerated Multicore Systems
SIAM Journal on Scientific Computing
Communication-optimal Parallel and Sequential QR and LU Factorizations
SIAM Journal on Scientific Computing
Computing All or Some Eigenvalues of Symmetric $\mathcal{H}_{\ell}$-Matrices
SIAM Journal on Scientific Computing
Detecting Localization in an Invariant Subspace
SIAM Journal on Scientific Computing
CALU: A Communication Optimal LU Factorization Algorithm
SIAM Journal on Matrix Analysis and Applications
Speeding up solving of differential matrix Riccati equations using GPGPU computing and MATLAB
Concurrency and Computation: Practice & Experience
New level-3 BLAS kernels for cholesky factorization
PPAM'11 Proceedings of the 9th international conference on Parallel Processing and Applied Mathematics - Volume Part I
Cache blocking for linear algebra algorithms
PPAM'11 Proceedings of the 9th international conference on Parallel Processing and Applied Mathematics - Volume Part I
Reducing the amount of pivoting in symmetric indefinite systems
PPAM'11 Proceedings of the 9th international conference on Parallel Processing and Applied Mathematics - Volume Part I
Performance analysis of parallel alternating directions algorithm for time dependent problems
PPAM'11 Proceedings of the 9th international conference on Parallel Processing and Applied Mathematics - Volume Part I
Auto-tuning dense vector and matrix-vector operations for fermi GPUs
PPAM'11 Proceedings of the 9th international conference on Parallel Processing and Applied Mathematics - Volume Part I
PPAM'11 Proceedings of the 9th international conference on Parallel Processing and Applied Mathematics - Volume Part I
PPAM'11 Proceedings of the 9th international conference on Parallel Processing and Applied Mathematics - Volume Part I
Implementations of main algorithms for generalized eigenproblem on GPU accelerator
ICSI'12 Proceedings of the Third international conference on Advances in Swarm Intelligence - Volume Part II
Self-taught dimensionality reduction on the high-dimensional small-sized data
Pattern Recognition
Statically typed matrix: in C++ library
Proceedings of the Fifth Balkan Conference in Informatics
A web-based structural health monitoring sensor network
International Journal of Computer Applications in Technology
Families of Algorithms for Reducing a Matrix to Condensed Form
ACM Transactions on Mathematical Software (TOMS)
ArtSurf: a method for deformable partial matching of protein small-molecule binding sites
Proceedings of the ACM Conference on Bioinformatics, Computational Biology and Biomedicine
Design of multi-dimensional transfer functions using dimensional reduction
EUROVIS'07 Proceedings of the 9th Joint Eurographics / IEEE VGTC conference on Visualization
Locality optimized shared-memory implementations of iterated runge-kutta methods
Euro-Par'07 Proceedings of the 13th international Euro-Par conference on Parallel Processing
Bob: a free signal processing and machine learning toolbox for researchers
Proceedings of the 20th ACM international conference on Multimedia
Concurrency and Computation: Practice & Experience
DVFS-control techniques for dense linear algebra operations on multi-core processors
Computer Science - Research and Development
Computer Science - Research and Development
Avoiding communication through a multilevel LU factorization
Euro-Par'12 Proceedings of the 18th international conference on Parallel Processing
Swarm capability of finding eigenvalues
AIMSA'12 Proceedings of the 15th international conference on Artificial Intelligence: methodology, systems, and applications
Accelerating Linear System Solutions Using Randomization Techniques
ACM Transactions on Mathematical Software (TOMS)
Level-3 Cholesky Factorization Routines Improve Performance of Many Cholesky Algorithms
ACM Transactions on Mathematical Software (TOMS)
Elemental: A New Framework for Distributed Memory Dense Matrix Computations
ACM Transactions on Mathematical Software (TOMS)
A pure L1-norm principal component analysis
Computational Statistics & Data Analysis
Euro-Par'12 Proceedings of the 18th international conference on Parallel processing workshops
High-performance bidiagonal reduction using tile algorithms on homogeneous multicore architectures
ACM Transactions on Mathematical Software (TOMS)
Blocked schur algorithms for computing the matrix square root
PARA'12 Proceedings of the 11th international conference on Applied Parallel and Scientific Computing
Performance modeling of pipelined linear algebra architectures on FPGAs
ARC'13 Proceedings of the 9th international conference on Reconfigurable Computing: architectures, tools, and applications
SemCache: semantics-aware caching for efficient GPU offloading
Proceedings of the 27th international ACM conference on International conference on supercomputing
Toward a scalable multi-GPU eigensolver via compute-intensive kernels and efficient communication
Proceedings of the 27th international ACM conference on International conference on supercomputing
Robust multilevel solvers for high-contrast anisotropic multiscale problems
Journal of Computational and Applied Mathematics
Scaling LAPACK panel operations using parallel cache assignment
ACM Transactions on Mathematical Software (TOMS)
Algorithm 930: FACTORIZE: An object-oriented linear system solver for MATLAB
ACM Transactions on Mathematical Software (TOMS)
Cache efficient implementation for block matrix operations
Proceedings of the High Performance Computing Symposium
Towards a functional run-time for dense NLA domain
Proceedings of the 2nd ACM SIGPLAN workshop on Functional high-performance computing
Parallel reduction to hessenberg form with algorithm-based fault tolerance
SC '13 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Patchwork algorithm for the parallel computation of the Green's function in open systems
Journal of Computational Electronics
A multicore solution to Block---Toeplitz linear systems of equations
The Journal of Supercomputing
ACM Transactions on Mathematical Software (TOMS)
CPU-GPU hybrid bidiagonal reduction with soft error resilience
ScalA '13 Proceedings of the Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems
SE-HPCCSE '13 Proceedings of the 1st International Workshop on Software Engineering for High Performance Computing in Computational Science and Engineering
C2FPGA-A dependency-timing graph design methodology
Journal of Parallel and Distributed Computing
A shift strategy for superquadratic convergence in the dqds algorithm for singular values
Journal of Computational and Applied Mathematics
A case study in mechanically deriving dense linear algebra code
International Journal of High Performance Computing Applications
Application-tailored linear algebra algorithms: A search-based approach
International Journal of High Performance Computing Applications
VBARMS: A variable block algebraic recursive multilevel solver for sparse linear systems
Journal of Computational and Applied Mathematics
A Basic Linear Algebra Compiler
Proceedings of Annual IEEE/ACM International Symposium on Code Generation and Optimization
Efficient search for inputs causing high floating-point errors
Proceedings of the 19th ACM SIGPLAN symposium on Principles and practice of parallel programming
Fast iterative graph computation with block updates
Proceedings of the VLDB Endowment
Optimally packed chains of bulges in multishift QR algorithms
ACM Transactions on Mathematical Software (TOMS)
Scalable matrix decompositions with multiple cores on FPGAs
Microprocessors & Microsystems
Fast matrix decomposition in F2
Journal of Computational and Applied Mathematics
Basic Singular Spectrum Analysis and forecasting with R
Computational Statistics & Data Analysis
Detecting the causes of ill-conditioning in structural finite element models
Computers and Structures
Speeding up NEC electromagnetic simulation using GPU technology for antenna design problems
International Journal of Computing Science and Mathematics
Feature selection with SVD entropy: Some modification and extension
Information Sciences: an International Journal
Classification of brain activation via spatial Bayesian variable selection in fMRI regression
Advances in Data Analysis and Classification
Amesos2 and Belos: Direct and iterative solvers for large sparse linear systems
Scientific Programming
Stability of rootfinding for barycentric Lagrange interpolants
Numerical Algorithms
Trends and outlook for the massive-scale analytics stack
IBM Journal of Research and Development
Hi-index | 0.16 |