Memory storage patterns in parallel processing
Memory storage patterns in parallel processing
Optimizing matrix multiply using PHiPAC: a portable, high-performance, ANSI C coding methodology
ICS '97 Proceedings of the 11th international conference on Supercomputing
Optimisation of component-based applications within a grid environment
Proceedings of the 2001 ACM/IEEE conference on Supercomputing
Parallel Programming Using Skeleton Functions
PARLE '93 Proceedings of the 5th International PARLE Conference on Parallel Architectures and Languages Europe
Runtime Interprocedural Data Placement Optimisation for Lazy Parallel Libraries (Extended Abstract)
Euro-Par '97 Proceedings of the Third International Euro-Par Conference on Parallel Processing
Array Design and Expression Evaluation in POOMA II
ISCOPE '98 Proceedings of the Second International Symposium on Computing in Object-Oriented Parallel Environments
ISCOPE '98 Proceedings of the Second International Symposium on Computing in Object-Oriented Parallel Environments
ISCOPE '98 Proceedings of the Second International Symposium on Computing in Object-Oriented Parallel Environments
IPDPS '00 Proceedings of the 14th International Symposium on Parallel and Distributed Processing
A Proposal for a Set of Parallel Basic Linear Algebra Subprograms
A Proposal for a Set of Parallel Basic Linear Algebra Subprograms
LAPACK Working Note 102: IML++ v. 1.2: Iterative Methods Library Reference Guide
LAPACK Working Note 102: IML++ v. 1.2: Iterative Methods Library Reference Guide
Incorporating application semantics and control into compilation
DSL'97 Proceedings of the Conference on Domain-Specific Languages on Conference on Domain-Specific Languages (DSL), 1997
Overcoming barriers to restructuring in a modular visualisation environment
LCR '04 Proceedings of the 7th workshop on Workshop on languages, compilers, and run-time support for scalable systems
Statistical Models for Empirical Search-Based Performance Tuning
International Journal of High Performance Computing Applications
DESOLA: An active linear algebra library using delayed evaluation and runtime code generation
Science of Computer Programming
A domain-specific interpreter for parallelizing a large mixed-language visualisation application
LCPC'05 Proceedings of the 18th international conference on Languages and Compilers for Parallel Computing
Optimization techniques for efficient HTA programs
Parallel Computing
Hi-index | 0.00 |
We argue that delayed-evaluation, self-optimising scientific software components, which dynamically change their behaviour according to their calling context at runtime offer a possible way of bridging the apparent conflict between the quality of scientific software and its performance. Rather than equipping scientific software components with a performance interface which allows the caller to supply the context information that is lost when building abstract software components, we propose to recapture this lost context information at runtime. This paper is accompanied by a public release of a parallel linear algebra library with both C and C++ language interfaces which implements this proposal. We demonstrate the usability of this library by showing that it can be used to supply linear algebra component functionality to an existing external software package. We give preliminary performance figures and discuss avenues for future work.