Managing Multicore with OpenMP (Extended Abstract)
Proceedings of the 15th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
Performance Improvement Using Parallel Simulation Protocol and Time Warp for DEVS Based Applications
DS-RT '08 Proceedings of the 2008 12th IEEE/ACM International Symposium on Distributed Simulation and Real-Time Applications
Survey on Parallel Programming Model
NPC '08 Proceedings of the IFIP International Conference on Network and Parallel Computing
Proceedings of the 4th workshop on Declarative aspects of multicore programming
CompSysTech '08 Proceedings of the 9th International Conference on Computer Systems and Technologies and Workshop for PhD Students in Computing
Semi-automatic parallelization of direct and inverse problems for geothermal simulation
Proceedings of the 2009 ACM symposium on Applied Computing
Multi-core parallelization in Clojure: a case study
Proceedings of the 6th European Lisp Workshop
Journal of Computational Physics
Numerical Simulations of Unsteady Shock Wave Interactions Using SaC and Fortran-90
PaCT '09 Proceedings of the 10th International Conference on Parallel Computing Technologies
Proceedings of the 16th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
Using OpenMP vs. Threading Building Blocks for Medical Imaging on Multi-cores
Euro-Par '09 Proceedings of the 15th International Euro-Par Conference on Parallel Processing
Proceedings of the 2009 International Conference on Hybrid Information Technology
The Quest for Parallel Reasoning on the Semantic Web
AMT '09 Proceedings of the 5th International Conference on Active Media Technology
ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
PLOS 2009: fifth workshop on programming languages and operating systems
ACM SIGOPS Operating Systems Review
State-of-the-art in heterogeneous computing
Scientific Programming
Online mapping of MPI-2 dynamic tasks to processes and threads
International Journal of High Performance Systems Architecture
BCYCLIC: A parallel block tridiagonal matrix cyclic solver
Journal of Computational Physics
Understanding complex multithreaded software systems by using trace visualization
Proceedings of the 5th international symposium on Software visualization
Empirical analysis of parallelism overheads on CMPs
PPAM'09 Proceedings of the 8th international conference on Parallel processing and applied mathematics: Part I
Automatic Run-time Parallelization and Transformation of I/O
Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis
Generators-of-generators library with optimization capabilities in fortress
Euro-Par'10 Proceedings of the 16th international Euro-Par conference on Parallel processing: Part II
Programming CUDA-based GPUs to simulate two-layer shallow water flows
Euro-Par'10 Proceedings of the 16th international Euro-Par conference on Parallel processing: Part II
Challenges and issues of supporting task parallelism in MPI
EuroMPI'10 Proceedings of the 17th European MPI users' group meeting conference on Recent advances in the message passing interface
Algorithm engineering: bridging the gap between algorithm theory and practice
Algorithm engineering: bridging the gap between algorithm theory and practice
Task management for irregular-parallel workloads on the GPU
Proceedings of the Conference on High Performance Graphics
Heterogeneous spline surface intersections
Proceedings of the 26th Spring Conference on Computer Graphics
Journal of Computational Physics
Parallel programming for multimedia applications
Multimedia Tools and Applications
International Journal of High Performance Computing Applications
Light-weight communications on Intel's single-chip cloud computer processor
ACM SIGOPS Operating Systems Review
Breadth in depth: a 1st year introduction to parallel programming
Proceedings of the 42nd ACM technical symposium on Computer science education
A Block FSAI-ILU Parallel Preconditioner for Symmetric Positive Definite Linear Systems
SIAM Journal on Scientific Computing
RMS-TM: a comprehensive benchmark suite for transactional memory systems
Proceedings of the 2nd ACM/SPEC International Conference on Performance engineering
Using aspect-orientation to simplify concurrent programming
Proceedings of the tenth international conference on Aspect-oriented software development companion
GPGPU-based parallel computation: application to molecular dynamics problems
COMPUTE '11 Proceedings of the Fourth Annual ACM Bangalore Conference
Mint: realizing CUDA performance in 3D stencil methods with annotated C
Proceedings of the international conference on Supercomputing
A framework for modeling mosquito vectors
Proceedings of the 2010 Summer Computer Simulation Conference
PERCS: the IBM power7-IH high-performance computing system
IBM Journal of Research and Development
PAKDD'11 Proceedings of the 15th Pacific-Asia conference on Advances in knowledge discovery and data mining - Volume Part I
OpenMPspy: Leveraging quality assurance for parallel software
Euro-Par'11 Proceedings of the 17th international conference on Parallel processing - Volume Part II
Cache size in a cost model for heterogeneous skeletons
Proceedings of the fifth international workshop on High-level parallel programming and applications
A survey on parallel ant colony optimization
Applied Soft Computing
Sisal 3.2 language features overview
PaCT'11 Proceedings of the 11th international conference on Parallel computing technologies
PaCT'11 Proceedings of the 11th international conference on Parallel computing technologies
Parallel appearance-adaptive models for real-time object tracking using particle swarm optimization
ICCCI'11 Proceedings of the Third international conference on Computational collective intelligence: technologies and applications - Volume Part II
Simulation of one-layer shallow water systems on multicore and CUDA architectures
The Journal of Supercomputing
Algorithm 915, SuiteSparseQR: Multifrontal multithreaded rank-revealing sparse QR factorization
ACM Transactions on Mathematical Software (TOMS)
MIRAGE'11 Proceedings of the 5th international conference on Computer vision/computer graphics collaboration techniques
Parallel implementations of gusfield's cut tree algorithm
ICA3PP'11 Proceedings of the 11th international conference on Algorithms and architectures for parallel processing - Volume Part I
Journal of Computational Physics
fMRI analysis on the GPU-Possibilities and challenges
Computer Methods and Programs in Biomedicine
Provenance for MapReduce-based data-intensive workflows
Proceedings of the 6th workshop on Workflows in support of large-scale science
The Karlsruhe Institute of Technology translation systems for the WMT 2011
WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
Automatic parallelization of recursive functions using quantifier elimination
FLOPS'10 Proceedings of the 10th international conference on Functional and Logic Programming
Performance evaluation of OpenMP-based algorithms for handling Kronecker descriptors
Journal of Parallel and Distributed Computing
Algorithmic skeletons for multi-core, multi-GPU systems and clusters
International Journal of High Performance Computing and Networking
Towards a codelet-based runtime for exascale computing: position paper
Proceedings of the 2nd International Workshop on Adaptive Self-Tuning Computing Systems for the Exaflop Era
Structural and Multidisciplinary Optimization
True 4D image denoising on the GPU
Journal of Biomedical Imaging - Special issue on Parallel Computation in Medical Imaging Applications
Solving the flexible job shop problem on GPU
ICAISC'12 Proceedings of the 11th international conference on Artificial Intelligence and Soft Computing - Volume Part II
SIDE'12 Proceedings of the 2012 international conference on Swarm and Evolutionary Computation
An object-oriented bulk synchronous parallel library for multicore programming
Concurrency and Computation: Practice & Experience
Code Parallelization for Multi-Core Software Defined Radio Platforms with OpenMP
Journal of Signal Processing Systems
Parallel algorithm for landform attributes representation on multicore and Multi-GPU systems
ICCSA'12 Proceedings of the 12th international conference on Computational Science and Its Applications - Volume Part I
Multiscale modeling of heterogeneous media applying AEH to 3d bodies
ICCSA'12 Proceedings of the 12th international conference on Computational Science and Its Applications - Volume Part I
PPAM'11 Proceedings of the 9th international conference on Parallel Processing and Applied Mathematics - Volume Part I
Hierarchical parallel approach in vascular network modeling: hybrid MPI+OpenMP implementation
PPAM'11 Proceedings of the 9th international conference on Parallel Processing and Applied Mathematics - Volume Part I
Autotuning of adaptive mesh refinement PDE solvers on shared memory architectures
PPAM'11 Proceedings of the 9th international conference on Parallel Processing and Applied Mathematics - Volume Part I
Real-time multi-view human motion tracking using particle swarm optimization with resampling
AMDO'12 Proceedings of the 7th international conference on Articulated Motion and Deformable Objects
CEFP'11 Proceedings of the 4th Summer School conference on Central European Functional Programming School
Reasoning about multi-process systems with the box calculus
CEFP'11 Proceedings of the 4th Summer School conference on Central European Functional Programming School
Coalition threading: combining traditional andnon-traditional parallelism to maximize scalability
Proceedings of the 21st international conference on Parallel architectures and compilation techniques
Journal of Computing Sciences in Colleges
Hybrid parallelization for multi-view visualization of time-dependent simulation data
EG PGV'09 Proceedings of the 9th Eurographics conference on Parallel Graphics and Visualization
Small and large deformation analysis with the p- and B-spline versions of the Finite Cell Method
Computational Mechanics
Cache-sensitive MapReduce DGEMM algorithms for shared memory architectures
Proceedings of the South African Institute for Computer Scientists and Information Technologists Conference
Performance optimisation of sequential programs on multi-core processors
Proceedings of the South African Institute for Computer Scientists and Information Technologists Conference
Computers and Industrial Engineering
A Framework for Modeling Genetically-Aware Mosquito Vectors for Sterile Insect Technique
International Journal of Agent Technologies and Systems
Cellular Automata and GPGPU: An Application to Lava Flow Modeling
International Journal of Grid and High Performance Computing
Comparing the performance of stochastic simulation on GPUs and OpenMP
International Journal of Computational Science and Engineering
Parallel Bayesian inference of range and reflectance from LaDAR profiles
Journal of Parallel and Distributed Computing
Pyjama: OpenMP-like implementation for Java, with GUI extensions
Proceedings of the 2013 International Workshop on Programming Models and Applications for Multicores and Manycores
Multi-core implementation of the differential ant-stigmergy algorithm for numerical optimization
The Journal of Supercomputing
Optimized contrast enhancement for real-time image and video dehazing
Journal of Visual Communication and Image Representation
Proceedings of the 6th International Conference on Computer Vision / Computer Graphics Collaboration Techniques and Applications
Proceedings of the third ACM SIGPLAN X10 Workshop
Accelerating tandem MS protein database searches using OpenCL
Proceedings of the 3rd international workshop on Emerging computational methods for the life sciences
Triangulating molecular surfaces on multiple GPUs
Proceedings of the 20th European MPI Users' Group Meeting
A parallel dual-scanline algorithm for partitioning parameterized 45-degree polygons
ACM Transactions on Design Automation of Electronic Systems (TODAES) - Special Section on Networks on Chip: Architecture, Tools, and Methodologies
GPU-based approaches for real-time sound source localization using the SRP-PHAT algorithm
International Journal of High Performance Computing Applications
On supernode transformations and multithreading for the longest common subsequence problem
AusPDC '12 Proceedings of the Tenth Australasian Symposium on Parallel and Distributed Computing - Volume 127
Proceedings of the Ninth IEEE/ACM/IFIP International Conference on Hardware/Software Codesign and System Synthesis
Expandable process networks to efficiently specify and explore task, data, and pipeline parallelism
Proceedings of the 2013 International Conference on Compilers, Architectures and Synthesis for Embedded Systems
Range query processing on single and multi GPU environments
Computers and Electrical Engineering
Computers in Biology and Medicine
A Case Study of Implementing Supernode Transformations
International Journal of Parallel Programming
Numerical integration on GPUs for higher order finite elements
Computers & Mathematics with Applications
Approaches to Distributed Execution of Scientific Workflows in Kepler
Fundamenta Informaticae - Scalable Workflow Enactment Engines and Technology
Hi-index | 0.02 |
"I hope that readers will learn to use the full expressibility and power of OpenMP. This book should provide an excellent introduction to beginners, and the performance section should help those with some experience who want to push OpenMP to its limits." --from the foreword by David J. Kuck, Intel Fellow, Software and Solutions Group, and Director, Parallel and Distributed Solutions, Intel Corporation OpenMP, a portable programming interface for shared memory parallel computers, was adopted as an informal standard in 1997 by computer scientists who wanted a unified model on which to base programs for shared memory systems. OpenMP is now used by many software developers; it offers significant advantages over both hand-threading and MPI. Using OpenMP offers a comprehensive introduction to parallel programming concepts and a detailed overview of OpenMP. Using OpenMP discusses hardware developments, describes where OpenMP is applicable, and compares OpenMP to other programming interfaces for shared and distributed memory parallel architectures. It introduces the individual features of OpenMP, provides many source code examples that demonstrate the use and functionality of the language constructs, and offers tips on writing an efficient OpenMP program. It describes how to use OpenMP in full-scale applications to achieve high performance on large-scale architectures, discussing several case studies in detail, and offers in-depth troubleshooting advice. It explains how OpenMP is translated into explicitly multithreaded code, providing a valuable behind-the-scenes account of OpenMP program performance. Finally, Using OpenMP considers trends likely to influence OpenMP development, offering a glimpse of the possibilities of a future OpenMP 3.0 from the vantage point of the current OpenMP 2.5. With multicore computer use increasing, the need for a comprehensive introduction and overview of the standard interface is clear. Using OpenMP provides an essential reference not only for students at both undergraduate and graduate levels but also for professionals who intend to parallelize existing codes or develop new parallel programs for shared memory computer architectures.