Parallel programming
Scalable Shared-Memory Multiprocessing
Scalable Shared-Memory Multiprocessing
Execution-driven performance analysis for distributed and parallel systems
Proceedings of the 2nd international workshop on Software and performance
Towards an integrated, web-executable parallel programming tool environment
Proceedings of the 2000 ACM/IEEE conference on Supercomputing
A comparative study of the NAS MG benchmark across parallel languages and architectures
Proceedings of the 2000 ACM/IEEE conference on Supercomputing
Parallel programming with message passing and directives
Computing in Science and Engineering
Speculative synchronization: applying thread-level speculation to explicitly parallel applications
Proceedings of the 10th international conference on Architectural support for programming languages and operating systems
Language Support for Multidisciplinary Applications
IEEE Computational Science & Engineering
Implementing the NAS Benchmark MG in SAC
IPDPS '02 Proceedings of the 16th International Parallel and Distributed Processing Symposium
Design and Evaluation of a High-Level Interface for Data Mining
IPDPS '02 Proceedings of the 16th International Parallel and Distributed Processing Symposium
An Asymmetric Real-Time Scheduling for Linux
IPDPS '02 Proceedings of the 16th International Parallel and Distributed Processing Symposium
On-Line Debugging and Performance Monitoring with Barriers
IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
Automatic Coarse Grain Task Parallel Processing on SMP Using OpenMP
LCPC '00 Proceedings of the 13th International Workshop on Languages and Compilers for Parallel Computing-Revised Papers
Virtual Shared Files: Towards User-Friendly Inter-Process Communications
PaCT '999 Proceedings of the 5th International Conference on Parallel Computing Technologies
Formalizing OpenMP Performance Properties with ASL
ISHPC '00 Proceedings of the Third International Symposium on High Performance Computing
A Comparison of Scalable Labeling Schemes for Detecting Races in OpenMP Programs
WOMPAT '01 Proceedings of the International Workshop on OpenMP Applications and Tools: OpenMP Shared Memory Parallel Programming
On Customizing the UML for Modeling Performance-Oriented Applications
UML '02 Proceedings of the 5th International Conference on The Unified Modeling Language
Teraflops Computing: A Challenge to Parallel Numerics?
ParNum '99 Proceedings of the 4th International ACPC Conference Including Special Tracks on Parallel Numerics and Parallel Computing in Image Processing, Video Processing, and Multimedia: Parallel Computation
Using generative design patterns to generate parallel code for a distributed memory environment
Proceedings of the ninth ACM SIGPLAN symposium on Principles and practice of parallel programming
A Library Hierarchy for Implementing Scalable Parallel Search Algorithms
The Journal of Supercomputing
A Multi-Platform Co-Array Fortran Compiler
Proceedings of the 13th International Conference on Parallel Architectures and Compilation Techniques
Proceedings of the 34th conference on Winter simulation: exploring new frontiers
Exploiting Barriers to Optimize Power Consumption of CMPs
IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Papers - Volume 01
SCIMA-SMP: on-chip memory processor architecture for SMP
WMPI '04 Proceedings of the 3rd workshop on Memory performance issues: in conjunction with the 31st international symposium on computer architecture
Shared memory multiprocessor support for functional array processing in SAC
Journal of Functional Programming
Numerical Libraries and Tools for Scalable Parallel Cluster Computing
International Journal of High Performance Computing Applications
Combining self-reported and automatic data to improve programming effort measurement
Proceedings of the 10th European software engineering conference held jointly with 13th ACM SIGSOFT international symposium on Foundations of software engineering
Performance Portability in the Physical Parameterizations of the Community Atmospheric Model
International Journal of High Performance Computing Applications
A Scalable Implementation of a Finite-Volume Dynamical Core in the Community Atmosphere Model
International Journal of High Performance Computing Applications
Identifying domain-specific defect classes using inspections and change history
Proceedings of the 2006 ACM/IEEE international symposium on Empirical software engineering
SAC: a functional array language for efficient multi-threaded execution
International Journal of Parallel Programming
Sequoia: programming the memory hierarchy
Proceedings of the 2006 ACM/IEEE conference on Supercomputing
Sequoia: programming the memory hierarchy
Proceedings of the 2006 ACM/IEEE conference on Supercomputing
Using fine grain multithreading for energy efficient computing
Proceedings of the 12th ACM SIGPLAN symposium on Principles and practice of parallel programming
Costing stepwise refinements of parallel programs
Computer Languages, Systems and Structures
A comparison of online and offline strategies for program adaptation
ACM-SE 45 Proceedings of the 45th annual southeast regional conference
The implementation of the finite-volume dynamical core in the community atmosphere model
Journal of Computational and Applied Mathematics
The rise and fall of High Performance Fortran: an historical object lesson
Proceedings of the third ACM SIGPLAN conference on History of programming languages
Application of OpenMP to weather, wave and ocean codes
Scientific Programming
Parallel programming environment for OpenMP
Scientific Programming
Charisma: orchestrating migratable parallel objects
Proceedings of the 16th international symposium on High performance distributed computing
Parallel Programmability and the Chapel Language
International Journal of High Performance Computing Applications
An Evaluation of the Oak Ridge National Laboratory Cray XT3
International Journal of High Performance Computing Applications
A portable runtime interface for multi-level memory hierarchies
Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming
International Journal of High Performance Computing and Networking
Cray XT4: an early evaluation for petascale scientific simulation
Proceedings of the 2007 ACM/IEEE conference on Supercomputing
Communications of the ACM - Web science
An optimized message passing framework for parallel implementation of signal processing applications
Proceedings of the conference on Design, automation and test in Europe
A pilot study to compare programming effort for two parallel programming models
Journal of Systems and Software
Early evaluation of IBM BlueGene/P
Proceedings of the 2008 ACM/IEEE conference on Supercomputing
Global trees: a framework for linked data structures on distributed memory parallel systems
Proceedings of the 2008 ACM/IEEE conference on Supercomputing
A Real-Time Programming Model for Heterogeneous MPSoCs
SAMOS '08 Proceedings of the 8th international workshop on Embedded Computer Systems: Architectures, Modeling, and Simulation
Language Extensions in Support of Compiler Parallelization
Languages and Compilers for Parallel Computing
A Case Study in Tightly Coupled Multi-paradigm Parallel Programming
Languages and Compilers for Parallel Computing
Parallel and distributed local search in COMET
Computers and Operations Research
Mapping parallelism to multi-cores: a machine learning based approach
Proceedings of the 14th ACM SIGPLAN symposium on Principles and practice of parallel programming
Scientific Programming - High Performance Computing with the Cell Broadband Engine
GPU accelerated Monte Carlo simulation of the 2D and 3D Ising model
Journal of Computational Physics
SigRace: signature-based data race detection
Proceedings of the 36th annual international symposium on Computer architecture
Dynamic performance tuning for speculative threads
Proceedings of the 36th annual international symposium on Computer architecture
Adapting application execution in CMPs using helper threads
Journal of Parallel and Distributed Computing
Asserting and checking determinism for multithreaded programs
Proceedings of the the 7th joint meeting of the European software engineering conference and the ACM SIGSOFT symposium on The foundations of software engineering
GRID '08 Proceedings of the 2008 9th IEEE/ACM International Conference on Grid Computing
Efficient Parallelization of the Preconditioned Conjugate Gradient Method
PaCT '09 Proceedings of the 10th International Conference on Parallel Computing Technologies
Research on Evaluation of Parallelization on an Embedded Multicore Platform
APPT '09 Proceedings of the 8th International Symposium on Advanced Parallel Processing Technologies
Impact of Quad-Core Cray XT4 System and Software Stack on Scientific Computation
Euro-Par '09 Proceedings of the 15th International Euro-Par Conference on Parallel Processing
On the Need for a Consortium of Capability Centers
International Journal of High Performance Computing Applications
Triangular matrix inversion on Graphics Processing Unit
Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis
Towards a framework for abstracting accelerators in parallel applications: experience with cell
Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis
Journal of Biomedical Imaging
PASTHA: parallelizing stencil calculations in Haskell
Proceedings of the 5th ACM SIGPLAN workshop on Declarative aspects of multicore programming
Ypnos: declarative, parallel structured grid programming
Proceedings of the 5th ACM SIGPLAN workshop on Declarative aspects of multicore programming
A GA-SVM feature selection model based on high performance computing techniques
SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
A breadth-first course in multicore and manycore programming
Proceedings of the 41st ACM technical symposium on Computer science education
A practical OpenMP compiler for system on chips
WOMPAT'03 Proceedings of the OpenMP applications and tools 2003 international conference on OpenMP shared memory parallel programming
Coarse grain task parallel processing with cache optimization on shared memory multiprocessor
LCPC'01 Proceedings of the 14th international conference on Languages and compilers for parallel computing
Grid computing: experiment management, tool integration, and scientific workflows
Grid computing: experiment management, tool integration, and scientific workflows
Cell Accelerated Cryoablation Simulation
Computer Methods and Programs in Biomedicine
Enabling multi-core based monitoring and fault tolerance in C++/Java
Proceedings of the 3rd International Workshop on Multicore Software Engineering
Thread tailor: dynamically weaving threads together for efficient, adaptive parallel applications
Proceedings of the 37th annual international symposium on Computer architecture
Simdist: a distribution system for easy parallelization of evolutionary computation
Genetic Programming and Evolvable Machines
Comparison study of performance of parallel steady state solver on different computer architectures
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems
Optimizing a parallel runtime system for multicore clusters: a case study
Proceedings of the 2010 TeraGrid Conference
Integration of Heterogeneous and Non-dedicated Environments for R
CCGRID '10 Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing
The reverse-acceleration model for programming petascale hybrid systems
IBM Journal of Research and Development
OoOJava: an out-of-order approach to parallel programming
HotPar'10 Proceedings of the 2nd USENIX conference on Hot topics in parallelism
Scalable clustering algorithm for N-body simulations in a shared-nothing cluster
SSDBM'10 Proceedings of the 22nd international conference on Scientific and statistical database management
Affinity-on-next-touch: an extension to the Linux kernel for NUMA architectures
PPAM'09 Proceedings of the 8th international conference on Parallel processing and applied mathematics: Part I
Estimating and exploiting potential parallelism by source-level dependence profiling
EuroPar'10 Proceedings of the 16th international Euro-Par conference on Parallel processing: Part I
A survey of algorithmic skeleton frameworks: high-level structured parallel programming enablers
Software—Practice & Experience - Focus on Selected PhD Literature Reviews in the Practical Aspects of Software Technology
Parallelisation of a simulation tool for casting and solidification processes on windows platforms
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
Enhancing L2 organization for CMPs with a center cell
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
Exploiting locality: a flexible DSM approach
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
Piccolo: building fast, distributed programs with partitioned tables
OSDI'10 Proceedings of the 9th USENIX conference on Operating systems design and implementation
High performance multi-node file copies and checksums for clustered file systems
LISA'10 Proceedings of the 24th international conference on Large installation system administration
Implementation and tuning of a parallel symmetric Toeplitz eigensolver
Journal of Parallel and Distributed Computing
Programming the memory hierarchy revisited: supporting irregular parallelism in sequoia
Proceedings of the 16th ACM symposium on Principles and practice of parallel programming
OoOJava: software out-of-order execution
Proceedings of the 16th ACM symposium on Principles and practice of parallel programming
SpiceC: scalable parallelism via implicit copying and explicit commit
Proceedings of the 16th ACM symposium on Principles and practice of parallel programming
A portable, efficient inter-core communication scheme for embedded multicore platforms
Journal of Systems Architecture: the EUROMICRO Journal
A workload-aware mapping approach for data-parallel programs
Proceedings of the 6th International Conference on High Performance and Embedded Architectures and Compilers
Synthesizing concurrent schedulers for irregular algorithms
Proceedings of the sixteenth international conference on Architectural support for programming languages and operating systems
Mechanisms that separate algorithms from implementations for parallel patterns
Proceedings of the 2010 Workshop on Parallel Programming Patterns
Exploring implicit parallelism in class diagrams
Journal of Systems and Software
Parallel skyline computation on multicore architectures
Information Systems
Debugging large scale applications in a virtualized environment
LCPC'10 Proceedings of the 23rd international conference on Languages and compilers for parallel computing
Frameworks for multi-core architectures: a comprehensive evaluation using 2D/3D image registration
ARCS'11 Proceedings of the 24th international conference on Architecture of computing systems
Programming heterogeneous clusters with accelerators using object-based programming
Scientific Programming
FPGA vs. multi-core CPUs vs. GPUs: hands-on experience with a sorting application
Facing the multicore-challenge
FPGA vs. multi-core CPUs vs. GPUs: hands-on experience with a sorting application
Facing the multicore-challenge
A programming model for deterministic task parallelism
Proceedings of the 2011 ACM SIGPLAN Workshop on Memory Systems Performance and Correctness
Trebuchet: exploring TLP with dataflow virtualisation
International Journal of High Performance Systems Architecture
Dimensionality reduction on multi-dimensional transfer functions for multi-channel volume data sets
Information Visualization - Special issue on selected papers from visualization and data analysis 2010
Proceedings of the 13th annual conference on Genetic and evolutionary computation
The rise and fall of high performance Fortran
Communications of the ACM
Hybrid PGAS runtime support for multicore nodes
Proceedings of the Fourth Conference on Partitioned Global Address Space Programming Model
A dynamic optimization framework for OpenMP
IWOMP'11 Proceedings of the 7th international conference on OpenMP in the Petascale era
Unifying barrier and point-to-point synchronization in OpenMP with phasers
IWOMP'11 Proceedings of the 7th international conference on OpenMP in the Petascale era
A GPU-Based Implementation for Range Queries on Spaghettis Data Structure
ICCSA'11 Proceedings of the 2011 international conference on Computational science and its applications - Volume Part I
OpenMP parallelization of a CFD code for multicore computers: analysis and comparison
PaCT'11 Proceedings of the 11th international conference on Parallel computing technologies
Proceedings of the 14th international conference on Model driven engineering languages and systems
ISABELA-QA: query-driven analytics with ISABELA-compressed extreme-scale scientific data
Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
Adaptive parallel approximate similarity search for responsive multimedia retrieval
Proceedings of the 20th ACM international conference on Information and knowledge management
PLDS: Partitioning linked data structures for parallelism
ACM Transactions on Architecture and Code Optimization (TACO) - HIPEAC Papers
Generative Version of the FastFlow Multicore Library
Electronic Notes in Theoretical Computer Science (ENTCS)
Multicore C++ Standard Template Library in a Generative Way
Electronic Notes in Theoretical Computer Science (ENTCS)
Distributed constraint-based local search
CP'06 Proceedings of the 12th international conference on Principles and Practice of Constraint Programming
Runtime model validation with parallel object constraint language
Proceedings of the 8th International Workshop on Model-Driven Engineering, Verification and Validation
Proceedings of the first international workshop on High performance computing, networking and analytics for the power grid
Building a scalable and portable message-passing library for embedded multicore systems
Proceedings of the 2011 ACM Symposium on Research in Applied Computation
Advances in Engineering Software
Workflow overhead analysis and optimizations
Proceedings of the 6th workshop on Workflows in support of large-scale science
Geometric minimum spanning trees with GEOFILTERKRUSKAL*
SEA'10 Proceedings of the 9th international conference on Experimental Algorithms
Experiences with co-array fortran on hardware shared memory platforms
LCPC'04 Proceedings of the 17th international conference on Languages and Compilers for High Performance Computing
Function flow: making synchronization easier in task parallelism
Proceedings of the 2012 International Workshop on Programming Models and Applications for Multicores and Manycores
DOJ: dynamically parallelizing object-oriented programs
Proceedings of the 17th ACM SIGPLAN symposium on Principles and Practice of Parallel Programming
S: a scripting language for high-performance RESTful web services
Proceedings of the 17th ACM SIGPLAN symposium on Principles and Practice of Parallel Programming
A practical tool for detecting races in OpenMP programs
PaCT'05 Proceedings of the 8th international conference on Parallel Computing Technologies
Improving the performance scalability of the community atmosphere model
International Journal of High Performance Computing Applications
Quasi-parallel network applications in real-time distribution management system
International Journal of Innovative Computing and Applications
Application-Level checkpointing techniques for parallel programs
ICDCIT'06 Proceedings of the Third international conference on Distributed Computing and Internet Technology
JetBench: an open source real-time multiprocessor benchmark
ARCS'10 Proceedings of the 23rd international conference on Architecture of Computing Systems
The profiling method in multicore processor for effective performance improvement
Proceedings of the 6th International Conference on Ubiquitous Information Management and Communication
MDE4HPC: an approach for using model-driven engineering in high-performance computing
SDL'11 Proceedings of the 15th international conference on Integrating System and Software Modeling
Proceedings of the 9th conference on Computing Frontiers
POET: a scripting language for applying parameterized source-to-source program transformations
Software—Practice & Experience
A survey on hardware-aware and heterogeneous computing on multicore processors and accelerators
Concurrency and Computation: Practice & Experience
Effective parallelization of loops in the presence of I/O operations
Proceedings of the 33rd ACM SIGPLAN conference on Programming Language Design and Implementation
Proceedings of the 5th International ICST Conference on Simulation Tools and Techniques
An object-oriented bulk synchronous parallel library for multicore programming
Concurrency and Computation: Practice & Experience
Distributed Shared Memory Programming in the Cloud
CCGRID '12 Proceedings of the 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012)
Productivity and Performance of Global-View Programming with XcalableMP PGAS Language
CCGRID '12 Proceedings of the 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012)
Integrating data-intensive cloud computing with multicores and clusters in an HPC course
Proceedings of the 17th ACM annual conference on Innovation and technology in computer science education
International Journal of High Performance Computing Applications
On-the-fly detection of data races in OpenMP programs
Proceedings of the 2012 Workshop on Parallel and Distributed Systems: Testing, Analysis, and Debugging
Robotic clusters: Multi-robot systems as computer clusters
Robotics and Autonomous Systems
PARDIS: a programmable memory controller for the DDRx interfacing standards
Proceedings of the 39th Annual International Symposium on Computer Architecture
Nonuniform memory affinity strategy in multithreaded sparse matrix computations
Proceedings of the 2012 Symposium on High Performance Computing
Parallel discrete event simulation for DEVS cellular models using a GPU
Proceedings of the 2012 Symposium on High Performance Computing
International Journal of High Performance Computing Applications
Portable explicit threading and concurrent programming for MPI applications
PPAM'11 Proceedings of the 9th international conference on Parallel Processing and Applied Mathematics - Volume Part II
Dynamically dispatching speculative threads to improve sequential execution
ACM Transactions on Architecture and Code Optimization (TACO)
CEFP'11 Proceedings of the 4th Summer School conference on Central European Functional Programming School
Journal of Systems Architecture: the EUROMICRO Journal
Efficient Entity Translation Mining: A Parallelized Graph Alignment Approach
ACM Transactions on Information Systems (TOIS)
Interactive physical simulation on multicore architectures
EG PGV'09 Proceedings of the 9th Eurographics conference on Parallel Graphics and Visualization
NUMA-aware graph mining techniques for performance and energy efficiency
SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
SGI® UV2: a fused computation and data analysis machine
SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Optimized parallel approach for 3D modelling of forest fire behaviour
PaCT'07 Proceedings of the 9th international conference on Parallel Computing Technologies
Efficient race verification for debugging programs with openMP directives
PaCT'07 Proceedings of the 9th international conference on Parallel Computing Technologies
High-performance dynamic quantum clustering on graphics processors
Journal of Computational Physics
Concurrency and Computation: Practice & Experience
A parallel accelerated adaptive mesh algorithm for the solution of electrical models of the heart
International Journal of High Performance Systems Architecture
Concurrent programming constructs for parallel MPI applications
The Journal of Supercomputing
Dynamic instrumentation for nested fork-join parallelism in OpenMP programs
FGIT'12 Proceedings of the 4th international conference on Future Generation Information Technology
A Simple Compressive Sensing Algorithm for Parallel Many-Core Architectures
Journal of Signal Processing Systems
Comparing the performance of stochastic simulation on GPUs and OpenMP
International Journal of Computational Science and Engineering
CAP: co-scheduling based on asymptotic profiling in CPU+GPU hybrid systems
Proceedings of the 2013 International Workshop on Programming Models and Applications for Multicores and Manycores
Auto-tuning methodology to represent landform attributes on multicore and multi-GPU systems
Proceedings of the 2013 International Workshop on Programming Models and Applications for Multicores and Manycores
HPCML: a modeling language dedicated to high-performance scientific computing
Proceedings of the 1st International Workshop on Model-Driven Engineering for High Performance and CLoud computing
KFusion: optimizing data flow without compromising modularity
Proceedings of the 12th annual international conference on Aspect-oriented software development
GPU-based SNESIM implementation for multiple-point statistical simulation
Computers & Geosciences
SimPL: an algorithm for placing VLSI circuits
Communications of the ACM
General data structure expansion for multi-threading
Proceedings of the 34th ACM SIGPLAN conference on Programming language design and implementation
FPGA acceleration using high-level languages of a Monte-Carlo method for pricing complex options
Journal of Systems Architecture: the EUROMICRO Journal
Dynamic threshold for imbalance assessment on load balancing for multicore systems
Computers and Electrical Engineering
Parallelizing Sequential Programs with Statistical Accuracy Tests
ACM Transactions on Embedded Computing Systems (TECS) - Special Section on Probabilistic Embedded Computing
Application of the ParalleX execution model to stencil-based problems
Computer Science - Research and Development
ARTM: a lightweight fork-join framework for many-core embedded systems
Proceedings of the Conference on Design, Automation and Test in Europe
Proceedings of the 2013 ACM SIGSIM conference on Principles of advanced discrete simulation
Runtime resource allocation for software pipelines
Proceedings of the 16th International Workshop on Software and Compilers for Embedded Systems
Taming the complexity of coordinated place and route
Proceedings of the 50th Annual Design Automation Conference
Parallel microscopic simulation of metropolitan-scale traffic
Proceedings of the 46th Annual Simulation Symposium
GPU-based Monte Carlo simulation for the Gibbs ensemble
Proceedings of the High Performance Computing Symposium
GPU-based acceleration of an RNA tertiary structure prediction algorithm
Computers in Biology and Medicine
CUBIT: compact bitmap profiling for dynamic data dependence analysis
Proceedings of the 2013 Research in Adaptive and Convergent Systems
Evaluation of two formulations of the conjugate gradients method with transactional memory
Euro-Par'13 Proceedings of the 19th international conference on Parallel Processing
An implementation of the codelet model
Euro-Par'13 Proceedings of the 19th international conference on Parallel Processing
A programmable memory controller for the DDRx interfacing standards
ACM Transactions on Computer Systems (TOCS)
Accelerating moderately stiff chemical kinetics in reactive-flow simulations using GPUs
Journal of Computational Physics
DESC: energy-efficient data exchange using synchronized counters
Proceedings of the 46th Annual IEEE/ACM International Symposium on Microarchitecture
Synchronous programming in audio processing: A lookup table oscillator case study
ACM Computing Surveys (CSUR)
CUDA-NP: realizing nested thread-level parallelism in GPGPU applications
Proceedings of the 19th ACM SIGPLAN symposium on Principles and practice of parallel programming
On Expressing Strategies for Directive-Driven Multicore Programing Models
Proceedings of Workshop on Parallel Programming and Run-Time Management Techniques for Many-core Architectures and Design Tools and Architectures for Multicore Embedded Computing Platforms
Proceedings of Programming Models and Applications on Multicores and Manycores
A Framework for Multiplatform HPC Applications
Proceedings of Programming Models and Applications on Multicores and Manycores
Parallel flow routing in SWMM 5
Environmental Modelling & Software
High-performance computing selection of models of DNA substitution for multicore clusters
International Journal of High Performance Computing Applications
A Parallel Data Distribution Management Algorithm
DS-RT '13 Proceedings of the 2013 IEEE/ACM 17th International Symposium on Distributed Simulation and Real Time Applications
Recent progress and challenges in exploiting graphics processors in computational fluid dynamics
The Journal of Supercomputing
CPU+GPU scheduling with asymptotic profiling
Parallel Computing
Tool support for software lookup table optimization
Scientific Programming
Colored Petri Net model with automatic parallelization on real-time multicore architectures
Journal of Systems Architecture: the EUROMICRO Journal
Energy and throughput aware fuzzy logic based reconfiguration for MPSoCs
Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology
Hi-index | 0.05 |
The authors present a new way to achieve scalability in parallel software with OpenMP, their portable alternative to message passing. They discuss its capabilities through specific examples and comparisons with other standard parallel programming models.