Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming
Intel® threading building blocks
Journal of Computing Sciences in Colleges
Larrabee: a many-core x86 architecture for visual computing
ACM SIGGRAPH 2008 papers
Putting intel® threading building blocks to work
Proceedings of the 1st international workshop on Multicore software engineering
Software engineering for multicore systems: an experience report
Proceedings of the 1st international workshop on Multicore software engineering
YetiSim: a C++ simulation library with execution graphs instead of coroutines
Proceedings of the 2008 Spring simulation multiconference
Design Issues in Parallel Array Languages for Shared Memory
SAMOS '08 Proceedings of the 8th international workshop on Embedded Computer Systems: Architectures, Modeling, and Simulation
Fine-Grained Task Scheduling Using Adaptive Data Structures
Euro-Par '08 Proceedings of the 14th international Euro-Par conference on Parallel Processing
A Design Pattern for Component Oriented Development of Agent Based Multithreaded Applications
Euro-Par '08 Proceedings of the 14th international Euro-Par conference on Parallel Processing
Deque-Free Work-Optimal Parallel STL Algorithms
Euro-Par '08 Proceedings of the 14th international Euro-Par conference on Parallel Processing
Backtracking-based load balancing
Proceedings of the 14th ACM SIGPLAN symposium on Principles and practice of parallel programming
Effective performance measurement and analysis of multithreaded applications
Proceedings of the 14th ACM SIGPLAN symposium on Principles and practice of parallel programming
Load balancing using work-stealing for pipeline parallelism in emerging applications
Proceedings of the 23rd international conference on Supercomputing
Parallel geometric algorithms for multi-core computers
Proceedings of the twenty-fifth annual symposium on Computational geometry
Dependency-aware reordering for parallelizing query optimization in multi-core CPUs
Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
Developing parallel programs: A design-oriented perspective
IWMSE '09 Proceedings of the 2009 ICSE Workshop on Multicore Software Engineering
Model-driven development of multi-core embedded software
IWMSE '09 Proceedings of the 2009 ICSE Workshop on Multicore Software Engineering
Reducers and other Cilk++ hyperobjects
Proceedings of the twenty-first annual symposium on Parallelism in algorithms and architectures
Brief announcement: a lower bound for depth-restricted work stealing
Proceedings of the twenty-first annual symposium on Parallelism in algorithms and architectures
Multi-target C++ implementation of parallel skeletons
Proceedings of the 8th workshop on Parallel/High-Performance Object-Oriented Scientific Computing
Parallel expression template for large vectors
Proceedings of the 8th workshop on Parallel/High-Performance Object-Oriented Scientific Computing
Guiding of smoke animations through variational coupling of simulations at different resolutions
Proceedings of the 2009 ACM SIGGRAPH/Eurographics Symposium on Computer Animation
VERTAF/Multi-Core: A SysML-Based Application Framework for Multi-Core Embedded Software Development
ICA3PP '09 Proceedings of the 9th International Conference on Algorithms and Architectures for Parallel Processing
OSL: Optimized Bulk Synchronous Parallel Skeletons on Distributed Arrays
APPT '09 Proceedings of the 8th International Symposium on Advanced Parallel Processing Technologies
Using OpenMP vs. Threading Building Blocks for Medical Imaging on Multi-cores
Euro-Par '09 Proceedings of the 15th International Euro-Par Conference on Parallel Processing
XJava: Exploiting Parallelism with Object-Oriented Stream Programming
Euro-Par '09 Proceedings of the 15th International Euro-Par Conference on Parallel Processing
Searching for Concurrent Design Patterns in Video Games
Euro-Par '09 Proceedings of the 15th International Euro-Par Conference on Parallel Processing
Native Client: a sandbox for portable, untrusted x86 native code
Communications of the ACM - Amir Pnueli: Ahead of His Time
Grace: safe multithreaded programming for C/C++
Proceedings of the 24th ACM SIGPLAN conference on Object oriented programming systems languages and applications
PFunc: modern task parallelism for modern high performance computing
Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis
Exploiting fine-grain thread parallelism on multicore architectures
Scientific Programming - Software Development for Multi-core Computing Systems
Qilin: exploiting parallelism on heterogeneous multiprocessors with adaptive mapping
Proceedings of the 42nd Annual IEEE/ACM International Symposium on Microarchitecture
GAMBIT: effective unit testing for concurrency libraries
Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
Helper locks for fork-join parallel programming
Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
Compress-and-conquer for optimal multicore computing
Proceedings of the 5th ACM SIGPLAN workshop on Declarative aspects of multicore programming
Parallel computing: at the interface of high school and industry
Proceedings of the 41st ACM technical symposium on Computer science education
Fine-Grained Multithreading Support for Hybrid Threaded MPI Programming
International Journal of High Performance Computing Applications
The Journal of Supercomputing
A container-iterator parallel programming model
PPAM'07 Proceedings of the 7th international conference on Parallel processing and applied mathematics
Memory models: a case for rethinking parallel languages and hardware
Communications of the ACM
OpenMP extensions for generic libraries
IWOMP'08 Proceedings of the 4th international conference on OpenMP in a new era of parallelism
New challenges of parallel job scheduling
JSSPP'07 Proceedings of the 13th international conference on Job scheduling strategies for parallel processing
Composing parallel software efficiently with lithe
PLDI '10 Proceedings of the 2010 ACM SIGPLAN conference on Programming language design and implementation
The Cilkview scalability analyzer
Proceedings of the twenty-second annual ACM symposium on Parallelism in algorithms and architectures
Brief announcement: serial-parallel reciprocity in dynamic multithreaded languages
Proceedings of the twenty-second annual ACM symposium on Parallelism in algorithms and architectures
Online mapping of MPI-2 dynamic tasks to processes and threads
International Journal of High Performance Systems Architecture
Parallel geometric algorithms for multi-core computers
Computational Geometry: Theory and Applications
STAPL: standard template adaptive parallel library
Proceedings of the 3rd Annual Haifa Experimental Systems Conference
Cohesion: a hybrid memory model for accelerators
Proceedings of the 37th annual international symposium on Computer architecture
Parallel operations of sparse polynomials on multicores: I. multiplication and Poisson bracket
Proceedings of the 4th International Workshop on Parallel and Symbolic Computation
Using memory mapping to support cactus stacks in work-stealing runtime systems
Proceedings of the 19th international conference on Parallel architectures and compilation techniques
A common substrate for cluster computing
HotCloud'09 Proceedings of the 2009 conference on Hot topics in cloud computing
Parallel programming must be deterministic by default
HotPar'09 Proceedings of the First USENIX conference on Hot topics in parallelism
Lithe: enabling efficient composition of parallel libraries
HotPar'09 Proceedings of the First USENIX conference on Hot topics in parallelism
New abstractions for data parallel programming
HotPar'09 Proceedings of the First USENIX conference on Hot topics in parallelism
Embedded Clarity in Filmification of Methods
Proceedings of the 2010 conference on New Trends in Software Methodologies, Tools and Techniques: Proceedings of the 9th SoMeT_10
Object oriented parallelisation of graph algorithms using parallel iterator
AusPDC '10 Proceedings of the Eighth Australasian Symposium on Parallel and Distributed Computing - Volume 107
HotPar'10 Proceedings of the 2nd USENIX conference on Hot topics in parallelism
Lessons from implementing the biCGStab method with SkeTo library
Proceedings of the fourth international workshop on High-level parallel programming and applications
Understanding complex multithreaded software systems by using trace visualization
Proceedings of the 5th international symposium on Software visualization
Empirical analysis of parallelism overheads on CMPs
PPAM'09 Proceedings of the 8th international conference on Parallel processing and applied mathematics: Part I
Self-replicating objects for multicore platforms
ECOOP'10 Proceedings of the 24th European conference on Object-oriented programming
A language-based tuning mechanism for task and pipeline parallelism
Euro-Par'10 Proceedings of the 16th international Euro-Par conference on Parallel processing: Part II
PPSN'10 Proceedings of the 11th international conference on Parallel problem solving from nature: Part II
Estimating and exploiting potential parallelism by source-level dependence profiling
EuroPar'10 Proceedings of the 16th international Euro-Par conference on Parallel processing: Part I
Formal specification of MPI 2.0: Case study in specifying a practical concurrent programming API
Science of Computer Programming
A survey of algorithmic skeleton frameworks: high-level structured parallel programming enablers
Software—Practice & Experience - Focus on Selected PhD Literature Reviews in the Practical Aspects of Software Technology
On the definition of service abstractions for parallel computing
PPAM'09 Proceedings of the 8th international conference on Parallel processing and applied mathematics: Part II
Challenges and issues of supporting task parallelism in MPI
EuroMPI'10 Proceedings of the 17th European MPI users' group meeting conference on Recent advances in the message passing interface
Algorithm engineering: bridging the gap between algorithm theory and practice
Algorithm engineering: bridging the gap between algorithm theory and practice
Introduction to computer graphics
ACM SIGGRAPH ASIA 2010 Courses
Piccolo: building fast, distributed programs with partitioned tables
OSDI'10 Proceedings of the 9th USENIX conference on Operating systems design and implementation
A multi-core software API for embedded MPSoC environments
MTPP'10 Proceedings of the Second Russia-Taiwan conference on Methods and tools of parallel programming multicomputers
Implementing fusion-equipped parallel skeletons by expression templates
IFL'09 Proceedings of the 21st international conference on Implementation and application of functional languages
Scientific Programming - Exploring Languages for Expressing Medium to Massive On-Chip Parallelism
SpiceC: scalable parallelism via implicit copying and explicit commit
Proceedings of the 16th ACM symposium on Principles and practice of parallel programming
Lifeline-based global load balancing
Proceedings of the 16th ACM symposium on Principles and practice of parallel programming
ACM SIGOPS Operating Systems Review
Parallelization libraries: Characterizing and reducing overheads
ACM Transactions on Architecture and Code Optimization (TACO)
Using aspect-orientation to simplify concurrent programming
Proceedings of the tenth international conference on Aspect-oriented software development companion
A new method for GPU based irregular reductions and its application to k-means clustering
Proceedings of the Fourth Workshop on General Purpose Processing on Graphics Processing Units
How many threads to spawn during program multithreading?
LCPC'10 Proceedings of the 23rd international conference on Languages and compilers for parallel computing
CnC-CUDA: declarative programming for GPUs
LCPC'10 Proceedings of the 23rd international conference on Languages and compilers for parallel computing
Frameworks for multi-core architectures: a comprehensive evaluation using 2D/3D image registration
ARCS'11 Proceedings of the 24th international conference on Architecture of computing systems
Proceedings of the 2011 SIGPLAN/SIGBED conference on Languages, compilers and tools for embedded systems
Lightweight parallel accumulators using C++ templates
Proceedings of the 4th International Workshop on Multicore Software Engineering
A programming model for deterministic task parallelism
Proceedings of the 2011 ACM SIGPLAN Workshop on Memory Systems Performance and Correctness
Parallelism orchestration using DoPE: the degree of parallelism executive
Proceedings of the 32nd ACM SIGPLAN conference on Programming language design and implementation
Designing an undergraduate curriculum based on parallelism and concurrency
Proceedings of the 49th Annual Southeast Regional Conference
Leveraging data-structure semantics for efficient algorithmic parallelism
Proceedings of the 8th ACM International Conference on Computing Frontiers
Types, regions, and effects for safe programming with object-oriented parallel frameworks
Proceedings of the 25th European conference on Object-oriented programming
Quantifying the potential task-based dataflow parallelism in MPI applications
Euro-Par'11 Proceedings of the 17th international conference on Parallel processing - Volume Part I
Parallelizing a real-time physics engine using transactional memory
Euro-Par'11 Proceedings of the 17th international conference on Parallel processing - Volume Part II
Improving per-node efficiency in the datacenter with new OS abstractions
Proceedings of the 2nd ACM Symposium on Cloud Computing
DistRM: distributed resource management for on-chip many-core systems
CODES+ISSS '11 Proceedings of the seventh IEEE/ACM/IFIP international conference on Hardware/software codesign and system synthesis
A survey on parallel ant colony optimization
Applied Soft Computing
Sisal 3.2 language features overview
PaCT'11 Proceedings of the 11th international conference on Parallel computing technologies
PaCT'11 Proceedings of the 11th international conference on Parallel computing technologies
Trasgo: a nested-parallel programming system
The Journal of Supercomputing
Algorithm 915, SuiteSparseQR: Multifrontal multithreaded rank-revealing sparse QR factorization
ACM Transactions on Mathematical Software (TOMS)
Algorithms and data structures for massively parallel generic adaptive finite element codes
ACM Transactions on Mathematical Software (TOMS)
Introduction to computer graphics
SIGGRAPH Asia 2011 Courses
Design of a Multicore Sparse Cholesky Factorization Using DAGs
SIAM Journal on Scientific Computing
On the simulation of large-scale architectures using multiple application abstraction levels
ACM Transactions on Architecture and Code Optimization (TACO) - HIPEAC Papers
PLDS: Partitioning linked data structures for parallelism
ACM Transactions on Architecture and Code Optimization (TACO) - HIPEAC Papers
Generative Version of the FastFlow Multicore Library
Electronic Notes in Theoretical Computer Science (ENTCS)
Multicore C++ Standard Template Library in a Generative Way
Electronic Notes in Theoretical Computer Science (ENTCS)
Habanero-Java: the new adventures of old X10
Proceedings of the 9th International Conference on Principles and Practice of Programming in Java
Programmable data dependencies and placements
DAMP '12 Proceedings of the 7th workshop on Declarative aspects and applications of multicore programming
Out-of-Core Computations of High-Resolution Level Sets by Means of Code Transformation
Journal of Scientific Computing
Kokkos Array performance-portable manycore programming model
Proceedings of the 2012 International Workshop on Programming Models and Applications for Multicores and Manycores
Deterministic parallel random-number generation for dynamic-multithreading platforms
Proceedings of the 17th ACM SIGPLAN symposium on Principles and Practice of Parallel Programming
A work-stealing scheduler for X10's task parallelism with suspension
Proceedings of the 17th ACM SIGPLAN symposium on Principles and Practice of Parallel Programming
AdaStreams: a type-based programming extension for stream-parallelism with ada 2005
Ada-Europe'10 Proceedings of the 15th Ada-Europe international conference on Reliable Software Technologies
A proposal for user-defined reductions in OpenMP
IWOMP'10 Proceedings of the 6th international conference on Beyond Loop Level Parallelism in OpenMP: accelerators, Tasking and more
A coordination-based model-driven method for parallel application development
MODELS'09 Proceedings of the 2009 international conference on Models in Software Engineering
Proceedings of the 50th Annual Southeast Regional Conference
Towards a codelet-based runtime for exascale computing: position paper
Proceedings of the 2nd International Workshop on Adaptive Self-Tuning Computing Systems for the Exaflop Era
Multi-Target vectorization with MTPS c++ generic library
PARA'10 Proceedings of the 10th international conference on Applied Parallel and Scientific Computing - Volume 2
Proceedings of the 9th conference on Computing Frontiers
Extending a highly parallel data mining algorithm to the intel ® many integrated core architecture
Euro-Par'11 Proceedings of the 2011 international conference on Parallel Processing - Volume 2
Trends and challenges in operating systems---from parallel computing to cloud computing
Concurrency and Computation: Practice & Experience
Effective parallelization of loops in the presence of I/O operations
Proceedings of the 33rd ACM SIGPLAN conference on Programming Language Design and Implementation
Work stealing and persistence-based load balancers for iterative overdecomposed applications
Proceedings of the 21st international symposium on High-Performance Parallel and Distributed Computing
Data-driven fault tolerance for work stealing computations
Proceedings of the 26th ACM international conference on Supercomputing
CATS: cache aware task-stealing based on online profiling in multi-socket multi-core architectures
Proceedings of the 26th ACM international conference on Supercomputing
Memory-mapping support for reducer hyperobjects
Proceedings of the twenty-fourth annual ACM symposium on Parallelism in algorithms and architectures
Integrating data-intensive cloud computing with multicores and clusters in an HPC course
Proceedings of the 17th ACM annual conference on Innovation and technology in computer science education
HARPPIE: hyper algorithmic recipe for productive parallelism intensive endeavors
Proceedings of the 34th International Conference on Software Engineering
StreamPI: a stream-parallel programming extension for object-oriented programming languages
The Journal of Supercomputing
Case study: stereo vision experiments with multi-core software API on embedded MPSoC environments
The Journal of Supercomputing
Scalable parallel interval propagation for sparse constraint satisfaction problems
PSI'11 Proceedings of the 8th international conference on Perspectives of System Informatics
SP4: scalable programmable packet processing platform
Proceedings of the ACM SIGCOMM 2012 conference on Applications, technologies, architectures, and protocols for computer communication
Parallel computation skeletons with premature termination property
FLOPS'12 Proceedings of the 11th international conference on Functional and Logic Programming
Optimization techniques for efficient HTA programs
Parallel Computing
Task-Based execution of nested OpenMP loops
IWOMP'12 Proceedings of the 8th international conference on OpenMP in a Heterogeneous World
Processor allocation for optimistic parallelization of irregular programs
ICCSA'12 Proceedings of the 12th international conference on Computational Science and Its Applications - Volume Part I
ICCSA'12 Proceedings of the 12th international conference on Computational Science and Its Applications - Volume Part I
Designing digital circuits for FPGAs using parallel genetic algorithms (WIP)
Proceedings of the 2012 Symposium on Theory of Modeling and Simulation - DEVS Integrative M&S Symposium
Autotuning of adaptive mesh refinement PDE solvers on shared memory architectures
PPAM'11 Proceedings of the 9th international conference on Parallel Processing and Applied Mathematics - Volume Part I
Parameterized model checking of fine grained concurrency
SPIN'12 Proceedings of the 19th international conference on Model Checking Software
Haskell vs. f# vs. scala: a high-level language features and parallelism support comparison
Proceedings of the 1st ACM SIGPLAN workshop on Functional high-performance computing
A meta-scheduler for the par-monad: composable scheduling for the heterogeneous cloud
Proceedings of the 17th ACM SIGPLAN international conference on Functional programming
Oversubscription of computational resources on multicore desktop systems
MSEPT'12 Proceedings of the 2012 international conference on Multicore Software Engineering, Performance, and Tools
LibEE: a multithreaded dependency graph for character animation
Proceedings of the Digital Production Symposium
SP4: scalable programmable packet processing platform
ACM SIGCOMM Computer Communication Review - Special october issue SIGCOMM '12
From sequential programming to flexible parallel execution
Proceedings of the 2012 international conference on Compilers, architectures and synthesis for embedded systems
Work-stealing without the baggage
Proceedings of the ACM international conference on Object oriented programming systems languages and applications
Proceedings of the ACM international conference on Object oriented programming systems languages and applications
Interactive physical simulation on multicore architectures
EG PGV'09 Proceedings of the 9th Eurographics conference on Parallel Graphics and Visualization
MPI-hybrid parallelism for volume rendering on large, multi-core systems
EG PGV'10 Proceedings of the 10th Eurographics conference on Parallel Graphics and Visualization
Load-balanced isosurfacing on multi-GPU clusters
EG PGV'10 Proceedings of the 10th Eurographics conference on Parallel Graphics and Visualization
Characterizing and mitigating work time inflation in task parallel programs
SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Fast box-counting algorithm on GPU
Computer Methods and Programs in Biomedicine
Concurrency and Computation: Practice & Experience
Euro-Par'12 Proceedings of the 18th international conference on Parallel Processing
An efficient unbounded lock-free queue for multi-core systems
Euro-Par'12 Proceedings of the 18th international conference on Parallel Processing
Beyond expert-only parallel programming?
Proceedings of the 2012 ACM workshop on Relaxing synchronization for multicore and manycore scalability
Parallel schedule synthesis for attribute grammars
Proceedings of the 18th ACM SIGPLAN symposium on Principles and practice of parallel programming
P3S: protein structure similarity search
Euro-Par'12 Proceedings of the 18th international conference on Parallel processing workshops
On the parallelization of the SProt measure and the TM-Score algorithm
Euro-Par'12 Proceedings of the 18th international conference on Parallel processing workshops
A high performance parallel DCT with OpenCL on heterogeneous computing environment
Multimedia Tools and Applications
Steal Tree: low-overhead tracing of work stealing schedulers
Proceedings of the 34th ACM SIGPLAN conference on Programming language design and implementation
Scalanytics: a declarative multi-core platform for scalable composable traffic analytics
Proceedings of the 22nd international symposium on High-performance parallel and distributed computing
Holistic run-time parallelism management for time and energy efficiency
Proceedings of the 27th international ACM conference on International conference on supercomputing
Proceedings of the 18th International Conference on 3D Web Technology
Hybrid parallel task placement in X10
Proceedings of the third ACM SIGPLAN X10 Workshop
A divide and conquer approach and a work-optimal parallel algorithm for the LIS problem
Information Processing Letters
Self-adaptive hybrid dynamic power management for many-core systems
Proceedings of the Conference on Design, Automation and Test in Europe
Exploring the Tradeoffs between Programmability and Efficiency in Data-Parallel Accelerators
ACM Transactions on Computer Systems (TOCS)
Developing a unified pipeline with character
Proceedings of the Symposium on Digital Production
Proceedings of the 51st ACM Southeast Conference
Proceedings of the 51st ACM Southeast Conference
A survey of pipelined workflow scheduling: Models and algorithms
ACM Computing Surveys (CSUR)
Examining the expert gap in parallel programming
Euro-Par'13 Proceedings of the 19th international conference on Parallel Processing
A classification of scientific visualization algorithms for massive threading
UltraVis '13 Proceedings of the 8th International Workshop on Ultrascale Visualization
Fence-free work stealing on bounded TSO processors
Proceedings of the 19th international conference on Architectural support for programming languages and operating systems
Easy, fast, and energy-efficient object detection on heterogeneous on-chip architectures
ACM Transactions on Architecture and Code Optimization (TACO)
Instant loading for main memory databases
Proceedings of the VLDB Endowment
DWS: Demand-aware Work-Stealing in Multi-programmed Multi-core Architectures
Proceedings of Programming Models and Applications on Multicores and Manycores
Journal of Systems Architecture: the EUROMICRO Journal
Agent-based distributed power management for kilo-core processors
Proceedings of the International Conference on Computer-Aided Design
GLB: lifeline-based global load balancing library in x10
Proceedings of the first workshop on Parallel programming for analytics applications
Friendly barriers: efficient work-stealing with return barriers
Proceedings of the 10th ACM SIGPLAN/SIGOPS international conference on Virtual execution environments
Adaptive workload-aware task scheduling for single-ISA asymmetric multicore architectures
ACM Transactions on Architecture and Code Optimization (TACO)
Optimizing image processing on multi-core CPUs with Intel parallel programming technologies
Multimedia Tools and Applications
Characterizing and mitigating work time inflation in task parallel programs
Scientific Programming - Selected Papers from Super Computing 2012
Manycore performance-portability: Kokkos multidimensional array library
Scientific Programming - A New Overview of the Trilinos Project --Part 1
Tpetra, and the use of generic programming in scientific computing
Scientific Programming - A New Overview of the Trilinos Project --Part 1
Hi-index | 0.02 |