Processor Allocation for Horizontal and Vertical Parallelism and Related Speedup Bounds
IEEE Transactions on Computers
Applications considerations in the system design of highly concurrent multiprocessors
IEEE Transactions on Computers
PAM-CRASH on the IBM 3090/VF: an integrated environment for crash analysis
IBM Systems Journal
Further results using the overhead model for parallel systems
IBM Journal of Research and Development
Execution of automatically parallelized APL programs on RP3
IBM Journal of Research and Development
Parallel text retrieval on a high performance supercomputer using the Vector Space Model
SIGIR '95 Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval
Decoupled access/execute computer architectures
ACM Transactions on Computer Systems (TOCS)
ICCS '02 Proceedings of the International Conference on Computational Science-Part II
Parallel Performance in Multi-physics Simulation
ICCS '02 Proceedings of the International Conference on Computational Science-Part II
Scheduling Divisible Tasks on Heterogeneous Linear Arrays with Applications to Layered Networks
IPDPS '02 Proceedings of the 16th International Parallel and Distributed Processing Symposium
Dynamic Power Management of Multiprocessor Systems
IPDPS '02 Proceedings of the 16th International Parallel and Distributed Processing Symposium
Predictability for Real-Time Command and Control
IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
A Dynamic Periodicity Detector: Application to Speedup Computation
IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
PARA '02 Proceedings of the 6th International Conference on Applied Parallel Computing Advanced Scientific Computing
Extended Overhead Analysis for OpenMP (Research Note)
Euro-Par '02 Proceedings of the 8th International Euro-Par Conference on Parallel Processing
Workload Characterization Issues and Methodologies
Performance Evaluation: Origins and Directions
Branch, Cut, and Price: Sequential and Parallel
Computational Combinatorial Optimization, Optimal or Provably Near-Optimal Solutions [based on a Spring School]
Shared Memory Multiprocessor Support for SAC
IFL '98 Selected Papers from the 10th International Workshop on 10th International Workshop
Efficient Parallel Solution to Calculate All Cycles in Graphs
PARA '02 Proceedings of the 6th International Conference on Applied Parallel Computing Advanced Scientific Computing
Parallel Job Scheduling: A Performance Perspective
Performance Evaluation: Origins and Directions
Instruction-level parallel processors-dynamic and static scheduling tradeoffs
PAS '97 Proceedings of the 2nd AIZU International Symposium on Parallel Algorithms / Architecture Synthesis
Multiprocessor Preprocessing Algorithms for Uniprocessor On-Line Scheduling
ICDCS '01 Proceedings of the The 21st International Conference on Distributed Computing Systems
Parallel application performance on shared high performance reconfigurable computing resources
Performance Evaluation - Performance modelling and evaluation of high-performance parallel and distributed systems
Implementation of Parallel Plasma Particle-In-Cell Codes on SuperSINET based Grid
HPCASIA '05 Proceedings of the Eighth International Conference on High-Performance Computing in Asia-Pacific Region
Communication Links for Distributed Quantum Computation
IEEE Transactions on Computers
IEEE Transactions on Computers
The Future of Parallel Processing
IEEE Transactions on Computers
A shared memory parallel algorithm for data reduction using the singular value decomposition
Proceedings of the 2008 Spring simulation multiconference
A shared memory parallel algorithm for hybrid image classification
SpringSim '07 Proceedings of the 2007 spring simulation multiconference - Volume 2
The role of MPI in development time: a case study
Proceedings of the 2008 ACM/IEEE conference on Supercomputing
Benchmark Study of a 3d Parallel Code for the Propagation of Large Subduction Earthquakes
Proceedings of the 15th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
On the exploitation of loop-level parallelism in embedded applications
ACM Transactions on Embedded Computing Systems (TECS)
Model-based performance analysis using block coverage measurements
Journal of Systems and Software
Observations on high-performance machines
AFIPS '67 (Fall) Proceedings of the November 14-16, 1967, fall joint computer conference
AFIPS '69 (Spring) Proceedings of the May 14-16, 1969, spring joint computer conference
Embedded DSP Processor Design: Application Specific Instruction Set Processors
Embedded DSP Processor Design: Application Specific Instruction Set Processors
Controlling chaos: on safe side-effects in data-parallel operations
Proceedings of the 4th workshop on Declarative aspects of multicore programming
On the Performance and Scalability of a GPU-Limited Commodity Cluster
ISVC '08 Proceedings of the 4th International Symposium on Advances in Visual Computing, Part II
A Parallel Architecture for Stateful, High-Speed Intrusion Detection
ICISS '08 Proceedings of the 4th International Conference on Information Systems Security
Roofline: an insightful visual performance model for multicore architectures
Communications of the ACM - A Direct Path to Dependable Software
Observer-invariant histopathology using genetics-based machine learning
Natural Computing: an international journal
Accelerating critical section execution with asymmetric multi-core architectures
Proceedings of the 14th international conference on Architectural support for programming languages and operating systems
Fast circuit simulation on graphics processing units
Proceedings of the 2009 Asia and South Pacific Design Automation Conference
Scheduling ?-Critical Tasks in mixed-parallel applications on a national grid
GRID '07 Proceedings of the 8th IEEE/ACM International Conference on Grid Computing
Performance models for hierarchical grid architectures
GRID '06 Proceedings of the 7th IEEE/ACM International Conference on Grid Computing
An approach for the effective utilization of GP-GPUs in parallel combined simulation
Proceedings of the 40th Conference on Winter Simulation
An Asymptotic Performance/Energy Analysis and Optimization of Multi-core Architectures
ICDCN '09 Proceedings of the 10th International Conference on Distributed Computing and Networking
International Journal of Parallel Programming
Flexible reference-counting-based hardware acceleration for garbage collection
Proceedings of the 36th annual international symposium on Computer architecture
Grid enabled MRP process improvement under distributed database environment
Journal of Systems and Software
Reconfigurable Computing: The Theory and Practice of FPGA-Based Computation
Reconfigurable Computing: The Theory and Practice of FPGA-Based Computation
A new look at the roles of spinning and blocking
Proceedings of the Fifth International Workshop on Data Management on New Hardware
Data-intensive computing for competent genetic algorithms: a pilot study using meandre
Proceedings of the 11th Annual conference on Genetic and evolutionary computation
On single-pass indexing with MapReduce
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Multicore Scheduling for Lightweight Communicating Processes
COORDINATION '09 Proceedings of the 11th International Conference on Coordination Models and Languages
Computational Experience with a Software Framework for Parallel Integer Programming
INFORMS Journal on Computing
International Journal of Geographical Information Science - Distributed Geographic Information Processing Research
A computational science IDE for HPC systems: design and applications
International Journal of Parallel Programming
NePaLTM: Design and Implementation of Nested Parallelism for Transactional Memory Systems
Genoa Proceedings of the 23rd European Conference on ECOOP 2009 --- Object-Oriented Programming
The Cilk++ concurrency platform
Proceedings of the 46th Annual Design Automation Conference
Massively parallel processing: it's déjà vu all over again
Proceedings of the 46th Annual Design Automation Conference
Study of neural net training methods in parallel and distributed architectures
Future Generation Computer Systems
Variable precision distance search for random fractal cluster simulations
WSEAS Transactions on Computers
Extending Amdahl's law in the multicore era
ACM SIGMETRICS Performance Evaluation Review
Parallel processing of Prestack Kirchhoff Time Migration on a PC Cluster
Computers & Geosciences
Heterogeneous multicore parallel programming for graphics processing units
Scientific Programming - Software Development for Multi-core Computing Systems
Vector system performance of the IBM 3090
IBM Systems Journal
Effect of increasing chip density on the evolution of computer architectures
IBM Journal of Research and Development
Parallel parameter study of the Wigner-Poisson equations for RTDs
Computers & Mathematics with Applications
Faster and More Complete Extended Static Checking for the Java Modeling Language
Journal of Automated Reasoning
Reevaluating Amdahl's law in the multicore era
Journal of Parallel and Distributed Computing
Brain derived vision algorithm on high performance architectures
International Journal of Parallel Programming
Compress-and-conquer for optimal multicore computing
Proceedings of the 5th ACM SIGPLAN workshop on Declarative aspects of multicore programming
Profiling-based hardware/software co-exploration for the design of video coding architectures
IEEE Transactions on Circuits and Systems for Video Technology
Polymorphic architectures: from media processing to supercomputing
CompSysTech '09 Proceedings of the International Conference on Computer Systems and Technologies and Workshop for PhD Students in Computing
International Journal of Parallel Programming
The Cilk++ concurrency platform
The Journal of Supercomputing
Paper: Toward a better parallel performance metric
Parallel Computing
High-performance cone beam reconstruction using CUDA compatible GPUs
Parallel Computing
New concepts for parallel object-relational query processing
New concepts for parallel object-relational query processing
Bundle Methods for Regularized Risk Minimization
The Journal of Machine Learning Research
Resource management for finite element codes on shared memory systems
ICCSA'03 Proceedings of the 2003 international conference on Computational science and its applications: PartI
Mt-ADRES: multithreading on coarse-grained reconfigurable architecture
ARC'07 Proceedings of the 3rd international conference on Reconfigurable computing: architectures, tools and applications
High performance computing for disease surveillance
BioSurveillance'07 Proceedings of the 2nd NSF conference on Intelligence and security informatics: BioSurveillance
Evaluating a low-power dual-core architecture
APPT'07 Proceedings of the 7th international conference on Advanced parallel processing technologies
DCGs + memoing = packrat parsing but is it worth it?
PADL'08 Proceedings of the 10th international conference on Practical aspects of declarative languages
Grid computing: experiment management, tool integration, and scientific workflows
Grid computing: experiment management, tool integration, and scientific workflows
Parsing XML using parallel traversal of streaming trees
HiPC'08 Proceedings of the 15th international conference on High performance computing
WiDGET: Wisconsin decoupled grid execution tiles
Proceedings of the 37th annual international symposium on Computer architecture
Modeling critical sections in Amdahl's law and its implications for multicore design
Proceedings of the 37th annual international symposium on Computer architecture
Agent-oriented programming: from prolog to guarded definite clauses
Agent-oriented programming: from prolog to guarded definite clauses
On the costs and benefits of stochasticity in stream processing
Proceedings of the 47th Design Automation Conference
Parallel image thinning through topological operators on shared memory parallel machines
Asilomar'09 Proceedings of the 43rd Asilomar conference on Signals, systems and computers
A dynamic, decentralised search algorithm for efficient data retrieval in a distributed tuple space
AusPDC '10 Proceedings of the Eighth Australasian Symposium on Parallel and Distributed Computing - Volume 107
Estimating parallel performance, a skeleton-based approach
Proceedings of the fourth international workshop on High-level parallel programming and applications
Generic load regulation framework for Erlang
Proceedings of the 9th ACM SIGPLAN workshop on Erlang
Hera-JVM: a runtime system for heterogeneous multi-core architectures
Proceedings of the ACM international conference on Object oriented programming systems languages and applications
Diagnosis, Tuning, and Redesign for Multicore Performance: A Case Study of the Fast Multipole Method
Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis
High-performance Computing to Simulate Large-scale Industrial Flows in Multistage Compressors
International Journal of High Performance Computing Applications
IEEE Transactions on Information Technology in Biomedicine - Special section on affective and pervasive computing for healthcare
IWOMP'05/IWOMP'06 Proceedings of the 2005 and 2006 international conference on OpenMP shared memory parallel programming
Babylon v2.0: middleware for distributed, parallel, and mobile java applications
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
Parallel SAH k-D tree construction
Proceedings of the Conference on High Performance Graphics
Journal of Computational Physics
Measuring software systems scalability for proactive data center management
OTM'10 Proceedings of the 2010 international conference on On the move to meaningful internet systems: Part II
On the energy-performance tradeoff for parallel applications
EPEW'10 Proceedings of the 7th European performance engineering conference on Computer performance engineering
Single-Chip Heterogeneous Computing: Does the Future Include Custom Logic, FPGAs, and GPGPUs?
MICRO '43 Proceedings of the 2010 43rd Annual IEEE/ACM International Symposium on Microarchitecture
Observations on tuning a java enterprise application for performance and scalability
IBM Journal of Research and Development
Achieving a single compute device image in OpenCL for multiple GPUs
Proceedings of the 16th ACM symposium on Principles and practice of parallel programming
Parallel points-to analysis for multi-core machines
Proceedings of the 6th International Conference on High Performance and Embedded Architectures and Compilers
Coarse-grained simulation method for performance evaluation of a shared memory system
Proceedings of the 16th Asia and South Pacific Design Automation Conference
Mathematical limits of parallel computation for embedded systems
Proceedings of the 16th Asia and South Pacific Design Automation Conference
Some computer organizations and their effectiveness
IEEE Transactions on Computers
Teaching concurrency-oriented programming with Erlang
Proceedings of the 42nd ACM technical symposium on Computer science education
ARCS'11 Proceedings of the 24th international conference on Architecture of computing systems
Frameworks for multi-core architectures: a comprehensive evaluation using 2D/3D image registration
ARCS'11 Proceedings of the 24th international conference on Architecture of computing systems
An Irregularly Portioned Lagrangian Monte Carlo Method for Turbulent Flow Simulation
Journal of Scientific Computing
Massively Parallel Logic Simulation with GPUs
ACM Transactions on Design Automation of Electronic Systems (TODAES)
Parallel re-initialization of level set functions on distributed unstructured tetrahedral grids
Journal of Computational Physics
Parallelizing join computations of SPARQL queries for large semantic web databases
Proceedings of the 2011 ACM Symposium on Applied Computing
Calculation of the acceleration of parallel programs as a function of the number of threads
ICCOMP'10 Proceedings of the 14th WSEAS international conference on Computers: part of the 14th WSEAS CSCC multiconference - Volume II
Resolving a L2-prefetch-caused parallel nonscaling on Intel Core microarchitecture
Journal of Parallel and Distributed Computing
Parallelism and data movement characterization of contemporary application classes
Proceedings of the twenty-third annual ACM symposium on Parallelism in algorithms and architectures
Automatic generation of executable communication specifications from parallel applications
Proceedings of the international conference on Supercomputing
Database scalability, elasticity, and autonomy in the cloud
DASFAA'11 Proceedings of the 16th international conference on Database systems for advanced applications - Volume Part I
An effective speedup metric for measuring productivity in large-scale parallel computer systems
The Journal of Supercomputing
Shared-memory, distributed-memory, and mixed-mode parallelisation of a CFD simulation code
Computer Science - Research and Development
Optimized HPL for AMD GPU and multi-core CPU usage
Computer Science - Research and Development
Dark silicon and the end of multicore scaling
Proceedings of the 38th annual international symposium on Computer architecture
Balance principles for algorithm-architecture co-design
HotPar'11 Proceedings of the 3rd USENIX conference on Hot topic in parallelism
Proceedings of the Third Workshop on Large Scale Data Mining: Theory and Applications
DISPAR-tournament: a parallel population reduction operator that behaves like a tournament
EvoApplications'11 Proceedings of the 2011 international conference on Applications of evolutionary computation - Volume Part I
Parametrizing multicore architectures for multiple sequence alignment
Proceedings of the 8th ACM International Conference on Computing Frontiers
A framework for parallel computational physics algorithms on multi-core: SPH in parallel
Advances in Engineering Software
On the scalability of multi-criteria protein structure comparison in the grid
Euro-Par 2010 Proceedings of the 2010 conference on Parallel processing
Accelerating multiple target drug screening on GPUs
Proceedings of the 9th International Conference on Computational Methods in Systems Biology
A survey on parallel ant colony optimization
Applied Soft Computing
Cache efficiency and scalability on multi-core architectures
PaCT'11 Proceedings of the 11th international conference on Parallel computing technologies
Attribute grammar genetic programming algorithm for automatic code parallelization
ICHIT'11 Proceedings of the 5th international conference on Convergence and hybrid information technology
What Hill-Marty model learn from and break through Amdahl's law?
Information Processing Letters
An application of high performance computing to improve linear acoustic simulation
Proceedings of the 14th Communications and Networking Symposium
Corrected model for "predicting the relative performance of CPU"
Proceedings of the 19th High Performance Computing Symposia
Checkpointing strategies for parallel jobs
Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
Patient specific dosimetry phantoms using multichannel LDDMM of the whole body
Journal of Biomedical Imaging - Special issue on Parallel Computation in Medical Imaging Applications
SpotMPI: a framework for auction-based HPC computing using amazon spot instances
ICA3PP'11 Proceedings of the 11th international conference on Algorithms and architectures for parallel processing - Volume Part II
SYRANT: SYmmetric resource allocation on not-taken and taken paths
ACM Transactions on Architecture and Code Optimization (TACO) - HIPEAC Papers
Practical experiences on the gridification of financial applications
Proceedings of the fourth workshop on High performance computational finance
Parallelized sigma-point Kalman filtering for structural dynamics
Computers and Structures
Parallel computing for optimal genomic sequence alignment
FSKD'06 Proceedings of the Third international conference on Fuzzy Systems and Knowledge Discovery
RRBS: a fault tolerance model for cluster/grid parallel file system
ISPA'05 Proceedings of the Third international conference on Parallel and Distributed Processing and Applications
Accelerating large semantic web databases by parallel join computations of SPARQL queries
ACM SIGAPP Applied Computing Review
Proceedings of the first annual workshop on High performance computing meets databases
An evaluation of parallel numerical hessian calculations
HPCS'09 Proceedings of the 23rd international conference on High Performance Computing Systems and Applications
ICCSA'06 Proceedings of the 2006 international conference on Computational Science and Its Applications - Volume Part V
HW/SW co-design of parallel systems
Proceedings of the International Conference on Computer-Aided Design
Misleading energy and performance claims in sub/near threshold digital systems
Proceedings of the International Conference on Computer-Aided Design
Sicstus prolog-the first 25 years
Theory and Practice of Logic Programming - Prolog Systems
Revisiting the combining synchronization technique
Proceedings of the 17th ACM SIGPLAN symposium on Principles and Practice of Parallel Programming
Bottleneck identification and scheduling in multithreaded applications
ASPLOS XVII Proceedings of the seventeenth international conference on Architectural Support for Programming Languages and Operating Systems
Teaching high performance computing parallelizing a real computational science application
ICCS'05 Proceedings of the 5th international conference on Computational Science - Volume Part II
Proceedings of the 8th FPGAWorld Conference
Modularized redundant parallel virtual file system
ACSAC'05 Proceedings of the 10th Asia-Pacific conference on Advances in Computer Systems Architecture
Jockey: guaranteed job latency in data parallel clusters
Proceedings of the 7th ACM european conference on Computer Systems
Multicore scheduling for lightweight communicating processes
Science of Computer Programming
A parallel mutual information based image registration algorithm for applications in remote sensing
ISPA'06 Proceedings of the 4th international conference on Parallel and Distributed Processing and Applications
JetBench: an open source real-time multiprocessor benchmark
ARCS'10 Proceedings of the 23rd international conference on Architecture of Computing Systems
An effective approximation algorithm for the Malleable Parallel Task Scheduling problem
Journal of Parallel and Distributed Computing
Journal of Computational Physics
Assessing the performance limits of parallelized near-threshold computing
Proceedings of the 49th Annual Design Automation Conference
Near-threshold operation for power-efficient computing?: it depends...
Proceedings of the 49th Annual Design Automation Conference
Batch-pipelining for multicore H.264 decoding
Journal of Visual Communication and Image Representation
Amdahl's law for predicting the future of multicores considered harmful
ACM SIGARCH Computer Architecture News
Boosting single thread performance in mobile processors via reconfigurable acceleration
ARC'12 Proceedings of the 8th international conference on Reconfigurable Computing: architectures, tools and applications
HELIX: automatic parallelization of irregular programs for chip multiprocessing
Proceedings of the Tenth International Symposium on Code Generation and Optimization
A fair comparison of modern CPUs and GPUs running the genetic algorithm under the knapsack benchmark
EvoApplications'12 Proceedings of the 2012t European conference on Applications of Evolutionary Computation
On the scalability of the clusters-booster concept: a critical assessment of the DEEP architecture
Proceedings of the Future HPC Systems: the Challenges of Power-Constrained Performance
Power Limitations and Dark Silicon Challenge the Future of Multicore
ACM Transactions on Computer Systems (TOCS)
Robotic clusters: Multi-robot systems as computer clusters
Robotics and Autonomous Systems
MapReduce indexing strategies: Studying scalability and efficiency
Information Processing and Management: an International Journal
Improving the performance of FD constraint solving in a CFLP system
FLOPS'12 Proceedings of the 11th international conference on Functional and Logic Programming
Retrofitted parallelism considered grossly sub-optimal
HotPar'12 Proceedings of the 4th USENIX conference on Hot Topics in Parallelism
Accelerating the simulation of shipboard power systems
Proceedings of the 2011 Grand Challenges on Modeling and Simulation Conference
Advances in Software Engineering
GPGPU implementation of growing neural gas: Application to 3D scene reconstruction
Journal of Parallel and Distributed Computing
Hierarchical parallel approach in vascular network modeling: hybrid MPI+OpenMP implementation
PPAM'11 Proceedings of the 9th international conference on Parallel Processing and Applied Mathematics - Volume Part I
Parallelization of the discrete chaotic block encryption algorithm
PPAM'11 Proceedings of the 9th international conference on Parallel Processing and Applied Mathematics - Volume Part II
Approaches to parallelize pareto ranking in NSGA-II algorithm
PPAM'11 Proceedings of the 9th international conference on Parallel Processing and Applied Mathematics - Volume Part II
A parallel solution for high resolution histological image analysis
Computer Methods and Programs in Biomedicine
Implementing the data center energy productivity metric
ACM Journal on Emerging Technologies in Computing Systems (JETC)
Scalability-based manycore partitioning
Proceedings of the 21st international conference on Parallel architectures and compilation techniques
Visualizing transactional memory
Proceedings of the 21st international conference on Parallel architectures and compilation techniques
ACM Transactions on Mathematical Software (TOMS)
GigaMesh and gilgamesh: –3D multiscale integral invariant cuneiform character extraction
VAST'10 Proceedings of the 11th International conference on Virtual Reality, Archaeology and Cultural Heritage
Knowledge-based out-of-core algorithms for data management in visualization
EUROVIS'06 Proceedings of the Eighth Joint Eurographics / IEEE VGTC conference on Visualization
Improving communication latency with the write-only architecture
Journal of Parallel and Distributed Computing
Efficient backprojection-based synthetic aperture radar computation with many-core processors
SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
3D parallel elastodynamic modeling of large subduction earthquakes
PVM/MPI'07 Proceedings of the 14th European conference on Recent Advances in Parallel Virtual Machine and Message Passing Interface
NetSlices: scalable multi-core packet processing in user-space
Proceedings of the eighth ACM/IEEE symposium on Architectures for networking and communications systems
Design space exploration towards a realtime and energy-aware GPGPU-based analysis of biosensor data
Computer Science - Research and Development
Vectorization technology to improve interpreter performance
ACM Transactions on Architecture and Code Optimization (TACO) - Special Issue on High-Performance Embedded Architectures and Compilers
Energy consumption modeling for hybrid computing
Euro-Par'12 Proceedings of the 18th international conference on Parallel Processing
Synchronization cannot be implemented as a library
Proceedings of the 2012 ACM conference on High integrity language technology
Improving disk I/O performance in a virtualized system
Journal of Computer and System Sciences
Power challenges may end the multicore era
Communications of the ACM
Proceedings of the Winter Simulation Conference
Two ports of a full evolutionary algorithm onto GPGPU
EA'11 Proceedings of the 10th international conference on Artificial Evolution
On the parallelization of the SProt measure and the TM-Score algorithm
Euro-Par'12 Proceedings of the 18th international conference on Parallel processing workshops
Extending the scope of the controlled logical clock
Cluster Computing
PARA'12 Proceedings of the 11th international conference on Applied Parallel and Scientific Computing
Parallel framework for topology optimization using the method of moving asymptotes
Structural and Multidisciplinary Optimization
Proceedings of the 18th International Conference on 3D Web Technology
Modeling performance of a parallel streaming engine: bridging theory and costs
Proceedings of the 4th ACM/SPEC International Conference on Performance Engineering
Invasive computing in HPC with X10
Proceedings of the third ACM SIGPLAN X10 Workshop
Expanding rural cellular networks with virtual coverage
nsdi'13 Proceedings of the 10th USENIX conference on Networked Systems Design and Implementation
Criticality stacks: identifying critical threads in parallel programs using synchronization behavior
Proceedings of the 40th Annual International Symposium on Computer Architecture
Proceedings of the 2013 ACM SIGSIM conference on Principles of advanced discrete simulation
Data-Fu: a language and an interpreter for interaction with read/write linked data
Proceedings of the 22nd international conference on World Wide Web
Parallel efficient aligner of pyrosequencing reads
Proceedings of the 20th European MPI Users' Group Meeting
Journal of Parallel and Distributed Computing
Estimating parallel performance
Journal of Parallel and Distributed Computing
On bottleneck analysis in stochastic stream processing
ACM Transactions on Design Automation of Electronic Systems (TODAES)
A shared matrix unit for a chip multi-core processor
Journal of Parallel and Distributed Computing
A survey of pipelined workflow scheduling: Models and algorithms
ACM Computing Surveys (CSUR)
Parallel scheduling for cyber-physical systems: analysis and case study on a self-driving car
Proceedings of the ACM/IEEE 4th International Conference on Cyber-Physical Systems
Parallel distributed-memory simplex for large-scale stochastic LP problems
Computational Optimization and Applications
Applications of heterogeneous computing in computational and simulation science
International Journal of Computational Science and Engineering
Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles
ACM SIGOPS 24th Symposium on Operating Systems Principles
Everything you always wanted to know about synchronization but were afraid to ask
Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles
Using HPX and LibGeoDecomp for scaling HPC applications on heterogeneous supercomputers
ScalA '13 Proceedings of the Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems
Journal of Parallel and Distributed Computing
Modeling the effects of DFS on power consumption in hybrid chip multiprocessors
E2SC '13 Proceedings of the 1st International Workshop on Energy Efficient Supercomputing
Parallelized sub-resource loading for web rendering engine
Journal of Systems Architecture: the EUROMICRO Journal
Journal of Computational Physics
Fast fingerprint identification for large databases
Pattern Recognition
Journal of Real-Time Image Processing
Microprocessors & Microsystems
A hyperscalar dual-core architecture for embedded systems
Microprocessors & Microsystems
Direct distributed memory access for CMPs
Journal of Parallel and Distributed Computing
Parallel flow routing in SWMM 5
Environmental Modelling & Software
Development and Evaluation of Distributed Simulation of Embedded Systems Using Ptolemy and HLA
DS-RT '13 Proceedings of the 2013 IEEE/ACM 17th International Symposium on Distributed Simulation and Real Time Applications
Amdahl's law in the era of process variation
International Journal of High Performance Systems Architecture
Recent progress and challenges in exploiting graphics processors in computational fluid dynamics
The Journal of Supercomputing
Parallel algorithm for evolvable-based boolean synthesis on GPUs
Analog Integrated Circuits and Signal Processing
Extending Amdahl's law and Gustafson's law by evaluating interconnections on multi-core processors
The Journal of Supercomputing
Eliminating unscalable communication in transaction processing
The VLDB Journal — The International Journal on Very Large Data Bases
Exploiting multi-core nodes in peer-to-peer grids
Journal of Parallel and Distributed Computing
Experience with a genetic algorithm implemented on a multiprocessor computer
Structural and Multidisciplinary Optimization
Efficient backprojection-based synthetic aperture radar computation with many-core processors
Scientific Programming - Selected Papers from Super Computing 2012
Hi-index | 0.05 |
For over a decade prophets have voiced the contention that the organization of a single computer has reached its limits and that truly significant advances can be made only by interconnection of a multiplicity of computers in such a manner as to permit cooperative solution. Variously the proper direction has been pointed out as general purpose computers with a generalized interconnection of memories, or as specialized computers with geometrically related memory interconnections and controlled by one or more instruction streams.