Computer
Memory coherence in shared virtual memory systems
ACM Transactions on Computer Systems (TOCS)
PVM: a framework for parallel distributed computing
Concurrency: Practice and Experience
The high performance Fortran handbook
The high performance Fortran handbook
Integrating parallelization strategies for linkage analysis
Computers and Biomedical Research
Techniques for reducing consistency-related communication in distributed shared-memory systems
ACM Transactions on Computer Systems (TOCS)
The directory-based cache coherence protocol for the DASH multiprocessor
ISCA '90 Proceedings of the 17th annual international symposium on Computer Architecture
Proceedings of the seventh international conference on Architectural support for programming languages and operating systems
An integrated compile-time/run-time software distributed shared memory system
Proceedings of the seventh international conference on Architectural support for programming languages and operating systems
A combined-consistency approach: sequential amp; causal-consistency
ACM SIGOPS Operating Systems Review
Distributed shared memory systems with improved barrier synchronization and data transfer
ICS '97 Proceedings of the 11th international conference on Supercomputing
Compiler and software distributed shared memory support for irregular applications
PPOPP '97 Proceedings of the sixth ACM SIGPLAN symposium on Principles and practice of parallel programming
Tradeoffs between false sharing and aggregation in software distributed shared memory
PPOPP '97 Proceedings of the sixth ACM SIGPLAN symposium on Principles and practice of parallel programming
Optimizing communication in HPF programs on fine-grain distributed shared memory
PPOPP '97 Proceedings of the sixth ACM SIGPLAN symposium on Principles and practice of parallel programming
Parallel breadth-first BDD construction
PPOPP '97 Proceedings of the sixth ACM SIGPLAN symposium on Principles and practice of parallel programming
VM-based shared memory on low-latency, remote-memory-access networks
Proceedings of the 24th annual international symposium on Computer architecture
From the I-WAY to the National Technology Grid
Communications of the ACM
A nationwide parallel computing environment
Communications of the ACM
Cashmere-2L: software coherent shared memory on a clustered remote-write network
Proceedings of the sixteenth ACM symposium on Operating systems principles
Clock Trees: Logical Clocks for Programs with Nested Parallelism
IEEE Transactions on Software Engineering
An interaction of coherence protocols and memory consistency models in DSM systems
ACM SIGOPS Operating Systems Review
Per-Node Multithreading and Remote Latency
IEEE Transactions on Computers
Data prefetching for software DSMs
ICS '98 Proceedings of the 12th international conference on Supercomputing
Shared Memory Programming in Metacomputing Environments: The Global Array Approach
The Journal of Supercomputing - Special issue: high performance distributed computing
Using cause-effect analysis to understand the performance of distributed programs
SPDT '98 Proceedings of the SIGMETRICS symposium on Parallel and distributed tools
Predicting the performance of distributed virtual shared-memory applications
IBM Systems Journal
Dynamic layout of distributed applications
ISAW '98 Proceedings of the third international workshop on Software architecture
The design, implementation, and evaluation of Jade
ACM Transactions on Programming Languages and Systems (TOPLAS)
A task- and data-parallel programming language based on shared objects
ACM Transactions on Programming Languages and Systems (TOPLAS)
Tapeworm: high-level abstractions of shared accesses
OSDI '99 Proceedings of the third symposium on Operating systems design and implementation
Anonymous Remote Computing: A Paradigm for Parallel Programming on Interconnected Workstations
IEEE Transactions on Software Engineering
False sharing problems in cluster-based disk arrays
Proceedings of the 1999 ACM symposium on Applied computing
An IDA algorithm for optimal spare allocation
Proceedings of the 1999 ACM symposium on Applied computing
Memory sharing predictor: the key to a speculative coherent DSM
ISCA '99 Proceedings of the 26th annual international symposium on Computer architecture
Transparent adaptive parallelism on NOWs using OpenMP
Proceedings of the seventh ACM SIGPLAN symposium on Principles and practice of parallel programming
Mechanisms and policies for supporting fine-grained cycle stealing
ICS '99 Proceedings of the 13th international conference on Supercomputing
Dynamic remote memory acquisition for parallel data mining on ATM-connected PC cluster
ICS '99 Proceedings of the 13th international conference on Supercomputing
Data management in networks: experimental evaluation of a provably good strategy
Proceedings of the eleventh annual ACM symposium on Parallel algorithms and architectures
Proceedings of the 14th ACM SIGPLAN conference on Object-oriented programming, systems, languages, and applications
Ace: a language for parallel programming with customizable protocols
ACM Transactions on Computer Systems (TOCS)
A high-level abstraction of shared accesses
ACM Transactions on Computer Systems (TOCS)
An Efficient and Scalable Approach for Implementing Fault-Tolerant DSM Architectures
IEEE Transactions on Computers
Formal Automatic Verification of Cache Coherence in Multiprocessors with Relaxed Memory Models
IEEE Transactions on Parallel and Distributed Systems
Improving fine-grained irregular shared-memory benchmarks by data reordering
Proceedings of the 2000 ACM/IEEE conference on Supercomputing
WebCom: A Web Based Volunteer Computer
The Journal of Supercomputing
RT-CRM: Real-Time Channel-Based Reflective Memory
IEEE Transactions on Computers
Implementation of a portable software DSM in Java
Proceedings of the 2001 joint ACM-ISCOPE conference on Java Grande
View-based consistency and false sharing effect in distributed shared memory
ACM SIGOPS Operating Systems Review
A synthesis of memory mechanisms for distributed architectures
ICS '01 Proceedings of the 15th international conference on Supercomputing
Contention elimination by replication of sequential sections in distributed shared memory programs
PPoPP '01 Proceedings of the eighth ACM SIGPLAN symposium on Principles and practices of parallel programming
PPoPP '01 Proceedings of the eighth ACM SIGPLAN symposium on Principles and practices of parallel programming
Strategies optimization and integration in DSM
ACM SIGOPS Operating Systems Review
Reducing coherence overhead of barrier synchronization in software DSMs
SC '98 Proceedings of the 1998 ACM/IEEE conference on Supercomputing
OpenMP on networks of workstations
SC '98 Proceedings of the 1998 ACM/IEEE conference on Supercomputing
Using high performance GIS software to visualize data: a hands-on software demonstration
SC '98 Proceedings of the 1998 ACM/IEEE conference on Supercomputing
A proposal for preservice student technology competence
SC '98 Proceedings of the 1998 ACM/IEEE conference on Supercomputing
Design and Performance Analysis of a Distributed Java Virtual Machine
IEEE Transactions on Parallel and Distributed Systems
OOPSLA '02 Proceedings of the 17th ACM SIGPLAN conference on Object-oriented programming, systems, languages, and applications
Shared State for Distributed Interactive Data Mining Applications
Distributed and Parallel Databases - Special issue: Parallel and distributed data mining
Run-time support for distributed sharing in safe languages
ACM Transactions on Computer Systems (TOCS)
Delta Execution: A preemptive Java thread migration mechanism
Cluster Computing
Adaptive Sector Grouping to Reduce False Sharing in Distributed RAID
Cluster Computing
An Effective Logical Cache for a Clustered LRC-Based DSM System
Cluster Computing
PHOENIX: A Self Adaptable Monitoring Platform for Cluster Management
Cluster Computing
Performance Tuning Software DSM Applications using Visualisation
The Journal of Supercomputing
COMPaS: A PC-Based SMP Cluster
IEEE Concurrency
Implementing Scoped Behavior for Flexible Distributed Data Sharing
IEEE Concurrency
Programming Languages for CSE: The State of the Art
IEEE Computational Science & Engineering
POEMS: End-to-End Performance Design of Large Parallel Adaptive Computational Systems
IEEE Transactions on Software Engineering
Eliminating synchronization bottlenecks using adaptive replication
ACM Transactions on Programming Languages and Systems (TOPLAS)
Hidden Costs in Avoiding False Sharing in Software DSMs
HiPC '01 Proceedings of the 8th International Conference on High Performance Computing
A Fully Compliant OpenMP Implementationon Software Distributed Shared Memory
HiPC '02 Proceedings of the 9th International Conference on High Performance Computing
A General Data Layout for Distributed Consistency in Data Parallel Applications
HiPC '02 Proceedings of the 9th International Conference on High Performance Computing
DSM-PM2: A Portable Implementation Platform for Multithreaded DSM Consistency Protocols
HIPS '01 Proceedings of the 6th International Workshop on High-Level Parallel Programming Models and Supportive Environments
The Working-Set Based Adaptive Protocol for Software Distributed Shared Memory
HiPC '00 Proceedings of the 7th International Conference on High Performance Computing
Visualizing the Memory Access Behavior of Shared Memory Applications on NUMA Architectures
ICCS '01 Proceedings of the International Conference on Computational Science-Part II
The Affinity Entry Consistency Protocol
ICPP '97 Proceedings of the international Conference on Parallel Processing
Two Layers Distributed Shared Memory
HPCN Europe 2001 Proceedings of the 9th International Conference on High-Performance Computing and Networking
Aurora: Scoped Behavior for Per-Context Optimized Distributed Data Sharing
IPPS '97 Proceedings of the 11th International Symposium on Parallel Processing
IPDPS '02 Proceedings of the 16th International Parallel and Distributed Processing Symposium
IPPS '97 Proceedings of the 11th International Symposium on Parallel Processing
Heaps and Stacks in Distributed Shared Memory
IPDPS '02 Proceedings of the 16th International Parallel and Distributed Processing Symposium
Parallel Data Mining on ATM-Connected PC Cluster and Optimization of Its Execution Environments
IPDPS '00 Proceedings of the 15 IPDPS 2000 Workshops on Parallel and Distributed Processing
Priority Based Messaging for Software Distributed Shared Memory
IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
Controlling Distributed Shared Memory Consistency from High Level Programming Languages
IPDPS '00 Proceedings of the 15 IPDPS 2000 Workshops on Parallel and Distributed Processing
Parallel Programming with Interacting Processes
LCPC '99 Proceedings of the 12th International Workshop on Languages and Compilers for Parallel Computing
Towards OpenMP Execution on Software Distributed Shared Memory Systems
ISHPC '02 Proceedings of the 4th International Symposium on High Performance Computing
Smooth and Efficient Integration of High-Availability in a Parallel Single Level Store System
Euro-Par '01 Proceedings of the 7th International Euro-Par Conference Manchester on Parallel Processing
Dynamic Resource Management in a Cluster for High-Availability (Research Note)
Euro-Par '02 Proceedings of the 8th International Euro-Par Conference on Parallel Processing
Euro-Par '99 Proceedings of the 5th International Euro-Par Conference on Parallel Processing
DAOS - Scalable And-Or Parallelism
Euro-Par '99 Proceedings of the 5th International Euro-Par Conference on Parallel Processing
A Node Count-Independent Logical Clock for Scaling Lazy Release Consistency Protocol
Euro-Par '99 Proceedings of the 5th International Euro-Par Conference on Parallel Processing
The Mobile Object Layer: A Run-Time Substrate for Mobile Adaptive Computations
ISCOPE '98 Proceedings of the Second International Symposium on Computing in Object-Oriented Parallel Environments
OMPC++ - A Portable High-Performance Implementation of DSM using OpenC++ Reflection
Reflection '99 Proceedings of the Second International Conference on Meta-Level Architectures and Reflection
Performance Oriented Programming for NUMA Architectures
WOMPAT '01 Proceedings of the International Workshop on OpenMP Applications and Tools: OpenMP Shared Memory Parallel Programming
Multilayer Online-Monitoring for Hybrid DSM Systems on Top of PC Clusters with a SMiLE
TOOLS '00 Proceedings of the 11th International Conference on Computer Performance Evaluation: Modelling Techniques and Tools
RHiNET/NI: A Reconfigurable Network Interface for Cluster Computing
FPL '02 Proceedings of the Reconfigurable Computing Is Going Mainstream, 12th International Conference on Field-Programmable Logic and Applications
InterAct: Virtual Sharing for Interactive Client-Server Applications
LCR '98 Selected Papers from the 4th International Workshop on Languages, Compilers, and Run-Time Systems for Scalable Computers
LCR '98 Selected Papers from the 4th International Workshop on Languages, Compilers, and Run-Time Systems for Scalable Computers
Locality Enhancement for Large-Scale Shared-Memory Multiprocessors
LCR '98 Selected Papers from the 4th International Workshop on Languages, Compilers, and Run-Time Systems for Scalable Computers
Compilation and Runtime-Optimizations for Software Distributed Shared Memory
LCR '00 Selected Papers from the 5th International Workshop on Languages, Compilers, and Run-Time Systems for Scalable Computers
Adaptive Parallelism for OpenMP Task Parallel Programs
LCR '00 Selected Papers from the 5th International Workshop on Languages, Compilers, and Run-Time Systems for Scalable Computers
Measuring Consistency Costs for Distributed Shared Data
LCR '00 Selected Papers from the 5th International Workshop on Languages, Compilers, and Run-Time Systems for Scalable Computers
Run-Time Support for Distributed Sharing in Typed Languages
LCR '00 Selected Papers from the 5th International Workshop on Languages, Compilers, and Run-Time Systems for Scalable Computers
Facilitating Parallel Programming in PVM Using Condensed Graphs
Proceedings of the 6th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
Program Development Environment for OpenMP Programs on ccNUMA Architectures
LSSC '01 Proceedings of the Third International Conference on Large-Scale Scientific Computing-Revised Papers
View consistencies and exact implementations
Parallel Computing
Preface: how to approach the virtual shared memory paradigm
Virtual shared memory for distributed architectures
Performance visualization for distributed shared memory systems
Virtual shared memory for distributed architectures
High-performance implementation and analysis of the linkmap program
Computers and Biomedical Research
Overview of high performance computers
Handbook of massive data sets
SMiLE: an integrated, multi-paradigm software infrastructure for SCI-based clusters
Future Generation Computer Systems - Selected papers from CCGRID 2002
Exploiting high-level coherence information to optimize distributed shared state
Proceedings of the ninth ACM SIGPLAN symposium on Principles and practice of parallel programming
A high performance get-put interface for ATM communications
CASCON '97 Proceedings of the 1997 conference of the Centre for Advanced Studies on Collaborative research
CASCON '96 Proceedings of the 1996 conference of the Centre for Advanced Studies on Collaborative research
Lock improvement technique for release consistency in distributed shared memory systems
FRONTIERS '96 Proceedings of the 6th Symposium on the Frontiers of Massively Parallel Computation
Shared Memory NUMA Programming on I-WAY
HPDC '96 Proceedings of the 5th IEEE International Symposium on High Performance Distributed Computing
Logging and Recovery in Adaptive Software Distributed Shared Memory Systems
SRDS '99 Proceedings of the 18th IEEE Symposium on Reliable Distributed Systems
An Effective Selection Policy for Load Balancing in Software DSM
ICPP '00 Proceedings of the Proceedings of the 2000 International Conference on Parallel Processing
Laurasia and Wind: A Compiler-Controlled Software DSM for WS/PC Clusters
IWIA '99 Proceedings of the 1999 International Workshop on Innovative Architecture
Locality and Performance of Page- and Object-Based DSMs
IPPS '98 Proceedings of the 12th. International Parallel Processing Symposium on International Parallel Processing Symposium
Update Protocols and Iterative Scientific Applications
IPPS '98 Proceedings of the 12th. International Parallel Processing Symposium on International Parallel Processing Symposium
IPPS '98 Proceedings of the 12th. International Parallel Processing Symposium on International Parallel Processing Symposium
Sourcebook of parallel computing
A Load Balancing Framework for Adaptive and Asynchronous Applications
IEEE Transactions on Parallel and Distributed Systems
Programming Environment Based on Distributed Shared Arrays for High-Performance Scientific Computing
SAINT-W '04 Proceedings of the 2004 Symposium on Applications and the Internet-Workshops (SAINT 2004 Workshops)
A Group-Based Load Balance Scheme for Software Distributed Shared Memory Systems
The Journal of Supercomputing
Homeless and home-based Lazy Release Consistency protocols on Distributed Shared Memory
ACSC '04 Proceedings of the 27th Australasian conference on Computer science - Volume 26
Merging, sorting and matrix operations on the SOME-bus multiprocessor architecture
Future Generation Computer Systems - Special issue: Advanced services for clusters and internet computing
Optimizing OpenMP programs on software distributed shared memory systems
International Journal of Parallel Programming - Special issue: OpenMP: Experiences and implementations
CAS-DSM: a compiler assisted software distributed shared memory
International Journal of Parallel Programming
A unified theory of shared memory consistency
Journal of the ACM (JACM)
Resource Policing to Support Fine-Grain Cycle Stealing in Networks of Workstations
IEEE Transactions on Parallel and Distributed Systems
Application-level checkpointing for shared memory programs
ASPLOS XI Proceedings of the 11th international conference on Architectural support for programming languages and operating systems
A page-coherent, causally consistent protocol for distributed shared memory
Journal of Systems and Software
A computation offloading scheme on handheld devices
Journal of Parallel and Distributed Computing - Special issue on middleware
Performance analysis of methods that overcome false sharing effects in software DSMs
Journal of Parallel and Distributed Computing
Survey of Architectures of Parallel Database Systems
Programming and Computing Software
Distributed Shared Arrays: An Integration of Message Passing and Multithreading on SMP Clusters
The Journal of Supercomputing
International Journal of Systems Science
Combined compile-time and runtime-driven, pro-active data movement in software DSM systems
LCR '04 Proceedings of the 7th workshop on Workshop on languages, compilers, and run-time support for scalable systems
Improving the Performance of Software Distributed Shared Memory with Speculation
IEEE Transactions on Parallel and Distributed Systems
Metacomputing with the ICET System
International Journal of High Performance Computing Applications
International Journal of High Performance Computing Applications
Shared memory computing on clusters with symmetric multiprocessors and system area networks
ACM Transactions on Computer Systems (TOCS)
Towards automatic translation of OpenMP to MPI
Proceedings of the 19th annual international conference on Supercomputing
Webcom-G: grid enabled metacomputing
Neural, Parallel & Scientific Computations - Special issue: Grid computing
Scalable database replication through dynamic multiversioning
CASCON '05 Proceedings of the 2005 conference of the Centre for Advanced Studies on Collaborative research
Note: The distributed virtual shared-memory system based on the InfiniBand architecture
Journal of Parallel and Distributed Computing - Special issue: Design and performance of networks for super-, cluster-, and grid-computing: Part I
Exploiting distributed version concurrency in a transactional memory cluster
Proceedings of the eleventh ACM SIGPLAN symposium on Principles and practice of parallel programming
Study of OpenMP applications on the InfiniBand-based software distributed shared-memory system
Parallel Computing - OpenMp
Exploiting NIC architectural support for enhancing IP-based protocols on high-performance networks
Journal of Parallel and Distributed Computing - Special issue: Design and performance of networks for super-, cluster-, and grid-computing: Part II
Proceedings of the 33rd annual international symposium on Computer Architecture
Running OpenMP applications efficiently on an everything-shared SDSM
Journal of Parallel and Distributed Computing - Special issue: 18th International parallel and distributed processing symposium
A Transparent Distributed Shared Memory for Clustered Symmetric Multiprocessors
The Journal of Supercomputing
Scalability issues in urban traffic systems
InfoScale '06 Proceedings of the 1st international conference on Scalable information systems
TMA: a trap-based memory architecture
Proceedings of the 20th annual international conference on Supercomputing
Distributed Shared Memory for Roaming Large Volumes
IEEE Transactions on Visualization and Computer Graphics
CycleMeter: detecting fraudulent peers in internet cycle sharing
Proceedings of the 2006 ACM/IEEE conference on Supercomputing
The leganet system: Freshness-aware transaction routing in a database cluster
Information Systems
A grid-enabled software distributed shared memory system on a wide area network
Future Generation Computer Systems
Execution model of three parallel languages: OpenMP, UPC and CAF
Scientific Programming - International Symposium of Parallel and Distributed Computing & International Workshop on Algorithms, Models and Tools for Parallel Computing on Heterogenous Networks
Parallel processing with windows NT networks
NT'97 Proceedings of the USENIX Windows NT Workshop on The USENIX Windows NT Workshop 1997
Implementing optimized distributed data sharing using scoped behaviour and a class library
COOTS'97 Proceedings of the 3rd conference on USENIX Conference on Object-Oriented Technologies (COOTS) - Volume 3
SLIC: an extensibility system for commodity operating systems
ATEC '98 Proceedings of the annual conference on USENIX Annual Technical Conference
Adaptive and reliable parallel computing on networks of workstations
ATEC '97 Proceedings of the annual conference on USENIX Annual Technical Conference
Thread clustering: sharing-aware scheduling on SMP-CMP-SMT multiprocessors
Proceedings of the 2nd ACM SIGOPS/EuroSys European Conference on Computer Systems 2007
TupleDSM: an educational tool for software distributed shared memory
WCAE-5 '99 Proceedings of the 1999 workshop on Computer architecture education
Software transactional memory for large scale clusters
Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming
Optimizing irregular shared-memory applications for clusters
Proceedings of the 22nd annual international conference on Supercomputing
MemX: supporting large memory workloads in Xen virtual machines
VTDC '07 Proceedings of the 2nd international workshop on Virtualization technology in distributed computing
Global trees: a framework for linked data structures on distributed memory parallel systems
Proceedings of the 2008 ACM/IEEE conference on Supercomputing
CORAL - online monitoring in distributed applications: issues and solutions
WSEAS Transactions on Computers
ACM Transactions on Programming Languages and Systems (TOPLAS)
Programming model for a heterogeneous x86 platform
Proceedings of the 2009 ACM SIGPLAN conference on Programming language design and implementation
Disaggregated memory for expansion and sharing in blade servers
Proceedings of the 36th annual international symposium on Computer architecture
A Software Transactional Memory Service for Grids
ICA3PP '09 Proceedings of the 9th International Conference on Algorithms and Architectures for Parallel Processing
Sinfonia: A new paradigm for building scalable distributed systems
ACM Transactions on Computer Systems (TOCS)
The multikernel: a new OS architecture for scalable multicore systems
Proceedings of the ACM SIGOPS 22nd symposium on Operating systems principles
Engineering Distributed Shared Memory Middleware for Java
OTM '09 Proceedings of the Confederated International Conferences, CoopIS, DOA, IS, and ODBASE 2009 on On the Move to Meaningful Internet Systems: Part I
Implicit and explicit transactions in a distributed transactional memory system
PDCN '08 Proceedings of the IASTED International Conference on Parallel and Distributed Computing and Networks
Algorithms for memory hierarchies: advanced lectures
Algorithms for memory hierarchies: advanced lectures
A characterization of shared data access patterns in UPC programs
LCPC'06 Proceedings of the 19th international conference on Languages and compilers for parallel computing
VECPAR'06 Proceedings of the 7th international conference on High performance computing for computational science
Supporting realistic OpenMP applications on a commodity cluster of workstations
WOMPAT'03 Proceedings of the OpenMP applications and tools 2003 international conference on OpenMP shared memory parallel programming
Experiences using OpenMP based on compiler directed software DSM on a PC cluster
WOMPAT'03 Proceedings of the OpenMP applications and tools 2003 international conference on OpenMP shared memory parallel programming
Compiling for a hybrid programming model using the LMAD representation
LCPC'01 Proceedings of the 14th international conference on Languages and compilers for parallel computing
Cloud-TM: harnessing the cloud with distributed transactional memories
ACM SIGOPS Operating Systems Review
Cohesion: a hybrid memory model for accelerators
Proceedings of the 37th annual international symposium on Computer architecture
Your computer is already a distributed system. why isn't your OS?
HotOS'09 Proceedings of the 12th conference on Hot topics in operating systems
DASH: a Recipe for a Flash-based Data Intensive Supercomputer
Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis
A framework for process migration in software DSM environments
EURO-PDP'00 Proceedings of the 8th Euromicro conference on Parallel and distributed processing
Evaluation of a virtual shared memory machine by the compilation of data parallel loops
EURO-PDP'00 Proceedings of the 8th Euromicro conference on Parallel and distributed processing
EURO-PDP'00 Proceedings of the 8th Euromicro conference on Parallel and distributed processing
Monitoring remotely executing shared memory programs in software DSMs
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
Reducing data access latency in SDSM systems using runtime optimizations
Proceedings of the 2010 Conference of the Center for Advanced Studies on Collaborative Research
Efficient system-enforced deterministic parallelism
OSDI'10 Proceedings of the 9th USENIX conference on Operating systems design and implementation
Scalable Speculative Parallelization on Commodity Clusters
MICRO '43 Proceedings of the 2010 43rd Annual IEEE/ACM International Symposium on Microarchitecture
Resource optimization in a LAN environment using SMIG-shared memory integrated with grid
Proceedings of the International Conference & Workshop on Emerging Trends in Technology
Snake: control flow distributed software transactional memory
SSS'11 Proceedings of the 13th international conference on Stabilization, safety, and security of distributed systems
Distributed anemone: transparent low-latency access to remote memory
HiPC'06 Proceedings of the 13th international conference on High Performance Computing
Simple, list-based parallel programming with transparent load balancing
PPAM'05 Proceedings of the 6th international conference on Parallel Processing and Applied Mathematics
View-Oriented parallel programming and view-based consistency
PDCAT'04 Proceedings of the 5th international conference on Parallel and Distributed Computing: applications and Technologies
Analysis of the abortion rate on lazy replication protocols
VECPAR'04 Proceedings of the 6th international conference on High Performance Computing for Computational Science
HiPC'04 Proceedings of the 11th international conference on High Performance Computing
Efficient system-enforced deterministic parallelism
Communications of the ACM
Supporting the OpenMP programming interface on teamster-g
GPC'06 Proceedings of the First international conference on Advances in Grid and Pervasive Computing
On paged distributed virtual memory algorithms in a broadcasting environment
Computer Communications
Finding, expressing and managing parallelism in programs executed on clusters of workstations
Computer Communications
Update protocols and cluster-based shared memory
Computer Communications
SnuCL: an OpenCL framework for heterogeneous CPU/GPU clusters
Proceedings of the 26th ACM international conference on Supercomputing
Proceedings of the 6th ACM International Conference on Distributed Event-Based Systems
Performance evaluation of view-oriented parallel programming on cluster of computers
HPCC'07 Proceedings of the Third international conference on High Performance Computing and Communications
CUDA-for-clusters: a system for efficient execution of CUDA kernels on multi-core clusters
Euro-Par'12 Proceedings of the 18th international conference on Parallel Processing
Computers and Electrical Engineering
SemCache: semantics-aware caching for efficient GPU offloading
Proceedings of the 27th international ACM conference on International conference on supercomputing
Proceedings of the 19th international conference on Architectural support for programming languages and operating systems
GPUfs: Integrating a file system with GPUs
ACM Transactions on Computer Systems (TOCS)
A compound OpenMP/MPI program development toolkit for hybrid CPU/GPU clusters
The Journal of Supercomputing
NSDI'14 Proceedings of the 11th USENIX Conference on Networked Systems Design and Implementation
Hi-index | 4.16 |
High-speed networks and improved microprocessor performance are making networks of workstations an appealing, low-cost vehicle for parallel computing. The hardware cost is essentially zero, since many organizations already have extensive workstation networks. In terms of performance, networked workstations can approach or exceed supercomputer performance for some applications, especially those without stringent synchronization and communication requirements. The authors' implementation, called TreadMarks, supports parallel computing on networks of workstations by providing the application with a shared memory abstraction. Shared memory facilitates the transition from sequential to parallel programs. After identifying possible sources of parallelism in the code, most of the data structures can be retained without change, and only synchronization needs to be added to achieve a correct shared-memory parallel program. TreadMarks is implemented entirely as a user-level library on top of Unix. Kernel modifications are unnecessary because modern Unix implementations provide all required communication and memory management functions. Programs written in C, C++, or Fortran are compiled and linked with the TreadMarks library using any standard compiler for that language. As a result, the system is fairly portable, and it has been ported to a number of platforms. These include IBM RS-6000, SP-1, and SP-2; DEC Alpha and DECStation; and Hewlett-Packard, Silicon Graphics, and Sun systems.