Lattice Boltzmann method for 3-D flows with curved boundary
Journal of Computational Physics
WireGL: a scalable graphics system for clusters
Proceedings of the 28th annual conference on Computer graphics and interactive techniques
Chromium: a stream-processing framework for interactive rendering on clusters
Proceedings of the 29th annual conference on Computer graphics and interactive techniques
Proceedings of the ACM SIGGRAPH/EUROGRAPHICS conference on Graphics hardware
Physically-based visual simulation on graphics hardware
Proceedings of the ACM SIGGRAPH/EUROGRAPHICS conference on Graphics hardware
Physical cloth simulation on a PC cluster
EGPGV '02 Proceedings of the Fourth Eurographics Workshop on Parallel Graphics and Visualization
Interactive visibility culling in complex environments using occlusion-switches
I3D '03 Proceedings of the 2003 symposium on Interactive 3D graphics
Using modern graphics architectures for general-purpose computing: a framework and analysis
Proceedings of the 35th annual ACM/IEEE international symposium on Microarchitecture
Sepia: Scalable 3D Compositing Using PCI Pamette
FCCM '99 Proceedings of the Seventh Annual IEEE Symposium on Field-Programmable Custom Computing Machines
Introduction to PCI Express: A Hardware and Software Developer's Guide
Introduction to PCI Express: A Hardware and Software Developer's Guide
Cg: a system for programming graphics hardware in a C-like language
ACM SIGGRAPH 2003 Papers
Linear algebra operators for GPU implementation of numerical algorithms
ACM SIGGRAPH 2003 Papers
Sparse matrix solvers on the GPU: conjugate gradients and multigrid
ACM SIGGRAPH 2003 Papers
Contention-Aware Communication Schedule for High-Speed Communication
Cluster Computing
Brook for GPUs: stream computing on graphics hardware
ACM SIGGRAPH 2004 Papers
Functionality Distribution for Parallel Rendering
IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Papers - Volume 01
LU-GPU: Efficient Algorithms for Solving Dense Linear Systems on Graphics Hardware
SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
Parallel occlusion culling on GPUs cluster
Proceedings of the 2006 ACM international conference on Virtual reality continuum and its applications
Parallel-SG: research of parallel graphics rendering system on PC-Cluster
Proceedings of the 2006 ACM international conference on Virtual reality continuum and its applications
The Journal of Supercomputing
Hierarchical clustering of gene expression profiles with graphics hardware acceleration
Pattern Recognition Letters
CellSs: a programming model for the cell BE architecture
Proceedings of the 2006 ACM/IEEE conference on Supercomputing
A memory model for scientific algorithms on graphics processors
Proceedings of the 2006 ACM/IEEE conference on Supercomputing
Flow simulation with locally-refined LBM
Proceedings of the 2007 symposium on Interactive 3D graphics and games
Proceedings of the 4th international conference on Computing frontiers
GPU-Accelerated Evaluation Platform for High Fidelity Network Modeling
Proceedings of the 21st International Workshop on Principles of Advanced and Distributed Simulation
Exploring weak scalability for FEM calculations on a GPU-enhanced cluster
Parallel Computing
CellSs: making it easier to program the cell broadband engine processor
IBM Journal of Research and Development
Towards acceleration of fault simulation using graphics processing units
Proceedings of the 45th annual Design Automation Conference
Adapting a message-driven parallel application to GPU-accelerated clusters
Proceedings of the 2008 ACM/IEEE conference on Supercomputing
GPU Accelerated Computation and Visualization of Hexagonal Cellular Automata
ACRI '08 Proceedings of the 8th international conference on Cellular Automata for Reseach and Industry
TeraFLOP computing on a desktop PC with GPUs for 3D CFD
International Journal of Computational Fluid Dynamics - Mesoscopic Methods And Their Applications To CFD
GPU for Parallel On-Board Hyperspectral Image Processing
International Journal of High Performance Computing Applications
Large calculation of the flow over a hypersonic vehicle using a GPU
Journal of Computational Physics
Using GPUs to improve multigrid solver performance on a cluster
International Journal of Computational Science and Engineering
Enclosed Five-Wall Immersive Cabin
ISVC '08 Proceedings of the 4th International Symposium on Advances in Visual Computing
Matrix-based streamization approach for improving locality and parallelism on FT64 stream processor
The Journal of Supercomputing
Accelerating statistical static timing analysis using graphics processing units
Proceedings of the 2009 Asia and South Pacific Design Automation Conference
Fast circuit simulation on graphics processing units
Proceedings of the 2009 Asia and South Pacific Design Automation Conference
A GPU-based method for approximate real-time fluid flow simulation
Machine Graphics & Vision International Journal
An approach for the effective utilization of GP-GPUs in parallel combined simulation
Proceedings of the 40th Conference on Winter Simulation
Non-rigid Registration for Large Sets of Microscopic Images on Graphics Processors
Journal of Signal Processing Systems
Stream processing for fast and efficient rotated Haar-like features using rotated integral images
International Journal of Intelligent Systems Technologies and Applications
A translation system for enabling data mining applications on GPUs
Proceedings of the 23rd international conference on Supercomputing
Probing biomolecular machines with graphics processors
Communications of the ACM - A View of Parallel Computing
Probing Biomolecular Machines with Graphics Processors
Queue - Bioscience
Co-processor acceleration of an unmodified parallel solid mechanics code with FEASTGPU
International Journal of Computational Science and Engineering
Multi-core acceleration of chemical kinetics for simulation and prediction
Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis
Comparison of two real-time image processing system approaches
CGIM '08 Proceedings of the Tenth IASTED International Conference on Computer Graphics and Imaging
Teaching design & analysis of multi-core parallel algorithms using CUDA
Journal of Computing Sciences in Colleges
Proceedings of the International Conference and Workshop on Emerging Trends in Technology
LBM based flow simulation using GPU computing processor
Computers & Mathematics with Applications
Accelerating large graph algorithms on the GPU using CUDA
HiPC'07 Proceedings of the 14th international conference on High performance computing
Implementation and evaluation of Jacobi iteration on the imagine stream processor
HiPC'07 Proceedings of the 14th international conference on High performance computing
Boolean satisfiability on a graphics processor
Proceedings of the 20th symposium on Great lakes symposium on VLSI
Highly parallel decoding of space-time codes on graphics processing units
Allerton'09 Proceedings of the 47th annual Allerton conference on Communication, control, and computing
Dynamic load balancing of Lattice Boltzmann free-surface fluid animations
Proceedings of the 7th International Conference on Computer Graphics, Virtual Reality, Visualisation and Interaction in Africa
A survey on CAD methods in 3D garment design
Computers in Industry
High-order finite-element seismic wave propagation modeling with MPI on a large GPU cluster
Journal of Computational Physics
Data-intensive document clustering on graphics processing unit (GPU) clusters
Journal of Parallel and Distributed Computing
Experience of parallelizing cryo-EM 3D reconstruction on a CPU-GPU heterogeneous system
Proceedings of the 20th international symposium on High performance distributed computing
Free surface flow simulations on GPGPUs using the LBM
Computers & Mathematics with Applications
The TheLMA project: Multi-GPU implementation of the lattice Boltzmann method
International Journal of High Performance Computing Applications
A resource selection method for cycle stealing in the GPU grid
ISPA'06 Proceedings of the 2006 international conference on Frontiers of High Performance Computing and Networking
HiPC'05 Proceedings of the 12th international conference on High Performance Computing
GPU-based active contour segmentation using gradient vector flow
ISVC'06 Proceedings of the Second international conference on Advances in Visual Computing - Volume Part I
Implementing survey propagation on graphics processing units
SAT'06 Proceedings of the 9th international conference on Theory and Applications of Satisfiability Testing
A GPGPU approach for accelerating 2-d/3-d rigid registration of medical images
ISPA'06 Proceedings of the 4th international conference on Parallel and Distributed Processing and Applications
Optimizing stencil application on multi-thread GPU architecture using stream programming model
ARCS'10 Proceedings of the 23rd international conference on Architecture of Computing Systems
A GPU-Based accelerator for chinese word segmentation
APWeb'12 Proceedings of the 14th Asia-Pacific international conference on Web Technologies and Applications
Massive threading: Using GPUs to increase the performance of digital forensics tools
Digital Investigation: The International Journal of Digital Forensics & Incident Response
Optimizing linpack benchmark on GPU-accelerated petascale supercomputer
Journal of Computer Science and Technology - Special issue on Community Analysis and Information Recommendation
An MPI-CUDA implementation of an improved Roe method for two-layer shallow water systems
Journal of Parallel and Distributed Computing
C-DAC's efforts: application kernels on HPC cluster with GPU accelerators
Proceedings of the ATIP/A*CRC Workshop on Accelerator Technologies for High-Performance Computing: Does Asia Lead the Way?
A fast implementation of the octagon abstract domain on graphics hardware
SAS'07 Proceedings of the 14th international conference on Static Analysis
Architecture-based optimization for mapping scientific applications to imagine
ISPA'07 Proceedings of the 5th international conference on Parallel and Distributed Processing and Applications
Cellular Automata and GPGPU: An Application to Lava Flow Modeling
International Journal of Grid and High Performance Computing
Efficient GPU implementation of the linearly interpolated bounce-back boundary condition
Computers & Mathematics with Applications
Power and Performance Management of GPUs Based Cluster
International Journal of Cloud Applications and Computing
SC '13 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Consolidated cluster systems for data centers in the cloud age: a survey and analysis
Frontiers of Computer Science: Selected Publications from Chinese Universities
Neither more nor less: optimizing thread-level parallelism for GPGPUs
PACT '13 Proceedings of the 22nd international conference on Parallel architectures and compilation techniques
Accelerated finite element elastodynamic simulations using the GPU
Journal of Computational Physics
Recent progress and challenges in exploiting graphics processors in computational fluid dynamics
The Journal of Supercomputing
Efficient implementation of data flow graphs on multi-gpu clusters
Journal of Real-Time Image Processing
Hi-index | 0.02 |
Inspired by the attractive Flops/dollar ratio and the incredible growth in the speed of modern graphics processing units (GPUs), we propose to use a cluster of GPUs for high performance scientific computing. As an example application, we have developed a parallel flow simulation using the lattice Boltzmann model (LBM) on a GPU cluster and have simulated the dispersion of airborne contaminants in the Times Square area of New York City. Using 30 GPU nodes, our simulation can compute a 480x400x80 LBM in 0.31second/step, a speed which is 4.6 times faster than that of our CPU cluster implementation. Besides the LBM, we also discuss other potential applications of the GPU cluster, such as cellular automata, PDE solvers, and FEM.