Distributed time, conservative parallel logic simulation on GPUs

Authors:
Bo Wang;Yuhao Zhu;Yangdong Deng
Affiliations:
Tsinghua University;Beihang University;Tsinghua University
Venue:
Proceedings of the 47th Design Automation Conference
Year:
2010

Citing 10
Cited 3

Introduction to algorithms

Introduction to algorithms
An evaluation of the Chandy-Misra-Bryant algorithm for digital logic simulation

ACM Transactions on Modeling and Computer Simulation (TOMACS) - Special issue on parallel and distributed systems performance
Parallel logic simulation of VLSI systems

ACM Computing Surveys (CSUR)
System-on-a-chip verification: methodology and techniques

System-on-a-chip verification: methodology and techniques
Parallel and Distribution Simulation Systems

Parallel and Distribution Simulation Systems
SIMULATION OF PACKET COMMUNICATION ARCHITECTURE COMPUTER SYSTEMS

SIMULATION OF PACKET COMMUNICATION ARCHITECTURE COMPUTER SYSTEMS
Distributed Simulation: A Case Study in Design and Verification of Distributed Programs

IEEE Transactions on Software Engineering
Towards acceleration of fault simulation using graphics processing units

Proceedings of the 45th annual Design Automation Conference
Event-driven gate-level simulation with GP-GPUs

Proceedings of the 46th Annual Design Automation Conference
GCS: high-performance gate-level simulation with GP-GPUs

Proceedings of the Conference on Design, Automation and Test in Europe

Towards accelerating irregular EDA applications with GPUs

Integration, the VLSI Journal
A high-throughput, high-accuracy system-level simulation framework for system on chips

VLSI Design
Accelerating RTL simulation with GPUs

Proceedings of the International Conference on Computer-Aided Design

Quantified Score

Hi-index	0.00

Visualization

Abstract

Logical simulation is the primary method to verify the correctness of IC designs. However, today's complex VLSI designs pose ever higher demand for the throughput of logic simulators. In this work, a parallel logic simulator was developed by leveraging the computing power of modern graphics processing units (GPUs). To expose more parallelism, we implemented a conservative parallel simulation approach, the CMB algorithm, on NVidia GPUs. The simulation processing is mapped to GPU hardware at the finest granularity. With carefully designed data structures and data flow organizations, our GPU based simulator could overcome many problems that hindered efficient implementations of the CMB algorithm on traditional parallel computers. In order to efficiently use the relatively limited capacity of GPU memory, a novel memory management mechanism was proposed to dynamically allocate and recycle GPU memory during simulation. We also introduced a CPU/GPU co-processing strategy for the best usage of computing resources. Experimental results showed that our GPU based simulator could outperform a CPU baseline event driven simulator by a factor of 29.2.