A Coarse-Grain Phased Logic CPU

Authors:
Robert B. Reese;Mitchell A. Thornton;Cherrice Traver
Affiliations:
-;-;-
Venue:
ASYNC '03 Proceedings of the 9th International Symposium on Asynchronous Circuits and Systems
Year:
2003

Citing 11
Cited 6

Micropipelines

Communications of the ACM
Efficient self-timing with level-encoded 2-phase dual-rail (LEDR)

Proceedings of the 1991 University of California/Santa Cruz conference on Advanced research in VLSI
Phased Logic: Supporting the Synchronous Design Paradigm with Delay-Insensitive Circuitry

IEEE Transactions on Computers
Statistical clock skew modeling with data delay variations

IEEE Transactions on Very Large Scale Integration (VLSI) Systems - System Level Design
An Asynchronous Low-Power 80C51 Microcontroller

ASYNC '98 Proceedings of the 4th International Symposium on Advanced Research in Asynchronous Circuits and Systems
AMULET3 Revealed

ASYNC '99 Proceedings of the 5th International Symposium on Advanced Research in Asynchronous Circuits and Systems
Asynchronous Design Using Commercial HDL Synthesis Tools

ASYNC '00 Proceedings of the 6th International Symposium on Advanced Research in Asynchronous Circuits and Systems
The Design of an Asynchronous MIPS R3000 Microprocessor

ARVLSI '97 Proceedings of the 17th Conference on Advanced Research in VLSI (ARVLSI '97)
Generalized Early Evaluation in Self-Timed Circuits

Proceedings of the conference on Design, automation and test in Europe
Arithmetic Logic Circuits using Self-Timed Bit Level Dataflow and Early Evaluation

ICCD '01 Proceedings of the International Conference on Computer Design: VLSI in Computers & Processors
Marked directed graphs

Journal of Computer and System Sciences

Performance enhancement in phased logic circuits using automatic slack-matching buffer insertion

Proceedings of the 14th ACM Great Lakes symposium on VLSI
A Coarse-Grain Phased Logic CPU

IEEE Transactions on Computers
Design automation of real-life asynchronous devices and systems

Foundations and Trends in Electronic Design Automation
Using functional independence conditions to optimize the performance of latency-insensitive systems

Proceedings of the 2007 IEEE/ACM international conference on Computer-aided design
Automated Synchronous-to-Asynchronous Circuits Conversion: A Survey

Integrated Circuit and System Design. Power and Timing Modeling, Optimization and Simulation
Leveraging local intracore information to increase global performance in block-based design of systems-on-chip

IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

A five-stage pipelined CPU based on the MIPs ISA is mapped to a self-timed implementation scheme known as Phased Logic (PL). The mapping is performed automatically from a netlist of D-Flip-Flops and 4-input Lookup Tables (LUT4s) to a netlist of PL blocks. Each PL block is composed of control logic wrapped around a collection of DFFs and LUT4s to form a multi-input/output PL gate. PL offers a speedup technique known as early evaluation that can be used to boost performance at the cost of additional logic within each block. In addition to early evaluation, this implementation uses bypass paths in the ALU for shift and logical instructions and buffering stages for increased dataflow to further improve performance. Additional speedup is gained by reordering instructions to provide more opportunity for early evaluation. Simulation results show an average speedup of 41% compared to the clocked netlist over a suite of five benchmark programs.