Tight integration of timing-driven synthesis and placement of parallel multiplier circuits

Authors:
Keoncheol Shin;Taewhan Kim
Affiliations:
Department of Electrical Engineering and Computer Science, Korea Advanced Institute of Science and Technology, Daejon, 305-701, Korea;School of Electrical Engineering and Computer Science, Seoul National University, Seoul 151-744 Korea
Venue:
IEEE Transactions on Very Large Scale Integration (VLSI) Systems
Year:
2004

Citing 18
Cited 2

Incremental tree height reduction for high level synthesis

DAC '91 Proceedings of the 28th ACM/IEEE Design Automation Conference
Redundant operator creation: a scheduling optimization technique

DAC '91 Proceedings of the 28th ACM/IEEE Design Automation Conference
Improving multiplier design by using improved column compression tree and optimized final adder in CMOS technology

IEEE Transactions on Very Large Scale Integration (VLSI) Systems
High-level algorithm and architecture transformations for DSP synthesis

Journal of VLSI Signal Processing Systems - Special issue on design environments for DSP
Design strategies for optimal hybrid final adders in a parallel multiplier

Journal of VLSI Signal Processing Systems - Special issue on VLSI arithmetic and implementations
A specification invariant technique for operation cost minimisation in flow-graphs

ISSS '94 Proceedings of the 7th international symposium on High-level synthesis
Optimal Circuits for Parallel Multipliers

IEEE Transactions on Computers
Maximally fast and arbitrarily fast implementation of linear computations

ICCAD '92 Proceedings of the 1992 IEEE/ACM international conference on Computer-aided design
An Optimal Allocation of Carry-Save-Adders in Arithmetic Circuits

IEEE Transactions on Computers
Signal representation guided synthesis using carry-save adders for synchronous data-path circuits

Proceedings of the 38th annual Design Automation Conference
Improved merging of datapath operators using information content and required precision analysis

Proceedings of the 38th annual Design Automation Conference
Computer Arithmetic

Computer Arithmetic
Computer Arithmetic: Principles, Architecture and Design

Computer Arithmetic: Principles, Architecture and Design
A Method for Speed Optimized Partial Product Reduction and Generation of Fast Parallel Multipliers Using an Algorithmic Approach

IEEE Transactions on Computers
Implementing Multiply-Accumulate Operation in Multiplication Time

ARITH '97 Proceedings of the 13th Symposium on Computer Arithmetic (ARITH '97)
A Specification Invariant Technique for Regularity Improvement between Flow-Graph Clusters

EDTC '96 Proceedings of the 1996 European conference on Design and Test
A linear-time heuristic for improving network partitions

DAC '82 Proceedings of the 19th Design Automation Conference
Circuit optimization using carry-save-adder cells

IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems

Integrated algorithmic logical and physical design of integer multiplier

Proceedings of the 2005 Asia and South Pacific Design Automation Conference
Placement based multiplier rewiring for cell-based designs

Proceedings of the 2008 IEEE/ACM International Conference on Computer-Aided Design

Quantified Score

Hi-index	0.00

Visualization

Abstract

In deep submicrometer (DSM) design, the interconnect delay becomes equally as or more important than that of logic gates. In particular, to achieve timing closure in DSM design, it is essential to consider the interconnect delay at an early stage of the synthesis process. Unfortunately, few successes of achieving a tight link of front-end synthesis to back-end layout have been reported, from a practical point of view, mainly due to the inaccuracy of predicting the layout effects during the synthesis. In this brief, we address a new approach to the problem of synthesis of parallel multiplier circuits combined with the consideration of layout effects. The approach is intended to overcome some of the limitations of the previous works, in which the effects of layout on the synthesis have either not been taken into account or considered only in local and limited ways, or the computation time is extremely large. The proposed approach refines the structure and placement of the circuit by iteratively performing two tasks. Task 1: timing-driven relocation. For a parallel multiplier circuit that was restructured at the prior iteration, we attempt to replace the modules in the structure while retaining the interconnects to find a placement with shorter timing. Task 2: timing-driven resynthesis. We attempt to restructure the interconnect topology of the placement obtained from Task 1 to further reduce the circuit timing, employing two heuristics: a modified version of timing-optimal FA-tree allocation by Stelling et al. (1996), considering interconnect delay, and a critical path-based local interconnect refinement. The iterative mechanism of the two tasks practically tightly integrates the synthesis and placement tasks so that both of the effects of placement on the results of synthesis and the effects of synthesis on the results of placement are taken into account. From experiments using a set of benchmark designs, it is shown that the approach is quite effective and efficient, producing designs with less interconnect delay over the sequential method of synthesis and placement.