Optimal Systolic Design for the Transitive Closure and the Shortest Path Problems
IEEE Transactions on Computers
Matching algorithms to array processors
ACM '87 Proceedings of the 1987 Fall Joint Computer Conference on Exploring technology: today and tomorrow
Perfect Latin squares and parallel array access
ISCA '89 Proceedings of the 16th annual international symposium on Computer architecture
Microprogramming instruction systolic arrays
MICRO 22 Proceedings of the 22nd annual workshop on Microprogramming and microarchitecture
Efficient Algorithms for Reconfiguration in VLSI/WSI Arrays
IEEE Transactions on Computers
Study of parallelism in regular iterative algorithms
SPAA '90 Proceedings of the second annual ACM symposium on Parallel algorithms and architectures
A systolic array processor for biological information signal processing
ICS '91 Proceedings of the 5th international conference on Supercomputing
Time Optimal Linear Schedules for Algorithms with Uniform Dependencies
IEEE Transactions on Computers
An Optimal Systolic Array for the Algebraic Path Problem
IEEE Transactions on Computers
Detecting static algorithms by partial evaluation
PEPM '91 Proceedings of the 1991 ACM SIGPLAN symposium on Partial evaluation and semantics-based program manipulation
Compile-Time Scheduling and Assignment of Data-Flow Program Graphs with Data-Dependent Iteration
IEEE Transactions on Computers
A fast static scheduling algorithm for DAGs on an unbounded number of processors
Proceedings of the 1991 ACM/IEEE conference on Supercomputing
Three-Dimensional Structured Networks for Matrix Equation Solving
IEEE Transactions on Computers - Special issue on artificial neural networks
Executing DSP Applications in a Fine-Grained Dataflow Environment
IEEE Transactions on Software Engineering
Minimizing the number of delay buffers in the synchronization of pipelined systems
DAC '91 Proceedings of the 28th ACM/IEEE Design Automation Conference
Parallel solution of Hough transform and convolution problems—a novel multimodal approach
SAC '92 Proceedings of the 1992 ACM/SIGAPP symposium on Applied computing: technological challenges of the 1990's
Some Combinatorial Aspects of Parallel Algorithm Design for Matrix Multiplication
IEEE Transactions on Computers
Design of Optimal Systolic Algorithms for the Transitive Closure Problem
IEEE Transactions on Computers
Decomposition of Complex Multipliers Using Polynomial Encoding
IEEE Transactions on Computers
A partial evaluator for data flow graphs
PEPM '93 Proceedings of the 1993 ACM SIGPLAN symposium on Partial evaluation and semantics-based program manipulation
A multi level testability assistant for VLSI design
EURO-DAC '92 Proceedings of the conference on European design automation
Design of Efficient Regular Arrays for Matrix Multiplication by Two-Step Regularization
IEEE Transactions on Parallel and Distributed Systems
Some New Designs of 2-D Array for Matrix Multiplication and Transitive Closure
IEEE Transactions on Parallel and Distributed Systems
Algorithm-Based Fault Tolerant Synthesis for Linear Operations
IEEE Transactions on Computers
IEEE Transactions on Parallel and Distributed Systems
From VHDL to efficient and first-time-right designs: a formal approach
ACM Transactions on Design Automation of Electronic Systems (TODAES)
Optimizing systems for effective block-processing: the k-delay problem
DAC '96 Proceedings of the 33rd annual Design Automation Conference
Achieving Full Parallelism Using Multidimensional Retiming
IEEE Transactions on Parallel and Distributed Systems
Optimal Data Scheduling for Uniform Multidimensional Applications
IEEE Transactions on Computers
On the Scalability of 2-D Discrete Wavelet Transform Algorithms
Multidimensional Systems and Signal Processing
Fast Implementation of 3-D Digital Filters Via SystolicArray Processors
Multidimensional Systems and Signal Processing
Architectural Synthesis of Digital Signal ProcessingAlgorithms Using “IRIS”
Journal of VLSI Signal Processing Systems - Special issue on the 1995 VLSI signal processing workshop
Journal of VLSI Signal Processing Systems
An Efficient VLSI Architecture for Full-Search Block MatchingAlgorithms
Journal of VLSI Signal Processing Systems
High throughput pipelined data path synthesis by conserving the regularity of nested loops
ICCAD '93 Proceedings of the 1993 IEEE/ACM international conference on Computer-aided design
A Self-Testing Nonincreasing Order Checker
IEEE Transactions on Computers
Node-covering, Error-correcting Codes and Multiprocessors with Very High Average Fault Tolerance
IEEE Transactions on Computers
Performance optimization of wireless local area networks through VLSI data compression
Wireless Networks - Special issue VLSI in wireless networks
A Unifying Lattice-Based Approach for the Partitioning of Systolic Arrays via LPGS and LSGP
Journal of VLSI Signal Processing Systems
A Linear Systolic Array for Real-Time Morphological Image Processing
Journal of VLSI Signal Processing Systems
Vlsi Array Architectures for Pyramid Vector Quantization
Journal of VLSI Signal Processing Systems - Special issue on future directions in the design and implementations of DSP systems
Efficient Partitioning of Algorithms for Long Convolutions and their Mapping onto Architectures
Journal of VLSI Signal Processing Systems - Special issue on systematic trade-off analysis in signal processing systems design
Systolic Array Implementation of Euclid's Algorithm for Inversion and Division in GF (2m)
IEEE Transactions on Computers
Automatic Generation of Modular Time-Space Mappings and Data Alignments
Journal of VLSI Signal Processing Systems - Special issue on application specific systems, architectures and processors
Journal of VLSI Signal Processing Systems
A Systolic Design Methodology with Application toFull-Search Block-Matching Architectures
Journal of VLSI Signal Processing Systems
Data-Driven Control Scheme for Linear Arrays: Application to a Stable Insertion Sorter
IEEE Transactions on Parallel and Distributed Systems
Multimedia Signal Processors: An Architectural Platform with Algorithmic Compilation
Journal of VLSI Signal Processing Systems - special issue on multimedia signal processing
ICCAD '92 Proceedings of the 1992 IEEE/ACM international conference on Computer-aided design
Advances in computer simulation
ANSS '91 Proceedings of the 24th annual symposium on Simulation
Parallel Implementation of Multidimensional Transforms without Interprocessor Communication
IEEE Transactions on Computers
Alpha du centaur: a prototype environment for the design of parallel regular alorithms
ICS '89 Proceedings of the 3rd international conference on Supercomputing
A Systolic Array Implementation of the Feng-Rao Algorithm
IEEE Transactions on Computers
A novel high throughput reconfigurable FPGA architecture
FPGA '00 Proceedings of the 2000 ACM/SIGDA eighth international symposium on Field programmable gate arrays
Buffer Assignment Algorithms on Data Driven ASICs
IEEE Transactions on Computers
Using Data Flow Information to Obtain Efficient Check Sets for Algorithm-Based Fault Tolerance
International Journal of Parallel Programming
Compaan: deriving process networks from Matlab for embedded signal processing architectures
CODES '00 Proceedings of the eighth international workshop on Hardware/software codesign
A Space-Time Representation Method of Iterative Algorithms for the Design of Processor Arrays
Journal of VLSI Signal Processing Systems
Architectural Synthesis of Computational Engines for Subband Adaptive Filtering
Journal of VLSI Signal Processing Systems
Fast MPEG-4 Motion Estimation: Processor Based and Flexible VLSI Implementations
Journal of VLSI Signal Processing Systems - Special issue on implementation of MPEG-4 multimedia codecs
Synthesizing transformations for locality enhancement of imperfectly-nested loop nests
Proceedings of the 14th international conference on Supercomputing
Linear QR Architecture for a Single Chip Adaptive Beamformer
Journal of VLSI Signal Processing Systems - Special issue on recent advances in the design and implementation of signal processing systems
DG2VHDL: A Tool to Facilitate the High Level Synthesisof Parallel Processing Array Architectures
Journal of VLSI Signal Processing Systems - Special issue on recent advances in the design and implementation of signal processing systems
DATE '00 Proceedings of the conference on Design, automation and test in Europe
Journal of VLSI Signal Processing Systems - Special issue on VLSI on custom computing technology
Dedicated Circuits for the Generation of Windows in Image Processing Architectures
Journal of VLSI Signal Processing Systems
Optimizing computations for effective block-processing
ACM Transactions on Design Automation of Electronic Systems (TODAES)
A brief history of cellular automata
ACM Computing Surveys (CSUR)
Finding Quadratic Schedules for Affine Recurrence Equations Via Nonsmooth Optimization
Journal of VLSI Signal Processing Systems
New Systolic Arrays for C + AB2, Inversion, and Division in GF(2m)
IEEE Transactions on Computers
IEEE Transactions on Computers
Interconnect pipelining in a throughput-intensive FPGA architecture
FPGA '01 Proceedings of the 2001 ACM/SIGDA ninth international symposium on Field programmable gate arrays
Reconfigurable Filter Coprocessor Architecture for DSP Applications
Journal of VLSI Signal Processing Systems
Radix-4 modular multiplication and exponentiation algorithms for the RSA public-key cryptosystem
ASP-DAC '00 Proceedings of the 2000 Asia and South Pacific Design Automation Conference
A CORDIC based array architecture for complex discrete wavelet transform
GLSVLSI '01 Proceedings of the 11th Great Lakes symposium on VLSI
Transformations for imperfectly nested loops
Supercomputing '96 Proceedings of the 1996 ACM/IEEE conference on Supercomputing
RSA cryptosystem design based on the Chinese remainder theorem
Proceedings of the 2001 Asia and South Pacific Design Automation Conference
An Efficient Algorithm-Based Fault Tolerance Design Using the Weighted Data-Check Relationship
IEEE Transactions on Computers
Systolic Opportunities for Multidimensional Data Streams
IEEE Transactions on Parallel and Distributed Systems
Journal of VLSI Signal Processing Systems - Parallel VLSI architectures for image and video processing
Scalable Linear Array Architecture with Data-Driven Control for Ultrahigh-Speed Vector Quantization
Journal of VLSI Signal Processing Systems - Parallel VLSI architectures for image and video processing
Loop re-ordering and pre-fetching at run-time
SC '97 Proceedings of the 1997 ACM/IEEE conference on Supercomputing
Synthesizing Transformations for Locality Enhancement of Imperfectly-Nested Loop Nests
International Journal of Parallel Programming
Ptolemy: a framework for simulating and prototyping heterogeneous systems
Readings in hardware/software co-design
Automated design synthesis and partitioning for adaptive reconfigurable hardware
Hardware implementation of intelligent systems
Digit-serial-in-serial-out systolic multiplier for Montgomery algorithm
Information Processing Letters
Synthesis of Embedded Software from Synchronous Dataflow Specifications
Journal of VLSI Signal Processing Systems
Development of a Run-Time Reconfiguration System with Low Reconfiguration Overhead
Journal of VLSI Signal Processing Systems
Performance-Scalable Array Architectures for Modular Multiplication
Journal of VLSI Signal Processing Systems
Efficient VLSI Architectures for Multiuser Channel Estimation in Wireless Base-Station Receivers
Journal of VLSI Signal Processing Systems
Journal of VLSI Signal Processing Systems
Design of Processor Arrays for Reconfigurable Architectures
The Journal of Supercomputing
Processor Array Synthesis from Shift-Variant Deep Nested Do Loops
The Journal of Supercomputing
Parallel Processing for Biomedical Signal Processing
Computer - Special issue on computer-based medical systems
On the Relationship Between Two Systolic Array Design Methodologies
IEEE Transactions on Computers
Performance Tradeoffs in Rings of Data-Driven Elements
IEEE Transactions on Computers
A Hybrid Neural Network Model for Solving Optimization Problems
IEEE Transactions on Computers
Reconfigurability and Reliability of Systolic/Wavefront Arrays
IEEE Transactions on Computers
A Systolic Architecture for Computing Inverses and Divisions in Finite Fields GF(2/sup m/)
IEEE Transactions on Computers
Reconfiguring Processor Arrays Using Multiple-Track Models: The 3Track-Spare-Approach
IEEE Transactions on Computers
Diagnosis by Signature Analysis of Test Responses
IEEE Transactions on Computers
Design of Space-Optimal Regular Arrays for Algorithms with Linear Schedules
IEEE Transactions on Computers
VLSI Architectures for High-Speed Range Estimation
IEEE Transactions on Pattern Analysis and Machine Intelligence
On Mapping Systolic Algorithms onto the Hypercube
IEEE Transactions on Parallel and Distributed Systems
Consistency in Dataflow Graphs
IEEE Transactions on Parallel and Distributed Systems
A VLSI Constant Geometry Architecture for the Fast Hartley and Fourier Transforms
IEEE Transactions on Parallel and Distributed Systems
On Time Mapping of Uniform Dependence Algorithms into Lower Dimensional Processor Arrays
IEEE Transactions on Parallel and Distributed Systems
Parallel Implementation of the Extended Square-Root Covariance Filter for Tracking Applications
IEEE Transactions on Parallel and Distributed Systems
On the Granularity and Clustering of Directed Acyclic Task Graphs
IEEE Transactions on Parallel and Distributed Systems
Synthesis of Algorithm-Based Fault-Tolerant Systems from Dependence Graphs
IEEE Transactions on Parallel and Distributed Systems
HARP: An Open Architecture for Parallel Matrix and Signal Processing
IEEE Transactions on Parallel and Distributed Systems
A General Methodology of Partitioning and Mapping for Given Regular Arrays
IEEE Transactions on Parallel and Distributed Systems
IEEE Transactions on Parallel and Distributed Systems
Generation of Injective and Reversible Modular Mappings
IEEE Transactions on Parallel and Distributed Systems
Systolic multiplier for Montgomery's algorithm
Integration, the VLSI Journal
Two-Dimensional Scheduling of Algorithms with Uniform Dependencies
PaCT '999 Proceedings of the 5th International Conference on Parallel Computing Technologies
PPAM '01 Proceedings of the th International Conference on Parallel Processing and Applied Mathematics-Revised Papers
A Power-Sum Systolic Architecture in GF(2m)
ICOIN '02 Revised Papers from the International Conference on Information Networking, Wireless Communications Technologies and Network Applications-Part II
Localization of Data Transfer in Processor Arrays
Euro-Par '99 Proceedings of the 5th International Euro-Par Conference on Parallel Processing
Area Efficient Exponentiation Using Modular Multiplier/Squarer in GF(2m
COCOON '01 Proceedings of the 7th Annual International Conference on Computing and Combinatorics
Efficient Power-Sum Systolic Architectures for Public-Key Cryptosystems in GF(2m)
COCOON '02 Proceedings of the 8th Annual International Conference on Computing and Combinatorics
High Level Simulation & Modeling for Medical Applications - Ultrasound Case
MICCAI '02 Proceedings of the 5th International Conference on Medical Image Computing and Computer-Assisted Intervention-Part II
Data-Reuse and Parallel Embedded Architectures for Low-Power, Real-Time Multimedia Applications
PATMOS '00 Proceedings of the 10th International Workshop on Integrated Circuit Design, Power and Timing Modeling, Optimization and Simulation
Mulitdimensional Streams Rooted in Dataflow
PACT '93 Proceedings of the IFIP WG10.3. Working Conference on Architectures and Compilation Techniques for Fine and Medium Grain Parallelism
Custom Coprocessor Based Matrix Algorithms for Image and Signal Processing
FPL '02 Proceedings of the Reconfigurable Computing Is Going Mainstream, 12th International Conference on Field-Programmable Logic and Applications
Inversion/Division Systolic Architecture for Public-Key Cryptosystems in GF(2m)
ISC '02 Proceedings of the 5th International Conference on Information Security
Automatic Synthesis of Motion Estimation Processors Based on a New Class of Hardware Architectures
Journal of VLSI Signal Processing Systems
Generation of distributed loop control
Embedded processor design challenges
A new 2-D systolic digital filter architecture without global broadcast
IEEE Transactions on Very Large Scale Integration (VLSI) Systems
Efficient Systolic Array Mapping of FIR Filters Used in PAM-QAM Modulators
Journal of VLSI Signal Processing Systems
Architectural design of array processors for multi-dimensional discrete Fourier transform
Highly parallel computaions
Hexagonal systolic arrays for matrix multiplication
Highly parallel computaions
Parallel ray tracing on a chip
Practical parallel rendering
Interconnect-aware high-level synthesis for low power
Proceedings of the 2002 IEEE/ACM international conference on Computer-aided design
Buffer size optimization for full-search block matching algorithms
ASAP '97 Proceedings of the IEEE International Conference on Application-Specific Systems, Architectures and Processors
A strategy for determining a Jacobi specific dataflow processor
ASAP '97 Proceedings of the IEEE International Conference on Application-Specific Systems, Architectures and Processors
Architectural approaches for video compression
ASAP '97 Proceedings of the IEEE International Conference on Application-Specific Systems, Architectures and Processors
Optimizing synchronous systems for multi-dimensional applications
EDTC '95 Proceedings of the 1995 European conference on Design and Test
Optimal Scheduling for Fast Systolic Array Implementations
EDTC '97 Proceedings of the 1997 European conference on Design and Test
Fully Parallel Hardware/Software Codesign for Multi-Dimensional DSP Applications
CODES '96 Proceedings of the 4th International Workshop on Hardware/Software Co-Design
The importance of interfaces: a HW/SW codesign case study
CODES '97 Proceedings of the 5th International Workshop on Hardware/Software Co-Design
FTCS '96 Proceedings of the The Twenty-Sixth Annual International Symposium on Fault-Tolerant Computing (FTCS '96)
Novel CORDIC-Based Systolic Arrays for the DFT and the DHT
HPC-ASIA '97 Proceedings of the High-Performance Computing on the Information Superhighway, HPC-Asia '97
Comparisons of fast 2D-DCT algorithms for parallel programmable digital signal processors
HPC-ASIA '97 Proceedings of the High-Performance Computing on the Information Superhighway, HPC-Asia '97
Design of a 3-bit Booth recoded novel VLSI concurrent multiplier-accumulator architecture
VLSID '95 Proceedings of the 8th International Conference on VLSI Design
Retiming of synchronous circuits with variable topology
VLSID '95 Proceedings of the 8th International Conference on VLSI Design
Architectures for Arithmetic over GF(2^m)
VLSID '97 Proceedings of the Tenth International Conference on VLSI Design: VLSI in Multimedia Applications
A New Systolic Array Algorithm for Memory-Based VLSI Array Implementation of DCT
ISCC '97 Proceedings of the 2nd IEEE Symposium on Computers and Communications (ISCC '97)
Memory Hierarchy Optimization of Multimedia Applications on Programmable Embedded Cores 1
ISQED '01 Proceedings of the 2nd International Symposium on Quality Electronic Design
Mapping deep nested do-loop DSP algorithms to large scale FPGA array structures
IEEE Transactions on Very Large Scale Integration (VLSI) Systems
PITIA: an FPGA for throughput-intensive applications
IEEE Transactions on Very Large Scale Integration (VLSI) Systems - Special section on the 2001 international conference on computer design (ICCD)
IEEE Transactions on Very Large Scale Integration (VLSI) Systems - Special section on the 2001 international conference on computer design (ICCD)
Towards a declarative framework for hardware-software codesign
CODES '94 Proceedings of the 3rd international workshop on Hardware/software co-design
Highly pipelined asynchronous FPGAs
FPGA '04 Proceedings of the 2004 ACM/SIGDA 12th international symposium on Field programmable gate arrays
High-Speed, Low-Complexity Systolic Designs of Novel Iterative Division Algorithms in GF(2^m)
IEEE Transactions on Computers
Power Analysis of a General Convolution Algorithm Mapped on a Linear Processor Array
Journal of VLSI Signal Processing Systems
Real-Time Imaging - Special issue on software engineering
Journal of VLSI Signal Processing Systems - Special issue on signal processing and neural networks for bioinformatics
SBCCI '04 Proceedings of the 17th symposium on Integrated circuits and system design
An Asynchronous Dataflow FPGA Architecture
IEEE Transactions on Computers
High Speed FPGA-Based Implementations of Delayed-LMS Filters
Journal of VLSI Signal Processing Systems
Joint channel estimation and data detection under fading on reconfigurable fabric
Integration, the VLSI Journal
New Processor Array Architectures for the Longest Common Subsequence Problem
The Journal of Supercomputing
JPEG, MPEG-4, and H.264 Codec IP Development
Proceedings of the conference on Design, Automation and Test in Europe - Volume 2
Area-efficient two-dimensional architectures for finite field inversion and division
GLSVLSI '05 Proceedings of the 15th ACM Great Lakes symposium on VLSI
Journal of Systems and Software - Special issue: Software engineering education and training
Rank order filters and priority queues
Distributed Computing
A fault-tolerant message passing algorithm and its hardware implementation
Advances in Engineering Software
A digit-serial multiplier for finite field GF(2m)
IEEE Transactions on Very Large Scale Integration (VLSI) Systems
A Threshold Neuron Model Based on the Processing of Difference Slices
Cybernetics and Systems Analysis
Processor Array Architectures for Deep Packet Classification
IEEE Transactions on Parallel and Distributed Systems
Toward a Realistic Task Scheduling Model
IEEE Transactions on Parallel and Distributed Systems
Implementation of the super-systolic array for convolution
ASP-DAC '03 Proceedings of the 2003 Asia and South Pacific Design Automation Conference
Design of an application-specific PLD architecture
Proceedings of the 2005 Asia and South Pacific Design Automation Conference
A fast digit-serial systolic multiplier for finite field GF(2m)
Proceedings of the 2005 Asia and South Pacific Design Automation Conference
Survey on Block Matching Motion Estimation Algorithms and Architectures with New Results
Journal of VLSI Signal Processing Systems
Low power synthesizable register files for processor and IP cores
Integration, the VLSI Journal - Special issue: Low-power design techniques
ACM Transactions on Design Automation of Electronic Systems (TODAES)
A hierarchical design methodology for full-search block matching motion estimation
Multidimensional Systems and Signal Processing
Parallel Asynchronous Watershed Algorithm-Architecture
IEEE Transactions on Parallel and Distributed Systems
An architecture for the estimation of higher order cumulants
ICASSP '93 Proceedings of the Acoustics, Speech, and Signal Processing, 1993. ICASSP-93 Vol 4., 1993 IEEE International Conference on - Volume 04
On-Chip Communication Architectures: System on Chip Interconnect
On-Chip Communication Architectures: System on Chip Interconnect
A power-based adaptive method for eigenanalysis without square-root operations
Digital Signal Processing
A programmable array processor architecture for flexible approximate string matching algorithms
Journal of Parallel and Distributed Computing
A Scalable Configurable Architecture for Advanced Wireless Communication Algorithms
Journal of VLSI Signal Processing Systems
The Journal of Supercomputing
Design of efficient architectures for 1-D and 2-D DLMS adaptive filters
Integration, the VLSI Journal
Journal of VLSI Signal Processing Systems
Implementing fine grain processor arrays on field-programmable logic
Integrated Computer-Aided Engineering
Efficient implementation of nested-loop multimedia algorithms
EURASIP Journal on Applied Signal Processing
Power modeling and efficient FPGA implementation of FHT for signal processing
IEEE Transactions on Very Large Scale Integration (VLSI) Systems
IEEE Transactions on Very Large Scale Integration (VLSI) Systems
A new strategy for multiprocessor scheduling of cyclic task graphs
International Journal of High Performance Computing and Networking
Processor array architectures for flexible approximate string matching
Journal of Systems Architecture: the EUROMICRO Journal
Analyzing concurrency in streaming applications
Journal of Systems Architecture: the EUROMICRO Journal
Multiplexer-based bit-parallel systolic multipliers over GF(2m)
Computers and Electrical Engineering
Transforming dependence graphs into signal flow graphs during systolic array processors design
EHAC'08 Proceedings of the 7th WSEAS International Conference on Electronics, Hardware, Wireless and Optical Communications
Semi-systolic Modular Multiplier over GF(2m)
ICCSA '08 Proceedings of the international conference on Computational Science and Its Applications, Part II
Trend and Challenge on System-on-a-Chip Designs
Journal of Signal Processing Systems
Adaptive Duplicated Filters and Interference Canceller for DS-CDMA Systems
Journal of Signal Processing Systems
Proceedings of the 2009 Asia and South Pacific Design Automation Conference
Reconfigurable Computing: The Theory and Practice of FPGA-Based Computation
Reconfigurable Computing: The Theory and Practice of FPGA-Based Computation
Automated Design Space Exploration for DSP Applications
Journal of Signal Processing Systems
A fault-tolerant message passing algorithm and its hardware implementation
Advances in Engineering Software
Parallel image processing with the block data parallel architecture
IBM Journal of Research and Development
Near Real Time Enhancement of Remote Sensing Imagery Based on a Network of Systolic Arrays
CIARP '09 Proceedings of the 14th Iberoamerican Conference on Pattern Recognition: Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications
Low power synthesizable register files for processor and IP cores
Integration, the VLSI Journal - Special issue: Low-power design techniques
Systolic array for string matching in NIDS
AsiaCSN '07 Proceedings of the Fourth IASTED Asian Conference on Communication Systems and Networks
WSEAS Transactions on Signal Processing
IEEE Transactions on Circuits and Systems for Video Technology
Scheduling task graphs optimally with A*
The Journal of Supercomputing
Paper: Program compression on the instruction systolic array
Parallel Computing
Paper: An FP-based tool for the synthesis of regular array algorithms
Parallel Computing
Paper: Systolic algorithm for polynomial interpolation and related problems
Parallel Computing
Efficient fixed-size systolic arrays for the modular multiplication
COCOON'99 Proceedings of the 5th annual international conference on Computing and combinatorics
A parallel architecture for motion estimation and DCT computation in MPEG-2 encoder
ICA3PP'07 Proceedings of the 7th international conference on Algorithms and architectures for parallel processing
Journal of Signal Processing Systems
Computationally efficient parallel matrix-matrix multiplication on the torus
ISHPC'05/ALPS'06 Proceedings of the 6th international symposium on high-performance computing and 1st international conference on Advanced low power systems
Parallel implementation of Cholesky LLT-algorithm in FPGA-based processor
PPAM'07 Proceedings of the 7th international conference on Parallel processing and applied mathematics
Performance evaluation of basic linear algebra subroutines on a matrix co-processor
PPAM'07 Proceedings of the 7th international conference on Parallel processing and applied mathematics
EURASIP Journal on Advances in Signal Processing
Journal of Signal Processing Systems
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
The general matrix multiply-add operation on 2D torus
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
Reconfigurable architecture of systolic array processors for remote sensing applications
SSIP '09/MIV'09 Proceedings of the 9th WSEAS international conference on signal, speech and image processing, and 9th WSEAS international conference on Multimedia, internet & video technologies
Block processing for rank order filtering using the rank order state machine architecture
ICASSP'93 Proceedings of the 1993 IEEE international conference on Acoustics, speech, and signal processing: plenary, special, audio, underwater acoustics, VLSI, neural networks - Volume I
ICASSP'93 Proceedings of the 1993 IEEE international conference on Acoustics, speech, and signal processing: plenary, special, audio, underwater acoustics, VLSI, neural networks - Volume I
Systematic design of full adder-based architectures for convolution
ICASSP'93 Proceedings of the 1993 IEEE international conference on Acoustics, speech, and signal processing: plenary, special, audio, underwater acoustics, VLSI, neural networks - Volume I
Representing and exploiting data parallelism using multidimensional dataflow diagrams
ICASSP'93 Proceedings of the 1993 IEEE international conference on Acoustics, speech, and signal processing: plenary, special, audio, underwater acoustics, VLSI, neural networks - Volume I
A highly parameterized and efficient FPGA-based skeleton for pairwise biological sequence alignment
IEEE Transactions on Very Large Scale Integration (VLSI) Systems
IEEE Transactions on Very Large Scale Integration (VLSI) Systems
High performance phylogenetic analysis with maximum parsimony on reconfigurable hardware
IEEE Transactions on Very Large Scale Integration (VLSI) Systems
VLSI systolic binary tree-searched vector quantizer for image compression
IEEE Transactions on Very Large Scale Integration (VLSI) Systems
ALADIN: a multilevel testability analyzer for VLSI system design
IEEE Transactions on Very Large Scale Integration (VLSI) Systems
A dynamically reconfigurable interconnect for array processors
IEEE Transactions on Very Large Scale Integration (VLSI) Systems
Efficient VLSI for Lempel-Ziv compression in wireless data communication networks
IEEE Transactions on Very Large Scale Integration (VLSI) Systems
Topological Patterns for Scalable Representation and Analysis of Dataflow Graphs
Journal of Signal Processing Systems
Semi-systolic architecture for modular multiplication over GF(2m)
ICCS'05 Proceedings of the 5th international conference on Computational Science - Volume Part III
A new digit-serial systolic mulitplier for high performance GF(2m) applications
HPCC'05 Proceedings of the First international conference on High Performance Computing and Communications
Low complexity systolic architecture for modular multiplication over GF(2m)
ICCS'06 Proceedings of the 6th international conference on Computational Science - Volume Part I
Area-Time efficient systolic architecture for the DCT
ACSAC'05 Proceedings of the 10th Asia-Pacific conference on Advances in Computer Systems Architecture
2-D discrete cosine transform (DCT) on meshes with hierarchical control modes
IbPRIA'05 Proceedings of the Second Iberian conference on Pattern Recognition and Image Analysis - Volume Part I
Reducing the number of processors elements in systolic arrays for matrix multiplication
SEPADS'12/EDUCATION'12 Proceedings of the 11th WSEAS international conference on Software Engineering, Parallel and Distributed Systems, and proceedings of the 9th WSEAS international conference on Engineering Education
CP-based SBHT-RLS algorithms for tracking channel estimates in multicarrier modulation systems
Journal of Electrical and Computer Engineering - Special issue on Implementations of Signal-Processing Algorithms for OFDM Systems
International Journal of Circuit Theory and Applications
Synthesis of a unidirectional systolic array for matrix-vector multiplication
Mathematical and Computer Modelling: An International Journal
Scalable Gaussian Normal Basis Multipliers over GF(2m) Using Hankel Matrix-Vector Representation
Journal of Signal Processing Systems
Processor array design with the use of genetic algorithm
LSSC'11 Proceedings of the 8th international conference on Large-Scale Scientific Computing
Two-level pipelining of systolic array graphics engines
EGGH'89 Proceedings of the Fourth Eurographics conference on Advances in Computer Graphics Hardware
Full-Hardware Architectures for Data-Dependent Superimposed Training Channel Estimation
Journal of Signal Processing Systems
International Journal of Reconfigurable Computing - Special issue on Selected Papers from the 2011 International Conference on Reconfigurable Computing and FPGAs (ReConFig 2011)
Synthesis and array processor realization of a 2-D IIR beam filter for wireless applications
IEEE Transactions on Very Large Scale Integration (VLSI) Systems
Parallel architectures for the kNN classifier -- design of soft IP cores and FPGA implementations
ACM Transactions on Embedded Computing Systems (TECS) - Special issue on application-specific processors
Hi-index | 0.09 |