Graphics Gems III
MorphoSys: case study of a reconfigurable computing system targeting multimedia applications
Proceedings of the 37th Annual Design Automation Conference
IEEE Transactions on Computers
Internet Streaming SIMD Extensions
Computer
Subword Parallelism with MAX-2
IEEE Micro
MorphoSys: A Reconfigurable Processor Trageted to High Performance Image Application
Proceedings of the 11 IPPS/SPDP'99 Workshops Held in Conjunction with the 13th International Parallel Processing Symposium and 10th Symposium on Parallel and Distributed Processing
IPDPS '02 Proceedings of the 16th International Parallel and Distributed Processing Symposium
Interactive Ray Tracing Using a SIMD Reconfigurable Architecture
SBAC-PAD '02 Proceedings of the 14th Symposium on Computer Architecture and High Performance Computing
A fast parallel reed-solomon decoder on a reconfigurable architecture
Proceedings of the 1st IEEE/ACM/IFIP international conference on Hardware/software codesign and system synthesis
Fast Parallel FFT on a Reconfigurable Computation Platform
SBAC-PAD '03 Proceedings of the 15th Symposium on Computer Architecture and High Performance Computing
Design and Analysis of a Programmable Single-Chip Architecture for DVB-T Base-Band Receiver
DATE '03 Proceedings of the conference on Design, Automation and Test in Europe - Volume 1
Interactive Ray Tracing on Reconfigurable SIMD MorphoSys
DATE '03 Proceedings of the conference on Design, Automation and Test in Europe: Designers' Forum - Volume 2
A generic network interface architecture for a networked processor array (NePA)
ARCS'08 Proceedings of the 21st international conference on Architecture of computing systems
Design of a router for network-on-chip
International Journal of High Performance Systems Architecture
Hi-index | 0.00 |
We introduce MaRS, a reconfigurable, parallel computing engine with special emphasis on scalability, lending itself to the computation-/data-intensive multimedia data processing and wireless communication. Global communication between the processing elements (PEs) in MaRS is performed through a 2D-mesh deadlock-free network, avoiding any concerns due to non-scalable bus-based communication. Additionally, we have developed a second layer of inter-PE connection realized by distributed shared register files and conditional operands, to enhance the performance of MaRS for those applications demanding a tightly coupled PE array. We have modeled and verified a major part of MaRS. The promising results of our preliminary analyses show that MaRS can efficiently be tailored to different applications offering flexible data communication, and high performance.