A VLIW architecture for a trace Scheduling Compiler
IEEE Transactions on Computers - Special issue on architectural support for programming languages and operating systems
ISCA '85 Proceedings of the 12th annual international symposium on Computer architecture
Allowing for ILP in an embedded Java processor
Proceedings of the 27th annual international symposium on Computer architecture
IEEE Transactions on Computers
Automatic Architectural Synthesis of VLIW and EPIC Processors
Proceedings of the 12th international symposium on System synthesis
Complexity-effective superscalar processors
Complexity-effective superscalar processors
High-Level Modeling and FPGA Prototyping of Produced Order Parallel Queue Processor Core
The Journal of Supercomputing
The QC-2 parallel Queue processor architecture
Journal of Parallel and Distributed Computing
A new code generation algorithm for 2-offset producer order queue computation model
Computer Languages, Systems and Structures
Dual-execution mode processor architecture
The Journal of Supercomputing
Design and architecture for an embedded 32-bit QueueCore
Journal of Embedded Computing - Issues in embedded single-chip multicore architectures
Design and implementation of a queue compiler
Microprocessors & Microsystems
Efficient compilation for queue size constrained queue processors
Parallel Computing
Compiler Support for Code Size Reduction Using a Queue-Based Processor
Transactions on High-Performance Embedded Architectures and Compilers II
Using NoC routers as processing elements
Proceedings of the 22nd Annual Symposium on Integrated Circuits and System Design: Chip on the Dunes
Compiling for Reduced Bit-Width Queue Processors
Journal of Signal Processing Systems
EUC'07 Proceedings of the 2007 international conference on Embedded and ubiquitous computing
Dual-execution mode processor architecture for embedded applications
Journal of Mobile Multimedia
On the design of a dual-execution modes processor: architecture and preliminary evaluation
ISPA'06 Proceedings of the 2006 international conference on Frontiers of High Performance Computing and Networking
Modular design structure and high-level prototyping for novel embedded processor core
EUC'05 Proceedings of the 2005 international conference on Embedded and Ubiquitous Computing
Hi-index | 0.00 |
This paper proposes novel produced order parallel queue processor architecture. To store intermediate results, the proposed system uses a first-in-first-out (FIFO) circular queue-registers instead of random access registers. Datum is inserted in the queue-registers in produced order scheme and can be reused. We show that this feature has profound implications in the areas of parallel execution, programs compactness, hardware simplicity and high execution speed.Our performance evaluations show a significant performance improvement (e.g., 10 to 26% decrease in program size and 6 to 46% decrease in execution time over a range of benchmark programs) when compared with the earlier proposed architecture.