Tuning the Pentium Pro Microarchitecture
IEEE Micro
Trident: a scalable architecture for scalar, vector, and matrix operations
CRPIT '02 Proceedings of the seventh Asia-Pacific conference on Computer systems architecture
Real-time stereo within the VIDET Project
Real-Time Imaging
Using Intel Streaming SIMD Extensions for 3D Geometry Processing
PCM '02 Proceedings of the Third IEEE Pacific Rim Conference on Multimedia: Advances in Multimedia Information Processing
MaRS: a macro-pipelined reconfigurable system
Proceedings of the 1st conference on Computing frontiers
Retargeting Sequential Image-Processing Programs for Data Parallel Execution
IEEE Transactions on Software Engineering
Reconfigurable universal SAD-multiplier array
Proceedings of the 2nd conference on Computing frontiers
Matrix register file and extended subwords: two techniques for embedded media processors
Proceedings of the 2nd conference on Computing frontiers
A PC-based real-time stereo vision system
Machine Graphics & Vision International Journal
Exploiting Vector Parallelism in Software Pipelined Loops
Proceedings of the 38th annual IEEE/ACM International Symposium on Microarchitecture
ASP-DAC '06 Proceedings of the 2006 Asia and South Pacific Design Automation Conference
Avoiding conversion and rearrangement overhead in SIMD architectures
International Journal of Parallel Programming
Quantized color instruction set for media-on-demand applications
ICME '03 Proceedings of the 2003 International Conference on Multimedia and Expo - Volume 2
Limitations of special-purpose instructions for similarity measurements in media SIMD extensions
CASES '06 Proceedings of the 2006 international conference on Compilers, architecture and synthesis for embedded systems
Versatility of extended subwords and the matrix register file
ACM Transactions on Architecture and Code Optimization (TACO)
The Impact of Multimedia Extensions for Multimedia Applications on Mobile Computing Systems
APCHI '08 Proceedings of the 8th Asia-Pacific conference on Computer-Human Interaction
AnySP: anytime anywhere anyway signal processing
Proceedings of the 36th annual international symposium on Computer architecture
Implementation of the DWT using intel IA-32 SIMD extensions
MAMECTIS'08 Proceedings of the 10th WSEAS international conference on Mathematical methods, computational techniques and intelligent systems
Applying Data Mapping Techniques to Vector DSPs
Journal of Signal Processing Systems
Performance Improvement of Multimedia Kernels by Alleviating Overhead Instructions on SIMD Devices
APPT '09 Proceedings of the 8th International Symposium on Advanced Parallel Processing Technologies
Multiplication acceleration through twin precision
IEEE Transactions on Very Large Scale Integration (VLSI) Systems
Proceedings of the 19th international symposium on Software testing and analysis
Color-Aware Instructions for Embedded Superscalar Processors
Journal of Signal Processing Systems
ICA3PP'10 Proceedings of the 10th international conference on Algorithms and Architectures for Parallel Processing - Volume Part I
Automatic detection of saturation and clipping idioms
LCPC'02 Proceedings of the 15th international conference on Languages and Compilers for Parallel Computing
Algorithms and architectures for 2D discrete wavelet transform
The Journal of Supercomputing
Vector Extensions for Decision Support DBMS Acceleration
MICRO-45 Proceedings of the 2012 45th Annual IEEE/ACM International Symposium on Microarchitecture
SimPL: an algorithm for placing VLSI circuits
Communications of the ACM
Taming the complexity of coordinated place and route
Proceedings of the 50th Annual Design Automation Conference
Exploring the Tradeoffs between Programmability and Efficiency in Data-Parallel Accelerators
ACM Transactions on Computer Systems (TOCS)
Ultra-low-power adder stage design for exascale floating point units
ACM Transactions on Embedded Computing Systems (TECS) - Special Issue on Design Challenges for Many-Core Processors, Special Section on ESTIMedia'13 and Regular Papers
Hi-index | 0.02 |
The SSE provides a rich set of instructions to meet the requirements of demanding multimedia and Internet applications. In implementing the SSE, the Pentium III developers made a number of design trade-offs to satisfy tight die size constraints and attain frequency goals.