Parallel, Pipelined and Folded Architectures for Computation of 1-D and 2-D DCT in Image and Video Codec

  • Authors:
  • Shen-Fu Hsiao;Jian-Ming Tseng

  • Affiliations:
  • Institute of Computer and Information Engineering, National Sun Yat-Sen University, No. 70, Lien-Hai Road, Kaohsiung 804, Taiwan, R.O.C;Institute of Computer and Information Engineering, National Sun Yat-Sen University, No. 70, Lien-Hai Road, Kaohsiung 804, Taiwan, R.O.C

  • Venue:
  • Journal of VLSI Signal Processing Systems - Parallel VLSI architectures for image and video processing
  • Year:
  • 2001

Quantified Score

Hi-index 0.00

Visualization

Abstract

Several parallel, pipelined and folded architectures with different throughput rates are presented for computation of DCT, one of the fundamental operations in image/video coding. This paper begins with a new decomposition algorithm for the 1-D DCT coefficient matrix. Then the 2-D DCT problem is converted into the corresponding 1-D counterpart through a regular index mapping technique. Afterward, depending on the trade-off between hardware complexity and speed performance, the derived decomposition algorithm is transformed into different parallel-pipelined and folded architectures that realize the butterfly operations and the post-processing operations. Compared to other DCT processor, our proposed parallel-pipelined architectures, without any intermediate transpose memory, have the features of modularity, regularity, locality, scalability, and pipelinability, with arithmetic hardware cost proportional to the logarithm of the transform length.