Journal of VLSI Signal Processing Systems - Special issue on recent development in video: algorithms, implementation and applications
Perceptual optimization for scalable video compression based on visual masking principles
IEEE Transactions on Circuits and Systems for Video Technology
Hi-index | 0.00 |
The discrete wavelet transform (DWT) provides an advantageous framework of multiresolution space-frequency representation with promising applications in image processing. The challenge as well as the opportunity in wavelet-based compression is to exploit the characteristics of the subband coefficients with respect to both spectral and spatial localities. A common problem with many existing quantization methods is that the inherent image structures are severely distorted with coarse quantization. Observation shows that subband coefficients with the same magnitude generally do not have the same perceptual importance. We propose in this paper a scene adaptive and signal adaptive quantization scheme capable of exploiting the spectral and spatial localization properties resulting from the wavelet transform. The quantization is implemented as maximum a posteriori probability estimation-based clustering in which subband coefficients are quantized to their cluster means, subject to local spatial constraints. The intensity distribution of each cluster within a subband is modeled by an optimal Laplacian source to achieve signal adaptivity, while spatial constraints are enforced by appropriate Gibbs random fields (GRF) to achieve scene adaptivity. With spatially isolated coefficients removed and clustered coefficients retained at the same time, the available bits are allocated to visually important scene structures so that the information loss is least perceptible. Furthermore, the reconstruction noise in the decompressed image can be suppressed using another GRF-based enhancement algorithm