A multimedia application: spatial perceptual entropy of multichannel audio signals

  • Authors:
  • Shuixian Chen;Ruimin Hu;Naixue Xiong

  • Affiliations:
  • Computer School, Wuhan University, Wuhan, China;Computer School, Wuhan University, Wuhan, China and National Engineering Research Center for Multimedia Software, Wuhan University, Wuhan, China;Computer School, Wuhan University, Wuhan, China

  • Venue:
  • EURASIP Journal on Wireless Communications and Networking - Special issue on multimedia communications over next generation wireless networks
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Usually multimedia data have to be compressed before transmitting, and higher compression rate, or equivalently lower bitrate, relieves the load of communication channels but impacts negatively the quality. We investigate the bitrate lower bound for perceptually lossless compression of a major type of multimedia--multichannel audio signals. This bound equals to the perceptible information rate of the signals. Traditionally, Perceptual Entropy (PE), based primarily on monaural hearing measures the perceptual information rate of individual channels. But PE cannot measure the spatial information captured by binaural hearing, thus is not suitable for estimating Spatial Audio Coding (SAC) bitrate bound. To measure this spatial information, we build a Binaural Cue Physiological Perception Model (BCPPM) on the ground of binaural hearing, which represents spatial information in the physical and physiological layers. This model enables computing Spatial Perceptual Entropy (SPE), the lower bitrate bound for SAC. For real-world stereo audio signals of various types, our experiments indicate that SPE reliably estimates their spatial information rate. Therefore, "SPE plus PE" gives lower bitrate bounds for communicating multichannel audio signals with transparent quality.