Speech authentication by semi-fragile speech watermarking utilizing analysis by synthesis and spectral distortion optimization

  • Authors:
  • Bin Yan;Yin-Jing Guo

  • Affiliations:
  • Department of Communication Engineering, School of Information and Electrical Engineering, Shandong University of Science and Technology, Qingdao, People's Republic of China 266510;Department of Communication Engineering, School of Information and Electrical Engineering, Shandong University of Science and Technology, Qingdao, People's Republic of China 266510

  • Venue:
  • Multimedia Tools and Applications
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper proposes an improved semi-fragile speech watermarking scheme by quantization of linear prediction (LP) parameters, i.e., the inverse sine (IS) parameters. The spectral distortion due to watermark embedding is controlled to meet the `transparency' criterion in speech coding. A modified bit allocation algorithm combined with watermarking is developed to determine the quantization step so that the `transparency' requirement is satisfied. Due to the statistical nature, the LP coefficients estimated from the watermarked speech signal are different from the watermarked LP coefficients even in the absence of attacks. This effect is the cause of increase in decoding error and minimum authentication length. To tackle this problem, an Analysis by Synthesis (AbS) scheme is developed to reduce the difference between the estimated LP coefficients and the watermarked ones. The watermark detection threshold and minimum authentication length are then derived according to the probability of error and the signal to noise ratio (SNR) requirements. Experimental results show that the proposed AbS based method can effectively reduce the difference between the watermarked IS parameter and the extracted IS parameter when there is no attacks. In addition, the modified bit allocation algorithm can automatically find the appropriate quantization step used in the odd-even modulation so that the transparency requirement is satisfied.