Temporally variable multi-aspect auditory morphing enabling extrapolation without objective and perceptual breakdown

  • Authors:
  • H. Kawahara;R. Nisimura;T. Irino;M. Morise;T. Takahashi;H. Banno

  • Affiliations:
  • Design Information Sciences Department, Faculty of Systems Eng., Wakayama Univ., 930 Sakaedani, 640-8510, Japan;Design Information Sciences Department, Faculty of Systems Eng., Wakayama Univ., 930 Sakaedani, 640-8510, Japan;Design Information Sciences Department, Faculty of Systems Eng., Wakayama Univ., 930 Sakaedani, 640-8510, Japan;1Kwansei Gakuin University, Japan;Kyoto University, Japan;Meijo University, Japan

  • Venue:
  • ICASSP '09 Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

A generalized framework of auditory morphing based on the speech analysis, modification and resynthesis system STRAIGHT is proposed that enables each morphing rate of representational aspects to be a function of time, including the temporal axis itself. Two types of algorithms were derived: an incremental algorithm for real-time manipulation of morphing rates and a batch processing algorithm for off-line post-production applications. By defining morphing in terms of the derivative of mapping functions in the logarithmic domain, breakdown of morphing resynthesis found in the previous formulation in the case of extrapolations was eliminated. A method to alleviate perceptual defects in extrapolation is also introduced.