From acoustics to Vocal Tract time functions

Authors:
Vikramjit Mitra;Yucel Ozbek;Hosung Nam;Xinhui Zhou;Carol Y. Espy-Wilson
Affiliations:
Department of Electrical and Computer Engineering, University of Maryland, College Park, USA;Department of Electrical and Computer Engineering, Middle East Technical University, Turkey;Haskins Laboratories, New Haven, CT, USA;Department of Electrical and Computer Engineering, University of Maryland, College Park, USA;Department of Electrical and Computer Engineering, University of Maryland, College Park, USA
Venue:
ICASSP '09 Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing
Year:
2009

Citing 0
Cited 2

Statistical methods for estimation of direct and differential kinematics of the vocal tract

Speech Communication
Research on the distal supervised learning model of speech inversion

ICICA'12 Proceedings of the Third international conference on Information Computing and Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper we present a technique for obtaining Vocal Tract (VT) time functions from the acoustic speech signal. Knowledge-based Acoustic Parameters (APs) are extracted from the speech signal and a pertinent subset is used to obtain the mapping between them and the VT time functions. Eight different vocal tract constriction variables consisting of five constriction degree variables, lip aperture (LA), tongue body (TBCD), tongue tip (TTCD), velum (VEL), and glottis (GLO); and three constriction location variables, lip protrusion (LP), tongue tip (TTCL), tongue body (TBCL) were considered in this study. The TAsk Dynamics Application model (TADA [1]) is used to create a synthetic speech dataset along with its corresponding VT time functions. We explore Support Vector Regression (SVR) followed by Kalman smoothing to achieve mapping between the APs and the VT time functions.