Estimation of relevant time-frequency features using Kendall coefficient for articulator position inference

  • Authors:
  • Alexander SepúLveda;Rodrigo Capobianco Guido;G. Castellanos-Dominguez

  • Affiliations:
  • Universidad Nacional de Colombia, Signal Processing and Recognition Group, Km. 9, Vía al aeropuerto, Campus La Nubia, Manizales, Colombia;University of São Paulo (USP), Institute of Physics at São Carlos (IFSC), Department of Physics and Informatics, Avenida Trabalhador Sãocarlense 400, 13566-590 São Carlos, SP, ...;Universidad Nacional de Colombia, Signal Processing and Recognition Group, Km. 9, Vía al aeropuerto, Campus La Nubia, Manizales, Colombia

  • Venue:
  • Speech Communication
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

The determination of relevant acoustic information for the inference of articulators position is an open issue. This paper presents a method to estimate those acoustic features better related to articulators movement. The input feature set is based on time-frequency representation calculated from the speech signal, whose parametrization is achieved using the wavelet-packet transform. The main focus is on measuring the relevant acoustic information, in terms of statistical association, for the inference of articulator positions. The rank correlation Kendall coefficient is used as the relevance measure. Attained statistical association is validated using the @g^2 information measure. The maps of relevant time-frequency features are calculated for the MOCHA-TIMIT database, where the articulatory information is represented by trajectories of specific positions in the vocal tract. Relevant maps are estimated over the whole speech signal as well as on specific phones, for which a given articulator is known to be critical. The usefulness of the relevant maps is tested in an acoustic-to-articulatory mapping system based on gaussian mixture models.