Pathological voice analysis and classification based on empirical mode decomposition

  • Authors:
  • Gastón Schlotthauer;María E. Torres;Hugo L. Rufiner

  • Affiliations:
  • Lab. de Señales y Dinámicas no Lineales, Fac. de Ingeniería, Universidad Nacional de Entre Ríos, Oro Verde, Entre Ríos, Argentina;Lab. de Señales y Dinámicas no Lineales, Fac. de Ingeniería, Universidad Nacional de Entre Ríos, Oro Verde, Entre Ríos, Argentina;Lab. de Cibernética, Fac. de Ingeniería, UNER, Oro Verde, Entre Ríos, Argentina

  • Venue:
  • COST'09 Proceedings of the Second international conference on Development of Multimodal Interfaces: active Listening and Synchrony
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Empirical mode decomposition (EMD) is an algorithm for signal analysis recently introduced by Huang. It is a completely data-driven non-linear method for the decomposition of a signal into AM - FM components. In this paper two new EMD-based methods for the analysis and classification of pathological voices are presented. They are applied to speech signals corresponding to real and simulated sustained vowels. We first introduce a method that allows the robust extraction of the fundamental frequency of sustained vowels. Its determination is crucial for pathological voice analysis and diagnosis. This new method is based on the ensemble empirical mode decomposition (EEMD) algorithm and its performance is compared with others from the state of the art. As a second EMD-based tool, we explore spectral properties of the intrinsic mode functions and apply them to the classification of normal and pathological sustained vowels. We show that just using a basic pattern classification algorithm, the selected spectral features of only three modes are enough to discriminate between normal and pathological voices.