Underpinning /nailon/: Automatic Estimation of Pitch Range and Speaker Relative Pitch

  • Authors:
  • Jens Edlund;Mattias Heldner

  • Affiliations:
  • KTH Speech, Music and Hearing, Stockholm, Sweden;KTH Speech, Music and Hearing, Stockholm, Sweden

  • Venue:
  • Speaker Classification II
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this study, we explore what is needed to get an automatic estimation of speaker relative pitch that is good enough for many practical tasks in speech technology. We present analyses of fundamental frequency (F0) distributions from eight speakers with a view to examine (i) the effect of semitone transform on the shape of these distributions; (ii) the errors resulting from calculation of percentiles from the means and standard deviations of the distributions; and (iii) the amount of voiced speech required to obtain a robust estimation of speaker relative pitch. In addition, we provide a hands-on description of how such an estimation can be obtained under real-time online conditions using /nailon/ --- our software for online analysis of prosody.