Acoustic variability and automatic recognition of children's speech

  • Authors:
  • Matteo Gerosa;Diego Giuliani;Fabio Brugnara

  • Affiliations:
  • ITC-IRST, Centro per la Ricerca Scientifica e Tecnologica, I-38050 Povo, Trento, Italy;ITC-IRST, Centro per la Ricerca Scientifica e Tecnologica, I-38050 Povo, Trento, Italy;ITC-IRST, Centro per la Ricerca Scientifica e Tecnologica, I-38050 Povo, Trento, Italy

  • Venue:
  • Speech Communication
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents several acoustic analyses carried out on read speech collected from Italian children aged from 7 to 13 years and North American children aged from 5 to 17 years. These analyses aimed at achieving a better understanding of spectral and temporal changes in speech produced by children of various ages in view of the development of automatic speech recognition applications. The results of these analyses confirm and complement the results reported in the literature, showing that characteristics of children's speech change with age and that spectral and temporal variability decrease as age increases. In fact, younger children show a substantially higher intra- and inter-speaker variability with respect to older children and adults. We investigated the use of several methods for speaker adaptive acoustic modeling to cope with inter-speaker spectral variability and to improve recognition performance for children. These methods proved to be effective in recognition of read speech with a vocabulary of about 11k words.