On mispronunciation analysis of individual foreign speakers using auditory periphery models

  • Authors:
  • Christos Koniaris;Giampiero Salvi;Olov Engwall

  • Affiliations:
  • Centre for Speech Technology, School of Computer Science & Communication, KTH - Royal Institute of Technology, Lindstedtsväägen 24, SE-100 44 Stockholm, Sweden;Centre for Speech Technology, School of Computer Science & Communication, KTH - Royal Institute of Technology, Lindstedtsväägen 24, SE-100 44 Stockholm, Sweden;Centre for Speech Technology, School of Computer Science & Communication, KTH - Royal Institute of Technology, Lindstedtsväägen 24, SE-100 44 Stockholm, Sweden

  • Venue:
  • Speech Communication
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

In second language (L2) learning, a major difficulty is to discriminate between the acoustic diversity within an L2 phoneme category and that between different categories. We propose a general method for automatic diagnostic assessment of the pronunciation of non-native speakers based on models of the human auditory periphery. Considering each phoneme class separately, the geometric shape similarity between the native auditory domain and the non-native speech domain is measured. The phonemes that deviate the most from the native pronunciation for a set of L2 speakers are detected by comparing the geometric shape similarity measure with that calculated for native speakers on the same phonemes. To evaluate the system, we have tested it with different non-native speaker groups from various language backgrounds. The experimental results are in accordance with linguistic findings and human listeners' ratings, particularly when both the spectral and temporal cues of the speech signal are utilized in the pronunciation analysis.