Analysis of inconsistencies in cross-lingual automatic ToBI tonal accent labeling

  • Authors:
  • David Escudero-Mancebo;Carlos Vivaracho Pascual;César González Ferreras;Valentín Cardeñoso-Payo;Lourdes Aguilar

  • Affiliations:
  • Dpt. of Computer Sciences, Universidad de Valladolid, Spain;Dpt. of Computer Sciences, Universidad de Valladolid, Spain;Dpt. of Computer Sciences, Universidad de Valladolid, Spain;Dpt. of Computer Sciences, Universidad de Valladolid, Spain;Dpt. of Spanish Philology, Universidad Autónoma de Barcelona, Spain

  • Venue:
  • TSD'11 Proceedings of the 14th international conference on Text, speech and dialogue
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents an experimental study on how corpus-based automatic prosodic information labeling can be transferred from a source language to a different target language. Tone accent identification models trained for Spanish, using the ESMA corpus, are used to automatically assign tonal accent ToBI labels on the (English) Boston Radio news corpus, and vice versa. Using just local raw prosodic acoustic features, we got about 75% correct annotation rates, which provides a good starting point to speed up automatic prosodic labeling of new unlabeled corpora. Despite the different ranges and relevance of inter corpora acoustic input features, the contrasting of the results with respect to manual labeling profiles indicate the potential capabilities of the procedure.