Design and evaluation of shared prosodic annotation for spontaneous French speech: from expert knowledge to non-expert annotation

  • Authors:
  • Anne Lacheret;Nicolas Obin;Mathieu Avanzi

  • Affiliations:
  • Paris Ouest University, Nanterre, France;Paris Ouest University, Nanterre, France and Analysis-Synthesis team, Ircam, Paris, France;Paris Ouest University, Nanterre, France and Neuchâtel University, Neuchâtel, Switzerland

  • Venue:
  • LAW IV '10 Proceedings of the Fourth Linguistic Annotation Workshop
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

In the area of large French speech corpora, there is a demonstrated need for a common prosodic notation system allowing for easy data exchange, comparison, and automatic annotation. The major questions are: (1) how to develop a single simple scheme of prosodic transcription which could form the basis of guidelines for non-expert manual annotation (NEMA), used for linguistic teaching and research; (2) based on this NEMA, how to establish reference prosodic corpora (RPC) for different discourse genres (Cresti and Moneglia, 2005); (3) how to use the RPC to develop corpus-based learning methods for automatic prosodic labelling in spontaneous speech (Buhman et al., 2002; Tamburini and Caini 2005, Avanzi, et al. 2010). This paper presents two pilot experiments conducted with a consortium of 15 French experts in prosody in order to provide a prosodic transcription framework (transcription methodology and transcription reliability measures) and to establish reference prosodic corpora in French.