Analysis of interrogatives in different domains

  • Authors:
  • Helena Moniz;Fernando Batista;Isabel Trancoso;Ana Isabel Mata

  • Affiliations:
  • Faculdade de Letras da Universidade de Lisboa, Centro de Linguística da Universidade de Lisboa, Alameda da Universidade, Portugal and INESC-ID, Lisboa, Portugal;INESC-ID, Lisboa, Portugal and ISCTE, Instituto Universitário de Lisboa, Lisboa, Portugal;INESC-ID, Lisboa, Portugal and Instituto Superior Técnico, Universidade Técnica de Lisboa, Lisboa, Portugal;Faculdade de Letras da Universidade de Lisboa, Centro de Linguística da Universidade de Lisboa, Alameda da Universidade, Portugal

  • Venue:
  • Proceedings of the Third COST 2102 international training school conference on Toward autonomous, adaptive, and context-aware multimodal interfaces: theoretical and practical issues
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

The aim of this work is twofold: to quantify the distinct interrogative types in different domains for European Portuguese, and to discuss the weight of the linguistic features that best describe these structures, in order to model interrogatives in speech. We analyzed spoken dialogue, university lectures, and broadcast news corpora, and, for the sake of comparison, newspaper texts. The statistical analysis confirms that the percentage of the different types of interrogative is highly dependent on the nature of the corpus. Experiments on the automatic detection of interrogatives for European Portuguese, using only lexical cues, show results that are strongly correlated with the detection of a specific type of interrogatives (namely wh- questions). When acoustic and prosodic features (pitch, energy and duration) are added, yes/no and tag questions are then increasingly identified, showing the advantages of combining both lexical, acoustic and prosodic information.