Analysis of interrogatives in different domains

Authors:
Helena Moniz;Fernando Batista;Isabel Trancoso;Ana Isabel Mata
Affiliations:
Faculdade de Letras da Universidade de Lisboa, Centro de Linguística da Universidade de Lisboa, Alameda da Universidade, Portugal and INESC-ID, Lisboa, Portugal;INESC-ID, Lisboa, Portugal and ISCTE, Instituto Universitário de Lisboa, Lisboa, Portugal;INESC-ID, Lisboa, Portugal and Instituto Superior Técnico, Universidade Técnica de Lisboa, Lisboa, Portugal;Faculdade de Letras da Universidade de Lisboa, Centro de Linguística da Universidade de Lisboa, Alameda da Universidade, Portugal
Venue:
Proceedings of the Third COST 2102 international training school conference on Toward autonomous, adaptive, and context-aware multimodal interfaces: theoretical and practical issues
Year:
2010

Citing 2
Cited 1

Assessing agreement on classification tasks: the kappa statistic

Computational Linguistics
Recovering capitalization and punctuation marks for automatic speech recognition: Case study for Portuguese broadcast news

Speech Communication

Question detection in spoken conversations using textual conversations

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2

Quantified Score

Hi-index	0.00

Visualization

Abstract

The aim of this work is twofold: to quantify the distinct interrogative types in different domains for European Portuguese, and to discuss the weight of the linguistic features that best describe these structures, in order to model interrogatives in speech. We analyzed spoken dialogue, university lectures, and broadcast news corpora, and, for the sake of comparison, newspaper texts. The statistical analysis confirms that the percentage of the different types of interrogative is highly dependent on the nature of the corpus. Experiments on the automatic detection of interrogatives for European Portuguese, using only lexical cues, show results that are strongly correlated with the detection of a specific type of interrogatives (namely wh- questions). When acoustic and prosodic features (pitch, energy and duration) are added, yes/no and tag questions are then increasingly identified, showing the advantages of combining both lexical, acoustic and prosodic information.