Towards automatic classification of speech styles

  • Authors:
  • Arlindo Veiga;Sara Candeias;Dirce Celorico;Jorge Proença;Fernando Perdigão

  • Affiliations:
  • DEEC, Instituto de Telecomunicações, pole of Coimbra, Coimbra, Portugal;DEEC, Instituto de Telecomunicações, pole of Coimbra, Coimbra, Portugal;DEEC, Instituto de Telecomunicações, pole of Coimbra, Coimbra, Portugal;DEEC, Instituto de Telecomunicações, pole of Coimbra, Coimbra, Portugal;DEEC, Instituto de Telecomunicações, pole of Coimbra, Coimbra, Portugal and DEEC/FCTUC, Universidade de Coimbra, Coimbra, Portugal

  • Venue:
  • PROPOR'12 Proceedings of the 10th international conference on Computational Processing of the Portuguese Language
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we present results from a study seeking to distinguish "unprepared" from "prepared" speech in broadcast news media. The idea is to explore the results from a previous experiment concerning the characterization of filled pauses and extensions, extending the analysis of such hesitation phenomena to large audio corpus. Daily news broadcasts of Portuguese television were segmented and labeled manually in terms of several speech styles, over a range of background environments. An automatic detection of filled pauses and extensions in this audio data allowed us to correlate the presence of hesitation events with segments of unprepared speech. Distinguishing unprepared speech from prepared speech is of considerable practical interest for audio segmentation, speech processing and linguistic research. The long-term objective of this work is to automatically segment all audio genres and speaking styles as well as identify prosodic and linguistic features of the speech segments.