Characterizing and detecting spontaneous speech: Application to speaker role recognition

Authors:
Richard Dufour;Yannick Estève;Paul Deléglise
Affiliations:
-;-;-
Venue:
Speech Communication
Year:
2014

Citing 15
Cited 0

BoosTexter: A Boosting-based Systemfor Text Categorization

Machine Learning - Special issue on information retrieval
The Rules Behind Roles: Identifying Speaker Role in Radio Broadcasts

Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on Innovative Applications of Artificial Intelligence
The kappa statistic: a second look

Computational Linguistics
Topic and speaker identification via large vocabulary continuous speech recognition

HLT '93 Proceedings of the workshop on Human Language Technology
Role recognition for meeting participants: an approach based on lexical information and social network analysis

MM '08 Proceedings of the 16th ACM international conference on Multimedia
Automatic named identification of speakers using diarization and ASR systems

ICASSP '09 Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing
Initial study on automatic identification of speaker role in broadcast news speech

NAACL-Short '06 Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Short Papers
Automatic role recognition in multiparty recordings: using social affiliation networks for feature extraction

IEEE Transactions on Multimedia
Speaker role recognition to help spontaneous conversational speech detection

Proceedings of the 2010 international workshop on Searching spontaneous conversational speech
Automatic indexing of speech segments with spontaneity levels on large audio database

Proceedings of the 2010 international workshop on Searching spontaneous conversational speech
Analysis and automatic recognition of false starts in spontaneous speech

ICASSP'93 Proceedings of the 1993 IEEE international conference on Acoustics, speech, and signal processing: speech processing - Volume II
Enriching speech recognition with automatic detection of sentence boundaries and disfluencies

IEEE Transactions on Audio, Speech, and Language Processing
Recognizing disfluencies in conversational speech

IEEE Transactions on Audio, Speech, and Language Processing
Edit disfluency detection and correction using a cleanup language model and an alignment model

IEEE Transactions on Audio, Speech, and Language Processing
Speakers Role Recognition in Multiparty Audio Recordings Using Social Network Analysis and Duration Distribution Modeling

IEEE Transactions on Multimedia

Quantified Score

Hi-index	0.00

Visualization

Abstract

Processing spontaneous speech is one of the many challenges that automatic speech recognition systems have to deal with. The main characteristics of this kind of speech are disfluencies (filled pause, repetition, false start, etc.) and many studies have focused on their detection and correction. Spontaneous speech is defined in opposition to prepared speech, where utterances contain well-formed sentences close to those found in written documents. Acoustic and linguistic features made available by the use of an automatic speech recognition system are proposed to characterize and detect spontaneous speech segments from large audio databases. To better define this notion of spontaneous speech, segments of an 11-hour corpus (French Broadcast News) had been manually labeled according to three classes of spontaneity. Firstly, we present a study of these features. We then propose a two-level strategy to automatically assign a class of spontaneity to each speech segment. The proposed system reaches a 73.0% precision and a 73.5% recall on high spontaneous speech segments, and a 66.8% precision and a 69.6% recall on prepared speech segments. A quantitative study shows that the classes of spontaneity are useful information to characterize the speaker roles. This is confirmed by extending the speech spontaneity characterization approach to build an efficient automatic speaker role recognition system.