Fundamentals of speech recognition
Fundamentals of speech recognition
A practical handbook of speech coders
A practical handbook of speech coders
Spoken Language Processing: A Guide to Theory, Algorithm, and System Development
Spoken Language Processing: A Guide to Theory, Algorithm, and System Development
Discrete-time speech signal processing: principles and practice
Discrete-time speech signal processing: principles and practice
Children's organization of discourse structure through pausing means
NOLISP'05 Proceedings of the 3rd international conference on Non-Linear Analyses and Algorithms for Speech Processing
Application of Expressive Speech in TTS System with Cepstral Description
Verbal and Nonverbal Features of Human-Human and Human-Machine Interaction
Automatic meeting participant role detection by dialogue patterns
COST'09 Proceedings of the Second international conference on Development of Multimodal Interfaces: active Listening and Synchrony
Multimodal behavior and interaction as indicators of cognitive load
ACM Transactions on Interactive Intelligent Systems (TiiS) - Special issue on highlights of the decade in interactive intelligent systems
Hi-index | 0.00 |
This study investigates pausing strategies, focusing attention on empty speech pauses. A cross-modal analysis (video and audio) of spontaneous narratives produced by male and female children (9 years old ± 3 months) and adults showed that a remarkable amount of empty speech pauses (91% in male and 84% in female children, and 95% in adults of both sexes) was related to the amount of added information conveyed in the speech flow. Both adults and children consistently exploited pausing strategies to signal discourse boundaries such as clauses (marked by empty speech pauses for 73% and 70% of cases in male and female children, respectively, and 56% in adults) and paragraphs (97% and 96% in male and female children, respectively, and 94% in adults). The high consistency, among subjects, in the distribution of speech pauses suggests that, at least in the Italian context, the speaker in narration makes use of an intrinsic timing behavior, probably a general pattern of rules, to control speech flow for discourse organization. The implications of these findings for the development of improved speech recognition and speech synthesis systems are discussed and procedures for the automatic detection of speech pauses are proposed.