Effects of automated transcription delay on non-native speakers' comprehension in real-time computer-mediated communication

Authors:
Lin Yao;Ying-xin Pan;Dan-ning Jiang
Affiliations:
Institute of Psychology, Chinese Academy of Sciences, Beijing, China;IBM China Research Lab, Beijing, China;IBM China Research Lab, Beijing, China
Venue:
INTERACT'11 Proceedings of the 13th IFIP TC 13 international conference on Human-computer interaction - Volume Part I
Year:
2011

Citing 3
Cited 0

Time-Compressing Speech: ASR Transcripts Are an Effective Way to Support Gist Extraction

MLMI '08 Proceedings of the 5th international workshop on Machine Learning for Multimodal Interaction
Effects of real-time transcription on non-native speaker's comprehension in computer-mediated communications

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Effects of automated transcription quality on non-native speakers' comprehension in real-time computer-mediated communication

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Real-time transcription generated by automated speech recognition (ASR) technologies with a reasonably high accuracy has been demonstrated to be valuable in facilitating non-native speakers' comprehension in real-time communication. Besides errors, time delay often exists due to technical problems in automated transcription as well. This study focuses on how the time delay of transcription impacts non-native speakers' comprehension performance and user experience. The experiment design simulated a one-way computermediated communication scenario, where comprehension performance and user experiences in 3 transcription conditions (no transcript; perfect transcripts with a 2-second delay; and transcripts with a 10% word-error-rate and a 2-second delay) were compared. The results showed that the participants can benefit from the transcription with a 2-second time delay, as their comprehension performance in this condition was improved compared with the no-transcript condition. However, the transcription presented with delay was found to have negative effects on user experience. In the final part of the paper, implications for further system development and design are discussed.