The effect of speech recognition accuracy rates on the usefulness and usability of webcast archives
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Automatically generated captions: will they help non-native speakers communicate in english?
Proceedings of the 3rd international conference on Intercultural collaboration
Dynamic captioning: video accessibility enhancement for hearing impairment
Proceedings of the international conference on Multimedia
Crowdsourcing correction of speech recognition captioning errors
Proceedings of the International Cross-Disciplinary Conference on Web Accessibility
Fast Caption Alignment for Automatic Indexing of Audio
International Journal of Multimedia Data Engineering & Management
Hi-index | 0.00 |
The simple act of listening or of taking notes while attending a lesson may represent an insuperable burden for millions of people with some form of disabilities (e.g., hearing impaired, dyslexic and ESL students). In this paper, we propose an architecture that aims at automatically creating captions for video lessons by exploiting advances in speech recognition technologies. Our approach couples the usage of off-the-shelf ASR (Automatic Speech Recognition) software with a novel caption alignment mechanism that smartly introduces unique audio markups into the audio stream before giving it to the ASR and transforms the plain transcript produced by the ASR into a timecoded transcript.