Enhancing learning accessibility through fully automatic captioning

Authors:
Maria Federico;Marco Furini
Affiliations:
Università di Modena e Reggio Emilia, Modena, Italy;Università di Modena e Reggio Emilia, Reggio Emilia, Italy
Venue:
Proceedings of the International Cross-Disciplinary Conference on Web Accessibility
Year:
2012

Citing 5
Cited 0

The effect of speech recognition accuracy rates on the usefulness and usability of webcast archives

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Automatically generated captions: will they help non-native speakers communicate in english?

Proceedings of the 3rd international conference on Intercultural collaboration
Dynamic captioning: video accessibility enhancement for hearing impairment

Proceedings of the international conference on Multimedia
Crowdsourcing correction of speech recognition captioning errors

Proceedings of the International Cross-Disciplinary Conference on Web Accessibility
Fast Caption Alignment for Automatic Indexing of Audio

International Journal of Multimedia Data Engineering & Management

Quantified Score

Hi-index	0.00

Visualization

Abstract

The simple act of listening or of taking notes while attending a lesson may represent an insuperable burden for millions of people with some form of disabilities (e.g., hearing impaired, dyslexic and ESL students). In this paper, we propose an architecture that aims at automatically creating captions for video lessons by exploiting advances in speech recognition technologies. Our approach couples the usage of off-the-shelf ASR (Automatic Speech Recognition) software with a novel caption alignment mechanism that smartly introduces unique audio markups into the audio stream before giving it to the ASR and transforms the plain transcript produced by the ASR into a timecoded transcript.