Advances in human-computer interaction (vol. 5)
Advances in human-computer interaction (vol. 5)
Tessa, a system to aid communication with deaf people
Proceedings of the fifth international ACM conference on Assistive technologies
Contrast-based image attention analysis by using fuzzy growing
MULTIMEDIA '03 Proceedings of the eleventh ACM international conference on Multimedia
Real-Time Multiple Objects Tracking with Occlusion Handling in Dynamic Scenes
CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
Visual Speech Recognition with Loosely Synchronized Feature Streams
ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision - Volume 2
Hierarchical movie affective content analysis based on arousal and valence features
MM '08 Proceedings of the 16th ACM international conference on Multimedia
Robust Face Recognition via Sparse Representation
IEEE Transactions on Pattern Analysis and Machine Intelligence
Unfolding speaker clustering potential: a biomimetic approach
MM '09 Proceedings of the 17th ACM international conference on Multimedia
Inferring semantic concepts from community-contributed images and noisy tags
MM '09 Proceedings of the 17th ACM international conference on Multimedia
MM '09 Proceedings of the 17th ACM international conference on Multimedia
Unified video annotation via multigraph learning
IEEE Transactions on Circuits and Systems for Video Technology
Beyond distance measurement: constructing neighborhood similarity for video annotation
IEEE Transactions on Multimedia - Special section on communities and media computing
Joint covariate selection and joint subspace selection for multiple classification problems
Statistics and Computing
Considering web accessibility in information retrieval systems
ICWE'07 Proceedings of the 7th international conference on Web engineering
Dynamic captioning: video accessibility enhancement for hearing impairment
Proceedings of the international conference on Multimedia
Proceedings of the 18th Brazilian symposium on Multimedia and the web
Proceedings of the 20th ACM international conference on Multimedia
Visual saliency detection based on photographic composition
Proceedings of the Fifth International Conference on Internet Multimedia Computing and Service
Personalized image recommendation and retrieval via latent SVM based model
Proceedings of the Fifth International Conference on Internet Multimedia Computing and Service
Camouflage texture evaluation using saliency map
Proceedings of the Fifth International Conference on Internet Multimedia Computing and Service
Hi-index | 0.00 |
There are more than 66 million people suffering from hearing impairment and this disability brings them difficulty in video content understanding due to the loss of audio information. If the scripts are available, captioning technology can help them in a certain degree by synchronously illustrating the scripts during the playing of videos. However, we show that the existing captioning techniques are far from satisfactory in assisting the hearing-impaired audience to enjoy videos. In this article, we introduce a scheme to enhance video accessibility using a Dynamic Captioning approach, which explores a rich set of technologies including face detection and recognition, visual saliency analysis, text-speech alignment, etc. Different from the existing methods that are categorized as static captioning, dynamic captioning puts scripts at suitable positions to help the hearing-impaired audience better recognize the speaking characters. In addition, it progressively highlights the scripts word-by-word via aligning them with the speech signal and illustrates the variation of voice volume. In this way, the special audience can better track the scripts and perceive the moods that are conveyed by the variation of volume. We implemented the technology on 20 video clips and conducted an in-depth study with 60 real hearing-impaired users. The results demonstrated the effectiveness and usefulness of the video accessibility enhancement scheme.