Effects of audio and visual surrogates for making sense of digital video

Authors:
Yaxiao Song;Gary Marchionini
Affiliations:
University of North Carolina at Chapel Hill, Chapel Hill, NC;University of North Carolina at Chapel Hill, Chapel Hill, NC
Venue:
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Year:
2007

Citing 13
Cited 13

The experience of flow in computer-mediated and in face-to-face groups

ICIS '91 Proceedings of the twelfth international conference on Information systems
The cost structure of sensemaking

CHI '93 Proceedings of the INTERACT '93 and CHI '93 Conference on Human Factors in Computing Systems
Evolving video skims into useful multimedia abstractions

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Multimodal surrogates for video browsing

Proceedings of the fourth ACM conference on Digital libraries
Browsing digital video

Proceedings of the SIGCHI conference on Human Factors in Computing Systems
Collages as dynamic summaries for news video

Proceedings of the tenth ACM international conference on Multimedia
Alternative Surrogates for Video Objects in a Digital Library: Users' Perspectives on Their Relative Usability

ECDL '02 Proceedings of the 6th European Conference on Research and Advanced Technology for Digital Libraries
How fast is too fast?: evaluating fast forward surrogates for digital video

Proceedings of the 3rd ACM/IEEE-CS joint conference on Digital libraries
Deciphering visual gist and its implications for video retrieval and interface design

CHI '05 Extended Abstracts on Human Factors in Computing Systems
Creating MAGIC: system for generating learning object metadata for instructional content

Proceedings of the 13th annual ACM international conference on Multimedia
The Open Video Digital Library: A Möbius strip of research and practice

Journal of the American Society for Information Science and Technology
Usage-oriented multimedia information retrieval technological evaluation

MIR '06 Proceedings of the 8th ACM international workshop on Multimedia information retrieval
The effect of text in storyboards for video navigation

ICASSP '01 Proceedings of the Acoustics, Speech, and Signal Processing, 2001. on IEEE International Conference - Volume 03

Evaluating audio skimming and frame rate acceleration for summarizing BBC rushes

CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
Exploring the utility of fast-forward surrogates for bbc rushes

TVS '08 Proceedings of the 2nd ACM TRECVid Video Summarization Workshop
An evaluation framework of user interaction with metadata surrogates

Journal of Information Science
Multimedia surrogates for video gisting: Toward combining spoken words and imagery

Information Processing and Management: an International Journal
Vlogging: A survey of videoblogging technology on the web

ACM Computing Surveys (CSUR)
Temporal hybridity: footage with instant replay in real time

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
What are the most eye-catching and ear-catching features in the video?: implications for video summarization

Proceedings of the 19th international conference on World wide web
Factors affecting click-through behavior in aggregated search interfaces

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Understanding how webcasts are used as sources of information

Journal of the American Society for Information Science and Technology
Webpage-based and video summarization-based learning platform for online multimedia learning

Edutainment'11 Proceedings of the 6th international conference on E-learning and games, edutainment technologies
"You've got video": increasing clickthrough when sharing enterprise video with email

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Panopticon: a parallel video overview system

Proceedings of the 26th annual ACM symposium on User interface software and technology
Video interaction: a research agenda

Personal and Ubiquitous Computing

Quantified Score

Hi-index	0.01

Visualization

Abstract

Video surrogates are meant to help people quickly make sense of the content of a video before downloading or seeking more detailed information. In this paper we present the results of a study comparing the effectiveness of three different surrogates for objects in digital video libraries. Thirty-six people participated in a within subjects user study in which they did five tasks for each of three surrogate alternatives: visual alone (a storyboard), audio alone (spoken description), and combined visual and audio (a storyboard augmented with spoken description). The results show that combined surrogates are more effective, strongly preferred, and do not penalize efficiency. The results also demonstrate that spoken descriptions alone lead to better understanding of the video segments than do visual storyboards alone, although people like to have visual surrogates and use them to confirm interpretations and add context. Participants were able to easily use the combined surrogates even though they were not synchronized, suggesting that synchronization of different media channels may not be necessary in surrogates as it is in full video. The results suggest that multimodal surrogates should be incorporated into video retrieval user interfaces and audio surrogates should be used in small display interfaces. The study also raises questions about the need to synchronize different information channels in multimedia surrogates.