VoiceNotes: a speech interface for a hand-held voice notetaker
CHI '93 Proceedings of the INTERACT '93 and CHI '93 Conference on Human Factors in Computing Systems
Content-based browsing of video sequences
MULTIMEDIA '94 Proceedings of the second ACM international conference on Multimedia
Video parsing, retrieval and browsing: an integrated and content-based solution
Proceedings of the third ACM international conference on Multimedia
A shot classification method of selecting effective key-frames for video browsing
MULTIMEDIA '96 Proceedings of the fourth ACM international conference on Multimedia
SpeechSkimmer: a system for interactively skimming recorded speech
ACM Transactions on Computer-Human Interaction (TOCHI) - Special issue on speech as data
Communications of the ACM
Broadcast news navigation using story segmentation
MULTIMEDIA '97 Proceedings of the fifth ACM international conference on Multimedia
Evolving video skims into useful multimedia abstractions
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Key to effective video retrieval: effective cataloging and browsing
MULTIMEDIA '98 Proceedings of the sixth ACM international conference on Multimedia
An intelligent media browser using automatic multimodal analysis
MULTIMEDIA '98 Proceedings of the sixth ACM international conference on Multimedia
Time-compression: systems concerns, usage, and benefits
Proceedings of the SIGCHI conference on Human Factors in Computing Systems
Auto-summarization of audio-video presentations
MULTIMEDIA '99 Proceedings of the seventh ACM international conference on Multimedia (Part 1)
Proceedings of the SIGCHI conference on Human Factors in Computing Systems
Design lessons from deployment of on-demand video
CHI '99 Extended Abstracts on Human Factors in Computing Systems
CVPR '97 Proceedings of the 1997 Conference on Computer Vision and Pattern Recognition (CVPR '97)
The audio notebook: paper and pen interaction with structured speech
The audio notebook: paper and pen interaction with structured speech
Designing presentations for on-demand viewing
CSCW '00 Proceedings of the 2000 ACM conference on Computer supported cooperative work
Viewing meeting captured by an omni-directional camera
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Design of a virtual auditorium
MULTIMEDIA '01 Proceedings of the ninth ACM international conference on Multimedia
Integrating Meeting Capture within a Collaborative Team Environment
UbiComp '01 Proceedings of the 3rd international conference on Ubiquitous Computing
Assessing tools for use with webcasts
Proceedings of the 5th ACM/IEEE-CS joint conference on Digital libraries
Summarizing multiple spoken documents: finding evidence from untranscribed audio
ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - Volume 2
Semantic keyword extraction via adaptive text binarization of unstructured unsourced video
ICIP'09 Proceedings of the 16th IEEE international conference on Image processing
A rhetorical syntax-driven model for speech summarization
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Tutorial Abstracts of ACL 2011
An ecologically valid evaluation of speech summarization
CHI '12 Extended Abstracts on Human Factors in Computing Systems
Ecological validity and the evaluation of speech summarization quality
Proceedings of Workshop on Evaluation Metrics and System Comparison for Automatic Summarization
Hi-index | 0.01 |
As more audio and video technical presentations go online, it becomes imperative to give users effective summarization and skimming tools so that they can find the presentation they want and browse through it quickly. In a previous study, we reported three automated methods for generating audio-video summaries and a user evaluation of those methods. An open question remained about how well various text/image only techniques will compare to the audio-video summarizations. This study attempts to fill that gap.This paper reports a user study that compares four possible ways of allowing a user to skim a presentation: 1) PowerPoint slides used by the speaker during the presentation, 2) the text transcript created by professional transcribers from the presentation, 3) the transcript with important points highlighted by the speaker, and 4) a audio-video summary created by the speaker. Results show that although some text-only conditions can match the audio-video summary, users have a marginal preference for audio-video (ANOVA f=3.067, p=0.087). Furthermore, different styles of slide-authoring (e.g., detailed vs. big-points only) can have a big impact on their effectiveness as summaries, raising a dilemma for some speakers in authoring for on-demand previewing versus that for live audiences.