Content-based tools for editing audio stories

Authors:
Steve Rubin;Floraine Berthouzoz;Gautham J. Mysore;Wilmot Li;Maneesh Agrawala
Affiliations:
University of California, Berkeley, Berkeley, USA;University of California, Berkeley, Berkeley, USA;Adobe Research, San Francisco, USA;Adobe Systems, San Francisco, USA;University of California, Berkeley, Berkeley, USA
Venue:
Proceedings of the 26th annual ACM symposium on User interface software and technology
Year:
2013

Citing 13
Cited 0

Browsing digital video

Proceedings of the SIGCHI conference on Human Factors in Computing Systems
A semi-automatic approach to home video editing

UIST '00 Proceedings of the 13th annual ACM symposium on User interface software and technology
Editing out Video Editing

IEEE MultiMedia
Simplifying video editing using metadata

DIS '02 Proceedings of the 4th conference on Designing interactive systems: processes, practices, methods, and techniques
Semantic speech editing

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
The Cuidado music browser: an end-to-end electronic music distribution system

Multimedia Tools and Applications
Opinion Mining and Sentiment Analysis

Foundations and Trends in Information Retrieval
Dialogue Editing for Motion Pictures: A Guide to the Invisible Art

Dialogue Editing for Motion Pictures: A Guide to the Invisible Art
DAFX: Digital Audio Effects

DAFX: Digital Audio Effects
Speech/music discrimination in audio podcast using structural segmentation and timbre recognition

CMMR'10 Proceedings of the 7th international conference on Exploring music contents
Tools for placing cuts and transitions in interview video

ACM Transactions on Graphics (TOG) - SIGGRAPH 2012 Conference Proceedings
UnderScore: musical underlays for audio stories

Proceedings of the 25th annual ACM symposium on User interface software and technology
Speech Enhancement: Theory and Practice

Speech Enhancement: Theory and Practice

Quantified Score

Hi-index	0.00

Visualization

Abstract

Audio stories are an engaging form of communication that combine speech and music into compelling narratives. Existing audio editing tools force story producers to manipulate speech and music tracks via tedious, low-level waveform editing. In contrast, we present a set of tools that analyze the audio content of the speech and music and thereby allow producers to work at much higher level. Our tools address several challenges in creating audio stories, including (1) navigating and editing speech, (2) selecting appropriate music for the score, and (3) editing the music to complement the speech. Key features include a transcript-based speech editing tool that automatically propagates edits in the transcript text to the corresponding speech track; a music browser that supports searching based on emotion, tempo, key, or timbral similarity to other songs; and music retargeting tools that make it easy to combine sections of music with the speech. We have used our tools to create audio stories from a variety of raw speech sources, including scripted narratives, interviews and political speeches. Informal feedback from first-time users suggests that our tools are easy to learn and greatly facilitate the process of editing raw footage into a final story.