Proceedings of the SIGCHI conference on Human Factors in Computing Systems
A semi-automatic approach to home video editing
UIST '00 Proceedings of the 13th annual ACM symposium on User interface software and technology
IEEE MultiMedia
Simplifying video editing using metadata
DIS '02 Proceedings of the 4th conference on Designing interactive systems: processes, practices, methods, and techniques
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
The Cuidado music browser: an end-to-end electronic music distribution system
Multimedia Tools and Applications
Opinion Mining and Sentiment Analysis
Foundations and Trends in Information Retrieval
Dialogue Editing for Motion Pictures: A Guide to the Invisible Art
Dialogue Editing for Motion Pictures: A Guide to the Invisible Art
DAFX: Digital Audio Effects
Speech/music discrimination in audio podcast using structural segmentation and timbre recognition
CMMR'10 Proceedings of the 7th international conference on Exploring music contents
Tools for placing cuts and transitions in interview video
ACM Transactions on Graphics (TOG) - SIGGRAPH 2012 Conference Proceedings
UnderScore: musical underlays for audio stories
Proceedings of the 25th annual ACM symposium on User interface software and technology
Speech Enhancement: Theory and Practice
Speech Enhancement: Theory and Practice
Hi-index | 0.00 |
Audio stories are an engaging form of communication that combine speech and music into compelling narratives. Existing audio editing tools force story producers to manipulate speech and music tracks via tedious, low-level waveform editing. In contrast, we present a set of tools that analyze the audio content of the speech and music and thereby allow producers to work at much higher level. Our tools address several challenges in creating audio stories, including (1) navigating and editing speech, (2) selecting appropriate music for the score, and (3) editing the music to complement the speech. Key features include a transcript-based speech editing tool that automatically propagates edits in the transcript text to the corresponding speech track; a music browser that supports searching based on emotion, tempo, key, or timbral similarity to other songs; and music retargeting tools that make it easy to combine sections of music with the speech. We have used our tools to create audio stories from a variety of raw speech sources, including scripted narratives, interviews and political speeches. Informal feedback from first-time users suggests that our tools are easy to learn and greatly facilitate the process of editing raw footage into a final story.