Active capture: integrating human-computer interaction and computer vision/audition to automate media capture

Authors:
M. Davis
Affiliations:
Sch. of Inf. Manage. & Syst., California Univ., Berkeley, CA, USA
Venue:
ICME '03 Proceedings of the 2003 International Conference on Multimedia and Expo - Volume 1
Year:
2003

Citing 0
Cited 9

Active capture: automatic direction for automatic movies

MULTIMEDIA '03 Proceedings of the eleventh ACM international conference on Multimedia
Active capture: automatic direction for automatic movies

MULTIMEDIA '03 Proceedings of the eleventh ACM international conference on Multimedia
Presiding over accidents: system direction of human action

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
The Future in Digital Media Computing is Meta

IEEE MultiMedia
From context to content: leveraging context to infer media metadata

Proceedings of the 12th annual ACM international conference on Multimedia
Designing systems that direct human action

CHI '05 Extended Abstracts on Human Factors in Computing Systems
Canonical processes of media production

Proceedings of the ACM workshop on Multimedia for human communication: from capture to convey
Active capture design case study: SIMS faces

DUX '05 Proceedings of the 2005 conference on Designing for User eXperience
Effective multimedia surveillance using a human-centric approach

Multimedia Tools and Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

While the devices for media capture have advanced from mechanical to computational since the invention of photography and motion pictures in the 19th century, their underlying user interaction paradigms have remained largely unchanged. Current interaction techniques for media capture do not leverage computation to solve key problems: the skill required to capture high quality media assets; the effort required to select useable assets from captured assets; and the lack of metadata describing the content and structure of media assets that could enable them to be retrieved and (re)used. We describe a new interaction and processing paradigm for media capture that redefines capture as a control process with feedback. By integrating human-computer interaction and computer vision and audition into an "active capture" process, we overcome the limitations of current media capture devices, algorithms, and interaction techniques. Active capture leverages media production knowledge to automate direction and cinematography and thus enables the automated production of annotated, high quality, reusable media assets.