ViRoom - Low Cost Synchronized Multicamera System and Its Self-calibration
Proceedings of the 24th DAGM Symposium on Pattern Recognition
MULTIMEDIA '03 Proceedings of the eleventh ACM international conference on Multimedia
High-quality video view interpolation using a layered representation
ACM SIGGRAPH 2004 Papers
High performance imaging using large camera arrays
ACM SIGGRAPH 2005 Papers
AHA: An Easily Extendible High-Resolution Camera Array
DMAMH '07 Proceedings of the Second Workshop on Digital Media and its Application in Museum & Heritage
EURASIP Journal on Applied Signal Processing - 3DTV: Capture, Transmission, and Display of 3D Video
Audiovisual Synchronization and Fusion Using Canonical Correlation Analysis
IEEE Transactions on Multimedia
A multimodal database for mimicry analysis
ACII'11 Proceedings of the 4th international conference on Affective computing and intelligent interaction - Volume Part I
AVEC 2011-the first international audio/visual emotion challenge
ACII'11 Proceedings of the 4th international conference on Affective computing and intelligent interaction - Volume Part II
AVEC 2012: the continuous audio/visual emotion challenge
Proceedings of the 14th ACM international conference on Multimodal interaction
Hi-index | 0.00 |
Applications such as surveillance and human motion capture require high-bandwidth recording from multiple cameras. Furthermore, the recent increase in research on sensor fusion has raised the demand on synchronization accuracy between video, audio and other sensor modalities. Previously, capturing synchronized, high resolution video from multiple cameras required complex, inflexible and expensive solutions. Our experiments show that a single PC, built from contemporary low-cost computer hardware, could currently handle up to 470MB/s of input data. This allows capturing from 18 cameras of 780x580pixels at 60fps each, or 36 cameras at 30fps. Furthermore, we achieve accurate synchronization between audio, video and additional sensors, by recording audio together with sensor trigger- or timestamp signals, using a multi-channel audio input. In this way, each sensor modality can be captured with separate software and hardware, allowing maximal flexibility with minimal cost.