SEVA: sensor-enhanced video annotation

Authors:
Xiaotao Liu;Mark Corner;Prashant Shenoy
Affiliations:
University of Massachusetts, Amherst, MA;University of Massachusetts, Amherst, MA;University of Massachusetts, Amherst, MA
Venue:
Proceedings of the 13th annual ACM international conference on Multimedia
Year:
2005

Citing 27
Cited 14

The Cricket location-support system

MobiCom '00 Proceedings of the 6th annual international conference on Mobile computing and networking
Designing annotation before it's needed

MULTIMEDIA '01 Proceedings of the ninth ACM international conference on Multimedia
Wireless sensor networks for habitat monitoring

WSNA '02 Proceedings of the 1st ACM international workshop on Wireless sensor networks and applications
People, places, things: web presence for the real world

Mobile Networks and Applications
The Interactive Workspaces Project: Experiences with Ubiquitous Computing Rooms

IEEE Pervasive Computing
Location Systems for Ubiquitous Computing

Computer
GPS: Location-Tracking Technology

Computer
Mica: A Wireless Platform for Deeply Embedded Networks

IEEE Micro
MyLifeBits: fulfilling the Memex vision

Proceedings of the tenth ACM international conference on Multimedia
LANDMARC: Indoor Location Sensing Using Active RFID

PERCOM '03 Proceedings of the First IEEE International Conference on Pervasive Computing and Communications
RFID Handbook: Fundamentals and Applications in Contactless Smart Cards and Identification

RFID Handbook: Fundamentals and Applications in Contactless Smart Cards and Identification
Introduction to MPEG-7: Multimedia Content Description Interface

Introduction to MPEG-7: Multimedia Content Description Interface
System support for pervasive applications

System support for pervasive applications
Geographic location tags on digital images

MULTIMEDIA '03 Proceedings of the eleventh ACM international conference on Multimedia
Confidence-based dynamic ensemble for image annotation and semantics discovery

MULTIMEDIA '03 Proceedings of the eleventh ACM international conference on Multimedia
Augmenting Film and Video Footage with Sensor Data

PERCOM '04 Proceedings of the Second IEEE International Conference on Pervasive Computing and Communications (PerCom'04)
Tracking moving devices with the cricket location system

Proceedings of the 2nd international conference on Mobile systems, applications, and services
Efficient retrieval of life log based on context and content

Proceedings of the the 1st ACM workshop on Continuous archival and retrieval of personal experiences
Minimal-impact audio-based personal archives

Proceedings of the the 1st ACM workshop on Continuous archival and retrieval of personal experiences
Passive capture and ensuing issues for a personal lifetime store

Proceedings of the the 1st ACM workshop on Continuous archival and retrieval of personal experiences
From context to content: leveraging context to infer media metadata

Proceedings of the 12th annual ACM international conference on Multimedia
Multi-level annotation of natural scenes using dominant image components and semantic concepts

Proceedings of the 12th annual ACM international conference on Multimedia
Efficient propagation for face annotation in family albums

Proceedings of the 12th annual ACM international conference on Multimedia
Effective automatic image annotation via a coherent language model and active learning

Proceedings of the 12th annual ACM international conference on Multimedia
A bootstrapping framework for annotating and retrieving WWW images

Proceedings of the 12th annual ACM international conference on Multimedia
Telos: enabling ultra-low power wireless research

IPSN '05 Proceedings of the 4th international symposium on Information processing in sensor networks
XYZ: a motion-enabled, power aware sensor node platform for distributed sensor network applications

IPSN '05 Proceedings of the 4th international symposium on Information processing in sensor networks

Design and implementation of a wireless sensor network for intelligent light control

Proceedings of the 6th international conference on Information processing in sensor networks
Lifesampler: enabling conversational video documentary

CHI '08 Extended Abstracts on Human Factors in Computing Systems
MedSMan: a live multimedia stream querying system

Multimedia Tools and Applications
Viewable scene modeling for geospatial video search

MM '08 Proceedings of the 16th ACM international conference on Multimedia
SEVA: Sensor-enhanced video annotation

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Vector model in support of versatile georeferenced video search

MMSys '10 Proceedings of the first annual ACM SIGMM conference on Multimedia systems
Generating synthetic meta-data for georeferenced video management

Proceedings of the 18th SIGSPATIAL International Conference on Advances in Geographic Information Systems
Design and implementation of geo-tagged video search framework

Journal of Visual Communication and Image Representation
Ferret: An RFID-enabled pervasive multimedia application

Ad Hoc Networks
Automatic tag generation and ranking for sensor-rich outdoor videos

MM '11 Proceedings of the 19th ACM international conference on Multimedia
A grid-based index and queries for large-scale geo-tagged video collections

DASFAA'12 Proceedings of the 17th international conference on Database Systems for Advanced Applications
HUGVid: handling, indexing and querying of uncertain geo-tagged videos

Proceedings of the 20th International Conference on Advances in Geographic Information Systems
Client-Side Relevance Feedback Approach for Image Retrieval in Mobile Environment

International Journal of Multimedia Data Engineering & Management
MediaQ: mobile multimedia management system

Proceedings of the 5th ACM Multimedia Systems Conference

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we study how a sensor-rich world can be exploited by digital recording devices such as cameras and camcorders to improve a user's ability to search through a large repository of image and video files. We design and implement a digital recording system that records identities and locations of objects (as advertised by their sensors) along with visual images (as recorded by a camera). The process, which we refer to as sensor-enhanced video annotation (SEVA), combines a series of correlation, interpolation, and extrapolation techniques. It produces a tagged stream that later can be used to efficiently search for videos or frames containing particular objects or people. We present detailed experiments with a prototype of our system using both stationary and mobile objects as well as GPS and ultrasound. Our experiments show that: (i) SEVA has zero error rates for static objects, except very close to the boundary of the viewable area; (ii) for moving objects or a moving camera, SEVA only misses objects leaving or entering the viewable area by 1-2 frames; (iii) SEVA can scale to 10 fast moving objects using current sensor technology; and (iv) SEVA runs online using relatively inexpensive hardware.