SEVA: Sensor-enhanced video annotation

Authors:
Xiaotao Liu;Mark Corner;Prashant Shenoy
Affiliations:
University of Massachusetts, Amherst, MA, USA;University of Massachusetts, Amherst, MA, USA;University of Massachusetts, Amherst, MA, USA
Venue:
ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Year:
2009

Citing 39
Cited 1

The active badge location system

ACM Transactions on Information Systems (TOIS)
The anatomy of a context-aware application

MobiCom '99 Proceedings of the 5th annual ACM/IEEE international conference on Mobile computing and networking
The Cricket location-support system

MobiCom '00 Proceedings of the 6th annual international conference on Mobile computing and networking
Content-Based Image Retrieval at the End of the Early Years

IEEE Transactions on Pattern Analysis and Machine Intelligence
Wireless sensor networks for habitat monitoring

WSNA '02 Proceedings of the 1st ACM international workshop on Wireless sensor networks and applications
People, places, things: web presence for the real world

Mobile Networks and Applications
The Interactive Workspaces Project: Experiences with Ubiquitous Computing Rooms

IEEE Pervasive Computing
Location Systems for Ubiquitous Computing

Computer
GPS: Location-Tracking Technology

Computer
Mica: A Wireless Platform for Deeply Embedded Networks

IEEE Micro
MyLifeBits: fulfilling the Memex vision

Proceedings of the tenth ACM international conference on Multimedia
LANDMARC: Indoor Location Sensing Using Active RFID

PERCOM '03 Proceedings of the First IEEE International Conference on Pervasive Computing and Communications
RFID Handbook: Fundamentals and Applications in Contactless Smart Cards and Identification

RFID Handbook: Fundamentals and Applications in Contactless Smart Cards and Identification
Introduction to MPEG-7: Multimedia Content Description Interface

Introduction to MPEG-7: Multimedia Content Description Interface
System support for pervasive applications

System support for pervasive applications
Geographic location tags on digital images

MULTIMEDIA '03 Proceedings of the eleventh ACM international conference on Multimedia
Confidence-based dynamic ensemble for image annotation and semantics discovery

MULTIMEDIA '03 Proceedings of the eleventh ACM international conference on Multimedia
What we talk about when we talk about context

Personal and Ubiquitous Computing
Saying What it Means: Semi-Automated (News) Media Annotation

Multimedia Tools and Applications
Augmenting Film and Video Footage with Sensor Data

PERCOM '04 Proceedings of the Second IEEE International Conference on Pervasive Computing and Communications (PerCom'04)
An architecture for privacy-sensitive ubiquitous computing

Proceedings of the 2nd international conference on Mobile systems, applications, and services
Tracking moving devices with the cricket location system

Proceedings of the 2nd international conference on Mobile systems, applications, and services
Efficient retrieval of life log based on context and content

Proceedings of the the 1st ACM workshop on Continuous archival and retrieval of personal experiences
Minimal-impact audio-based personal archives

Proceedings of the the 1st ACM workshop on Continuous archival and retrieval of personal experiences
Passive capture and ensuing issues for a personal lifetime store

Proceedings of the the 1st ACM workshop on Continuous archival and retrieval of personal experiences
Networked multimedia event exploration

Proceedings of the 12th annual ACM international conference on Multimedia
From context to content: leveraging context to infer media metadata

Proceedings of the 12th annual ACM international conference on Multimedia
Context data in geo-referenced digital photo collections

Proceedings of the 12th annual ACM international conference on Multimedia
Multi-level annotation of natural scenes using dominant image components and semantic concepts

Proceedings of the 12th annual ACM international conference on Multimedia
Efficient propagation for face annotation in family albums

Proceedings of the 12th annual ACM international conference on Multimedia
Effective automatic image annotation via a coherent language model and active learning

Proceedings of the 12th annual ACM international conference on Multimedia
A bootstrapping framework for annotating and retrieving WWW images

Proceedings of the 12th annual ACM international conference on Multimedia
SEVA: sensor-enhanced video annotation

Proceedings of the 13th annual ACM international conference on Multimedia
Mindful documentary

Mindful documentary
Telos: enabling ultra-low power wireless research

IPSN '05 Proceedings of the 4th international symposium on Information processing in sensor networks
XYZ: a motion-enabled, power aware sensor node platform for distributed sensor network applications

IPSN '05 Proceedings of the 4th international symposium on Information processing in sensor networks
Extraction of social context and application to personal multimedia exploration

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Over-exposed?: privacy patterns and considerations in online and mobile photo sharing

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Ferret: RFID localization for pervasive multimedia

UbiComp'06 Proceedings of the 8th international conference on Ubiquitous Computing

Detecting and identifying people in mobile videos

MM '11 Proceedings of the 19th ACM international conference on Multimedia

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this article, we study how a sensor-rich world can be exploited by digital recording devices such as cameras and camcorders to improve a user's ability to search through a large repository of image and video files. We design and implement a digital recording system that records identities and locations of objects (as advertised by their sensors) along with visual images (as recorded by a camera). The process, which we refer to as Sensor-Enhanced Video Annotation (SEVA), combines a series of correlation, interpolation, and extrapolation techniques. It produces a tagged stream that later can be used to efficiently search for videos or frames containing particular objects or people. We present detailed experiments with a prototype of our system using both stationary and mobile objects as well as GPS and ultrasound. Our experiments show that: (i) SEVA has zero error rates for static objects, except very close to the boundary of the viewable area; (ii) for moving objects or a moving camera, SEVA only misses objects leaving or entering the viewable area by 1--2 frames; (iii) SEVA can scale to 10 fast-moving objects using current sensor technology; and (iv) SEVA runs online using relatively inexpensive hardware.