Multi-camera spatio-temporal fusion and biased sequence-data learning for security surveillance

Authors:
Gang Wu;Yi Wu;Long Jiao;Yuan-Fang Wang;Edward Y. Chang
Affiliations:
University of California, Santa Barbara, CA;University of California, Santa Barbara, CA;University of California, Santa Barbara, CA;University of California, Santa Barbara, CA;University of California, Santa Barbara, CA
Venue:
MULTIMEDIA '03 Proceedings of the eleventh ACM international conference on Multimedia
Year:
2003

Citing 21
Cited 17

Computer graphics: principles and practice (2nd ed.)

Computer graphics: principles and practice (2nd ed.)
Introduction to statistical pattern recognition (2nd ed.)

Introduction to statistical pattern recognition (2nd ed.)
Three-dimensional computer vision: a geometric viewpoint

Three-dimensional computer vision: a geometric viewpoint
Model-based object pose in 25 lines of code

International Journal of Computer Vision - Special issue: image understanding research at the University of Maryland
The nature of statistical learning theory

The nature of statistical learning theory
Distance-based indexing for high-dimensional metric spaces

SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
Object Pose: The Link between Weak Perspective,Paraperspective, and Full Perspective

International Journal of Computer Vision
Geometry and invariance in kernel based methods

Advances in kernel methods
Improving support vector machine classifiers by modifying kernal functions

Neural Networks
Exploiting generative models in discriminative classifiers

Proceedings of the 1998 conference on Advances in neural information processing systems II
Monitoring Activities from Multiple Video Streams: Establishing a Common Coordinate Frame

IEEE Transactions on Pattern Analysis and Machine Intelligence
A Flexible New Technique for Camera Calibration

IEEE Transactions on Pattern Analysis and Machine Intelligence
Curves and Surfaces for Computer-Aided Geometric Design: A Practical Code

Curves and Surfaces for Computer-Aided Geometric Design: A Practical Code
Epipolar Geometry in Stereo, Motion, and Object Recognition: A Unified Approach

Epipolar Geometry in Stereo, Motion, and Object Recognition: A Unified Approach
Using the Fisher Kernel Method to Detect Remote Protein Homologies

Proceedings of the Seventh International Conference on Intelligent Systems for Molecular Biology
Pattern discovery in sequences under a Markov assumption

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
MPI-Video Infrastructure for Dynamic Environments

ICMCS '98 Proceedings of the IEEE International Conference on Multimedia Computing and Systems
Predictive Tracking for Augmented Reality

Predictive Tracking for Augmented Reality
An Introduction to the Kalman Filter

An Introduction to the Kalman Filter
Pattern Classification (2nd Edition)

Pattern Classification (2nd Edition)
Mismatch string kernels for discriminative protein classification

Bioinformatics

Adaptive stream resource management using Kalman Filters

SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
A time series clustering based framework for multimedia mining and summarization using audio features

Proceedings of the 6th ACM SIGMM international workshop on Multimedia information retrieval
Distance-function design and fusion for sequence data

Proceedings of the thirteenth ACM international conference on Information and knowledge management
KBA: Kernel Boundary Alignment Considering Imbalanced Data Distribution

IEEE Transactions on Knowledge and Data Engineering
A video analysis framework for soft biometry security surveillance

Proceedings of the third ACM international workshop on Video surveillance & sensor networks
Support for effective use of multiple video streams in security

Proceedings of the 4th ACM international workshop on Video surveillance and sensor networks
Effects of presenting geographic context on tracking activity between cameras

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
A content-adaptive analysis and representation framework for audio event discovery from "unscripted" multimedia

EURASIP Journal on Applied Signal Processing
DOTS: support for effective video surveillance

Proceedings of the 15th international conference on Multimedia
How close are we to solving the problem of automated visual surveillance?: A review of real-world surveillance, scientific progress and evaluative mechanisms

Machine Vision and Applications
Video object matching across multiple independent views using local descriptors and adaptive learning

Pattern Recognition Letters
Navigational strategies in behaviour modelling

Artificial Intelligence
A method for analyzing work tasks and status by video-and-PC-monitoring system

AMC '09 Proceedings of the 2009 workshop on Ambient media computing
Video analytics for multi-camera traffic surveillance

Proceedings of the Second International Workshop on Computational Transportation Science
A feature sequence kernel for video concept classification

MMM'11 Proceedings of the 17th international conference on Advances in multimedia modeling - Volume Part I
Sequence-based kernels for online concept detection in video

AIEMPro '11 Proceedings of the 2011 ACM international workshop on Automated media analysis and production for novel TV services
Visual surveillance using less ROIs of multiple non-calibrated cameras

ACCV'06 Proceedings of the 7th Asian conference on Computer Vision - Volume Part I

Quantified Score

Hi-index	0.00

Visualization

Abstract

We present a framework for multi-camera video surveillance. The framework consists of three phases: detection, representation, and recognition. The detection phase handles multi-source spatio-temporal data fusion for efficiently and reliably extracting motion trajectories from video. The representation phase summarizes raw trajectory data to construct hierarchical, invariant, and content-rich descriptions of the motion events. Finally, the recognition phase deals with event classification and identification on the data descriptors. Because of space limits, we describe only briefly how we detect and represent events, but we provide in-depth treatment on the third phase: event recognition. For effective recognition, we devise a sequence-alignment kernel function to perform sequence data learning for identifying suspicious events. We show that when the positive training instances (i.e., suspicious events) are significantly outnumbered by the negative training instances (benign events), then SVMs (or any other learning methods) can suffer a high incidence of errors. To remedy this problem, we propose the kernel boundary alignment (KBA) algorithm to work with the sequence-alignment kernel. Through empirical study in a parking-lot surveillance setting, we show that our spatio-temporal fusion scheme and biased sequence-data learning method are highly effective in identifying suspicious events.