Fall detection in multi-camera surveillance videos: experimentations and observations

Authors:
Sen Wang;Zhongwen Xu;Yi Yang;Xue Li;Chaoyi Pang;Alexander G. Haumptmann
Affiliations:
School of ITEE, EAIT, University of Queensland, Brisbane, Australia;Zhejiang University, Zhejiang, China;School of ITEE, EAIT, University of Queensland, Brisbane, Australia;School of ITEE, EAIT, University of Queensland, Brisbane, Australia;eHealth Center, CSIRO, Brisbane, Australia;Carnegie Mellon University, Pittsburgh, USA
Venue:
Proceedings of the 1st ACM international workshop on Multimedia indexing and information retrieval for healthcare
Year:
2013

Citing 16
Cited 0

Space-time Interest Points

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Recognizing Human Actions: A Local SVM Approach

ICPR '04 Proceedings of the Pattern Recognition, 17th International Conference on (ICPR'04) Volume 3 - Volume 03
Early versus late fusion in semantic video analysis

Proceedings of the 13th annual ACM international conference on Multimedia
Early versus late fusion in semantic video analysis

Proceedings of the 13th annual ACM international conference on Multimedia
Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Fall Detection from Human Shape and Motion History Using Video Surveillance

AINAW '07 Proceedings of the 21st International Conference on Advanced Information Networking and Applications Workshops - Volume 02
Behavior recognition via sparse spatio-temporal features

ICCCN '05 Proceedings of the 14th International Conference on Computer Communications and Networks
Aging in place: fall detection and localization in a distributed smart camera network

Proceedings of the 15th international conference on Multimedia
Compressed-domain Fall Incident Detection for Intelligent Homecare

Journal of VLSI Signal Processing Systems
An Efficient Dense and Scale-Invariant Spatio-Temporal Interest Point Detector

ECCV '08 Proceedings of the 10th European Conference on Computer Vision: Part II
Local descriptors for spatio-temporal recognition

SCVMA'04 Proceedings of the First international conference on Spatial Coherence for Visual Motion Analysis
Double fusion for multimedia event detection

MMM'12 Proceedings of the 18th international conference on Advances in Multimedia Modeling
Cross-view action recognition via view knowledge transfer

CVPR '11 Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition
A Real-Time, Multiview Fall Detection System: A LHMM-Based Approach

IEEE Transactions on Circuits and Systems for Video Technology
Action recognition by exploring data distribution and feature correlation

CVPR '12 Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
HMDB: A large video database for human motion recognition

ICCV '11 Proceedings of the 2011 International Conference on Computer Vision

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper presents our study on fall detection for ageing care monitoring. We collected a choreographed multi-camera dataset that contains fall actions and other actions such as walking, standing up, sitting down and so forth. In our work, MoSIFT feature is extracted from the videos recorded by each camera. We conduct a series of experiments to show the performance variations of fall detection when different methods are used. We first compare the performance of the standard Bag-of-Words and spatial Bag-of-Words with different codebook sizes. Then, we test different fusion methods which combines the information from the videos recorded by two orthogonally deployed cameras, where a non-linear χ2 kernel Support Vector Machine (SVM) is trained to detect fall actions. In addition, we also use explicit feature maps along with linear kernel for fall detection and compare it to the standard bag of word representation with a non-linear χ2 kernel. Our experiment results show that late fusion of Bag-of-Words with a 1000 centers codebook obtains the best performance. The best result reaches 90.46% in average precision, which in turn may provide a more independent and safer living environment for the elderly.