A decision-theoretic generalization of on-line learning and an application to boosting
EuroCOLT '95 Proceedings of the Second European Conference on Computational Learning Theory
CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 2 - Volume 02
Multi-probe LSH: efficient indexing for high-dimensional similarity search
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Speeded-Up Robust Features (SURF)
Computer Vision and Image Understanding
Fast Keypoint Recognition Using Random Ferns
IEEE Transactions on Pattern Analysis and Machine Intelligence
BRIEF: binary robust independent elementary features
ECCV'10 Proceedings of the 11th European conference on Computer vision: Part IV
Forward-Backward Error: Automatic Detection of Tracking Failures
ICPR '10 Proceedings of the 2010 20th International Conference on Pattern Recognition
Machine learning for high-speed corner detection
ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part I
Context tracker: Exploring supporters and distracters in unconstrained environments
CVPR '11 Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition
ORB: An efficient alternative to SIFT or SURF
ICCV '11 Proceedings of the 2011 International Conference on Computer Vision
Struck: Structured output tracking with kernels
ICCV '11 Proceedings of the 2011 International Conference on Computer Vision
Hi-index | 0.00 |
In this paper we propose using invariant local features and a global appearance validation for assisting a robust object tracker initialized by a single example. Although local features have been used before for several object detection and tracking applications, these approaches often model an object as a collection of key-points and descriptors, which involves constructing a set of correspondences between object and image key-points via descriptor matching or key-point classification. However, these algorithms cannot properly adapt to long video sequences due to their limited capacity for incremental update. We differentiate from these approaches in that we obtain key-point-to-object correspondences instead of key-point-to-key-point correspondences converting the problem into an easier binary classification problem, which allows us to use a state-of-the-art algorithm to incrementally update our classifier. Our approach is embedded into the Tracking-Learning-Detection (TLD) framework by performing a set of changes in the detection stage. We show how measuring the density of positive local features given by a binary classifier trained on-line is a good signal of the object's presence, and in combination with a global appearance validation it yields a strong object detector for assisting a tracking algorithm. In order to validate our approach we compare the tracking results against the original TLD approach on a set of 10 videos.