Struck: Structured output tracking with kernels

Authors:
Sam Hare;Amir Saffari;Philip H. S. Torr
Affiliations:
Oxford Brookes University, UK;Oxford Brookes University, UK;Oxford Brookes University, UK
Venue:
ICCV '11 Proceedings of the 2011 International Conference on Computer Vision
Year:
2011

Citing 0
Cited 11

Robust tracking with weighted online structured learning

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part III
Real-time compressive tracking

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part III
Exploiting the circulant structure of tracking-by-detection with kernels

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part IV
Robust visual tracking using discriminative stable regions and K-means clustering

Neurocomputing
Local features classification for adaptive tracking

MICAI'12 Proceedings of the 11th Mexican international conference on Advances in Artificial Intelligence - Volume Part I
Robust visual tracking using dynamic classifier selection with sparse representation of label noise

ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part III
A multiple face detection and tracking system based on TLD

Proceedings of the Fifth International Conference on Internet Multimedia Computing and Service
A survey of appearance models in visual object tracking

ACM Transactions on Intelligent Systems and Technology (TIST) - Survey papers, special sections on the semantic adaptive social web, intelligent systems for health informatics, regular papers
Combining histogram-wise and pixel-wise matchings for kernel tracking through constrained optimization

Computer Vision and Image Understanding
Co-trained generative and discriminative trackers with cascade particle filter

Computer Vision and Image Understanding
Structured partial least squares for simultaneous object tracking and segmentation

Neurocomputing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Adaptive tracking-by-detection methods are widely used in computer vision for tracking arbitrary objects. Current approaches treat the tracking problem as a classification task and use online learning techniques to update the object model. However, for these updates to happen one needs to convert the estimated object position into a set of labelled training examples, and it is not clear how best to perform this intermediate step. Furthermore, the objective for the classifier (label prediction) is not explicitly coupled to the objective for the tracker (accurate estimation of object position). In this paper, we present a framework for adaptive visual object tracking based on structured output prediction. By explicitly allowing the output space to express the needs of the tracker, we are able to avoid the need for an intermediate classification step. Our method uses a kernelized structured output support vector machine (SVM), which is learned online to provide adaptive tracking. To allow for real-time application, we introduce a budgeting mechanism which prevents the unbounded growth in the number of support vectors which would otherwise occur during tracking. Experimentally, we show that our algorithm is able to outperform state-of-the-art trackers on various benchmark videos. Additionally, we show that we can easily incorporate additional features and kernels into our framework, which results in increased performance.