Hybrid eye detection algorithm for outdoor environments
Proceedings of the 2012 ACM Conference on Ubiquitous Computing
Stixels motion estimation without optical flow computation
ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part VI
Real-time annotation of video objects on tablet computers
Proceedings of the 11th International Conference on Mobile and Ubiquitous Multimedia
Symmetry-driven accumulation of local features for human characterization and re-identification
Computer Vision and Image Understanding
Block covariance based l1 tracker with a subtle template dictionary
Pattern Recognition
Hybrid POMDP based evolutionary adaptive framework for efficient visual tracking algorithms
Proceedings of the 15th annual conference on Genetic and evolutionary computation
Enhanced local binary covariance matrices (ELBCM) for texture analysis and object tracking
Proceedings of the 6th International Conference on Computer Vision / Computer Graphics Collaboration Techniques and Applications
The open platform for personal lifelogging: the eLifeLog architecture
CHI '13 Extended Abstracts on Human Factors in Computing Systems
Dynamic objectness for adaptive tracking
ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part III
Soft-assigned bag of features tracking
Proceedings of the Fifth International Conference on Internet Multimedia Computing and Service
An improved real-time compressive tracking method
Proceedings of the Fifth International Conference on Internet Multimedia Computing and Service
A multiple face detection and tracking system based on TLD
Proceedings of the Fifth International Conference on Internet Multimedia Computing and Service
Non-rigid target tracking based on 'flow-cut' in pair-wise frames with online hough forests
Proceedings of the 21st ACM international conference on Multimedia
Hand and foot gesture interaction for handheld devices
Proceedings of the 21st ACM international conference on Multimedia
Expertise estimation based on simple multimodal features
Proceedings of the 15th ACM on International conference on multimodal interaction
Finger in air: touch-less interaction on smartphone
Proceedings of the 12th International Conference on Mobile and Ubiquitous Multimedia
Touch-less interaction smartphone on go!
SIGGRAPH Asia 2013 Posters
Eye pupil localization with an ensemble of randomized trees
Pattern Recognition
Visual tracking via weakly supervised learning from multiple imperfect oracles
Pattern Recognition
Pattern Recognition Letters
Efficient and robust multi-template tracking using multi-start interactive hybrid search
Computer Vision and Image Understanding
Neurocomputing
Hi-index | 0.14 |
This paper investigates long-term tracking of unknown objects in a video stream. The object is defined by its location and extent in a single frame. In every frame that follows, the task is to determine the object's location and extent or indicate that the object is not present. We propose a novel tracking framework (TLD) that explicitly decomposes the long-term tracking task into tracking, learning, and detection. The tracker follows the object from frame to frame. The detector localizes all appearances that have been observed so far and corrects the tracker if necessary. The learning estimates the detector's errors and updates it to avoid these errors in the future. We study how to identify the detector's errors and learn from them. We develop a novel learning method (P-N learning) which estimates the errors by a pair of “experts”: 1) P-expert estimates missed detections, and 2) N-expert estimates false alarms. The learning process is modeled as a discrete dynamical system and the conditions under which the learning guarantees improvement are found. We describe our real-time implementation of the TLD framework and the P-N learning. We carry out an extensive quantitative evaluation which shows a significant improvement over state-of-the-art approaches.