Handwritten digit recognition with a back-propagation network
Advances in neural information processing systems 2
Object Recognition from Local Scale-Invariant Features
ICCV '99 Proceedings of the International Conference on Computer Vision-Volume 2 - Volume 2
Video Google: A Text Retrieval Approach to Object Matching in Videos
ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
CVPRW '04 Proceedings of the 2004 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'04) Volume 12 - Volume 12
Histograms of Oriented Gradients for Human Detection
CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories
CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Robust Face Recognition via Sparse Representation
IEEE Transactions on Pattern Analysis and Machine Intelligence
SURF: speeded up robust features
ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part I
Adaptive deconvolutional networks for mid and high level feature learning
ICCV '11 Proceedings of the 2011 International Conference on Computer Vision
Hi-index | 0.00 |
Recent researches in cognitive science and document recognition have been applied to deal with the problem of categorizing object. Bag-of-Features (BoF) and its extension Spatial Pyramid Matching (SPM) have made a breakthrough in resolving this kind of challenges. Many methods followed this guideline really enhance the recognition accuracy but still have drawbacks in developing a real-world application whose data size is many times bigger. In this paper we propose two kinds of strategy include five criteria to evaluate and select the most appropriate training samples using for building a high performance classifier. We also suggest a method called reinforcement codebook learning to make the codebook training process not only purpose-built to best fits with the most suitable criteria but also much more efficient by reducing significantly its complexity of computation. Experiments on benchmark object dataset demonstrate that our proposed framework outperforms remarkable results and is comparable with the state-of-the-art in spite of using just 20% of 9 · 106 descriptors for training the dictionary. These results give a promise of building a efficient and feasible object categorization system for practical application as so as suggest some ideas to improve the visual feature representation in future.