Toward a practical visual object recognition system

Authors:
Mao Nguyen;Minh-Triet Tran
Affiliations:
John von Neumann Institute and University of Science, VNU-HCM;University of Science, VNU-HCM
Venue:
Proceedings of the Fourth Symposium on Information and Communication Technology
Year:
2013

Citing 9
Cited 0

Handwritten digit recognition with a back-propagation network

Advances in neural information processing systems 2
Object Recognition from Local Scale-Invariant Features

ICCV '99 Proceedings of the International Conference on Computer Vision-Volume 2 - Volume 2
Video Google: A Text Retrieval Approach to Object Matching in Videos

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Learning Generative Visual Models from Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object Categories

CVPRW '04 Proceedings of the 2004 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'04) Volume 12 - Volume 12
Histograms of Oriented Gradients for Human Detection

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Robust Face Recognition via Sparse Representation

IEEE Transactions on Pattern Analysis and Machine Intelligence
SURF: speeded up robust features

ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part I
Adaptive deconvolutional networks for mid and high level feature learning

ICCV '11 Proceedings of the 2011 International Conference on Computer Vision

Quantified Score

Hi-index	0.00

Visualization

Abstract

Recent researches in cognitive science and document recognition have been applied to deal with the problem of categorizing object. Bag-of-Features (BoF) and its extension Spatial Pyramid Matching (SPM) have made a breakthrough in resolving this kind of challenges. Many methods followed this guideline really enhance the recognition accuracy but still have drawbacks in developing a real-world application whose data size is many times bigger. In this paper we propose two kinds of strategy include five criteria to evaluate and select the most appropriate training samples using for building a high performance classifier. We also suggest a method called reinforcement codebook learning to make the codebook training process not only purpose-built to best fits with the most suitable criteria but also much more efficient by reducing significantly its complexity of computation. Experiments on benchmark object dataset demonstrate that our proposed framework outperforms remarkable results and is comparable with the state-of-the-art in spite of using just 20% of 9 · 106 descriptors for training the dictionary. These results give a promise of building a efficient and feasible object categorization system for practical application as so as suggest some ideas to improve the visual feature representation in future.