Multimodal feature generation framework for semantic image classification
Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
Hybrid classifiers for object classification with a rich background
ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part V
Local hypersphere coding based on edges between visual words
ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part I
Traffic sign recognition using group sparse coding
Information Sciences: an International Journal
Hi-index | 0.00 |
The codebook based (bag-of-words) model is a widely applied model for image classification. We analyze recent coding strategies in this model, and find that saliency is the fundamental characteristic of coding. The saliency in coding means that if a visual code is much closer to a descriptor than other codes, it will obtain a very strong response. The salient representation under maximum pooling operation leads to the state-of-the-art performance on many databases and competitions. However, most current coding schemes do not recognize the role of salient representation, so that they may lead to large deviations in representing local descriptors. In this paper, we propose "salient coding", which employs the ratio between descriptors' nearest code and other codes to describe descriptors. This approach can guarantee salient representation without deviations. We study salient coding on two sets of image classification databases (15-Scenes and PASCAL VOC2007). The experimental results demonstrate that our approach outperforms all other coding methods in image classification.