In defense of soft-assignment coding

Authors:
Lingqiao Liu; Lei Wang; Xinwang Liu
Affiliations:
School of Engineering, Australian National University, ACT 0200, Australia;School of Engineering, Australian National University, ACT 0200, Australia;School of Engineering, Australian National University, ACT 0200, Australia
Venue:
ICCV '11 Proceedings of the 2011 International Conference on Computer Vision
Year:
2011

Citing 0
Cited 10

Multimodal feature generation framework for semantic image classification

Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
Spherical soft assignment: improving image representation in content-based image retrieval

PCM'12 Proceedings of the 13th Pacific-Rim conference on Advances in Multimedia Information Processing
Comparison of mid-level feature coding approaches and pooling strategies in visual concept detection

Computer Vision and Image Understanding
Local hypersphere coding based on edges between visual words

ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part I
A comparative study of encoding, pooling and normalization methods for action recognition

ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part III
Exploring dense trajectory feature and encoding methods for human interaction recognition

Proceedings of the Fifth International Conference on Internet Multimedia Computing and Service
A local descriptor based on Laplacian pyramid coding for action recognition

Pattern Recognition Letters
Background subtraction using hybrid feature coding in the bag-of-features framework

Pattern Recognition Letters
Bilevel visual words coding for image classification

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Towards large-scale geometry indexing by feature selection

Computer Vision and Image Understanding

Quantified Score

Hi-index	0.00

Visualization

Abstract

In object recognition, soft-assignment coding enjoys computational efficiency and conceptual simplicity. However, its classification performance is inferior to the newly developed sparse or local coding schemes. It would be highly desirable if its classification performance could become comparable to the state-of-the-art, leading to a coding scheme which perfectly combines computational efficiency and classification performance. To achieve this, we revisit soft-assignment coding from two key aspects: classification performance and probabilistic interpretation. For the first aspect, we argue that the inferiority of soft-assignment coding is due to its neglect of the underlying manifold structure of local features. To remedy this, we propose a simple modification to localize the soft-assignment coding, which surprisingly achieves comparable or even better performance than existing sparse or local coding schemes while maintaining its computational advantage. For the second aspect, based on our probabilistic interpretation of the soft-assignment coding, we give a probabilistic explanation to the magic max-pooling operation, which has successfully been used by sparse or local coding schemes but still poorly understood. This probability explanation motivates us to develop a new mix-order max-pooling operation which further improves the classification performance of the proposed coding scheme. As experimentally demonstrated, the localized soft-assignment coding achieves the state-of-the-art classification performance with the highest computational efficiency among the existing coding schemes.