Unsupervised Learning of Human Action Categories Using Spatial-Temporal Words
International Journal of Computer Vision
Image near-duplicate retrieval using local dependencies in spatial-scale space
MM '08 Proceedings of the 16th ACM international conference on Multimedia
Descriptive visual words and visual phrases for image applications
MM '09 Proceedings of the 17th ACM international conference on Multimedia
Coboost learning of visual categories with 1st and 2nd order features from Google images
MM '09 Proceedings of the 17th ACM international conference on Multimedia
Scale-invariant visual language modeling for object categorization
IEEE Transactions on Multimedia - Special issue on integration of context and content
Structural Context for Object Categorization
PCM '09 Proceedings of the 10th Pacific Rim Conference on Multimedia: Advances in Multimedia Information Processing
Coherent phrase model for efficient image near-duplicate retrieval
IEEE Transactions on Multimedia
Embedding spatial information into image content description for scene retrieval
Pattern Recognition
Building contextual visual vocabulary for large-scale image applications
Proceedings of the international conference on Multimedia
Scene categorization using boosted back-propagation neural networks
PCM'10 Proceedings of the 11th Pacific Rim conference on Advances in multimedia information processing: Part I
Building compact local pairwise codebook with joint feature space clustering
ECCV'10 Proceedings of the 11th European conference on Computer vision: Part I
Spatial statistics of visual keypoints for texture recognition
ECCV'10 Proceedings of the 11th European conference on Computer vision: Part IV
Event detection and recognition for semantic annotation of video
Multimedia Tools and Applications
Personalization in multimedia retrieval: A survey
Multimedia Tools and Applications
Building descriptive and discriminative visual codebook for large-scale image applications
Multimedia Tools and Applications
Modeling spatial and semantic cues for large-scale near-duplicated image retrieval
Computer Vision and Image Understanding
A BOVW based query generative model
MMM'11 Proceedings of the 17th international conference on Advances in multimedia modeling - Volume Part I
Exploiting Textons distributions on spatial hierarchy for scene classification
Journal on Image and Video Processing - Special issue on selected papers from multimedia modeling conference 2009
Exploiting local dependencies with spatial-scale space (S-Cube) for near-duplicate retrieval
Computer Vision and Image Understanding
Proceedings of the 1st ACM International Conference on Multimedia Retrieval
Semantics extraction from images
Knowledge-driven multimedia information extraction and ontology evolution
Implicit scene context for object segmentation and classification
DAGM'11 Proceedings of the 33rd international conference on Pattern recognition
Image classification using probability higher-order local auto-correlations
ACCV'09 Proceedings of the 9th Asian conference on Computer Vision - Volume Part III
Encoding spatial arrangement of visual words
CIARP'11 Proceedings of the 16th Iberoamerican Congress conference on Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications
Spatial feature interdependence matrix (SFIM): a robust descriptor for face recognition
PSIVT'11 Proceedings of the 5th Pacific Rim conference on Advances in Image and Video Technology - Volume Part I
Semantic parsing of street scenes from video
International Journal of Robotics Research
Comparing image classification methods: K-nearest-neighbor and support-vector-machines
AMERICAN-MATH'12/CEA'12 Proceedings of the 6th WSEAS international conference on Computer Engineering and Applications, and Proceedings of the 2012 American conference on Applied Mathematics
Intelligent multi-camera video surveillance: A review
Pattern Recognition Letters
Improving bag-of-visual-words model with spatial-temporal correlation for video retrieval
Proceedings of the 21st ACM international conference on Information and knowledge management
Topic based pose relevance learning in dance archives
Proceedings of the 21st ACM international conference on Information and knowledge management
Bag of spatio-visual words for context inference in scene classification
Pattern Recognition
Segmentation and classification of objects with implicit scene context
Proceedings of the 15th international conference on Theoretical Foundations of Computer Vision: outdoor and large-scale real-world scene analysis
Text extraction from scene images by character appearance and structure modeling
Computer Vision and Image Understanding
Part-based object detection into a hierarchy of image segmentations combining color and topology
Pattern Recognition Letters
Action disambiguation analysis using normalized google-like distance correlogram
ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part III
Object class detection: A survey
ACM Computing Surveys (CSUR)
Learning group-based dictionaries for discriminative image representation
Pattern Recognition
Visual word spatial arrangement for image retrieval and classification
Pattern Recognition
Learning structured visual dictionary for object tracking
Image and Vision Computing
The Visual Computer: International Journal of Computer Graphics
Reading the legends of Roman Republican coins
Journal on Computing and Cultural Heritage (JOCCH)
Hi-index | 0.00 |
This paper presents a new model of object classes which incorporates appearance and shape information jointly. Modeling objects appearance by distributions of visual words has recently proven successful. Here appearancebased models are augmented by capturing the spatial arrangement of visual words. Compact spatial modeling without loss of discrimination is achieved through the introduction of adaptive vector quantized correlograms, which we call correlatons. Efficiency is further improved by means of integral images. The robustness of our new models to geometric transformations, severe occlusions and missing information is also demonstrated. The accuracy of discrimination of the proposed models is assessed with respect to existing databases with large numbers of object classes viewed under general conditions, and shown to outperform appearance-only models.