Discriminative Object Class Models of Appearance and Shape by Correlatons

Authors:
S. Savarese;J. Winn;A. Criminisi
Affiliations:
University of Illinois at Urbana-Champaign;Microsoft Research Ltd., Cambridge, CB3 0FB, United Kingdom;Microsoft Research Ltd., Cambridge, CB3 0FB, United Kingdom
Venue:
CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Year:
2006

Citing 0
Cited 44

Unsupervised Learning of Human Action Categories Using Spatial-Temporal Words

International Journal of Computer Vision
Image near-duplicate retrieval using local dependencies in spatial-scale space

MM '08 Proceedings of the 16th ACM international conference on Multimedia
Descriptive visual words and visual phrases for image applications

MM '09 Proceedings of the 17th ACM international conference on Multimedia
Coboost learning of visual categories with 1st and 2nd order features from Google images

MM '09 Proceedings of the 17th ACM international conference on Multimedia
Scale-invariant visual language modeling for object categorization

IEEE Transactions on Multimedia - Special issue on integration of context and content
Structural Context for Object Categorization

PCM '09 Proceedings of the 10th Pacific Rim Conference on Multimedia: Advances in Multimedia Information Processing
Coherent phrase model for efficient image near-duplicate retrieval

IEEE Transactions on Multimedia
Embedding spatial information into image content description for scene retrieval

Pattern Recognition
Building contextual visual vocabulary for large-scale image applications

Proceedings of the international conference on Multimedia
Scene categorization using boosted back-propagation neural networks

PCM'10 Proceedings of the 11th Pacific Rim conference on Advances in multimedia information processing: Part I
Building compact local pairwise codebook with joint feature space clustering

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part I
Spatial statistics of visual keypoints for texture recognition

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part IV
Event detection and recognition for semantic annotation of video

Multimedia Tools and Applications
Personalization in multimedia retrieval: A survey

Multimedia Tools and Applications
Building descriptive and discriminative visual codebook for large-scale image applications

Multimedia Tools and Applications
Modeling spatial and semantic cues for large-scale near-duplicated image retrieval

Computer Vision and Image Understanding
Transform based spatio-temporal descriptors for human action recognition

Neurocomputing
A BOVW based query generative model

MMM'11 Proceedings of the 17th international conference on Advances in multimedia modeling - Volume Part I
Exploiting Textons distributions on spatial hierarchy for scene classification

Journal on Image and Video Processing - Special issue on selected papers from multimedia modeling conference 2009
Exploiting local dependencies with spatial-scale space (S-Cube) for near-duplicate retrieval

Computer Vision and Image Understanding
Exploiting photographic style for category-level image classification by generalizing the spatial pyramid

Proceedings of the 1st ACM International Conference on Multimedia Retrieval
Semantics extraction from images

Knowledge-driven multimedia information extraction and ontology evolution
Implicit scene context for object segmentation and classification

DAGM'11 Proceedings of the 33rd international conference on Pattern recognition
Image classification using probability higher-order local auto-correlations

ACCV'09 Proceedings of the 9th Asian conference on Computer Vision - Volume Part III
Encoding spatial arrangement of visual words

CIARP'11 Proceedings of the 16th Iberoamerican Congress conference on Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications
Spatial feature interdependence matrix (SFIM): a robust descriptor for face recognition

PSIVT'11 Proceedings of the 5th Pacific Rim conference on Advances in Image and Video Technology - Volume Part I
Semantic parsing of street scenes from video

International Journal of Robotics Research
Comparing image classification methods: K-nearest-neighbor and support-vector-machines

AMERICAN-MATH'12/CEA'12 Proceedings of the 6th WSEAS international conference on Computer Engineering and Applications, and Proceedings of the 2012 American conference on Applied Mathematics
Image representation for generic object recognition using higher-order local autocorrelation features on posterior probability images

Pattern Recognition
Intelligent multi-camera video surveillance: A review

Pattern Recognition Letters
Improving bag-of-visual-words model with spatial-temporal correlation for video retrieval

Proceedings of the 21st ACM international conference on Information and knowledge management
Topic based pose relevance learning in dance archives

Proceedings of the 21st ACM international conference on Information and knowledge management
Bag of spatio-visual words for context inference in scene classification

Pattern Recognition
Segmentation and classification of objects with implicit scene context

Proceedings of the 15th international conference on Theoretical Foundations of Computer Vision: outdoor and large-scale real-world scene analysis
ISABoost: A weak classifier inner structure adjusting based AdaBoost algorithm-ISABoost based application in scene categorization

Neurocomputing
Text extraction from scene images by character appearance and structure modeling

Computer Vision and Image Understanding
Part-based object detection into a hierarchy of image segmentations combining color and topology

Pattern Recognition Letters
Action disambiguation analysis using normalized google-like distance correlogram

ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part III
Object class detection: A survey

ACM Computing Surveys (CSUR)
Learning group-based dictionaries for discriminative image representation

Pattern Recognition
Visual word spatial arrangement for image retrieval and classification

Pattern Recognition
Learning structured visual dictionary for object tracking

Image and Vision Computing
A co-boost framework for learning object categories from Google Images with 1st and 2nd order features

The Visual Computer: International Journal of Computer Graphics
Reading the legends of Roman Republican coins

Journal on Computing and Cultural Heritage (JOCCH)

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper presents a new model of object classes which incorporates appearance and shape information jointly. Modeling objects appearance by distributions of visual words has recently proven successful. Here appearancebased models are augmented by capturing the spatial arrangement of visual words. Compact spatial modeling without loss of discrimination is achieved through the introduction of adaptive vector quantized correlograms, which we call correlatons. Efficiency is further improved by means of integral images. The robustness of our new models to geometric transformations, severe occlusions and missing information is also demonstrated. The accuracy of discrimination of the proposed models is assessed with respect to existing databases with large numbers of object classes viewed under general conditions, and shown to outperform appearance-only models.