Classification by pairwise coupling
NIPS '97 Proceedings of the 1997 conference on Advances in neural information processing systems 10
Perceptual Organization and Visual Recognition
Perceptual Organization and Visual Recognition
Distortion Invariant Object Recognition in the Dynamic Link Architecture
IEEE Transactions on Computers
Unsupervised Learning of Models for Recognition
ECCV '00 Proceedings of the 6th European Conference on Computer Vision-Part I
Scale & Affine Invariant Interest Point Detectors
International Journal of Computer Vision
Distinctive Image Features from Scale-Invariant Keypoints
International Journal of Computer Vision
Pictorial Structures for Object Recognition
International Journal of Computer Vision
Learning to Detect Objects in Images via a Sparse, Part-Based Representation
IEEE Transactions on Pattern Analysis and Machine Intelligence
Combining Top-Down and Bottom-Up Segmentation
CVPRW '04 Proceedings of the 2004 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'04) Volume 4 - Volume 04
CVPRW '04 Proceedings of the 2004 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'04) Volume 12 - Volume 12
Shape Matching and Object Recognition Using Low Distortion Correspondences
CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
Combining Generative Models and Fisher Kernels for Object Recognition
ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 - Volume 01
Image Analysis, Random Fields and Markov Chain Monte Carlo Methods: A Mathematical Introduction (Stochastic Modelling and Applied Probability)
The Representation and Matching of Pictorial Structures
IEEE Transactions on Computers
Object categorization by compositional graphical models
EMMCVPR'05 Proceedings of the 5th international conference on Energy Minimization Methods in Computer Vision and Pattern Recognition
A stochastic grammar of images
Foundations and Trends® in Computer Graphics and Vision
Object retrieval using configurations of salient regions
CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
Learning an Alphabet of Shape and Appearance for Multi-Class Object Detection
International Journal of Computer Vision
Object Tracking Using Grayscale Appearance Models and Swarm Based Particle Filter
HAIS '08 Proceedings of the 3rd international workshop on Hybrid Artificial Intelligence Systems
Seeing the Objects Behind the Dots: Recognition in Videos from a Moving Camera
International Journal of Computer Vision
Object segmentation in video via graph cut built on superpixels
Fundamenta Informaticae - Cognitive Informatics, Cognitive Computing, and Their Denotational Mathematical Foundations (II)
A new compositional technique for hand posture recognition
ICCOMP'09 Proceedings of the WSEAES 13th international conference on Computers
A compositional technique for hand posture recognition: new results
WSEAS TRANSACTIONS on COMMUNICATIONS
Compositional object recognition, segmentation, and tracking in video
EMMCVPR'07 Proceedings of the 6th international conference on Energy minimization methods in computer vision and pattern recognition
Exploiting low-level image segmentation for object recognition
DAGM'06 Proceedings of the 28th conference on Pattern Recognition
Object segmentation in video via graph cut built on superpixels
Fundamenta Informaticae - Cognitive Informatics, Cognitive Computing, and Their Denotational Mathematical Foundations (II)
Hi-index | 0.01 |
This contribution proposes a compositional approach to visual object categorization of scenes. Compositions are learned from the Caltech 101 database and form intermediate abstractions of images that are semantically situated between low-level representations and the high-level categorization. Salient regions, which are described by localized feature histograms, are detected as image parts. Subsequently compositions are formed as bags of parts with a locality constraint. After performing a spatial binding of compositions by means of a shape model, coupled probabilistic kernel classifiers are applied thereupon to establish the final image categorization. In contrast to the discriminative training of the categorizer, intermediate compositions are learned in a generative manner yielding relevant part agglomerations, i.e. groupings which are frequently appearing in the dataset while simultaneously supporting the discrimination between sets of categories. Consequently, compositionality simplifies the learning of a complex categorization model for complete scenes by splitting it up into simpler, sharable compositions. The architecture is evaluated on the highly challenging Caltech 101 database which exhibits large intra-category variations. Our compositional approach shows competitive retrieval rates in the range of 53.6 ± 0.88% or, with a multi-scale feature set, rates of 57.8 ± 0.79%.