Relative attributes

Authors:
Devi Parikh;Kristen Grauman
Affiliations:
Toyota Technological Institute Chicago (TTIC), USA;University of Texas at Austin, USA
Venue:
ICCV '11 Proceedings of the 2011 International Conference on Computer Vision
Year:
2011

Citing 0
Cited 21

Attribute feedback

Proceedings of the 20th ACM international conference on Multimedia
Distributional semantics with eyes: using image analysis to improve computational representations of word meaning

Proceedings of the 20th ACM international conference on Multimedia
Supervised earth mover's distance learning and its computer vision applications

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part I
Attributes for classifier feedback

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part III
Constrained semi-supervised learning using attributes and comparative attributes

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part III
Learning hybrid part filters for scene recognition

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part V
View-Invariant action recognition using latent kernelized structural SVM

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part V
Attribute learning for understanding unstructured social activity

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part IV
Unsupervised learning of discriminative relative visual attributes

ECCV'12 Proceedings of the 12th international conference on Computer Vision - Volume Part III
Human age estimation using ranking SVM

CCBR'12 Proceedings of the 7th Chinese conference on Biometric Recognition
Learning attribute-aware dictionary for image classification and search

Proceedings of the 3rd ACM conference on International conference on multimedia retrieval
NuActiv: recognizing unseen new activities using semantic attribute-based learning

Proceeding of the 11th annual international conference on Mobile systems, applications, and services
Relative forest for attribute prediction

ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part I
AfNet: the affordance network

ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part I
Finding happiest moments in a social context

ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part II
MLRank: Multi-correlation Learning to Rank for image annotation

Pattern Recognition
Towards decrypting attractiveness via multi-modality cues

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Attribit: content creation with semantic attributes

Proceedings of the 26th annual ACM symposium on User interface software and technology
GIANT: geo-informative attributes for location recognition and exploration

Proceedings of the 21st ACM international conference on Multimedia
Scene image retrieval via re-ranking semantic and packed dense interestpoints

Neurocomputing
Editor's Choice Article: A survey of approaches and trends in person re-identification

Image and Vision Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Human-nameable visual "attributes" can benefit various recognition tasks. However, existing techniques restrict these properties to categorical labels (for example, a person is 'smiling' or not, a scene is 'dry' or not), and thus fail to capture more general semantic relationships. We propose to model relative attributes. Given training data stating how object/scene categories relate according to different attributes, we learn a ranking function per attribute. The learned ranking functions predict the relative strength of each property in novel images. We then build a generative model over the joint space of attribute ranking outputs, and propose a novel form of zero-shot learning in which the supervisor relates the unseen object category to previously seen objects via attributes (for example, 'bears are furrier than giraffes'). We further show how the proposed relative attributes enable richer textual descriptions for new images, which in practice are more precise for human interpretation. We demonstrate the approach on datasets of faces and natural scenes, and show its clear advantages over traditional binary attribute prediction for these new tasks.