Joint people, event, and location recognition in personal photo collections using cross-domain context

Authors:
Dahua Lin;Ashish Kapoor;Gang Hua;Simon Baker
Affiliations:
Computer Science and Artificial Intelligence Laboratory, MIT and Microsoft Research;Microsoft Research;Nokia Research Center Hollywood;Microsoft Research
Venue:
ECCV'10 Proceedings of the 11th European conference on Computer vision: Part I
Year:
2010

Citing 11
Cited 8

A limited memory algorithm for bound constrained optimization

SIAM Journal on Scientific Computing
The Earth Mover's Distance as a Metric for Image Retrieval

International Journal of Computer Vision
Contextual Priming for Object Detection

International Journal of Computer Vision
Context-based vision system for place and object recognition

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Automated annotation of human faces in family albums

MULTIMEDIA '03 Proceedings of the eleventh ACM international conference on Multimedia
Leveraging context to resolve identity in photo albums

Proceedings of the 5th ACM/IEEE-CS joint conference on Digital libraries
Towards context-aware face recognition

Proceedings of the 13th annual ACM international conference on Multimedia
EasyAlbum: an interactive photo annotation system based on face clustering and re-ranking

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Graphical Models, Exponential Families, and Variational Inference

Foundations and Trends® in Machine Learning
Context-aided human recognition – clustering

ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part III
A new class of upper bounds on the log partition function

IEEE Transactions on Information Theory

Personalized portraits ranking

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Graph-based recognition in photo collections using social semantics

SBNMA '11 Proceedings of the 2011 ACM workshop on Social and behavioural networked media access
Facing scalability: Naming faces in an online social network

Pattern Recognition
Discovering inherent event taxonomies from social media collections

Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
Who is here: location aware face recognition

Proceedings of the Third International Workshop on Sensing Applications on Mobile Phones
Describing clothing by semantic attributes

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part III
Object class detection: A survey

ACM Computing Surveys (CSUR)
Concurrent photo sequence organization

Multimedia Tools and Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

We present a framework for vision-assisted tagging of personal photo collections using context. Whereas previous efforts mainly focus on tagging people, we develop a unified approach to jointly tag across multiple domains (specifically people, events, and locations). The heart of our approach is a generic probabilistic model of context that couples the domains through a set of cross-domain relations. Each relation models how likely the instances in two domains are to co-occur. Based on this model, we derive an algorithm that simultaneously estimates the cross-domain relations and infers the unknown tags in a semi-supervised manner. We conducted experiments on two well-known datasets and obtained significant performance improvements in both people and location recognition. We also demonstrated the ability to infer event labels with missing timestamps (i.e. with no event features).