Video Google: A Text Retrieval Approach to Object Matching in Videos
ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Automated annotation of human faces in family albums
MULTIMEDIA '03 Proceedings of the eleventh ACM international conference on Multimedia
Distinctive Image Features from Scale-Invariant Keypoints
International Journal of Computer Vision
Visual language modeling for image classification
Proceedings of the international workshop on Workshop on multimedia information retrieval
Automatic person annotation of family photo album
CIVR'06 Proceedings of the 5th international conference on Image and Video Retrieval
Travelmedia: An intelligent management system for media captured in travel
Journal of Visual Communication and Image Representation
Facial expression based automatic album creation
ICONIP'10 Proceedings of the 17th international conference on Neural information processing: models and applications - Volume Part II
Fusing matching and biometric similarity measures for face diarization in video
Proceedings of the 3rd ACM conference on International conference on multimedia retrieval
Audiovisual diarization of people in video content
Multimedia Tools and Applications
Hi-index | 0.00 |
For consumer photos, this work clusters faces with large variations in lighting, pose, and expression. After matching face images by local feature points, we transform matching situations into a novel representation called visual sentences. Then, visual language models are constructed to describe the dependency of image patches on faces. With the probabilistic framework, we develop a clustering algorithm to group the same individual's face images into the same cluster. An interesting observation about evaluating face clustering performance is proposed, and we demonstrate the superiority of the proposed visual language model approach.