Gathering and ranking photos of named entities with high precision, high recall, and diversity

Authors:
Bilyana Taneva;Mouna Kacimi;Gerhard Weikum
Affiliations:
Max-Planck Institute for Informatics, Saarbrücken, Germany;Free University of Bozen-Bolzano, Bolzano, Italy;Max-Planck Institute for Informatics, Saarbrücken, Germany
Venue:
Proceedings of the third ACM international conference on Web search and data mining
Year:
2010

Citing 23
Cited 12

Data mining: practical machine learning tools and techniques with Java implementations

Data mining: practical machine learning tools and techniques with Java implementations
Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography

Communications of the ACM
Multidimensional binary search trees used for associative searching

Communications of the ACM
Shape Indexing Using Approximate Nearest-Neighbour Search in High-Dimensional Spaces

CVPR '97 Proceedings of the 1997 Conference on Computer Vision and Pattern Recognition (CVPR '97)
Distinctive Image Features from Scale-Invariant Keypoints

International Journal of Computer Vision
Composite Templates for Cloth Modeling and Sketching

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 1
Yago: a core of semantic knowledge

Proceedings of the 16th international conference on World Wide Web
Ontology driven content based image retrieval

Proceedings of the 6th ACM international conference on Image and video retrieval
Learning people annotation from the web via consistency learning

Proceedings of the international workshop on Workshop on multimedia information retrieval
LabelMe: A Database and Web-Based Tool for Image Annotation

International Journal of Computer Vision
Image retrieval: Ideas, influences, and trends of the new age

ACM Computing Surveys (CSUR)
Generating diverse and representative image search results for landmarks

Proceedings of the 17th international conference on World Wide Web
World-scale mining of objects and events from community photo collections

CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
Learning to reduce the semantic gap in web image retrieval and annotation

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
YAGO: A Large Ontology from Wikipedia and WordNet

Web Semantics: Science, Services and Agents on the World Wide Web
80 Million Tiny Images: A Large Data Set for Nonparametric Object and Scene Recognition

IEEE Transactions on Pattern Analysis and Machine Intelligence
Visual diversification of image search results

Proceedings of the 18th international conference on World wide web
Mapping the world's photos

Proceedings of the 18th international conference on World wide web
MEDIALIFE: from images to a life chronicle

Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
Open information extraction from the web

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Introduction to a large-scale general purpose ground truth database: methodology, annotation tool and benchmarks

EMMCVPR'07 Proceedings of the 6th international conference on Energy minimization methods in computer vision and pattern recognition
DBpedia: a nucleus for a web of open data

ISWC'07/ASWC'07 Proceedings of the 6th international The semantic web and 2nd Asian conference on Asian semantic web conference
MPEG-7 multimedia description schemes

IEEE Transactions on Circuits and Systems for Video Technology

Database researchers: plumbers or thinkers?

Proceedings of the 14th International Conference on Extending Database Technology
SocialSearch: enhancing entity search with social network matching

Proceedings of the 14th International Conference on Extending Database Technology
CATE: context-aware timeline for entity illustration

Proceedings of the 20th international conference companion on World wide web
What have fruits to do with technology?: the case of Orange, Blackberry and Apple

Proceedings of the International Conference on Web Intelligence, Mining and Semantics
Enishi: searching knowledge about relations by complementarily utilizing wikipedia and the web

WISE'10 Proceedings of the 11th international conference on Web information systems engineering
Multipedia: enriching DBpedia with multimedia information

Proceedings of the sixth international conference on Knowledge capture
Finding images of difficult entities in the long tail

Proceedings of the 20th ACM international conference on Information and knowledge management
Chapter 3: search for knowledge

Search Computing
Knowledge harvesting in the big-data era

Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
Aggregated search: A new information retrieval paradigm

ACM Computing Surveys (CSUR)
SocialSearch+: enriching social network with web evidences

World Wide Web
Div400: a social image retrieval result diversification dataset

Proceedings of the 5th ACM Multimedia Systems Conference

Quantified Score

Hi-index	0.00

Visualization

Abstract

Knowledge-sharing communities like Wikipedia and automated extraction methods like those of DBpedia enable the construction of large machine-processible knowledge bases with relational facts about entities. These endeavors lack multimodal data like photos and videos of people and places. While photos of famous entities are abundant on the Internet, they are much harder to retrieve for less popular entities such as notable computer scientists or regionally interesting churches. Querying the entity names in image search engines yields large candidate lists, but they often have low precision and unsatisfactory recall. Our goal is to populate a knowledge base with photos of named entities, with high precision, high recall, and diversity of photos for a given entity. We harness relational facts about entities for generating expanded queries to retrieve different candidate lists from image search engines. We use a weighted voting method to determine better rankings of an entity's photos. Appropriate weights are dependent on the type of entity (e.g., scientist vs. politician) and automatically computed from a small set of training entities. We also exploit visual similarity measures based on SIFT features, for higher diversity in the final rankings. Our experiments with photos of persons and landmarks show significant improvements of ranking measures like MAP and NDCG, and also for diversity-aware ranking.