Using web photos for measuring video frame interestingness
IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
A Spatial User Similarity Measure for Geographic Recommender Systems
GeoS '09 Proceedings of the 3rd International Conference on GeoSpatial Semantics
Vector model in support of versatile georeferenced video search
MMSys '10 Proceedings of the first annual ACM SIGMM conference on Multimedia systems
Design and implementation of geo-tagged video search framework
Journal of Visual Communication and Image Representation
Weakly supervised landmark labeling in searched data
ICIMCS '10 Proceedings of the Second International Conference on Internet Multimedia Computing and Service
Geotagging in multimedia and computer vision--a survey
Multimedia Tools and Applications
Modeling urban scenes in the spatial-temporal space
ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part II
Geometric Latent Dirichlet Allocation on a Matching Graph for Large-scale Image Datasets
International Journal of Computer Vision
Multiple view object cosegmentation using appearance and stereo cues
ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part V
An evaluation of two automatic landmark building discovery algorithms for city reconstruction
ECCV'10 Proceedings of the 11th European conference on Trends and Topics in Computer Vision - Volume Part II
Navigating the worldwide community of photos
ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP) - Special Sections on the 20th Anniversary of ACM International Conference on Multimedia, Best Papers of ACM Multimedia 2012
3D Wikipedia: using online text to automatically label and navigate reconstructed geometry
ACM Transactions on Graphics (TOG)
Object class detection: A survey
ACM Computing Surveys (CSUR)
Hi-index | 0.00 |
Given a collection of images of a static scene taken by many different people, we identify and segment interesting objects. To solve this problem, we use the distribution of images in the collection along with a new field-of-view cue, which leverages the observation that people tend to take photos that frame an object of interest within the field of view. Hence, image features that appear together in many images are likely to be part of the same object. We evaluate the effectiveness of this cue by comparing the segmentations computed by our method against hand-labeled ones for several different models. We also show how the results of our segmentations can be used to highlight important objects in the scene and label them using noisy user-specified textual tag data. These methods are demonstrated on photos of several popular tourist sites downloaded from the Internet.