OPTIMOL: Automatic Online Picture Collection via Incremental Model Learning

Authors:
Li-Jia Li;Li Fei-Fei
Affiliations:
Dept. of Computer Science, Princeton University, Princeton, USA;Dept. of Computer Science, Princeton University, Princeton, USA and Dept. of Computer Science, Stanford University, Stanford, USA
Venue:
International Journal of Computer Vision
Year:
2010

Citing 37
Cited 19

A training algorithm for optimal margin classifiers

COLT '92 Proceedings of the fifth annual workshop on Computational learning theory
WordNet: a lexical database for English

Communications of the ACM
Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection

IEEE Transactions on Pattern Analysis and Machine Intelligence
A view of the EM algorithm that justifies incremental, sparse, and other variants

Proceedings of the NATO Advanced Study Institute on Learning in graphical models
Probabilistic latent semantic indexing

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
IRM: integrated region matching for image retrieval

MULTIMEDIA '00 Proceedings of the eighth ACM international conference on Multimedia
Saliency, Scale and Image Description

International Journal of Computer Vision
Unifying Keywords and Visual Contents in Image Retrieval

IEEE MultiMedia
Maximum Entropy Markov Models for Information Extraction and Segmentation

ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
A decision-theoretic generalization of on-line learning and an application to boosting

EuroCOLT '95 Proceedings of the Second European Conference on Computational Learning Theory
Object Recognition from Local Scale-Invariant Features

ICCV '99 Proceedings of the International Conference on Computer Vision-Volume 2 - Volume 2
Automatic image annotation and retrieval using cross-media relevance models

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Latent dirichlet allocation

The Journal of Machine Learning Research
Matching words and pictures

The Journal of Machine Learning Research
Video Google: A Text Retrieval Approach to Object Matching in Videos

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
A Bayesian Approach to Unsupervised One-Shot Learning of Object Categories

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
A bootstrapping approach to annotating large image collection

MIR '03 Proceedings of the 5th ACM SIGMM international workshop on Multimedia information retrieval
Content-based image retrieval by clustering

MIR '03 Proceedings of the 5th ACM SIGMM international workshop on Multimedia information retrieval
Pictorial Structures for Object Recognition

International Journal of Computer Vision
Learning to Detect Objects in Images via a Sparse, Part-Based Representation

IEEE Transactions on Pattern Analysis and Machine Intelligence
Learning Generative Visual Models from Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object Categories

CVPRW '04 Proceedings of the 2004 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'04) Volume 12 - Volume 12
Semi-Supervised Self-Training of Object Detection Models

WACV-MOTION '05 Proceedings of the Seventh IEEE Workshops on Application of Computer Vision (WACV/MOTION'05) - Volume 1 - Volume 01
OBJ CUT

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
A Sparse Object Category Model for Efficient Learning and Exhaustive Recognition

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
Discovering Objects and their Localization in Images

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 - Volume 01
Learning Hierarchical Models of Scenes, Objects, and Parts

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision - Volume 2
Learning Object Categories from Google"s Image Search

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision - Volume 2
Probabilistic web image gathering

Proceedings of the 7th ACM SIGMM international workshop on Multimedia information retrieval
One-Shot Learning of Object Categories

IEEE Transactions on Pattern Analysis and Machine Intelligence
Animals on the Web

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Using Dependent Regions for Object Categorization in a Generative Framework

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Pattern Recognition and Machine Learning (Information Science and Statistics)

Pattern Recognition and Machine Learning (Information Science and Statistics)
Effective self-training for parsing

HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
Introduction to a large-scale general purpose ground truth database: methodology, annotation tool and benchmarks

EMMCVPR'07 Proceedings of the 6th international conference on Energy minimization methods in computer vision and pattern recognition
Learning methods for generic object recognition with invariance to pose and lighting

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
Scene classification via pLSA

ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part IV
An efficient color representation for image retrieval

IEEE Transactions on Image Processing

Automatic online labeling images via co-active-learning

Proceedings of the First International Conference on Internet Multimedia Computing and Service
On the sampling of web images for learning visual concept classifiers

Proceedings of the ACM International Conference on Image and Video Retrieval
Extracting structures in image collections for object recognition

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part I
Text mining for automatic image tagging

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Active learning through notes data in Flickr: an effortless training data acquisition approach for object localization

Proceedings of the 1st ACM International Conference on Multimedia Retrieval
Special Issue on Probabilistic Models for Image Understanding, Part II

International Journal of Computer Vision
Retrieving and ranking unannotated images through collaboratively mining online search results

Proceedings of the 20th ACM international conference on Information and knowledge management
On the pooling of positive examples with ontology for visual concept learning

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Learning from search engine and human supervision for web image search

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Learning to summarize web image and text mutually

Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
Joint-rerank: a novel method for image search reranking

Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
SIR: the smart image retrieval engine

SISAP'12 Proceedings of the 5th international conference on Similarity Search and Applications
An incremental structured part model for image classification

SSPR'12/SPR'12 Proceedings of the 2012 Joint IAPR international conference on Structural, Syntactic, and Statistical Pattern Recognition
A data-driven detection optimization framework

Neurocomputing
Learning realistic facial expressions from web images

Pattern Recognition
"Tell me more": how semantic technologies can help refining internet image search

Proceedings of the International Workshop on Video and Image Ground Truth in Computer Vision Applications
A boosting approach for the simultaneous detection and segmentation of generic objects

Pattern Recognition Letters
Object class detection: A survey

ACM Computing Surveys (CSUR)
A co-boost framework for learning object categories from Google Images with 1st and 2nd order features

The Visual Computer: International Journal of Computer Graphics

Quantified Score

Hi-index	0.00

Visualization

Abstract

The explosion of the Internet provides us with a tremendous resource of images shared online. It also confronts vision researchers the problem of finding effective methods to navigate the vast amount of visual information. Semantic image understanding plays a vital role towards solving this problem. One important task in image understanding is object recognition, in particular, generic object categorization. Critical to this problem are the issues of learning and dataset. Abundant data helps to train a robust recognition system, while a good object classifier can help to collect a large amount of images. This paper presents a novel object recognition algorithm that performs automatic dataset collecting and incremental model learning simultaneously. The goal of this work is to use the tremendous resources of the web to learn robust object category models for detecting and searching for objects in real-world cluttered scenes. Humans contiguously update the knowledge of objects when new examples are observed. Our framework emulates this human learning process by iteratively accumulating model knowledge and image examples. We adapt a non-parametric latent topic model and propose an incremental learning framework. Our algorithm is capable of automatically collecting much larger object category datasets for 22 randomly selected classes from the Caltech 101 dataset. Furthermore, our system offers not only more images in each object category but also a robust object category model and meaningful image annotation. Our experiments show that OPTIMOL is capable of collecting image datasets that are superior to the well known manually collected object datasets Caltech 101 and LabelMe.