A bootstrapping approach to annotating large image collection

Authors:
HuaMin Feng;Tat-Seng Chua
Affiliations:
National University of Singapore, Singapore, Republic of Singapore;National University of Singapore, Singapore, Republic of Singapore
Venue:
MIR '03 Proceedings of the 5th ACM SIGMM international workshop on Multimedia information retrieval
Year:
2003

Citing 16
Cited 18

A sequential algorithm for training text classifiers

SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
The nature of statistical learning theory

The nature of statistical learning theory
VisualSEEk: a fully automated content-based image query system

MULTIMEDIA '96 Proceedings of the fourth ACM international conference on Multimedia
Combining labeled and unlabeled data with co-training

COLT' 98 Proceedings of the eleventh annual conference on Computational learning theory
Analyzing the effectiveness and applicability of co-training

Proceedings of the ninth international conference on Information and knowledge management
Unsupervised Segmentation of Color-Texture Regions in Images and Video

IEEE Transactions on Pattern Analysis and Machine Intelligence
Learning-based linguistic indexing of pictures with 2--d MHMMs

Proceedings of the tenth ACM international conference on Multimedia
Learning pattern rules for Chinese named entity extraction

Eighteenth national conference on Artificial intelligence
Recognition of Images in Large Databases Using a Learning Framework

Recognition of Images in Large Databases Using a Learning Framework
Bayes point machines

The Journal of Machine Learning Research
Bootstrapping

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Uncertainty reduction in collaborative bootstrapping: measure and algorithm

ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Multimedia semantic indexing using model vectors

ICME '03 Proceedings of the 2003 International Conference on Multimedia and Expo - Volume 1
Active learning with statistical models

Journal of Artificial Intelligence Research
An active learning framework for content-based information retrieval

IEEE Transactions on Multimedia
CBSA: content-based soft annotation for multimodal image retrieval using Bayes point machines

IEEE Transactions on Circuits and Systems for Video Technology

A bootstrapping framework for annotating and retrieving WWW images

Proceedings of the 12th annual ACM international conference on Multimedia
A survey of content-based image retrieval with high-level semantics

Pattern Recognition
Automatic object model acquisition and object recognition by integrating linguistic and visual information

Proceedings of the 15th international conference on Multimedia
Deriving semantic terms for images by mining the web

Proceedings of the 11th International Conference on Electronic Commerce
Building an automatic annotate image system by using bootstrapping

CATE '07 Proceedings of the 10th IASTED International Conference on Computers and Advanced Technology in Education
Improving keyword based web image search with visual feature distribution and term expansion

Knowledge and Information Systems
Automatic online labeling images via co-active-learning

Proceedings of the First International Conference on Internet Multimedia Computing and Service
OPTIMOL: Automatic Online Picture Collection via Incremental Model Learning

International Journal of Computer Vision
Dynamic Adaboost learning with feature selection based on parallel genetic algorithm for image annotation

Knowledge-Based Systems
A method for processing the natural language query in ontology-based image retrieval system

AMR'06 Proceedings of the 4th international conference on Adaptive multimedia retrieval: user, context, and feedback
SAFIRE: towards standardized semantic rich image annotation

AMR'06 Proceedings of the 4th international conference on Adaptive multimedia retrieval: user, context, and feedback
Multi-modal multi-label semantic indexing of images based on hybrid ensemble learning

PCM'07 Proceedings of the multimedia 8th Pacific Rim conference on Advances in multimedia information processing
A semantic content-based retrieval method for histopathology images

AIRS'08 Proceedings of the 4th Asia information retrieval conference on Information retrieval technology
Composition based semantic scene retrieval for ancient murals

PCM'10 Proceedings of the 11th Pacific Rim conference on Advances in multimedia information processing: Part I
Two-stage localization for image labeling

PCM'10 Proceedings of the 11th Pacific Rim conference on Advances in multimedia information processing: Part I
Weakly supervised landmark labeling in searched data

ICIMCS '10 Proceedings of the Second International Conference on Internet Multimedia Computing and Service
A review on automatic image annotation techniques

Pattern Recognition
Automatic image annotation by mining the web

DaWaK'06 Proceedings of the 8th international conference on Data Warehousing and Knowledge Discovery

Quantified Score

Hi-index	0.01

Visualization

Abstract

Huge amount of manual efforts are required to annotate large image/video archives with text annotations. Several recent works attempted to automate this task by employing supervised learning approaches to associate visual information extracted in segmented images with semantic concepts provided by associated text. The main limitation of such approaches, however, is that large labeled training corpus is still needed for effective learning, and semantically meaningful segmentation for images is in general unavailable. This paper explores the use of bootstrapping approach to tackle this problem. The idea is to start from a small set of labeled training examples, and successively annotate a larger set of unlabeled examples. This is done using the cotraining approach, in which two "statistically independent" classifiers are used to co-train and co-annotate the unlabeled examples. An active learning approach is used to select the best examples to label at each stage of learning in order to maximize the learning objective. To accomplish this, we break the task of annotating images into the sub-tasks of: (a) segmenting images into meaningful units, (b) extracting appropriate features for the units, and (c) associating these features with text. Because of the uncertainty in sub-tasks (a) and (b), we adopt two independent segmentation methods (task a) and two independent sets of features (task b) to support co-training. We carried out experiments using a mid-sized image collection (comprising about 6,000 images from CorelCD, PhotoCD and Web) and demonstrated that our bootstrapping approach significantly improve the performance of annotation by about 10% in terms of F1 measure as compared to the best results obtained from the traditional supervised learning approach. Moreover, the bootstrapping approach has the key advantage of requiring much fewer labeled examples in training.