A sequential algorithm for training text classifiers
SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
The nature of statistical learning theory
The nature of statistical learning theory
VisualSEEk: a fully automated content-based image query system
MULTIMEDIA '96 Proceedings of the fourth ACM international conference on Multimedia
Combining labeled and unlabeled data with co-training
COLT' 98 Proceedings of the eleventh annual conference on Computational learning theory
Analyzing the effectiveness and applicability of co-training
Proceedings of the ninth international conference on Information and knowledge management
Unsupervised Segmentation of Color-Texture Regions in Images and Video
IEEE Transactions on Pattern Analysis and Machine Intelligence
Learning-based linguistic indexing of pictures with 2--d MHMMs
Proceedings of the tenth ACM international conference on Multimedia
Learning pattern rules for Chinese named entity extraction
Eighteenth national conference on Artificial intelligence
Recognition of Images in Large Databases Using a Learning Framework
Recognition of Images in Large Databases Using a Learning Framework
The Journal of Machine Learning Research
ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Uncertainty reduction in collaborative bootstrapping: measure and algorithm
ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Multimedia semantic indexing using model vectors
ICME '03 Proceedings of the 2003 International Conference on Multimedia and Expo - Volume 1
Active learning with statistical models
Journal of Artificial Intelligence Research
An active learning framework for content-based information retrieval
IEEE Transactions on Multimedia
CBSA: content-based soft annotation for multimodal image retrieval using Bayes point machines
IEEE Transactions on Circuits and Systems for Video Technology
A bootstrapping framework for annotating and retrieving WWW images
Proceedings of the 12th annual ACM international conference on Multimedia
A survey of content-based image retrieval with high-level semantics
Pattern Recognition
Proceedings of the 15th international conference on Multimedia
Deriving semantic terms for images by mining the web
Proceedings of the 11th International Conference on Electronic Commerce
Building an automatic annotate image system by using bootstrapping
CATE '07 Proceedings of the 10th IASTED International Conference on Computers and Advanced Technology in Education
Improving keyword based web image search with visual feature distribution and term expansion
Knowledge and Information Systems
Automatic online labeling images via co-active-learning
Proceedings of the First International Conference on Internet Multimedia Computing and Service
OPTIMOL: Automatic Online Picture Collection via Incremental Model Learning
International Journal of Computer Vision
A method for processing the natural language query in ontology-based image retrieval system
AMR'06 Proceedings of the 4th international conference on Adaptive multimedia retrieval: user, context, and feedback
SAFIRE: towards standardized semantic rich image annotation
AMR'06 Proceedings of the 4th international conference on Adaptive multimedia retrieval: user, context, and feedback
Multi-modal multi-label semantic indexing of images based on hybrid ensemble learning
PCM'07 Proceedings of the multimedia 8th Pacific Rim conference on Advances in multimedia information processing
A semantic content-based retrieval method for histopathology images
AIRS'08 Proceedings of the 4th Asia information retrieval conference on Information retrieval technology
Composition based semantic scene retrieval for ancient murals
PCM'10 Proceedings of the 11th Pacific Rim conference on Advances in multimedia information processing: Part I
Two-stage localization for image labeling
PCM'10 Proceedings of the 11th Pacific Rim conference on Advances in multimedia information processing: Part I
Weakly supervised landmark labeling in searched data
ICIMCS '10 Proceedings of the Second International Conference on Internet Multimedia Computing and Service
A review on automatic image annotation techniques
Pattern Recognition
Automatic image annotation by mining the web
DaWaK'06 Proceedings of the 8th international conference on Data Warehousing and Knowledge Discovery
Hi-index | 0.01 |
Huge amount of manual efforts are required to annotate large image/video archives with text annotations. Several recent works attempted to automate this task by employing supervised learning approaches to associate visual information extracted in segmented images with semantic concepts provided by associated text. The main limitation of such approaches, however, is that large labeled training corpus is still needed for effective learning, and semantically meaningful segmentation for images is in general unavailable. This paper explores the use of bootstrapping approach to tackle this problem. The idea is to start from a small set of labeled training examples, and successively annotate a larger set of unlabeled examples. This is done using the cotraining approach, in which two "statistically independent" classifiers are used to co-train and co-annotate the unlabeled examples. An active learning approach is used to select the best examples to label at each stage of learning in order to maximize the learning objective. To accomplish this, we break the task of annotating images into the sub-tasks of: (a) segmenting images into meaningful units, (b) extracting appropriate features for the units, and (c) associating these features with text. Because of the uncertainty in sub-tasks (a) and (b), we adopt two independent segmentation methods (task a) and two independent sets of features (task b) to support co-training. We carried out experiments using a mid-sized image collection (comprising about 6,000 images from CorelCD, PhotoCD and Web) and demonstrated that our bootstrapping approach significantly improve the performance of annotation by about 10% in terms of F1 measure as compared to the best results obtained from the traditional supervised learning approach. Moreover, the bootstrapping approach has the key advantage of requiring much fewer labeled examples in training.