An active learning framework for content-based information retrieval

Authors:
Cha Zhang;Tsuhan Chen
Affiliations:
Dept. of Electr. & Eng., Carnegie Mellon Univ., Pittsburgh, PA;-
Venue:
IEEE Transactions on Multimedia
Year:
2002

Citing 0
Cited 32

A bootstrapping approach to annotating large image collection

MIR '03 Proceedings of the 5th ACM SIGMM international workshop on Multimedia information retrieval
Active learning using pre-clustering

ICML '04 Proceedings of the twenty-first international conference on Machine learning
A bootstrapping framework for annotating and retrieving WWW images

Proceedings of the 12th annual ACM international conference on Multimedia
Active feedback in ad hoc information retrieval

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Enhancing relevance feedback in image retrieval using unlabeled data

ACM Transactions on Information Systems (TOIS)
Real-time computerized annotation of pictures

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Region-based image retrieval using an object ontology and relevance feedback

EURASIP Journal on Applied Signal Processing
Calligraphic Interfaces: Classifier combination for sketch-based 3D part retrieval

Computers and Graphics
Content-based object movie retrieval and relevance feedbacks

EURASIP Journal on Advances in Signal Processing
Active learning for constructing transliteration lexicons from the Web

Journal of the American Society for Information Science and Technology
Semantic force relevance feedback, content-free 3D object retrieval and annotation propagation: bridging the gap and beyond

Multimedia Tools and Applications
Content-based image retrieval with the normalized information distance

Computer Vision and Image Understanding
Exploring multimedia in a keyword space

MM '08 Proceedings of the 16th ACM international conference on Multimedia
Learning to segment from a few well-selected training images

ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Semi-automatic dynamic auxiliary-tag-aided image annotation

Pattern Recognition
Optimistic active learning using mutual information

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Three-dimensional shape searching: state-of-the-art review and future trends

Computer-Aided Design
Unsupervised active learning based on hierarchical graph-theoretic clustering

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Shape-Based Autotagging of 3D Models for Retrieval

SAMT '09 Proceedings of the 4th International Conference on Semantic and Digital Media Technologies: Semantic Multimedia
A semantic learning for content-based image retrieval using analytical hierarchy process

Expert Systems with Applications: An International Journal
A novel traffic analysis for identifying search fields in the long tail of web sites

Proceedings of the 19th international conference on World wide web
Active learning for regression based on query by committee

IDEAL'07 Proceedings of the 8th international conference on Intelligent data engineering and automated learning
An effective procedure exploiting unlabeled data to build monitoring system

Expert Systems with Applications: An International Journal
Active learning and subspace clustering for anomaly detection

Intelligent Data Analysis
Evaluating the impact of coder errors on active learning

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Coached active learning for interactive video search

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Human activity language: grounding concepts with a linguistic framework

SAMT'06 Proceedings of the First international conference on Semantic and Digital Media Technologies
A probability-based unified 3d shape search

SAMT'06 Proceedings of the First international conference on Semantic and Digital Media Technologies
An image retrieval scheme using multi-instance and pseudo image concepts

PCM'04 Proceedings of the 5th Pacific Rim conference on Advances in Multimedia Information Processing - Volume Part I
Biased minimax probability machine active learning for relevance feedback in content-based image retrieval

IDEAL'06 Proceedings of the 7th international conference on Intelligent Data Engineering and Automated Learning
Sketch-based 3D engineering part class browsing and retrieval

SBM'06 Proceedings of the Third Eurographics conference on Sketch-Based Interfaces and Modeling
Content-based retrieval of human actions from realistic video databases

Information Sciences: an International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

We propose a general active learning framework for content-based information retrieval. We use this framework to guide hidden annotations in order to improve the retrieval performance. For each object in the database, we maintain a list of probabilities, each indicating the probability of this object having one of the attributes. During training, the learning algorithm samples objects in the database and presents them to the annotator to assign attributes. For each sampled object, each probability is set to be one or zero depending on whether or not the corresponding attribute is assigned by the annotator. For objects that have not been annotated, the learning algorithm estimates their probabilities with biased kernel regression. Knowledge gain is then defined to determine, among the objects that have not been annotated, which one the system is the most uncertain. The system then presents it as the next sample to the annotator to which it is assigned attributes. During retrieval, the list of probabilities works as a feature vector for us to calculate the semantic distance between two objects, or between the user query and an object in the database. The overall distance between two objects is determined by a weighted sum of the semantic distance and the low-level feature distance. The algorithm is tested on both synthetic databases and real databases of 3D models. In both cases, the retrieval performance of the system improves rapidly with the number of annotated samples. Furthermore, we show that active learning outperforms learning based on random sampling.