Using tagged images of low visual ambiguity to boost the learning efficiency of object detectors

Authors:
Elisavet Chatzilari
Affiliations:
ITI-CERTH & CVSSP-University of Surrey, Thessaloniki & Surrey, Greece
Venue:
Proceedings of the 21st ACM international conference on Multimedia
Year:
2013

Citing 8
Cited 0

New trends and ideas in visual concept detection: the MIR flickr retrieval evaluation initiative

Proceedings of the international conference on Multimedia information retrieval
The segmented and annotated IAPR TC-12 benchmark

Computer Vision and Image Understanding
Visual Word Ambiguity

IEEE Transactions on Pattern Analysis and Machine Intelligence
Evaluating Color Descriptors for Object and Scene Recognition

IEEE Transactions on Pattern Analysis and Machine Intelligence
Leveraging loosely-tagged images and inter-object correlations for tag recommendation

Proceedings of the international conference on Multimedia
Leveraging social media for scalable object detection

Pattern Recognition
Large-scale live active learning: Training object detectors with crawled data and crowds

CVPR '11 Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition
Multi-modal region selection approach for training object detectors

Proceedings of the 2nd ACM International Conference on Multimedia Retrieval

Quantified Score

Hi-index	0.00

Visualization

Abstract

Motivated by the abundant availability of user-generated multimedia content, a data augmentation approach that enhances an initial manually labelled training set with regions from user tagged images is presented. Initially, object detection classifiers are trained using a small number of manually labelled regions as the training set. Then, a set of positive regions is automatically selected from a large number of loosely tagged images, pre-segmented by an automatic segmentation algorithm, to enhance the initial training set. In order to overcome the noisy nature of user tagged images and the lack of information about the pixel level annotations, the main contribution of this work is the introduction of the visual ambiguity term. Visual ambiguity is caused by the visual similarity of semantically dissimilar concepts with respect to the employed visual representation and analysis system (i.e. segmentation, feature space, classifier) and, in this work, is modelled so that the images where ambiguous concepts co-exist are penalized. Preliminary experimental results show that the employment of visual ambiguity guides the selection process away from the ambiguous images and, as a result, allows for better separation between the targeted true positive and the undesired negative regions.