Enhanced representation and multi-task learning for image annotation

Authors:
Alexander Binder;Wojciech Samek;Klaus-Robert MüLler;Motoaki Kawanabe
Affiliations:
Machine Learning Group, Berlin Institute of Technology (TU Berlin), Marchstrasse 23, 10587 Berlin, Germany and Fraunhofer Institute FIRST, Kekuléstr. 7, 12489 Berlin, Germany;Machine Learning Group, Berlin Institute of Technology (TU Berlin), Marchstrasse 23, 10587 Berlin, Germany and Fraunhofer Institute FIRST, Kekuléstr. 7, 12489 Berlin, Germany;Machine Learning Group, Berlin Institute of Technology (TU Berlin), Marchstrasse 23, 10587 Berlin, Germany and Bernstein Focus: Neurotechnology Berlin, 10587 Berlin, Germany and Department of Brai ...;ATR Brain Information Communication Research Laboratory Group, 2-2-2 Hikaridai, Seika-cho, Soraku-gun, Kyoto 619-0288, Japan and Fraunhofer Institute FIRST, Kekuléstr. 7, 12489 Berlin, German ...
Venue:
Computer Vision and Image Understanding
Year:
2013

Citing 28
Cited 0

A Model of Saliency-Based Visual Attention for Rapid Scene Analysis

IEEE Transactions on Pattern Analysis and Machine Intelligence
Saliency, Scale and Image Description

International Journal of Computer Vision
Distinctive Image Features from Scale-Invariant Keypoints

International Journal of Computer Vision
Learning Multiple Tasks with Kernel Methods

The Journal of Machine Learning Research
Large Margin Methods for Structured and Interdependent Output Variables

The Journal of Machine Learning Research
Large Scale Multiple Kernel Learning

The Journal of Machine Learning Research
Sharing Visual Features for Multiclass and Multiview Object Detection

IEEE Transactions on Pattern Analysis and Machine Intelligence
Speeded-Up Robust Features (SURF)

Computer Vision and Image Understanding
A Multiple Kernel Learning Approach to Joint Multi-class Object Detection

Proceedings of the 30th DAGM symposium on Pattern Recognition
Randomized Clustering Forests for Image Classification

IEEE Transactions on Pattern Analysis and Machine Intelligence
Multi-cue fusion for semantic video indexing

MM '08 Proceedings of the 16th ACM international conference on Multimedia
Kernel Codebooks for Scene Categorization

ECCV '08 Proceedings of the 10th European Conference on Computer Vision: Part III
New trends and ideas in visual concept detection: the MIR flickr retrieval evaluation initiative

Proceedings of the international conference on Multimedia information retrieval
Evaluating Color Descriptors for Object and Scene Recognition

IEEE Transactions on Pattern Analysis and Machine Intelligence
The SHOGUN Machine Learning Toolbox

The Journal of Machine Learning Research
Efficient object category recognition using classemes

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part I
A discriminative latent model of object classes and attributes

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part V
lp-Norm Multiple Kernel Learning

The Journal of Machine Learning Research
Multi-task learning via non-sparse multiple kernel learning

CAIP'11 Proceedings of the 14th international conference on Computer analysis of images and patterns - Volume Part I
Images as sets of locally weighted features

Computer Vision and Image Understanding
Sampling strategies for bag-of-features image classification

ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part IV
Efficient Additive Kernels via Explicit Feature Maps

IEEE Transactions on Pattern Analysis and Machine Intelligence
Evaluating knowledge transfer and zero-shot learning in a large-scale setting

CVPR '11 Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition
Contextualizing object detection and classification

CVPR '11 Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition
Real-Time Visual Concept Classification

IEEE Transactions on Multimedia
Empowering Visual Categorization With the GPU

IEEE Transactions on Multimedia
An introduction to kernel-based learning algorithms

IEEE Transactions on Neural Networks
On Taxonomies for Multi-class Image Categorization

International Journal of Computer Vision

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper we propose a novel biased random sampling strategy for image representation in Bag-of-Words models. We evaluate its impact on the feature properties and the ranking quality for a set of semantic concepts and show that it improves performance of classifiers in image annotation tasks and increases the correlation between kernels and labels. As second contribution we propose a method called Output Kernel Multi-Task Learning (MTL) to improve ranking performance by transfer information between classes. The main advantages of output kernel MTL are that it permits asymmetric information transfer between tasks and scales to training sets of several thousand images. We give a theoretical interpretation of the method and show that the learned contributions of source tasks to target tasks are semantically consistent. Both strategies are evaluated on the ImageCLEF PhotoAnnotation dataset. Our best visual result which used the MTL method was ranked first according to mean Average Precision (mAP) within the purely visual submissions in the ImageCLEF 2011 PhotoAnnotation Challenge. Our multi-modal submission achieved the first rank by mAP among all submissions in the same competition.