Detection of visual concepts and annotation of images using ensembles of trees for hierarchical multi-label classification

  • Authors:
  • Ivica Dimitrovski;Dragi Kocev;Suzana Loskovska;Sašo Džeroski

  • Affiliations:
  • Department of Knowledge Technologies, Jožef Stefan Institute, Ljubljana, Slovenia and Department of Computer Science, Faculty of Electrical Engineering and Information Technology, Skopje, Mac ...;Department of Knowledge Technologies, Jožef Stefan Institute, Ljubljana, Slovenia;Department of Computer Science, Faculty of Electrical Engineering and,Information Technology, Skopje, Macedonia;Department of Knowledge Technologies, Jožef Stefan Institute, Ljubljana, Slovenia

  • Venue:
  • ICPR'10 Proceedings of the 20th International conference on Recognizing patterns in signals, speech, images, and videos
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we present a hierarchical multi-label classification system for visual concepts detection and image annotation. Hierarchical multi-label classification (HMLC) is a variant of classification where an instance may belong to multiple classes at the same time and these classes/labels are organized in a hierarchy. The system is composed of two parts: feature extraction and classification/annotation. The feature extraction part provides global and local descriptions of the images. These descriptions are then used to learn a classifier and to annotate an image with the corresponding concepts. To this end, we use predictive clustering trees (PCTs), which are able to classify target concepts that are organized in a hierarchy. Our approach to HMLC exploits the annotation hierarchy by building a single predictive clustering tree that can simultaneously predict all of the labels used to annotate an image. Moreover, we constructed ensembles (random forests) of PCTs, to improve the predictive performance. We tested our system on the image database from the ImageCLEF@ICPR 2010 photo annotation task. The extensive experiments conducted on the benchmark database show that our system has very high predictive performance and can be easily scaled to large number of visual concepts and large amounts of data.