Fashion-focused creative commons social dataset

  • Authors:
  • Babak Loni;Maria Menendez;Mihai Georgescu;Luca Galli;Claudio Massari;Ismail Sengor Altingovde;Davide Martinenghi;Mark Melenhorst;Raynor Vliegendhart;Martha Larson

  • Affiliations:
  • Delft University of Technology, The Netherlands;University of Trento, Italy;L3S Research Center, Germany;Polytechnic of Milan, Italy;Innovation Engineering, Italy;L3S Research Center, Germany;Polytechnic of Milan, Italy;Novay, The Netherlands;Delft University of Technology, The Netherlands;Delft University of Technology, The Netherlands

  • Venue:
  • Proceedings of the 4th ACM Multimedia Systems Conference
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this work, we present a fashion-focused Creative Commons dataset, which is designed to contain a mix of general images as well as a large component of images that are focused on fashion (i.e., relevant to particular clothing items or fashion accessories). The dataset contains 4810 images and related metadata. Furthermore, a ground truth on image's tags is presented. Ground truth generation for large-scale datasets is a necessary but expensive task. Traditional expert based approaches have become an expensive and non-scalable solution. For this reason, we turn to crowdsourcing techniques in order to collect ground truth labels; in particular we make use of the commercial crowdsourcing platform, Amazon Mechanical Turk (AMT). Two different groups of annotators (i.e., trusted annotators known to the authors and crowdsourcing workers on AMT) participated in the ground truth creation. Annotation agreement between the two groups is analyzed. Applications of the dataset in different contexts are discussed. This dataset contributes to research areas such as crowdsourcing for multimedia, multimedia content analysis, and design of systems that can elicit fashion preferences from users.