Fashion-focused creative commons social dataset

Authors:
Babak Loni;Maria Menendez;Mihai Georgescu;Luca Galli;Claudio Massari;Ismail Sengor Altingovde;Davide Martinenghi;Mark Melenhorst;Raynor Vliegendhart;Martha Larson
Affiliations:
Delft University of Technology, The Netherlands;University of Trento, Italy;L3S Research Center, Germany;Polytechnic of Milan, Italy;Innovation Engineering, Italy;L3S Research Center, Germany;Polytechnic of Milan, Italy;Novay, The Netherlands;Delft University of Technology, The Netherlands;Delft University of Technology, The Netherlands
Venue:
Proceedings of the 4th ACM Multimedia Systems Conference
Year:
2013

Citing 18
Cited 1

Labeling images with a computer game

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Peekaboom: a game for locating objects in images

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Games with a Purpose

Computer
How reliable are annotations via crowdsourcing: a study about inter-annotator agreement for multi-label image annotation

Proceedings of the international conference on Multimedia information retrieval
Developing metrics to characterize Flickr groups

Journal of the American Society for Information Science and Technology
Human computation: a survey and taxonomy of a growing field

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Content redundancy in YouTube and its application to video tagging

ACM Transactions on Information Systems (TOIS)
Improved video categorization from text metadata and user comments

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Minority voices of crowdsourcing: why we should pay attention to every member of the crowd

Proceedings of the ACM 2012 conference on Computer Supported Cooperative Work Companion
Leveraging user comments for aesthetic aware image search reranking

Proceedings of the 21st international conference on World Wide Web
The CUBRIK project: human-enhanced time-aware multimedia search

Proceedings of the 21st international conference companion on World Wide Web
Fashion coordinates recommender system using photographs from fashion magazines

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
Parsing clothing in fashion photographs

CVPR '12 Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
The Community and the Crowd: Multimedia Benchmark Dataset Development

IEEE MultiMedia
Hi, magic closet, tell me what to wear!

Proceedings of the 20th ACM international conference on Multimedia
A Draw-and-Guess Game to Segment Images

SOCIALCOM-PASSAT '12 Proceedings of the 2012 ASE/IEEE International Conference on Social Computing and 2012 ASE/IEEE International Conference on Privacy, Security, Risk and Trust
Can social features help learning to rank youtube videos?

WISE'12 Proceedings of the 13th international conference on Web Information Systems Engineering
Increasing cheat robustness of crowdsourcing tasks

Information Retrieval

Fashion 10000: an enriched social image dataset for fashion and clothing

Proceedings of the 5th ACM Multimedia Systems Conference

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this work, we present a fashion-focused Creative Commons dataset, which is designed to contain a mix of general images as well as a large component of images that are focused on fashion (i.e., relevant to particular clothing items or fashion accessories). The dataset contains 4810 images and related metadata. Furthermore, a ground truth on image's tags is presented. Ground truth generation for large-scale datasets is a necessary but expensive task. Traditional expert based approaches have become an expensive and non-scalable solution. For this reason, we turn to crowdsourcing techniques in order to collect ground truth labels; in particular we make use of the commercial crowdsourcing platform, Amazon Mechanical Turk (AMT). Two different groups of annotators (i.e., trusted annotators known to the authors and crowdsourcing workers on AMT) participated in the ground truth creation. Annotation agreement between the two groups is analyzed. Applications of the dataset in different contexts are discussed. This dataset contributes to research areas such as crowdsourcing for multimedia, multimedia content analysis, and design of systems that can elicit fashion preferences from users.