Compressing tags to find interesting media groups

Authors:
Matthijs van Leeuwen;Francesco Bonchi;Börkur Sigurbjörnsson;Arno Siebes
Affiliations:
Universiteit Utrecht, Utrecht, Netherlands;Yahoo! Research, Barcelona, Spain;Yahoo! Research, Barcelona, Spain;Universiteit Utrecht, Utrecht, Netherlands
Venue:
Proceedings of the 18th ACM conference on Information and knowledge management
Year:
2009

Citing 9
Cited 5

Mining frequent patterns without candidate generation

SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Why we tag: motivations for annotation in mobile and online media

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Characterising the difference

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Flickr tag recommendation based on collective knowledge

Proceedings of the 17th international conference on World Wide Web
Exploring social annotations for information retrieval

Proceedings of the 17th international conference on World Wide Web
Real-time automatic tag recommendation

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Clustering high-dimensional data: A survey on subspace clustering, pattern-based clustering, and correlation clustering

ACM Transactions on Knowledge Discovery from Data (TKDD)
Clustering the tagged web

Proceedings of the Second ACM International Conference on Web Search and Data Mining
Information retrieval in folksonomies: search and ranking

ESWC'06 Proceedings of the 3rd European conference on The Semantic Web: research and applications

Maximal exceptions with minimal descriptions

Data Mining and Knowledge Discovery
Image tagging and search: a gender oriented study

Proceedings of second ACM SIGMM workshop on Social media
Social media driven image retrieval

Proceedings of the 1st ACM International Conference on Multimedia Retrieval
GaMuSo: graph base music recommendation in a social bookmarking service

IDA'11 Proceedings of the 10th international conference on Advances in intelligent data analysis X
Improving tag recommendation using few associations

IDA'12 Proceedings of the 11th international conference on Advances in Intelligent Data Analysis

Quantified Score

Hi-index	0.01

Visualization

Abstract

On photo sharing websites like Flickr and Zooomr, users are offered the possibility to assign tags to their uploaded pictures. Using these tags to find interesting groups of semantically related pictures in the result set of a given query is a problem with obvious applications. We analyse this problem from a Minimum Description Length (MDL) perspective and develop an algorithm that finds the most interesting groups. The method is based on Krimp, which finds small sets of patterns that characterise the data using compression. These patterns are sets of tags, often assignedtogether to photos. The better a database compresses, the more structure it contains and thus the more homogeneous it is. Following this observation we devise a compression-based measure. Our experiments on Flickr data show that the most interesting and homogeneous groups are found. We show extensive examples and compare to clusterings on the Flickr website.