Improved video categorization from text metadata and user comments

Authors:
Katja Filippova;Keith B. Hall
Affiliations:
Google Inc., Zurich, Switzerland;Google Inc., Zurich, Switzerland
Venue:
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Year:
2011

Citing 15
Cited 10

Automatic recognition of film genres

Proceedings of the third ACM international conference on Multimedia
Combining labeled and unlabeled data with co-training

COLT' 98 Proceedings of the eleventh annual conference on Computational learning theory
News video classification using SVM-based multimodal classifiers and combination strategies

Proceedings of the tenth ACM international conference on Multimedia
Latent dirichlet allocation

The Journal of Machine Learning Research
Bridging the Gap: A Genre Analysis of Weblogs

HICSS '04 Proceedings of the Proceedings of the 37th Annual Hawaii International Conference on System Sciences (HICSS'04) - Track 4 - Volume 4
Hierarchical clustering of WWW image search results using visual, textual and link information

Proceedings of the 12th annual ACM international conference on Multimedia
A bootstrapping framework for annotating and retrieving WWW images

Proceedings of the 12th annual ACM international conference on Multimedia
Joint categorization of queries and clips for web-based video search

MIR '06 Proceedings of the 8th ACM international workshop on Multimedia information retrieval
Multi-modality web video categorization

Proceedings of the international workshop on Workshop on multimedia information retrieval
Predicting response to political blog posts with topic models

NAACL '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Video2Text: Learning to Annotate Video Content

ICDMW '09 Proceedings of the 2009 IEEE International Conference on Data Mining Workshops
Text-based video content classification for online video-sharing sites

Journal of the American Society for Information Science and Technology
Content-enriched classifier for web video classification

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Distributed training strategies for the structured perceptron

HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Detecting controversial events from twitter

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management

Model news relatedness through user comments

Proceedings of the 21st international conference companion on World Wide Web
Effective web video clustering using playlist information

Proceedings of the 27th Annual ACM Symposium on Applied Computing
SocialTransfer: cross-domain transfer learning from social streams for media applications

Proceedings of the 20th ACM international conference on Multimedia
On the prediction of popularity of trends and hits for user generated videos

Proceedings of the sixth ACM international conference on Web search and data mining
Exploiting user comments for audio-visual content indexing and retrieval

ECIR'13 Proceedings of the 35th European conference on Advances in Information Retrieval
Features with feelings: incorporating user preferences in video categorization

ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part III
Fashion-focused creative commons social dataset

Proceedings of the 4th ACM Multimedia Systems Conference
Leveraging viewer comments for mood classification of music video clips

Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Enriching media fragments with named entities for video classification

Proceedings of the 22nd international conference on World Wide Web companion
Comment-based multi-view clustering of web 2.0 items

Proceedings of the 23rd international conference on World wide web

Quantified Score

Hi-index	0.00

Visualization

Abstract

We consider the task of assigning categories (e.g., howto/cooking, sports/basketball, pet/dogs) to YouTube videos from video and text signals. We show that two complementary views on the data -- from the video and text perspectives -- complement each other and refine predictions. The contributions of the paper are threefold: (1) we show that a text-based classifier trained on imperfect predictions of the weakly supervised video content-based classifier is not redundant; (2) we demonstrate that a simple model which combines the predictions made by the two classifiers outperforms each of them taken independently; (3) we analyse such sources of text information as video title, description, user tags and viewers' comments and show that each of them provides valuable clues to the topic of the video.