Automatic tagging and geotagging in video collections and communities

Authors:
Martha Larson;Mohammad Soleymani;Pavel Serdyukov;Stevan Rudinac;Christian Wartena;Vanessa Murdock;Gerald Friedland;Roeland Ordelman;Gareth J. F. Jones
Affiliations:
Delft University of Technology, Delft, the Netherlands;University of Geneva, Geneva, Switzerland;Yandex Moscow, Russia;Delft University of Technology;Novay, Enschede, Netherlands;Yahoo! Research Barcelona, Barcelona, Spain;International Computer Science Institute, Berkeley, CA;Netherlands Institute for Sound and Vision, and University of Twente;Dublin City University, Dublin, Ireland
Venue:
Proceedings of the 1st ACM International Conference on Multimedia Retrieval
Year:
2011

Citing 30
Cited 23

Inductive learning algorithms and representations for text categorization

Proceedings of the seventh international conference on Information and knowledge management
Document language models, query models, and risk minimization for information retrieval

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Model-based feedback in the language modeling approach to information retrieval

Proceedings of the tenth international conference on Information and knowledge management
SVM Classification Using Sequences of Phonemes and Syllables

PKDD '02 Proceedings of the 6th European Conference on Principles of Data Mining and Knowledge Discovery
Topic detection and tracking: event-based information organization

Topic detection and tracking: event-based information organization
Dialogue act modeling for automatic tagging and recognition of conversational speech

Computational Linguistics
Web-a-where: geotagging web content

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Elements of Information Theory (Wiley Series in Telecommunications and Signal Processing)

Elements of Information Theory (Wiley Series in Telecommunications and Signal Processing)
Techniques for information retrieval from voice messages

ICASSP '91 Proceedings of the Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference
Evaluation campaigns and TRECVid

MIR '06 Proceedings of the 8th ACM international workshop on Multimedia information retrieval
Improving text classification for oral history archives with temporal domain knowledge

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Flickr tag recommendation based on collective knowledge

Proceedings of the 17th international conference on World Wide Web
Social tag prediction

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Overview of the CLEF-2007 Cross-Language Speech Retrieval Track

Advances in Multilingual and Multimodal Information Retrieval
Speech Processing for Audio Indexing

GoTAL '08 Proceedings of the 6th international conference on Advances in Natural Language Processing
Topic Detection by Clustering Keywords

DEXA '08 Proceedings of the 2008 19th International Conference on Database and Expert Systems Application
Methods for extracting place semantics from Flickr tags

ACM Transactions on the Web (TWEB)
Parallel neural networks for multimodal video genre classification

Multimedia Tools and Applications
Mapping the world's photos

Proceedings of the 18th international conference on World wide web
Placing flickr photos on a map

Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Domain-specific keyphrase extraction

IJCAI'99 Proceedings of the 16th international joint conference on Artificial intelligence - Volume 2
Pairwise interaction tensor factorization for personalized tag recommendation

Proceedings of the third ACM international conference on Web search and data mining
Annotation of heterogeneous multimedia content using automatic speech recognition

SAMT'07 Proceedings of the semantic and digital media technologies 2nd international conference on Semantic Multimedia
DCU at VideoClef 2008

CLEF'08 Proceedings of the 9th Cross-language evaluation forum conference on Evaluating systems for multilingual and multimodal information access
You are where you tweet: a content-based approach to geo-locating twitter users

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Impact of spontaneous speech features on business concept detection: a study of call-centre data.

Proceedings of the 2010 international workshop on Searching spontaneous conversational speech
Keyword Extraction Using Word Co-occurrence

DEXA '10 Proceedings of the 2010 Workshops on Database and Expert Systems Applications
Finding locations of flickr resources using language models and similarity search

Proceedings of the 1st ACM International Conference on Multimedia Retrieval
Multi-modal, multi-resource methods for placing Flickr videos on the map

Proceedings of the 1st ACM International Conference on Multimedia Retrieval
Methods for extracting place semantics from Flickr tags

ACM Transactions on the Web (TWEB)

Finding locations of flickr resources using language models and similarity search

Proceedings of the 1st ACM International Conference on Multimedia Retrieval
Multi-modal, multi-resource methods for placing Flickr videos on the map

Proceedings of the 1st ACM International Conference on Multimedia Retrieval
Video2GPS: a demo of multimodal location estimation on flickr videos

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Multimodal location estimation on Flickr videos

WSM '11 Proceedings of the 3rd ACM SIGMM international workshop on Social media
A hierarchical, multi-modal approach for placing videos on the map using millions of Flickr photographs

SBNMA '11 Proceedings of the 2011 ACM workshop on Social and behavioural networked media access
Cross-modal categorisation of user-generated video sequences

Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
A visual approach for video geocoding using bag-of-scenes

Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
Spoken Content Retrieval: A Survey of Techniques and Technologies

Foundations and Trends in Information Retrieval
State of the Geotag: where are we?

Proceedings of the ACM multimedia 2012 workshop on Geotagging and its applications in multimedia
Pushing the limits of mechanical turk: qualifying the crowd for video geo-location

Proceedings of the ACM multimedia 2012 workshop on Crowdsourcing for multimedia
Intent and its discontents: the user at the wheel of the online video search engine

Proceedings of the 20th ACM international conference on Multimedia
GeoMM'12: ACM international workshop on geotagging and its applications in multimedia

Proceedings of the 20th ACM international conference on Multimedia
Georeferencing Flickr photos using language models at different levels of granularity: An evidence based approach

Web Semantics: Science, Services and Agents on the World Wide Web
Multimedia multimodal geocoding

Proceedings of the 20th International Conference on Advances in Geographic Information Systems
Automatic image tagging using two-layered Bayesian networks and mobile data from smart phones

Proceedings of the 10th International Conference on Advances in Mobile Computing & Multimedia
@Phillies Tweeting from Philly? Predicting Twitter User Locations with Spatial Word Usage

ASONAM '12 Proceedings of the 2012 International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2012)
Exploiting user comments for audio-visual content indexing and retrieval

ECIR'13 Proceedings of the 35th European conference on Advances in Information Retrieval
Geo-visual ranking for location prediction of social images

Proceedings of the 3rd ACM conference on International conference on multimedia retrieval
Boosting retrieval of digital spoken content

KES'12 Proceedings of the 16th international conference on Knowledge Engineering, Machine Learning and Lattice Computing with Applications
Blip10000: a social video dataset containing SPUG content for tagging and retrieval

Proceedings of the 4th ACM Multimedia Systems Conference
Human vs machine: establishing a human baseline for multimodal location estimation

Proceedings of the 21st ACM international conference on Multimedia
A novel fusion method for integrating multiple modalities and knowledge for multimodal location estimation

Proceedings of the 2nd ACM international workshop on Geotagging and its applications in multimedia
A mobile picture tagging system using tree-structured layered Bayesian networks

Mobile Information Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Automatically generated tags and geotags hold great promise to improve access to video collections and online communities. We overview three tasks offered in the MediaEval 2010 benchmarking initiative, for each, describing its use scenario, definition and the data set released. For each task, a reference algorithm is presented that was used within MediaEval 2010 and comments are included on lessons learned. The Tagging Task, Professional involves automatically matching episodes in a collection of Dutch television with subject labels drawn from the keyword thesaurus used by the archive staff. The Tagging Task, Wild Wild Web involves automatically predicting the tags that are assigned by users to their online videos. Finally, the Placing Task requires automatically assigning geo-coordinates to videos. The specification of each task admits the use of the full range of available information including user-generated metadata, speech recognition transcripts, audio, and visual features.