Learning concept bundles for video search with complex queries

Authors:
Jin Yuan;Zheng-Jun Zha;Yao-Tao Zheng;Meng Wang;Xiangdong Zhou;Tat-Seng Chua
Affiliations:
School of computing, National University of Singapore, Singapore, Singapore;School of computing, National University of Singapore, Singapore, Singapore;Institute for Infocomm Research of Singapore, Singapore, Singapore;School of computing, National University of Singapore, Singapore, Singapore;the department of Computing, Fudan University of China, Shanghai, China;School of computing, National University of Singapore, Singapore, Singapore
Venue:
MM '11 Proceedings of the 19th ACM international conference on Multimedia
Year:
2011

Citing 30
Cited 4

Foundations of statistical natural language processing

Foundations of statistical natural language processing
Regularized multi--task learning

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Multiple kernel learning, conic duality, and the SMO algorithm

ICML '04 Proceedings of the twenty-first international conference on Machine learning
On the detection of semantic concepts at TRECVID

Proceedings of the 12th annual ACM international conference on Multimedia
The challenge problem for automated detection of 101 semantic concepts in multimedia

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
A Framework for Learning Predictive Structures from Multiple Tasks and Unlabeled Data

The Journal of Machine Learning Research
The Google Similarity Distance

IEEE Transactions on Knowledge and Data Engineering
Correlative multi-label video annotation

Proceedings of the 15th international conference on Multimedia
Cross-domain video concept detection using adaptive svms

Proceedings of the 15th international conference on Multimedia
The importance of query-concept-mapping for automatic video retrieval

Proceedings of the 15th international conference on Multimedia
Semantic concept-based query expansion and re-ranking for multimedia retrieval

Proceedings of the 15th international conference on Multimedia
Introduction to Information Retrieval

Introduction to Information Retrieval
Flickr distance

MM '08 Proceedings of the 16th ACM international conference on Multimedia
Fusing semantics, observability, reliability and diversity of concept detectors for video search

MM '08 Proceedings of the 16th ACM international conference on Multimedia
Semantic Concept Classification by Joint Semi-supervised Learning of Feature Subspaces and Support Vector Machines

ECCV '08 Proceedings of the 10th European Conference on Computer Vision: Part IV
A syntactic tree matching approach to finding similar questions in community-based qa services

Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Concept-Based Video Retrieval

Foundations and Trends in Information Retrieval
Transferring multi-device localization models using latent multi-task learning

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 3
Visual query suggestion

MM '09 Proceedings of the 17th ACM international conference on Multimedia
Semantic context transfer across heterogeneous sources for domain adaptive video search

MM '09 Proceedings of the 17th ACM international conference on Multimedia
Unified video annotation via multigraph learning

IEEE Transactions on Circuits and Systems for Video Technology
Beyond distance measurement: constructing neighborhood similarity for video annotation

IEEE Transactions on Multimedia - Special section on communities and media computing
Enhancing accessibility of microblogging messages using semantic knowledge

Proceedings of the 20th ACM international conference on Information and knowledge management
Assessing effectiveness in video retrieval

CIVR'05 Proceedings of the 4th international conference on Image and Video Retrieval
Video retrieval using high level features: exploiting query matching and confidence-based weighting

CIVR'06 Proceedings of the 5th international conference on Image and Video Retrieval
Can High-Level Concepts Fill the Semantic Gap in Video Retrieval? A Case Study With Broadcast News

IEEE Transactions on Multimedia
Adding Semantics to Detectors for Video Retrieval

IEEE Transactions on Multimedia
Selection of Concept Detectors for Video Search by Ontology-Enriched Semantic Spaces

IEEE Transactions on Multimedia
Representations of Keypoint-Based Semantic Concept Detection: A Comprehensive Study

IEEE Transactions on Multimedia
Interactive Video Indexing With Statistical Active Learning

IEEE Transactions on Multimedia

Video browser showdown by NUS

MMM'12 Proceedings of the 18th international conference on Advances in Multimedia Modeling
Harvesting visual concepts for image search with complex queries

Proceedings of the 20th ACM international conference on Multimedia
Exploiting visual word co-occurrence for image retrieval

Proceedings of the 20th ACM international conference on Multimedia
Memory recall based video search: Finding videos you have seen before based on your memory

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)

Quantified Score

Hi-index	0.00

Visualization

Abstract

Classifiers for primitive visual concepts like "car", "sky" have been well developed and widely used to support video search on simple queries. However, it is usually ineffective for complex queries like "one or more people at a table or desk with a computer visible", as they carry semantics far more complex and different from simply aggregating the meanings of their constituent primitive concepts. To facilitate video search of complex queries, we propose a higher-level semantic descriptor named "concept bundle", which integrates multiple primitive concepts, such as "(soccer, fighting)", "(lion, hunting, zebra)" etc, to describe the visual representation of the complex semantics. The proposed approach first automatically selects informative concept bundles. It then builds a novel concept bundle classifier based on multi-task learning by exploiting the relatedness between concept bundle and its primitive concepts. To model a complex query, it proposes an optimal selection strategy to select related primitive concepts and concept bundles by considering both their classifier performance and semantic relatedness with respect to the query. The final results are generated by fusing the individual results from these selected primitive concepts and concept bundles. Extensive experiments are conducted on two video datasets: TRECVID 2008 and YouTube datasets. The experimental results indicate that: (a) our concept bundle learning approach outperforms the state-of-the-art methods by at least 19% and 29% on TRECVID 2008 and YouTube datasets, respectively; and (b) the use of concept bundles can improve the search performance for complex queries by at least 37.5% on TRECVID 2008 and 52% on YouTube datasets.