Can High-Level Concepts Fill the Semantic Gap in Video Retrieval? A Case Study With Broadcast News

Authors:
A. Hauptmann;Rong Yan;Wei-Hao Lin;M. Christel;H. Wactlar
Affiliations:
Carnegie Mellon Univ., Pittsburgh;-;-;-;-
Venue:
IEEE Transactions on Multimedia
Year:
2007

Citing 0
Cited 44

Correlative multilabel video annotation with temporal kernels

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Fusing semantics, observability, reliability and diversity of concept detectors for video search

MM '08 Proceedings of the 16th ACM international conference on Multimedia
Study on the combination of video concept detectors

MM '08 Proceedings of the 16th ACM international conference on Multimedia
Assessing concept selection for video retrieval

MIR '08 Proceedings of the 1st ACM international conference on Multimedia information retrieval
I-Quest: an intelligent query structuring based on user browsing feedback for semantic retrieval of video data

Multimedia Tools and Applications
Concept-Based Video Retrieval

Foundations and Trends in Information Retrieval
Inferring semantic concepts from community-contributed images and noisy tags

MM '09 Proceedings of the 17th ACM international conference on Multimedia
Concept detectors: how good is good enough?

MM '09 Proceedings of the 17th ACM international conference on Multimedia
Distribution-based concept selection for concept-based video retrieval

MM '09 Proceedings of the 17th ACM international conference on Multimedia
Unified video annotation via multigraph learning

IEEE Transactions on Circuits and Systems for Video Technology
Reusing annotation labor for concept selection

Proceedings of the ACM International Conference on Image and Video Retrieval
NUS-WIDE: a real-world web image database from National University of Singapore

Proceedings of the ACM International Conference on Image and Video Retrieval
Beyond distance measurement: constructing neighborhood similarity for video annotation

IEEE Transactions on Multimedia - Special section on communities and media computing
A lexica family with small semantic GAP

ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
Learning social tag relevance by neighbor voting

IEEE Transactions on Multimedia
A Multi-Pronged Approach to Improving Semantic Extraction of News Video

Journal of Signal Processing Systems
Content-based story segmentation of news video by multimodal analysis

FSKD'09 Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 7
Explicit and implicit concept-based video retrieval with bipartite graph propagation model

Proceedings of the international conference on Multimedia
Shiatsu: semantic-based hierarchical automatic tagging of videos by segmentation using cuts

Proceedings of the 3rd international workshop on Automated information extraction in media production
Efficient large-scale image data set exploration: visual concept network and image summarization

MMM'11 Proceedings of the 17th international conference on Advances in multimedia modeling - Volume Part II
Bimodal fusion of low-level visual features and high-level semantic features for near-duplicate video clip detection

Image Communication
Learning concept bundles for video search with complex queries

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Towards textually describing complex video contents with audio-visual concept classifiers

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Generating visual concept network from large-scale weakly-tagged images

MMM'10 Proceedings of the 16th international conference on Advances in Multimedia Modeling
Double fusion for multimedia event detection

MMM'12 Proceedings of the 18th international conference on Advances in Multimedia Modeling
Short communication: Towards a universal detector by mining concepts with small semantic gaps

Expert Systems with Applications: An International Journal
Halfway through the semantic gap: Prosemantic features for image retrieval

Information Sciences: an International Journal
Fusing concept detection and geo context for visual search

Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
Classifier-specific intermediate representation for multimedia tasks

Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
Multimodal knowledge-based analysis in multimedia event detection

Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
Simulating the future of concept-based video retrieval under improved detector performance

Multimedia Tools and Applications
Knowledge adaptation for ad hoc multimedia event detection with few exemplars

Proceedings of the 20th ACM international conference on Multimedia
Detection bank: an object detection based video representation for multimedia event recognition

Proceedings of the 20th ACM international conference on Multimedia
Learning hybrid part filters for scene recognition

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part V
Objects as attributes for scene classification

ECCV'10 Proceedings of the 11th European conference on Trends and Topics in Computer Vision - Volume Part I
SHIATSU: tagging and retrieving videos without worries

Multimedia Tools and Applications
An integrated semantic-based approach in concept based video retrieval

Multimedia Tools and Applications
Recommendations for video event recognition using concept vocabularies

Proceedings of the 3rd ACM conference on International conference on multimedia retrieval
We are not equally negative: fine-grained labeling for multimedia event detection

Proceedings of the 21st ACM international conference on Multimedia
Beyond bag of words: image representation in sub-semantic space

Proceedings of the 21st ACM international conference on Multimedia
Multimedia search reranking: A literature survey

ACM Computing Surveys (CSUR)
The uncertain representation ranking framework for concept-based video retrieval

Information Retrieval
Evaluating multimedia features and fusion for example-based event detection

Machine Vision and Applications
Object Bank: An Object-Level Image Representation for High-Level Visual Recognition

International Journal of Computer Vision

Quantified Score

Hi-index	0.00

Visualization

Abstract

A number of researchers have been building high-level semantic concept detectors such as outdoors, face, building, to help with semantic video retrieval. Our goal is to examine how many concepts would be needed, and how they should be selected and used. Simulating performance of video retrieval under different assumptions of concept detection accuracy, we find that good retrieval can be achieved even when detection accuracy is low, if sufficiently many concepts are combined. We also derive suggestions regarding the types of concepts that would be most helpful for a large concept lexicon. Since our user study finds that people cannot predict which concepts will help their query, we also suggest ways to find the best concepts to use. Ultimately, this paper concludes that "concept-based" video retrieval with fewer than 5000 concepts, detected with a minimal accuracy of 10% mean average precision is likely to provide high accuracy results in broadcast news retrieval.