How many high-level concepts will fill the semantic gap in news video retrieval?

Authors:
Alexander Hauptmann;Rong Yan;Wei-Hao Lin
Affiliations:
School of Computer Science, Pittsburgh, PA;School of Computer Science, Pittsburgh, PA;School of Computer Science, Pittsburgh, PA
Venue:
Proceedings of the 6th ACM international conference on Image and video retrieval
Year:
2007

Citing 20
Cited 31

Foundations of statistical natural language processing

Foundations of statistical natural language processing
Content-Based Image Retrieval at the End of the Early Years

IEEE Transactions on Pattern Analysis and Machine Intelligence
Does organisation by similarity assist image browsing?

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
End-User Searching Challenges Indexing Practices inthe Digital Newspaper Photo Archive

Information Retrieval
Global Optimization by Multilevel Coordinate Search

Journal of Global Optimization
Lessons Learned from Building a Terabyte Digital Video Library

Computer
News video classification using SVM-based multimodal classifiers and combination strategies

Proceedings of the tenth ACM international conference on Multimedia
Proceedings of the International Conference on Image and Video Retrieval

CIVR '02 Proceedings of the International Conference on Image and Video Retrieval
Automatic image annotation and retrieval using cross-media relevance models

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Matching words and pictures

The Journal of Machine Learning Research
The combination limit in multimedia retrieval

MULTIMEDIA '03 Proceedings of the eleventh ACM international conference on Multimedia
Optimal multimodal fusion for multimedia data analysis

Proceedings of the 12th annual ACM international conference on Multimedia
On the detection of semantic concepts at TRECVID

Proceedings of the 12th annual ACM international conference on Multimedia
Visual Concepts for News Story Tracking: Analyzing and Exploiting the NIST TRECVID Video Annotation Experiment

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
Surrogate scoring for improved metasearch precision

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Learning the semantics of multimedia queries and concepts from a small number of examples

Proceedings of the 13th annual ACM international conference on Multimedia
Large-Scale Concept Ontology for Multimedia

IEEE MultiMedia
The challenge problem for automated detection of 101 semantic concepts in multimedia

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Probabilistic models for combining diverse knowledge sources in multimedia retrieval

Probabilistic models for combining diverse knowledge sources in multimedia retrieval
TRECVID: benchmarking the effectiveness of information retrieval tasks on digital video

CIVR'03 Proceedings of the 2nd international conference on Image and video retrieval

Ontology-enriched semantic space for video search

Proceedings of the 15th international conference on Multimedia
The evolution of visual information retrieval

Journal of Information Science
Exploring multimedia in a keyword space

MM '08 Proceedings of the 16th ACM international conference on Multimedia
Boosting image retrieval through aggregating search results based on visual annotations

MM '08 Proceedings of the 16th ACM international conference on Multimedia
Using Second Order Statistics to Enhance Automated Image Annotation

ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
Image categorization combining neighborhood methods and boosting

LS-MMRM '09 Proceedings of the First ACM workshop on Large-scale multimedia retrieval and mining
Identifying news videos' ideological perspectives using emphatic patterns of visual concepts

MM '09 Proceedings of the 17th ACM international conference on Multimedia
Image annotation using clickthrough data

Proceedings of the ACM International Conference on Image and Video Retrieval
Why meaningful automatic tagging of images is very hard

ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
Multigraph-based query-independent learning for video search

IEEE Transactions on Circuits and Systems for Video Technology
A Multi-Pronged Approach to Improving Semantic Extraction of News Video

Journal of Signal Processing Systems
Learning automatic concept detectors from online video

Computer Vision and Image Understanding
Today's and tomorrow's retrieval practice in the audiovisual archive

Proceedings of the ACM International Conference on Image and Video Retrieval
Everyday concept detection in visual lifelogs: validation, relationships and trends

Multimedia Tools and Applications
Investigating fuzzy DLs-based reasoning in semantic image analysis

Multimedia Tools and Applications
Supervised reranking for web image search

Proceedings of the international conference on Multimedia
Instant customized summaries streaming: a service for immediate awareness of new video content

AMR'09 Proceedings of the 7th international conference on Adaptive multimedia retrieval: understanding media and adapting to the user
Development and evaluation of a multifaceted magazine image categorization model

Journal of the American Society for Information Science and Technology
A survey of semantic image and video annotation tools

Knowledge-driven multimedia information extraction and ontology evolution
Passively recognising human activities through lifelogging

Computers in Human Behavior
VisionGo: Towards video retrieval with joint exploration of human and computer

Information Sciences: an International Journal
Reliability and effectiveness of clickthrough data for automatic image annotation

Multimedia Tools and Applications
On the spatial extents of SIFT descriptors for visual concept detection

ICVS'11 Proceedings of the 8th international conference on Computer vision systems
Classification of semantic concepts to support the analysis of the inter-cultural visual repertoires of TV news reviews

KI'11 Proceedings of the 34th Annual German conference on Advances in artificial intelligence
Why did the prime minister resign?: generation of event explanations from large news repositories

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Using manual and automated annotations to search images by semantic similarity

Multimedia Tools and Applications
Building semantic hierarchies faithful to image semantics

MMM'12 Proceedings of the 18th international conference on Advances in Multimedia Modeling
Multimodal video concept detection via bag of auditory words and multiple kernel learning

MMM'12 Proceedings of the 18th international conference on Advances in Multimedia Modeling
Halfway through the semantic gap: Prosemantic features for image retrieval

Information Sciences: an International Journal
Assistive tagging: A survey of multimedia tagging with human-computer joint exploration

ACM Computing Surveys (CSUR)
There is no data like less data: percepts for video concept detection on consumer-produced media

Proceedings of the 2012 ACM international workshop on Audio and multimedia methods for large-scale video analysis

Quantified Score

Hi-index	0.00

Visualization

Abstract

A number of researchers have been building high-level semantic concept detectors such as outdoors, face, building, etc., to help with semantic video retrieval. Using the TRECVID video collection and LSCOM truth annotations from 300 concepts, we simulate performance of video retrieval under different assumptions of concept detection accuracy. Even low detection accuracy provides good retrieval results, when sufficiently many concepts are used. Considering this extrapolation under reasonable assumptions, this paper arrives at the conclusion that "concept-based" video retrieval with fewer than 5000 concepts, detected with minimal accuracy of 10% mean average precision is likely to provide high accuracy results, comparable to text retrieval on the web, in a typical broadcast news collection. We also derive evidence that it should be feasible to find sufficiently many new, useful concepts that would be helpful for retrieval.