Cluster-based data modeling for semantic video search

Authors:
Jelena Tešić;Apostol (Paul) Natsev;John R. Smith
Affiliations:
IBM Watson Research Center, Hawthorne, NY;IBM Watson Research Center, Hawthorne, NY;IBM Watson Research Center, Hawthorne, NY
Venue:
Proceedings of the 6th ACM international conference on Image and video retrieval
Year:
2007

Citing 4
Cited 9

Semantic representation: search and mining of multimedia content

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Learning the semantics of multimedia queries and concepts from a small number of examples

Proceedings of the 13th annual ACM international conference on Multimedia
Probabilistic latent query analysis for combining multiple retrieval sources

SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Large-Scale Concept Ontology for Multimedia

IEEE MultiMedia

Semantic concept-based query expansion and re-ranking for multimedia retrieval

Proceedings of the 15th international conference on Multimedia
Exploring multimedia in a keyword space

MM '08 Proceedings of the 16th ACM international conference on Multimedia
Assessing concept selection for video retrieval

MIR '08 Proceedings of the 1st ACM international conference on Multimedia information retrieval
Graph-Based Pairwise Learning to Rank for Video Search

MMM '09 Proceedings of the 15th International Multimedia Modeling Conference on Advances in Multimedia Modeling
Query-based video event definition using rough set theory

EiMM '09 Proceedings of the 1st ACM international workshop on Events in multimedia
Example-based event retrieval in video archive using rough set theory and video ontology

Proceedings of the Tenth International Workshop on Multimedia Data Mining
Using manual and automated annotations to search images by semantic similarity

Multimedia Tools and Applications
Query-Based video event definition using rough set theory and high-dimensional representation

MMM'10 Proceedings of the 16th international conference on Advances in Multimedia Modeling
Story-Based retrieval by learning and measuring the concept-based and content-based similarity

MMM'10 Proceedings of the 16th international conference on Advances in Multimedia Modeling

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper we present a novel approach to query-by-example using existing high-level semantics in the dataset. Typically with visual topics, the examples are not sufficiently diverse to create robust model of the user's need in the descriptor's space. As a result, direct modeling using the provided topic examples as training data is inadequate. Otherwise, systems resort to multiple content-based searches using each example in turn, which typically provides poor results. We explore the relevance of visual concept models and how they help refine the query topics. We propose a new technique of leveraging the underlying semantics contained in the visual query topic examples to improve the search. We treat the semantic space as the descriptor space, and intelligently model a query in that space. We use unlabeled data to expand the diversity of the topic examples as well as provide a robust set of negative examples that allow direct modeling. The approach intelligently models a positive and pseudo-negative space using unbiased and biased methods for data sampling and data selection, and improves semantic retrieval by %12 over TRECVID 2006 topics. Moreover, we explore the visual context in fusion with text and visual search baselines and examine how this component can improve baseline retrieval results by expanding and re-ranking them. We apply the proposed methods in a multimodal video search system, and show how the underlined semantics of the queries can significantly improve the overall visual search results, improving baseline by over 46%, and enhancing performance of other modalities by at least 10%. We also demonstrate improved robustness over a range of query topic training examples and query topics with varying visual support of in TRECVID.