LDA-Based Retrieval Framework for Semantic News Video Retrieval

Authors:
Juan Cao;Jintao Li;Yongdong Zhang;Sheng Tang
Affiliations:
Chinese Academy of Sciences, China/ Graduate University of the Chinese Academy of Sciences, China;Chinese Academy of Sciences, China;Chinese Academy of Sciences, China;Chinese Academy of Sciences, China
Venue:
ICSC '07 Proceedings of the International Conference on Semantic Computing
Year:
2007

Citing 0
Cited 5

A density-based method for adaptive LDA model selection

Neurocomputing
A perceptual hashing algorithm using latent dirichlet allocation

ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
Video fingerprinting using Latent Dirichlet Allocation and facial images

Pattern Recognition
Topic based query suggestions for video search

MMM'12 Proceedings of the 18th international conference on Advances in Multimedia Modeling
Latent topic model for audio retrieval

Pattern Recognition

Quantified Score

Hi-index	0.00

Visualization

Abstract

Topic-based language model has attracted much attention as the propounding of semantic retrieval in recent years. Especially for the ASR text with errors, the topic representation is more reasonable than the exact term representation. Among these models, Latent Dirichlet Allocation(LDA) has been noted for its ability to discover the latent topic structure, and is broadly applied in many text-related tasks. But up to now its application in information retrieval(IR) is still limited to be a supplement to the standard document models, and furthermore, it has been pointed out that directly employing the basic LDA model will hurt retrieval performance. In this paper, we propose a lexicon-guided two-level LDA retrieval framework. It uses the HowNet to guide the first-level LDA model's parameter estimation, and further construct the second-level LDA models based on the first-level's inference results. We use TRECVID 2005 ASR collection to evaluate it, and compare it with the vector space model(VSM) and latent semantic Indexing(LSI). Our experiments show the proposed method is very competitive.