Near-duplicate video retrieval based on clustering by multiple sequence alignment

  • Authors:
  • Yandan Wang;Mohammed Belkhatir;Bashar Tahayna

  • Affiliations:
  • Monash University, Bandar Sunway, Malaysia;University of Lyon, Villeurbanne, France;Monash University, Bandar Sunway, Malaysia

  • Venue:
  • Proceedings of the 20th ACM international conference on Multimedia
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

In Near-Duplicate Video Retrieval (NDVR), recent works have focused on bettering index structures and matching schemes not only to improve retrieval accuracy but also to enforce scalability in an effort to keep up with the ever-growing size of video collections. In this paper, we propose a framework for the retrieval of Near-Duplicate Videos (NDVs) based on a pre-processing step of clustering inspired by Multiple Sequence Alignment (MSA) of DNA sequences. In our technique, we represent videos as alphabetical genomes and MSA is employed to automatically cluster a video collection. NDVR is then conducted on these formed clusters instead of the original video collection. Experimentally, we show that our clustering-based approach, while being significantly faster than state-of-the-art techniques that are not based on a pre-processing clustering step, i.e. the n-gram and Dynamic Time Warping (DTW) techniques, yields equivalent results in a precision/recall evaluation framework.