CRIM's content-based audio copy detection system for TRECVID 2009

  • Authors:
  • Vishwa Nath Gupta;Gilles Boulianne;Patrick Cardinal

  • Affiliations:
  • Centre de recherche informatique de Montréal (CRIM), Montréal, Canada H3N 1M3;Centre de recherche informatique de Montréal (CRIM), Montréal, Canada H3N 1M3;Centre de recherche informatique de Montréal (CRIM), Montréal, Canada H3N 1M3

  • Venue:
  • Multimedia Tools and Applications
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

We report results on audio copy detection for TRECVID 2009 copy detection task. This task involves searching for transformed audio queries in over 385 h of test audio. The queries were transformed in seven different ways, three of them involved mixing unrelated speech to the original query, making it a much more difficult task. We give results with two different audio fingerprints and show that mapping each test frame to the nearest query frame (nearest-neighbor fingerprint) results in robust audio copy detection. The most difficult task in TRECVID 2009 was to detect audio copies using predetermined thresholds computed from 2008 data. We show that the nearest-neighbor fingerprints were robust to even this task and gave actual minimal normalized detection cost rate (NDCR) of around 0.06 for all the transformations. These results are close to those obtained by using the optimal threshold for each transform. This result shows the robustness of the nearest-neighbor fingerprints. These nearest-neighbor fingerprints can be efficiently computed on a graphics processing unit, leading to a very fast search.