Audio-Visual Event Detection using Duration dependent input output Markov models

  • Authors:
  • Milind R. Naphade;Ashutosh Garg;Thomas S. Huang

  • Affiliations:
  • -;-;-

  • Venue:
  • CBAIVL '01 Proceedings of the IEEE Workshop on Content-based Access of Image and Video Libraries (CBAIVL'01)
  • Year:
  • 2001

Quantified Score

Hi-index 0.00

Visualization

Abstract

Analysis of audio-visual data and detection of semanticevents with spatio-temporal support is a challenging multimedia understanding problem. The difficulty lies in the gap that exists between low level media features and high levelsemantic concept. We introduce a duration dependent input output Marko model (DDIOMM)to detect events basedon multiple modalities. The DDIOMM combines the abilityto model non-exponential duration densities with the mapping of input sequences to output sequences. We test theDDIOMM by modeling the audio-visual event explosion.We compare the detection performance of the DDIOMMwith the IOMM as well as the HMM. Experiments revealthat modeling of duration improves detection performance.