Audio-Visual Event Detection using Duration dependent input output Markov models

Authors:
Milind R. Naphade;Ashutosh Garg;Thomas S. Huang
Affiliations:
-;-;-
Venue:
CBAIVL '01 Proceedings of the IEEE Workshop on Content-based Access of Image and Video Libraries (CBAIVL'01)
Year:
2001

Citing 0
Cited 3

Affective content detection using HMMs

MULTIMEDIA '03 Proceedings of the eleventh ACM international conference on Multimedia
Semantic context detection using audio event fusion: camera-ready version

EURASIP Journal on Applied Signal Processing
Video Affective Content Representation and Recognition Using Video Affective Tree and Hidden Markov Models

ACII '07 Proceedings of the 2nd international conference on Affective Computing and Intelligent Interaction

Quantified Score

Hi-index	0.00

Visualization

Abstract

Analysis of audio-visual data and detection of semanticevents with spatio-temporal support is a challenging multimedia understanding problem. The difficulty lies in the gap that exists between low level media features and high levelsemantic concept. We introduce a duration dependent input output Marko model (DDIOMM)to detect events basedon multiple modalities. The DDIOMM combines the abilityto model non-exponential duration densities with the mapping of input sequences to output sequences. We test theDDIOMM by modeling the audio-visual event explosion.We compare the detection performance of the DDIOMMwith the IOMM as well as the HMM. Experiments revealthat modeling of duration improves detection performance.