Extracting semantics from audio-visual content: the final frontier in multimedia retrieval

Authors:
M. R. Naphade;T. S. Huang
Affiliations:
IBM Thomas J. Watson Res. Center, Hawthorne, NY;-
Venue:
IEEE Transactions on Neural Networks
Year:
2002

Citing 0
Cited 36

Supporting timeliness and accuracy in distributed real-time content-based video analysis

MULTIMEDIA '03 Proceedings of the eleventh ACM international conference on Multimedia
Semantic context detection based on hierarchical audio models

MIR '03 Proceedings of the 5th ACM SIGMM international workshop on Multimedia information retrieval
Discovering Document Semantics QBYS: A System for Querying the WWW by Semantics

Multimedia Tools and Applications
An audio/video analysis mechanism for web indexing

Proceedings of the 15th international conference on World Wide Web
Real-time video content analysis: QoS-aware application composition and parallel processing

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
The Semantic Pathfinder: Using an Authoring Metaphor for Generic Multimedia Indexing

IEEE Transactions on Pattern Analysis and Machine Intelligence
Deploying personalized mobile services in an agent-based environment

Expert Systems with Applications: An International Journal
Modeling human-like intelligent image processing: An information processing perspective and approach

Image Communication
Inexpensive fusion methods for enhancing feature detection

Image Communication
Semantic context detection using audio event fusion: camera-ready version

EURASIP Journal on Applied Signal Processing
Region-based image retrieval using an object ontology and relevance feedback

EURASIP Journal on Applied Signal Processing
Human-centered multimedia systems: tutorial overview

Proceedings of the 15th international conference on Multimedia
Web search engine multimedia functionality

Information Processing and Management: an International Journal
A fuzzy extension in ALC description logics

AIKED'05 Proceedings of the 4th WSEAS International Conference on Artificial Intelligence, Knowledge Engineering Data Bases
Fuzzy color-based approach for understanding animated movies content in the indexing task

Journal on Image and Video Processing - Color in Image and Video Processing
Automatic creation and evaluation of MPEG-7 compliant summary descriptions for generic audiovisual content

Image Communication
Concept-Based Video Retrieval

Foundations and Trends in Information Retrieval
Ice hockey shot event modeling with mixture hidden Markov model

EiMM '09 Proceedings of the 1st ACM international workshop on Events in multimedia
Image Annotation Using Sub-block Energy of Color Correlograms

AICI '09 Proceedings of the International Conference on Artificial Intelligence and Computational Intelligence
A Neural Network Based Framework for Audio Scene Analysis in Audio Sensor Networks

PCM '09 Proceedings of the 10th Pacific Rim Conference on Multimedia: Advances in Multimedia Information Processing
Multiple features in temporal models for the representation of visual contents in video

CIVR'03 Proceedings of the 2nd international conference on Image and video retrieval
Techniques for parallel execution of the particle filter

SCIA'03 Proceedings of the 13th Scandinavian conference on Image analysis
Modeling visual information processing in brain: a computer vision point of view and approach

BVAI'07 Proceedings of the 2nd international conference on Advances in brain, vision and artificial intelligence
A semantic framework for video genre classification and event analysis

Image Communication
Investigating fuzzy DLs-based reasoning in semantic image analysis

Multimedia Tools and Applications
Perceptual-based quality assessment for audio-visual services: A survey

Image Communication
IPSILON: incremental parsing for semantic indexing of latent concepts

IEEE Transactions on Image Processing
Analysing multimedia content in social networking environments

Proceedings of the 2010 ACM workshop on Social, adaptive and personalized multimedia interaction and access
Personalization in multimedia retrieval: A survey

Multimedia Tools and Applications
An efficient image classifier using discrete cosine transform

Proceedings of the International Conference & Workshop on Emerging Trends in Technology
Semantic context inference in multimedia search

The future internet
Image annotation based on central region features reduction

AICI'11 Proceedings of the Third international conference on Artificial intelligence and computational intelligence - Volume Part II
Semantic image segmentation with a multidimensional hidden markov model

MMM'07 Proceedings of the 13th international conference on Multimedia Modeling - Volume Part I
Audio visual cues for video indexing and retrieval

PCM'04 Proceedings of the 5th Pacific Rim conference on Advances in Multimedia Information Processing - Volume Part I
Ice hockey shooting event modeling with mixture hidden Markov model

Multimedia Tools and Applications
Audio and video feature fusion for activity recognition in unconstrained videos

IDEAL'06 Proceedings of the 7th international conference on Intelligent Data Engineering and Automated Learning

Quantified Score

Hi-index	0.00

Visualization

Abstract

Multimedia understanding is a fast emerging interdisciplinary research area. There is tremendous potential for effective use of multimedia content through intelligent analysis. Diverse application areas are increasingly relying on multimedia understanding systems. Advances in multimedia understanding are related directly to advances in signal processing, computer vision, pattern recognition, multimedia databases, and smart sensors. We review the state-of-the-art techniques in multimedia retrieval. In particular, we discuss how multimedia retrieval can be viewed as a pattern recognition problem. We discuss how reliance on powerful pattern recognition and machine learning techniques is increasing in the field of multimedia retrieval. We review the state-of-the-art multimedia understanding systems with particular emphasis on a system for semantic video indexing centered around multijects and multinets. We discuss how semantic retrieval is centered around concepts and context and the various mechanisms for modeling concepts and context.