Multimodal metadata fusion using causal strength

Authors:
Yi Wu;Edward Y. Chang;Belle L. Tseng
Affiliations:
University of California, Santa Barbara, CA;University of California, Santa Barbara, CA;NEC Labs America, Cupertino, CA
Venue:
Proceedings of the 13th annual ACM international conference on Multimedia
Year:
2005

Citing 17
Cited 13

Knowledge representation and inference in similarity networks and Bayesian multinets

Artificial Intelligence
Texture Features for Browsing and Retrieval of Image Data

IEEE Transactions on Pattern Analysis and Machine Intelligence
Bayesian Network Classifiers

Machine Learning - Special issue on learning with probabilistic representations
Causality: models, reasoning, and inference

Causality: models, reasoning, and inference
Support vector machine active learning for image retrieval

MULTIMEDIA '01 Proceedings of the ninth ACM international conference on Multimedia
Understanding and Using Context

Personal and Ubiquitous Computing
Visually Searching the Web for Content

IEEE MultiMedia
Object Recognition from Local Scale-Invariant Features

ICCV '99 Proceedings of the International Conference on Computer Vision-Volume 2 - Volume 2
Position-Annotated Photographs: A Geotemporal Web

IEEE Pervasive Computing
Distinctive Image Features from Scale-Invariant Keypoints

International Journal of Computer Vision
From context to content: leveraging context to infer media metadata

Proceedings of the 12th annual ACM international conference on Multimedia
Context data in geo-referenced digital photo collections

Proceedings of the 12th annual ACM international conference on Multimedia
EXTENT: fusing context, content, and semantic ontology for photo annotation

Proceedings of the 2nd international workshop on Computer vision meets databases
Solving multiclass learning problems via error-correcting output codes

Journal of Artificial Intelligence Research
PCA-SIFT: a more distinctive representation for local image descriptors

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
Bayesian fusion of camera metadata cues in semantic scene classification

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
A Bayesian approach to learning causal networks

UAI'95 Proceedings of the Eleventh conference on Uncertainty in artificial intelligence

Multimodal fusion using learned text concepts for image categorization

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
A scalable service for photo annotation, sharing, and search

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
MultiTube--Where Web 2.0 and Multimedia Could Meet

IEEE MultiMedia
Enhanced max margin learning on multimodal data mining in a multimedia database

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Refining image annotation using contextual relations between words

Proceedings of the 6th ACM international conference on Image and video retrieval
How flickr helps us make sense of the world: context and content in community-contributed media collections

Proceedings of the 15th international conference on Multimedia
Generating diverse and representative image search results for landmarks

Proceedings of the 17th international conference on World Wide Web
Construction of extended geographical database based on photo shooting history

CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
Web-Scale Image Annotation

PCM '08 Proceedings of the 9th Pacific Rim Conference on Multimedia: Advances in Multimedia Information Processing
Multimedia data mining: state of the art and challenges

Multimedia Tools and Applications
Context-based support vector machines for interconnected image annotation

ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part I
Social multimedia: highlighting opportunities for search and mining of multimedia data in social media applications

Multimedia Tools and Applications
Multimedia retrieval and classification for web content

FDIA'07 Proceedings of the 1st BCS IRSG conference on Future Directions in Information Access

Quantified Score

Hi-index	0.00

Visualization

Abstract

We propose a probabilistic framework that uses influence diagrams to fuse metadata of multiple modalities for photo annotation. We fuse contextual information (location, time, and camera parameters), visual content (holistic and local perceptual features), and semantic ontology in a synergistic way. We use causal strengths to encode causalities between variables, and between variables and semantic labels. Through analytical and empirical studies, we demonstrate that our fusion approach can achieve high-quality photo annotation and good interpretability, substantially better than traditional methods.