Automatic multimedia cross-modal correlation discovery

Authors:
Jia-Yu Pan;Hyung-Jeong Yang;Christos Faloutsos;Pinar Duygulu
Affiliations:
Carnegie Mellon University, Pittsburgh, PA;Carnegie Mellon University, Pittsburgh, PA;Carnegie Mellon University, Pittsburgh, PA;Carnegie Mellon University, Pittsburgh, PA
Venue:
Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Year:
2004

Citing 19
Cited 93

Latent semantic indexing: a probabilistic analysis

PODS '98 Proceedings of the seventeenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
A semidiscrete matrix decomposition for latent semantic indexing information retrieval

ACM Transactions on Information Systems (TOIS)
The anatomy of a large-scale hypertextual Web search engine

WWW7 Proceedings of the seventh international conference on World Wide Web 7
Authoritative sources in a hyperlinked environment

Proceedings of the ninth annual ACM-SIAM symposium on Discrete algorithms
Normalized Cuts and Image Segmentation

IEEE Transactions on Pattern Analysis and Machine Intelligence
Topic-sensitive PageRank

Proceedings of the 11th international conference on World Wide Web
Searching Multimedia Databases by Content

Searching Multimedia Databases by Content
Name-It: Naming and Detecting Faces in News Videos

IEEE MultiMedia
Lessons Learned from Building a Terabyte Digital Video Library

Computer
Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary

ECCV '02 Proceedings of the 7th European Conference on Computer Vision-Part IV
Multiple-Instance Learning for Natural Scene Classification

ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
The R+-Tree: A Dynamic Index for Multi-Dimensional Objects

VLDB '87 Proceedings of the 13th International Conference on Very Large Data Bases
VideoCube: A Novel Tool for Video Mining and Classification

ICADL '02 Proceedings of the 5th International Conference on Asian Digital Libraries: Digital Libraries: People, Knowledge, and Technology
Automatic image annotation and retrieval using cross-media relevance models

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Automatic Linguistic Indexing of Pictures by a Statistical Modeling Approach

IEEE Transactions on Pattern Analysis and Machine Intelligence
Matching words and pictures

The Journal of Machine Learning Research
MARSYAS: a framework for audio analysis

Organised Sound
MARSYAS: a framework for audio analysis

Organised Sound
Electricity based external similarity of categorical attributes

PAKDD'03 Proceedings of the 7th Pacific-Asia conference on Advances in knowledge discovery and data mining

STRG-Index: spatio-temporal region graph indexing for large video databases

Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Mining images on semantics via statistical learning

Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Emotion-based music recommendation by association discovery from film music

Proceedings of the 13th annual ACM international conference on Multimedia
Image annotations by combining multiple evidence & wordNet

Proceedings of the 13th annual ACM international conference on Multimedia
Neighborhood Formation and Anomaly Detection in Bipartite Graphs

ICDM '05 Proceedings of the Fifth IEEE International Conference on Data Mining
Relevance search and anomaly detection in bipartite graphs

ACM SIGKDD Explorations Newsletter
Center-piece subgraphs: problem definition and fast solutions

Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
An adaptive graph model for automatic image annotation

MIR '06 Proceedings of the 8th ACM international workshop on Multimedia information retrieval
Diversifying the image retrieval results

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Multimedia simplification for optimized MMS synthesis

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Web image annotation by fusing visual features and textual information

Proceedings of the 2007 ACM symposium on Applied computing
Enhanced max margin learning on multimodal data mining in a multimedia database

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Correlation search in graph databases

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Fast best-effort pattern matching in large attributed graphs

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Fast direction-aware proximity for graph mining

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Cross-modal correlation learning for clustering on image-audio dataset

Proceedings of the 15th international conference on Multimedia
Emotion-based impressionism slideshow with automatic music accompaniment

Proceedings of the 15th international conference on Multimedia
A graph-based image annotation framework

Pattern Recognition Letters
DBconnect: mining research community on DBLP data

Proceedings of the 9th WebKDD and 1st SNA-KDD 2007 workshop on Web mining and social network analysis
Annotation suggestion and search for personal multimedia objects on the web

CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
Colibri: fast mining of large static and dynamic graphs

Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
A family of dissimilarity measures between nodes generalizing both the shortest-path and the commute-time distances

Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Imagination: Exploiting Link Analysis for Accurate Image Annotation

Adaptive Multimedial Retrieval: Retrieval, User, and Semantics
C-DEM: a multi-modal query system for Drosophila Embryo databases

Proceedings of the VLDB Endowment
Fast mining of complex time-stamped events

Proceedings of the 17th ACM conference on Information and knowledge management
Emotion-based music recommendation by affinity discovery from film music

Expert Systems with Applications: An International Journal
Graph nodes clustering with the sigmoid commute-time kernel: A comparative study

Data & Knowledge Engineering
Mining Research Communities in Bibliographical Data

Advances in Web Mining and Web Usage Analysis
Top-K Correlation Sub-graph Search in Graph Databases

DASFAA '09 Proceedings of the 14th International Conference on Database Systems for Advanced Applications
Grocery shopping recommendations based on basket-sensitive random walk

Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Tagging and retrieving images with co-occurrence models: from corel to flickr

LS-MMRM '09 Proceedings of the First ACM workshop on Large-scale multimedia retrieval and mining
Ontology-Based Semantic Web Image Retrieval by Utilizing Textual and Visual Annotations

WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 03
iPoG: fast interactive proximity querying on graphs

Proceedings of the 18th ACM conference on Information and knowledge management
Community mining on dynamic weighted directed graphs

Proceedings of the 1st ACM international workshop on Complex networks meet information & knowledge management
Multigraph-based query-independent learning for video search

IEEE Transactions on Circuits and Systems for Video Technology
Knowledge Based Image Annotation Refinement

Journal of Signal Processing Systems
Fast computation of SimRank for static and dynamic information networks

Proceedings of the 13th International Conference on Extending Database Technology
Image annotation with tagprop on the MIRFLICKR set

Proceedings of the international conference on Multimedia information retrieval
Tracking the random surfer: empirically measured teleportation parameters in PageRank

Proceedings of the 19th international conference on World wide web
Parallelizing Random Walk with Restart for large-scale query recommendation

Proceedings of the 2010 Workshop on Massive Data Analytics on the Cloud
ALLRIGHT: automatic ontology instantiation from tabular web documents

ISWC'07/ASWC'07 Proceedings of the 6th international The semantic web and 2nd Asian conference on Asian semantic web conference
A spectral method for context based disambiguation of image annotations

ICIP'09 Proceedings of the 16th IEEE international conference on Image processing
Edge-preserving colorization using data-driven random walks with restart

ICIP'09 Proceedings of the 16th IEEE international conference on Image processing
Recognition of attentive objects with a concept association network for image annotation

Pattern Recognition
Crowdsourcing and service delivery

IBM Journal of Research and Development
Transitive node similarity for link prediction in social networks with positive and negative links

Proceedings of the fourth ACM conference on Recommender systems
Spatial outlier detection: random walk based approaches

Proceedings of the 18th SIGSPATIAL International Conference on Advances in Geographic Information Systems
Improving graph-walk-based similarity with reranking: Case studies for personal information management

ACM Transactions on Information Systems (TOIS)
Visual query expansion via incremental hypernetwork models of image and text

PRICAI'10 Proceedings of the 11th Pacific Rim international conference on Trends in artificial intelligence
Probabilistic temporal multimedia data mining

ACM Transactions on Intelligent Systems and Technology (TIST)
Multimedia data mining: state of the art and challenges

Multimedia Tools and Applications
Semi-supervised classification and betweenness computation on large, sparse, directed graphs

Pattern Recognition
Index design and query processing for graph conductance search

The VLDB Journal — The International Journal on Very Large Data Bases
Combining visual and textual modalities for multimedia ontology matching

SAMT'10 Proceedings of the 5th international conference on Semantic and digital media technologies
Evaluation of image segmentation algorithms from the perspective of salient region detection

ACIVS'11 Proceedings of the 13th international conference on Advanced concepts for intelligent vision systems
Learning protein functions from bi-relational graph of proteins and function annotations

WABI'11 Proceedings of the 11th international conference on Algorithms in bioinformatics
A generalized stochastic block model for recommendation in social rating networks

Proceedings of the fifth ACM conference on Recommender systems
"Tell me more": finding related items from user provided feedback

DS'11 Proceedings of the 14th international conference on Discovery science
Finding representative and diverse community contributed images to create visual summaries of geographic areas

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Fusion of region and image-based techniques for automatic image annotation

MMM'07 Proceedings of the 13th international conference on Multimedia Modeling - Volume Part I
Evaluating the length of virtual horizontal bar chart columns augmented with wrench and sound feedback

ICCHP'06 Proceedings of the 10th international conference on Computers Helping People with Special Needs
Improving the image retrieval results via topic coverage graph

PCM'06 Proceedings of the 7th Pacific Rim conference on Advances in Multimedia Information Processing
Fast and exact top-k search for random walk with restart

Proceedings of the VLDB Endowment
BASSET: scalable gateway finder in large graphs

PAKDD'10 Proceedings of the 14th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part II
A probabilistic model for correspondence problems using random walks with restart

ACCV'09 Proceedings of the 9th Asian conference on Computer Vision - Volume Part III
Improving image annotations using wordnet

MIS'05 Proceedings of the 11th international conference on Advances in Multimedia Information Systems
Effective heterogeneous similarity measure with nearest neighbors for cross-media retrieval

MMM'12 Proceedings of the 18th international conference on Advances in Multimedia Modeling
An experimental investigation of kernels on graphs for collaborative recommendation and semisupervised classification

Neural Networks
Taxonomy-Oriented recommendation towards recommendation with stage

APWeb'12 Proceedings of the 14th Asia-Pacific international conference on Web Technologies and Applications
Fine-grained access control of personal data

Proceedings of the 17th ACM symposium on Access Control Models and Technologies
A cross-modal method of labeling music tags

Multimedia Tools and Applications
Gateway finder in large graphs: problem definitions and fast solutions

Information Retrieval
Fast and accurate link prediction in social networking systems

Journal of Systems and Software
Product recommendation with temporal dynamics

Expert Systems with Applications: An International Journal
Co-transfer learning via joint transition probability graph based method

Proceedings of the 1st International Workshop on Cross Domain Knowledge Discovery in Web and Social Network Mining
Multi-label image annotation based on neighbor pair correlation chain

MLDM'12 Proceedings of the 8th international conference on Machine Learning and Data Mining in Pattern Recognition
When a friend in Twitter is a friend in life

Proceedings of the 3rd Annual ACM Web Science Conference
PathRank: Ranking nodes on a heterogeneous graph for flexible hybrid recommender systems

Expert Systems with Applications: An International Journal
Multimedia ontology matching by using visual and textual modalities

Multimedia Tools and Applications
Graph-based semi-supervised learning with multi-modality propagation for large-scale image datasets

Journal of Visual Communication and Image Representation
Fast Recommendation on Bibliographic Networks

ASONAM '12 Proceedings of the 2012 International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2012)
E-rank: A Structural-Based Similarity Measure in Social Networks

WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 01
Efficient ad-hoc search for personalized PageRank

Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
Big graph mining: algorithms and discoveries

ACM SIGKDD Explorations Newsletter
Learning latent friendship propagation networks with interest awareness for link prediction

Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Exploiting user clicks for automatic seed set generation for entity matching

Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining
LAFT-Explorer: inferring, visualizing and predicting how your social network expands

Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining
MLRank: Multi-correlation Learning to Rank for image annotation

Pattern Recognition
Overlapping community detection using seed set expansion

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Utilizing social and behavioral neighbors for personalized recommendation

ISNN'13 Proceedings of the 10th international conference on Advances in Neural Networks - Volume Part II
From biological to social networks: Link prediction based on multi-way spectral clustering

Data & Knowledge Engineering
Cross domain recommendation based on multi-type media fusion

Neurocomputing
Random walks based modularity: application to semi-supervised learning

Proceedings of the 23rd international conference on World wide web

Quantified Score

Hi-index	0.00

Visualization

Abstract

Given an image (or video clip, or audio song), how do we automatically assign keywords to it? The general problem is to find correlations across the media in a collection of multimedia objects like video clips, with colors, and/or motion, and/or audio, and/or text scripts. We propose a novel, graph-based approach, "MMG", to discover such cross-modal correlations.Our "MMG" method requires no tuning, no clustering, no user-determined constants; it can be applied to any multimedia collection, as long as we have a similarity function for each medium; and it scales linearly with the database size. We report auto-captioning experiments on the "standard" Corel image database of 680 MB, where it outperforms domain specific, fine-tuned methods by up to 10 percentage points in captioning accuracy (50% relative improvement).