A Theory for Multiresolution Signal Decomposition: The Wavelet Representation
IEEE Transactions on Pattern Analysis and Machine Intelligence
International Journal of Computer Vision
Combining labeled and unlabeled data with co-training
COLT' 98 Proceedings of the eleventh annual conference on Computational learning theory
Analyzing the effectiveness and applicability of co-training
Proceedings of the ninth international conference on Information and knowledge management
Combining Labeled and Unlabeled Data for MultiClass Text Categorization
ICML '02 Proceedings of the Nineteenth International Conference on Machine Learning
Combining Textual and Visual Cues for Content-Based Image Retrieval on the World Wide Web
CBAIVL '98 Proceedings of the IEEE Workshop on Content - Based Access of Image and Video Libraries
Image Indexing Using Color Correlograms
CVPR '97 Proceedings of the 1997 Conference on Computer Vision and Pattern Recognition (CVPR '97)
Laplacian Eigenmaps for dimensionality reduction and data representation
Neural Computation
ReCoM: reinforcement clustering of multi-type interrelated data objects
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
The combination limit in multimedia retrieval
MULTIMEDIA '03 Proceedings of the eleventh ACM international conference on Multimedia
Manifold-ranking based image retrieval
Proceedings of the 12th annual ACM international conference on Multimedia
Optimal multimodal fusion for multimedia data analysis
Proceedings of the 12th annual ACM international conference on Multimedia
Locality preserving clustering for image database
Proceedings of the 12th annual ACM international conference on Multimedia
Hierarchical clustering of WWW image search results using visual, textual and link information
Proceedings of the 12th annual ACM international conference on Multimedia
A bootstrapping framework for annotating and retrieving WWW images
Proceedings of the 12th annual ACM international conference on Multimedia
ICDM '04 Proceedings of the Fourth IEEE International Conference on Data Mining
ICPR '96 Proceedings of the 13th International Conference on Pattern Recognition - Volume 2
Noise adaptive stream weighting in audio-visual speech recognition
EURASIP Journal on Applied Signal Processing
Audio-visual speech modeling for continuous speech recognition
IEEE Transactions on Multimedia
An adaptive graph model for automatic image annotation
MIR '06 Proceedings of the 8th ACM international workshop on Multimedia information retrieval
Adaptive image retrieval using a Graph model for semantic feature integration
MIR '06 Proceedings of the 8th ACM international workshop on Multimedia information retrieval
Bipartite graph reinforcement model for web image annotation
Proceedings of the 15th international conference on Multimedia
Structure-sensitive manifold ranking for video concept detection
Proceedings of the 15th international conference on Multimedia
Optimizing multi-graph learning: towards a unified video annotation scheme
Proceedings of the 15th international conference on Multimedia
Image annotation via graph learning
Pattern Recognition
Annotating photo collections by label propagation according to multiple similarity cues
MM '08 Proceedings of the 16th ACM international conference on Multimedia
MM '08 Proceedings of the 16th ACM international conference on Multimedia
Emerging Trends in Visual Computing
Unified video annotation via multigraph learning
IEEE Transactions on Circuits and Systems for Video Technology
Topic analysis for topic-focused multi-document summarization
Proceedings of the 18th ACM conference on Information and knowledge management
Graph-based multi-modality learning for topic-focused multi-document summarization
IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Multigraph-based query-independent learning for video search
IEEE Transactions on Circuits and Systems for Video Technology
Build Chinese emotion lexicons using a graph-based algorithm and multiple resources
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Fusing heterogeneous modalities for video and image re-ranking
Proceedings of the 1st ACM International Conference on Multimedia Retrieval
Multimodal search and retrieval using manifold learning and query formulation
Proceedings of the 16th International Conference on 3D Web Technology
Discovering multirelational structure in social media streams
ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Automatic refinement of keyword annotations for web image search
MMM'07 Proceedings of the 13th international conference on Multimedia Modeling - Volume Part I
Boosting cross-media retrieval by learning with positive and negative examples
MMM'07 Proceedings of the 13th International conference on Multimedia Modeling - Volume Part II
Multi-graph multi-instance learning for object-based image and video retrieval
Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
K-Nearest Neighbors Relevance Annotation Model for Distance Education
International Journal of Distance Education Technologies
Graph-based semi-supervised learning with multi-modality propagation for large-scale image datasets
Journal of Visual Communication and Image Representation
Using Multiple Resources in Graph-Based Semi-supervised Sentiment Classification
WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 03
A heterogenous automatic feedback semi-supervised method for image reranking
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Video key frame extraction through dynamic Delaunay clustering with a structural constraint
Journal of Visual Communication and Image Representation
Multimodal retrieval with relevance feedback based on genetic programming
Multimedia Tools and Applications
Hi-index | 0.00 |
To better understand the content of multimedia, a lot of research efforts have been made on how to learn from multi-modal feature. In this paper, it is studied from a graph point of view: each kind of feature from one modality is represented as one independent graph; and the learning task is formulated as inferring from the constraints in every graph as well as supervision information (if available). For semi-supervised learning, two different fusion schemes, namely linear form and sequential form, are proposed. For each scheme, it is derived from optimization point of view; and further justified from two sides: similarity propagation and Bayesian interpretation. By doing so, we reveal the regular optimization nature, transductive learning nature as well as prior fusion nature of the proposed schemes, respectively. Moreover, the proposed method can be easily extended to unsupervised learning, including clustering and embedding. Systematic experimental results validate the effectiveness of the proposed method.