Learning-based linguistic indexing of pictures with 2--d MHMMs

Authors:
James Z. Wang;Jia Li
Affiliations:
The Pennsylvania State University, University Park, PA;The Pennsylvania State University, University Park, PA
Venue:
Proceedings of the tenth ACM international conference on Multimedia
Year:
2002

Citing 18
Cited 20

Ten lectures on wavelets

Ten lectures on wavelets
Region Competition: Unifying Snakes, Region Growing, and Bayes/MDL for Multiband Image Segmentation

IEEE Transactions on Pattern Analysis and Machine Intelligence
An introduction to support Vector Machines: and other kernel-based learning methods

An introduction to support Vector Machines: and other kernel-based learning methods
Normalized Cuts and Image Segmentation

IEEE Transactions on Pattern Analysis and Machine Intelligence
Unsupervised Multiresolution Segmentation for Images with Low Depth of Field

IEEE Transactions on Pattern Analysis and Machine Intelligence
Technique for eliminating irrelevant terms in term rewriting for annotated media retrieval

MULTIMEDIA '01 Proceedings of the ninth ACM international conference on Multimedia
SIMPLIcity: Semantics-Sensitive Integrated Matching for Picture LIbraries

IEEE Transactions on Pattern Analysis and Machine Intelligence
Integrated Region-Based Image Retrieval

Integrated Region-Based Image Retrieval
Neural Networks for Pattern Recognition

Neural Networks for Pattern Recognition
Image Segmentation and Compression Using Hidden Markov Models

Image Segmentation and Compression Using Hidden Markov Models
Computer Vision: A Modern Approach

Computer Vision: A Modern Approach
A semantic modeling approach for image retrieval by content

The VLDB Journal — The International Journal on Very Large Data Bases - Spatial Database Systems
Query by Image and Video Content: The QBIC System

Computer
A Region-Based Fuzzy Feature Matching Approach to Content-Based Image Retrieval

IEEE Transactions on Pattern Analysis and Machine Intelligence
NeTra: a toolbox for navigating large image databases

ICIP '97 Proceedings of the 1997 International Conference on Image Processing (ICIP '97) 3-Volume Set-Volume 1 - Volume 1
Image classification by a two-dimensional hidden Markov model

IEEE Transactions on Signal Processing
Multiresolution image classification by hierarchical modeling with two-dimensional hidden Markov models

IEEE Transactions on Information Theory
Texture classification and segmentation using wavelet frames

IEEE Transactions on Image Processing

Generic image classification using visual knowledge on the web

MULTIMEDIA '03 Proceedings of the eleventh ACM international conference on Multimedia
Confidence-based dynamic ensemble for image annotation and semantics discovery

MULTIMEDIA '03 Proceedings of the eleventh ACM international conference on Multimedia
A bootstrapping approach to annotating large image collection

MIR '03 Proceedings of the 5th ACM SIGMM international workshop on Multimedia information retrieval
Generating fuzzy semantic metadata describing spatial relations from images using the R-histogram

Proceedings of the 4th ACM/IEEE-CS joint conference on Digital libraries
Learning an image manifold for retrieval

Proceedings of the 12th annual ACM international conference on Multimedia
A bootstrapping framework for annotating and retrieving WWW images

Proceedings of the 12th annual ACM international conference on Multimedia
Using One-Class and Two-Class SVMs for Multiclass Image Annotation

IEEE Transactions on Knowledge and Data Engineering
Learning the semantics of multimedia queries and concepts from a small number of examples

Proceedings of the 13th annual ACM international conference on Multimedia
Evaluation strategies for image understanding and retrieval

Proceedings of the 7th ACM SIGMM international workshop on Multimedia information retrieval
Linking images and keywords for semantics-based image retrieval

ICME '03 Proceedings of the 2003 International Conference on Multimedia and Expo - Volume 2
Real-time computerized annotation of pictures

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Automatic Image Annotation with Relevance Feedback and Latent Semantic Analysis

Adaptive Multimedial Retrieval: Retrieval, User, and Semantics
Deriving semantic terms for images by mining the web

Proceedings of the 11th International Conference on Electronic Commerce
Hierarchical long-term learning for automatic image annotation

SAMT'07 Proceedings of the semantic and digital media technologies 2nd international conference on Semantic Multimedia
Naming of image regions for user-friendly image retrieval

ICIAR'06 Proceedings of the Third international conference on Image Analysis and Recognition - Volume Part I
Multimedia semantics integration using linguistic model

PAKDD'06 Proceedings of the 10th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining
Content-free image retrieval by combinations of keywords and user feedbacks

CIVR'05 Proceedings of the 4th international conference on Image and Video Retrieval
Automatic image annotation by mining the web

DaWaK'06 Proceedings of the 8th international conference on Data Warehousing and Knowledge Discovery
Use of adaptive still image descriptors for annotation of video frames

ICIAR'07 Proceedings of the 4th international conference on Image Analysis and Recognition
Learning a ground object manifold for interpreting high-resolution sensor image

AICI'12 Proceedings of the 4th international conference on Artificial Intelligence and Computational Intelligence

Quantified Score

Hi-index	0.00

Visualization

Abstract

Automatic linguistic indexing of pictures is an important but highly challenging problem for researchers in computer vision and content-based image retrieval. In this paper, we introduce a statistical modeling approach to this problem. Categorized images are used to train a dictionary of hundreds of concepts automatically based on statistical modeling. Images of any given concept category are regarded as instances of a stochastic process that characterizes the category. To measure the extent of association between an image and the textual description of a category of images, the likelihood of the occurrence of the image based on the stochastic process derived from the category is computed. A high likelihood indicates a strong association. In our experimental implementation, the ALIP (Automatic Linguistic Indexing of Pictures) system, we focus on a particular group of stochastic processes for describing images, that is, the two-dimensional multiresolution hidden Markov models (2-D MHMMs). We implemented and tested the system on a photographic image database of 600 different semantic cat- egories, each with about 40 training images. Tested using 3,000 images outside the training database, the system has demonstrated good accuracy and high potential in linguistic indexing of these test images.