Categorizing Visual Contents by Matching Visual ``Keywords''

Authors:
Joo-Hwee Lim
Affiliations:
-
Venue:
VISUAL '99 Proceedings of the Third International Conference on Visual Information and Information Systems
Year:
1999

Citing 8
Cited 2

Fast multiresolution image querying

SIGGRAPH '95 Proceedings of the 22nd annual conference on Computer graphics and interactive techniques
Photobook: content-based manipulation of image databases

International Journal of Computer Vision
Combining classifiers in text categorization

SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
The Handbook of Brain Theory and Neural Networks

The Handbook of Brain Theory and Neural Networks
Training Templates for Scene Classification using a Few Examples

CAIVL '97 Proceedings of the 1997 Workshop on Content-Based Access of Image and Video Libraries (CBAIVL '97)
Configuration based scene classification and image indexing

CVPR '97 Proceedings of the 1997 Conference on Computer Vision and Pattern Recognition (CVPR '97)
A General Framework for Object Detection

ICCV '98 Proceedings of the Sixth International Conference on Computer Vision
Video query: research directions

IBM Journal of Research and Development - Papers on mustimedia systems

ViVo: Visual Vocabulary Construction for Mining Biomedical Images

ICDM '05 Proceedings of the Fifth IEEE International Conference on Data Mining
A fast visual word frequency - inverse image frequency for detector of rare concepts

CLEF'09 Proceedings of the 10th international conference on Cross-language evaluation forum: multimedia experiments

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we propose a three-layer visual information processing architecture for extracting concise non-textual descriptions from visual contents. These coded descriptions capture both local saliencies and spatial configurations present in visual contents via prototypical visual tokens called visual "keywords". Categorization of images and video shots represented by keyframes can be performed by comparing their coded descriptions. We demonstrate our proposed architecture in natural scene image categorization that outperforms methods which use aggregate measures of low-level features.