Fast multiresolution image querying
SIGGRAPH '95 Proceedings of the 22nd annual conference on Computer graphics and interactive techniques
Photobook: content-based manipulation of image databases
International Journal of Computer Vision
Combining classifiers in text categorization
SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
The Handbook of Brain Theory and Neural Networks
The Handbook of Brain Theory and Neural Networks
Training Templates for Scene Classification using a Few Examples
CAIVL '97 Proceedings of the 1997 Workshop on Content-Based Access of Image and Video Libraries (CBAIVL '97)
Configuration based scene classification and image indexing
CVPR '97 Proceedings of the 1997 Conference on Computer Vision and Pattern Recognition (CVPR '97)
A General Framework for Object Detection
ICCV '98 Proceedings of the Sixth International Conference on Computer Vision
Video query: research directions
IBM Journal of Research and Development - Papers on mustimedia systems
ViVo: Visual Vocabulary Construction for Mining Biomedical Images
ICDM '05 Proceedings of the Fifth IEEE International Conference on Data Mining
A fast visual word frequency - inverse image frequency for detector of rare concepts
CLEF'09 Proceedings of the 10th international conference on Cross-language evaluation forum: multimedia experiments
Hi-index | 0.00 |
In this paper, we propose a three-layer visual information processing architecture for extracting concise non-textual descriptions from visual contents. These coded descriptions capture both local saliencies and spatial configurations present in visual contents via prototypical visual tokens called visual "keywords". Categorization of images and video shots represented by keyframes can be performed by comparing their coded descriptions. We demonstrate our proposed architecture in natural scene image categorization that outperforms methods which use aggregate measures of low-level features.