Web image indexing by using associated texts

Authors:
Zhiguo Gong;Leong Hou U.;Chan Wa Cheang
Affiliations:
Faculty of Science and Technology, University of Macau, Macao, P.R. China;Faculty of Science and Technology, University of Macau, Macao, P.R. China;Faculty of Science and Technology, University of Macau, Macao, P.R. China
Venue:
Knowledge and Information Systems
Year:
2006

Citing 0
Cited 7

A language-independent, open-vocabulary system based on HMMs for recognition of ultra low resolution words

Proceedings of the 2008 ACM symposium on Applied computing
Webpage segmentation for extracting images and their surrounding contextual information

MM '09 Proceedings of the 17th ACM international conference on Multimedia
Improving keyword based web image search with visual feature distribution and term expansion

Knowledge and Information Systems
A Conceptual Model for Publishing Multimedia Content on the Semantic Web

SAMT '09 Proceedings of the 4th International Conference on Semantic and Digital Media Technologies: Semantic Multimedia
A user study to investigate semantically relevant contextual information of WWW images

International Journal of Human-Computer Studies
Going beyond the surrounding text to semantically annotate and search digital images

ACIIDS'10 Proceedings of the Second international conference on Intelligent information and database systems: Part I
Slash-based relevance propagation model for topic distillation

Journal of Web Engineering

Quantified Score

Hi-index	0.00

Visualization

Abstract

In order to index Web images, the whole associated texts are partitioned into a sequence of text blocks, then the local relevance of a term to the corresponding image is calculated with respect to both its local occurrence in the block and the distance of the block to the image. Thus, the overall relevance of a term is determined as the sum of all its local weight values multiplied by the corresponding distance factors of the text blocks. In the present approach, the associated text of a Web image is firstly partitioned into three parts, including a page-oriented text (TM), a link-oriented text (LT), and a caption-oriented text (BT). Since the big size and semantic divergence, the caption-oriented text is further partitioned into finer blocks based on the tree structure of the tag elements within the BT text. During the processing, all heading nodes are pulled up in order to correlate with their semantic scopes, and a collapse algorithm is also exploited to remove the empty blocks. In our system, the relevant factors of the text blocks are determined by using a greedy Two-Way-Merging algorithm.