A user study to investigate semantically relevant contextual information of WWW images

Authors:
Fariza Fauzi;Mohammed Belkhatir
Affiliations:
Faculty of IT, Monash University, University Street, 46150 Sunway, Malaysia;Lyon Institute of Technology, Université de Lyon I, Campus de la Doua, France
Venue:
International Journal of Human-Computer Studies
Year:
2010

Citing 16
Cited 1

Teaching critical evaluation skills for World Wide Web resources

Computers in Libraries
Unifying textual and visual cues for content-based image retrieval on the World Wide Web

Computer Vision and Image Understanding - Special issue on content-based access for image and video libraries
Content-Based Image Retrieval at the End of the Early Years

IEEE Transactions on Pattern Analysis and Machine Intelligence
A model of multimedia information retrieval

Journal of the ACM (JACM)
Web mining for web image retrieval

Journal of the American Society for Information Science and Technology - Visual based retrieval systems and web mining
Query by Image and Video Content: The QBIC System

Computer
Automatic Web Information Extraction in the ROADRUNNER System

Revised Papers from the HUMACS, DASWIS, ECOMO, and DAMA on ER 2001 Workshops
Classification of user image descriptions

International Journal of Human-Computer Studies
Web image indexing by using associated texts

Knowledge and Information Systems
A survey of content-based image retrieval with high-level semantics

Pattern Recognition
Clustering and searching WWW images using link and page layout analysis

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Learning generative visual models from few training examples: An incremental Bayesian approach tested on 101 object categories

Computer Vision and Image Understanding
User-generated descriptions of individual images versus labels of groups of images: A comparison using basic level theory

Information Processing and Management: an International Journal
Image annotation via graph learning

Pattern Recognition
A broadcast model for web image annotation

PCM'06 Proceedings of the 7th Pacific Rim conference on Advances in Multimedia Information Processing
A full-text framework for the image retrieval signal/semantic integration

DEXA'05 Proceedings of the 16th international conference on Database and Expert Systems Applications

Multifaceted conceptual image indexing on the world wide web

Information Processing and Management: an International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

The contextual information of Web images is investigated to address the issue of enriching their index characterizations with semantic descriptors and therefore bridge the semantic gap (i.e. the gap between the low-level content-based description of images and their semantic interpretation). Although we are highly motivated by the availability of rich knowledge on the Web and the relative success achieved by commercial search engines in indexing images using surrounding text-based information in webpages, we are aware that the unpredictable quality of the surrounding text is a major limiting factor. In order to improve its quality, we highlight contextual information which is relevant for the semantic characterization of Web images and study its statistical properties in terms of its location and nature considering a classification into five semantic concept classes: signal, object, scene, abstract and relational. A user study is conducted to validate the results. The results suggest that there are several locations that consistently contain relevant textual information with respect to the image. The importance of each location is influenced by the type of webpage as the results show the different distribution of relevant contextual information across the locations for different webpage types. The frequently found semantic concept classes are object and abstract. Another important outcome of the user study shows that a webpage is not an atomic unit and can be further partitioned into smaller segments. Segments containing images are of interest and termed as image segments. We observe that users typically single out textual information which they consider relevant to the image from the textual information bounded within the image segment. Hence, our second contribution is a DOM Tree-based webpage segmentation algorithm to automatically partition webpages into image segments. We use the resultant human-labeled dataset to validate the effectiveness of our segmentation method and experiments demonstrate that our method achieves better results compared to an existing segmentation algorithm.