Integrating text retrieval and image retrieval in XML document searching

Authors:
D. Tjondronegoro;J. Zhang;J. Gu;A. Nguyen;S. Geva
Affiliations:
Queensland University of Technology, Brisbane, Australia;Queensland University of Technology, Brisbane, Australia;Queensland University of Technology, Brisbane, Australia;Queensland University of Technology, Brisbane, Australia;Queensland University of Technology, Brisbane, Australia
Venue:
INEX'05 Proceedings of the 4th international conference on Initiative for the Evaluation of XML Retrieval
Year:
2005

Citing 3
Cited 7

Querying and ranking XML documents

Journal of the American Society for Information Science and Technology - XML
Searching structured documents

Information Processing and Management: an International Journal
GPX: gardens point XML information retrieval at INEX 2004

INEX'04 Proceedings of the Third international conference on Initiative for the Evaluation of XML Retrieval

Benchmarking multimedia search in structured collections

MIR '06 Proceedings of the 8th ACM international workshop on Multimedia information retrieval
Towards a structure-based multimedia retrieval model

MIR '08 Proceedings of the 1st ACM international conference on Multimedia information retrieval
XML Multimedia Retrieval: From Relevant Textual Information to Relevant Multimedia Fragments

ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
Characterizing image properties for digital mammograms

HIKM '09 Proceedings of the Third Australasian Workshop on Health Informatics and Knowledge Management - Volume 97
Using textual and structural context for searching Multimedia Elements

International Journal of Business Intelligence and Data Mining
Contextual image annotation via projection and quantum theory inspired measurement for integration of text and visual features

QI'11 Proceedings of the 5th international conference on Quantum interaction
Using XML logical structure to retrieve (multimedia)

ECDL'07 Proceedings of the 11th European conference on Research and Advanced Technology for Digital Libraries

Quantified Score

Hi-index	0.00

Visualization

Abstract

Many XML documents contain a mixture of text and images. Images play an important role in webpage or article presentation. However, popular Information Retrieval systems still largely depend on pure text retrieval as it is believed that text descriptions including body text and the caption of images contain precise information. On the other hand, images are more attractive and easier to understand than pure text. We assume that if the image content is used in addition to the pure text-based retrieval, the retrieval result should be better than text-only or image-only retrieval. We test this hypothesis by doing a series of experiments using the Lonely Planet XML document collection. Two search engines, an XML document search engine using both content and structure based on text, and a content-based image search engine were used at the same time. The results generated by these two search engines were merged together to form a new result. This paper presents our current work, initial results and vision into future work.