An Investigation of Mixed-Media Information Retrieval

  • Authors:
  • Gareth J. F. Jones;Adenike M. Lam-Adesina

  • Affiliations:
  • -;-

  • Venue:
  • ECDL '02 Proceedings of the 6th European Conference on Research and Advanced Technology for Digital Libraries
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

Digital document archives are increasingly derived from various different media sources. At present such archives are stored and searched independently. The Information Retrieval from Mixed-Media Collections (IRMMC) project is investigating retrieval from combined document collections composed of items originating from differing media forms. Experimentalin vestigation of a "mixed-media" retrieval task based on the existing TREC Spoken Document Retrievaltask combining Text, Spoken and Scanned Image is described. Results show that nontext media perform well within the mixed-media collection. Also while pseudo relevance feedback is extremely effective for spoken documents, its behaviour for document image retrievalis more complex.