Natural-language retrieval of images based on descriptive captions

  • Authors:
  • Eugene J. Guglielmo;Neil C. Rowe

  • Affiliations:
  • Monterey Bay Aquarium Research Institute, Moss Landing, CA;Naval Postgraduate School, Monterey, CA

  • Venue:
  • ACM Transactions on Information Systems (TOIS)
  • Year:
  • 1996

Quantified Score

Hi-index 0.00

Visualization

Abstract

We describe a prototype intelligent information retrieval system that uses natural-language understanding to efficiently locate captioned data. Multimedia data generally require captions to explain their features and significance. Such descriptive captions often rely on long nominal compounds (strings of consecutive nouns) which create problems of disambiguating word sence. In our system, captions and user queries are parsed and interpreted to produce a logical form using a detailed theory of the meaning of nominal compounds. A fine-grain match can then compare the logical form of the query to the logical forms for each caption. To improve system efficiency, we first perform a coarse-grain match with index files, using nouns and verbs extracted from the query. Our experiments with randomly selected queries and captions from an existing image library show an increase of 30% in precision and 50% in recall over the keyphrase approach currently used. Our processing times have a median of seven seconds as compared to eight minutes for the existing system, and our system is much easier to use.