Unifying textual and visual cues for content-based image retrieval on the World Wide Web
Computer Vision and Image Understanding - Special issue on content-based access for image and video libraries
Marie-4: A High-Recall, Self-Improving Web Crawler That Finds Images Using Captions
IEEE Intelligent Systems
Hi-index | 0.01 |
We have developed a tool MARIE-4 for building virtual libraries of multimedia (images, video, and audio) by automatically exploring (crawling) a specified subdomain of the World Wide Web to create an index based on caption keywords. Our approach uses carefully-researched criteria to identify and rate caption text, and employs both an expert system and a neural network. We have used it to create a keyword-based interface to nearly all nontrivial captioned publicly-accessible U.S. Navy images (667,573), video (8,290), and audio (2,499), called the Navy Virtual Multimedia Library (NAVMULIB).