Intelligent Indexing and Semantic Retrieval of Multimodal Documents

Authors:
Rohini K. Srihari;Zhongfei Zhang;Aibing Rao
Affiliations:
Center for Document Analysis and Recognition (CEDAR), UB Commons, 520 Lee Entrance-Suite 202, State University of New York at Buffalo, Buffalo, NY 14228-2583, USA. rohini@cedar.buffalo.edu;Center for Document Analysis and Recognition (CEDAR), UB Commons, 520 Lee Entrance-Suite 202, State University of New York at Buffalo, Buffalo, NY 14228-2583, USA. zhongfei@cedar.buffalo.edu;Center for Document Analysis and Recognition (CEDAR), UB Commons, 520 Lee Entrance-Suite 202, State University of New York at Buffalo, Buffalo, NY 14228-2583, USA. arao@cedar.buffalo.edu
Venue:
Information Retrieval
Year:
2000

Citing 17
Cited 23

Automatic text processing

Automatic text processing
Retrieval of similar pictures on pictorial databases

Pattern Recognition
Color indexing

International Journal of Computer Vision
Exploiting captions in retrieval of multimedia data

Information Processing and Management: an International Journal
Visual semantics: extracting visual information from text accompanying pictures

AAAI '94 Proceedings of the twelfth national conference on Artificial intelligence (vol. 1)
An image retrieval model based on classical logic

SIGIR '95 Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval
VisualSEEk: a fully automated content-based image query system

MULTIMEDIA '96 Proceedings of the fourth ACM international conference on Multimedia
Using semantic contents and WordNet in image retrieval

Proceedings of the 20th annual international ACM SIGIR conference on Research and development in information retrieval
Intelligent multimedia information retrieval

Intelligent multimedia information retrieval
An agent-based architecture for content-based multimedia browsing

Intelligent multimedia information retrieval
Principles of multimedia database systems

Principles of multimedia database systems
Readings in agents

Readings in agents
Structured Document Image Analysis

Structured Document Image Analysis
Finding Pictures in Context

MINAR '98 Proceedings of the IAPR International Workshop on Multimedia Information Analysis and Retrieval
Integrated spatial and feature image query

Multimedia Systems
A simple rule-based part of speech tagger

ANLC '92 Proceedings of the third conference on Applied natural language processing
Nymble: a high-performance learning name-finder

ANLC '97 Proceedings of the fifth conference on Applied natural language processing

Faceted metadata for image search and browsing

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Measuring Semantic Similarity Between Words Using Lexical Knowledge and Neural Networks

IDEAL '02 Proceedings of the Third International Conference on Intelligent Data Engineering and Automated Learning
Active e-document framework ADF: model and tool

Information and Management
Confidence-based dynamic ensemble for image annotation and semantics discovery

MULTIMEDIA '03 Proceedings of the eleventh ACM international conference on Multimedia
Fuzzy resource space model and platform

Journal of Systems and Software
Semantics and feature discovery via confidence-based ensemble

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Using One-Class and Two-Class SVMs for Multiclass Image Annotation

IEEE Transactions on Knowledge and Data Engineering
Video text recognition using sequential Monte Carlo and error voting methods

Pattern Recognition Letters
Information graphics: an untapped resource for digital libraries

SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
An empirical investigation of user term feedback in text-based targeted image search

ACM Transactions on Information Systems (TOIS)
Image retrieval based on indexing and relevance feedback

Pattern Recognition Letters
A web-based multi-agent system approach to document engineering

International Journal of Web Engineering and Technology
Content-based image retrieval with the normalized information distance

Computer Vision and Image Understanding
Multi-Dimensional Dynamic Time Warping for Image Texture Similarity

SBIA '08 Proceedings of the 19th Brazilian Symposium on Artificial Intelligence: Advances in Artificial Intelligence
Retrieving video features for language acquisition

Expert Systems with Applications: An International Journal
Improving keyword based web image search with visual feature distribution and term expansion

Knowledge and Information Systems
A Hybrid Concept Similarity Measure Model for Ontology Environment

OTM '09 Proceedings of the Confederated International Workshops and Posters on On the Move to Meaningful Internet Systems: ADI, CAMS, EI2N, ISDE, IWSSA, MONET, OnToContent, ODIS, ORM, OTM Academy, SWWS, SEMELS, Beyond SAWSDL, and COMBEK 2009
A spatiotemporal text localization and identification approach for content-based video browsing

Proceedings of the 7th International Conference on Advances in Mobile Computing and Multimedia
A service concept recommendation system for enhancing the dependability of semantic service matchmakers in the service ecosystem environment

Journal of Network and Computer Applications
Measuring Chinese-English cross-lingual word similarity with HowNet and parallel corpus

CICLing'11 Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part II
Web query expansion by wordnet

DEXA'05 Proceedings of the 16th international conference on Database and Expert Systems Applications
Multi-term web query expansion using wordnet

DEXA'06 Proceedings of the 17th international conference on Database and Expert Systems Applications
Measuring semantic similarity between words by removing noise and redundancy in web snippets

Concurrency and Computation: Practice & Experience

Quantified Score

Hi-index	0.00

Visualization

Abstract

Finding useful information from large multimodal document collections such as the WWW without encountering numerous false positives poses a challenge to multimedia information retrieval systems (MMIR). This research addresses the problem of finding pictures. The fact that images do not appear in isolation, but rather with accompanying, collateral text is exploited. Taken independently, existing techniques for picture retrieval using (i) text-based and (ii) image-based methods have several limitations. This research presents a general model for multimodal information retrieval that addresses the following issues: (i) users' information need, (ii) expressing information need through composite, multimodal queries, and (iii) determining the most appropriate weighted combination of indexing techniques in order to best satisfy information need. A machine learning approach is proposed for the latter. The focus is on improving precision and recall in a MMIR system by optimally combining text and image similarity. Experiments are presented which demonstrate the utility of individual indexing systems in improving overall average precision.