IEEE Transactions on Pattern Analysis and Machine Intelligence
Evaluation of model-based retrieval effectiveness with OCR text
ACM Transactions on Information Systems (TOIS)
Effects of OCR errors on ranking and feedback using the vector space model
Information Processing and Management: an International Journal
Large-Scale Simulation Studies in Image Pattern Recognition
IEEE Transactions on Pattern Analysis and Machine Intelligence
Automatic subject indexing using an associative neural network
Proceedings of the third ACM conference on Digital libraries
The indexing and retrieval of document images: a survey
Computer Vision and Image Understanding - Special issue on document image understanding and retrieval
The anatomy of a large-scale hypertextual Web search engine
WWW7 Proceedings of the seventh international conference on World Wide Web 7
Proceedings of the third annual conference on Autonomous Agents
SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
A Survey on Content-Based Retrieval for Multimedia Databases
IEEE Transactions on Knowledge and Data Engineering
ICDAR '97 Proceedings of the 4th International Conference on Document Analysis and Recognition
Robust Retrieval of Noisy Text
ADL '96 Proceedings of the 3rd International Forum on Research and Technology Advances in Digital Libraries
ICPR '02 Proceedings of the 16 th International Conference on Pattern Recognition (ICPR'02) Volume 1 - Volume 1
Document Image Decoding Using Iterated Complete Path Search with Subsampled Heuristic Scoring
ICDAR '01 Proceedings of the Sixth International Conference on Document Analysis and Recognition
Style consistency in pattern fields
Style consistency in pattern fields
Cut-and-paste text summarization
Cut-and-paste text summarization
Training on Severely Degraded Text-Line Images
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 1
Features for Word Spotting in Historical Manuscripts
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 1
Document Transformation System from Papers to XML Data Based on Pivot XML Document Method
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 1
Adaptive multilingual sentence boundary disambiguation
Computational Linguistics
A maximum entropy approach to identifying sentence boundaries
ANLC '97 Proceedings of the fifth conference on Applied natural language processing
Journal of the American Society for Information Science and Technology - Intelligence and Security Informatics
Some applications of tree-based modelling to speech and language
HLT '89 Proceedings of the workshop on Speech and Natural Language
Decoding of text lines in grayscale document images
ICASSP '01 Proceedings of the Acoustics, Speech, and Signal Processing, 2001. on IEEE International Conference - Volume 03
Color Text Extraction from Camera-based Images the Impact of the Choice of the Clustering Distance
ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
Automatic categorization of figures in scientific documents
Proceedings of the 6th ACM/IEEE-CS joint conference on Digital libraries
Logical document conversion: combining functional and formal knowledge
Proceedings of the 2007 ACM symposium on Document engineering
Digitizing a million books: challenges for document analysis
DAS'06 Proceedings of the 7th international conference on Document Analysis Systems
Hi-index | 0.00 |
No existing document image understanding technology, whether experimental or commercially available, can guarantee high accuracy across the full range of documents of interest to industrial and government agency users. Ideally, users should be able to search, access, examine, and navigate among document images as effectively as they can among encoded data files, using familiar interfaces and tools as fully as possible. We are investigating novel algorithms and software tools at the frontiers of document image analysis, information retrieval, text mining, and visualization that will assist in the full integration of such documents into collections of textual document images as well as "born digital" documents. Our approaches emphasize versatility first: that is, methods which work reliably across the broadest possible range of documents.