Man-machine interface issues in the construction and use of an expert system
International Journal of Man-Machine Studies - Ellis Horwood series in artificial intelligence
Pattern recognition: human and mechanical
Pattern recognition: human and mechanical
Whether software engineering needs to be artificially intelligent
IEEE Transactions on Software Engineering
Digital image processing (2nd ed.)
Digital image processing (2nd ed.)
International Journal of Man-Machine Studies - Knowledge acquisition for knowledge-based systems, part 1. Based on an AAAI work
International Journal of Man-Machine Studies - Knowledge acquisition for knowledge-based systems, part 1. Based on an AAAI work
An overview of knowledge-acquisition and transfer
International Journal of Man-Machine Studies
Pattern Recognition
Recognizing address blocks on mail pieces
AI Magazine
Perspectives on imperfect information processing
IEEE Transactions on Systems, Man and Cybernetics
Knowledge elicitation using discourse analysis
International Journal of Man-Machine Studies - Special Issue: Knowledge Acquisition for Knowledge-based Systems. Part 4
International Journal of Man-Machine Studies - Special Issue: Knowledge Acquisition for Knowledge-based Systems. Part 4
ASTEK: A multi-paradigm knowledge acquisition tool for complex structured knowledge
International Journal of Man-Machine Studies
Acquiring strategic knowledge from experts
International Journal of Man-Machine Studies
A Robust Algorithm for Text String Separation from Mixed Text/Graphics Images
IEEE Transactions on Pattern Analysis and Machine Intelligence
Two complementary techniques for digitized document analysis
DOCPROCS '88 Proceedings of the ACM conference on Document processing systems
Tracking text in mixed-mode documents
DOCPROCS '88 Proceedings of the ACM conference on Document processing systems
Classification of newspaper image blocks using texture analysis
Computer Vision, Graphics, and Image Processing
A survey of knowledge acquisition techniques and tools
Knowledge Acquisition
A review of segmentation and contextual analysis techniques for text recognition
Pattern Recognition
Preprocessing and presorting of envelope images for automatic sorting using OCR
Pattern Recognition
Automated entry system for printed documents
Pattern Recognition
IBM Systems Journal
Preliminary investigation of techniques for automated reading of unformatted text
Communications of the ACM
Algorithms for Graphics and Imag
Algorithms for Graphics and Imag
Digital Document Processing
Knowledge Acquisition for Knowledge-Based Systems
Knowledge Acquisition for Knowledge-Based Systems
A Foreword to Knowledge and Data Engineering
IEEE Transactions on Knowledge and Data Engineering
Knowledge and Data Engineering
IEEE Transactions on Knowledge and Data Engineering
PM: A System to Support the Automatic Acquisition of Programming Knowledge
IEEE Transactions on Knowledge and Data Engineering
Merkmale für die Segmentation von Dokumenten zur automatischen Textverarbeitung
Modelle und Strukturen, DAGM Symposium
A Fast Algorithm for Bottom-Up Document Layout Analysis
IEEE Transactions on Pattern Analysis and Machine Intelligence
Multiresolution Analysis in Extraction of Reference Lines from Documents with Gray Level Background
IEEE Transactions on Pattern Analysis and Machine Intelligence
INFORMys: A Flexible Invoice-Like Form-Reader System
IEEE Transactions on Pattern Analysis and Machine Intelligence
A hierarchy-aware approach to faceted classification of objected-oriented components
ACM Transactions on Software Engineering and Methodology (TOSEM)
Corrigenda: a hierarchy-aware approach to faceted classification of object-oriented components
ACM Transactions on Software Engineering and Methodology (TOSEM)
Machine Learning for Intelligent Processing of Printed Documents
Journal of Intelligent Information Systems - Special issue on methodologies for intelligent information systems
IEEE Transactions on Knowledge and Data Engineering
Symbolic Learning Techniques in Paper Document Processing
MLDM '99 Proceedings of the First International Workshop on Machine Learning and Data Mining in Pattern Recognition
A Layout-Free Method for Extracting Elements from Document Images
DAS '98 Selected Papers from the Third IAPR Workshop on Document Analysis Systems: Theory and Practice
Information Retrieval in Document Image Databases
IEEE Transactions on Knowledge and Data Engineering
Rule identification using ontology while acquiring rules from Web pages
International Journal of Human-Computer Studies
RELATIONAL DATA MINING AND ILP FOR DOCUMENT IMAGE UNDERSTANDING
Applied Artificial Intelligence
Feature string-based intelligent information retrieval from Tamil document images
International Journal of Computer Applications in Technology
Text line segmentation in handwritten documents using Mumford-Shah model
Pattern Recognition
Multi-page document analysis based on format consistency and clustering
International Journal of Computer Applications in Technology
Template-based information mining from HTML documents
AAAI'97/IAAI'97 Proceedings of the fourteenth national conference on artificial intelligence and ninth conference on Innovative applications of artificial intelligence
FAETON: Form Analysis and Extraction Tool for ONtology construction
International Journal of Computer Applications in Technology
Structure extraction from PDF-based book documents
Proceedings of the 11th annual international ACM/IEEE joint conference on Digital libraries
Rule-based personalized comparison shopping including delivery cost
Electronic Commerce Research and Applications
KSEM'06 Proceedings of the First international conference on Knowledge Science, Engineering and Management
Automatically structuring domain knowledge from text: An overview of current research
Information Processing and Management: an International Journal
Hi-index | 0.01 |
The knowledge acquisition bottleneck has become the major impediment to the development and application of effective information systems. To remove this bottleneck, new document processing techniques must be introduced to automatically acquire knowledge from various types of documents. By presenting a survey on the techniques and problems involved, this paper aims at serving as a catalyst to stimulate research in automatic knowledge acquisition through document processing. In this study, a document is considered to have two structures: geometric structure and logical structure. These play a key role in the process of the knowledge acquisition, which can be viewed as a process of acquiring the above structures. Extracting the geometric structure from a document refers to document analysis; mapping the geometric structure into logical structure is regarded as document understanding. Both areas are described in this paper, and the basic concept of document structure and its measurement based on entropy analysis is introduced. Logical structure and geometric models are proposed. Both top-down and bottom-up approaches and their entropy analyses are presented. Different techniques are discussed with practical examples. Mapping methods, such as tree transformation, document formatting knowledge and document format description language, are described.