Logical Structure Analysis of Document Images Based on Emergent Computation
ICDAR '99 Proceedings of the Fifth International Conference on Document Analysis and Recognition
Knowledge-based derivation of document logical structure
ICDAR '95 Proceedings of the Third International Conference on Document Analysis and Recognition (Volume 1) - Volume 1
How to Build a Digital Library
How to Build a Digital Library
Computational Linguistics
Automatic discovery of logical document structure
Automatic discovery of logical document structure
An Adaptable Search System for Collections of Partially Structured Documents
IEEE Intelligent Systems
Logical Structure Analysis and Generation for Structured Documents: A Syntactic Approach
IEEE Transactions on Knowledge and Data Engineering
Reference metadata extraction using a hierarchical knowledge representation framework
Decision Support Systems
A simple method for citation metadata extraction using hidden markov models
Proceedings of the 8th ACM/IEEE-CS joint conference on Digital libraries
Opinion Mining and Sentiment Analysis
Foundations and Trends in Information Retrieval
Proceedings of the 2012 ACM symposium on Document engineering
Annotating archaeological texts: an example of domain-specific annotation in the humanities
LAW VI '12 Proceedings of the Sixth Linguistic Annotation Workshop
Hi-index | 0.00 |
Most existing HLT pipelines assume the input is pure text or, at most, HTML and either ignore (logical) document structure or remove it. We argue that identifying the structure of documents is essential in digital library and other types of applications, and show that it is relatively straightforward to extend existing pipelines to achieve ones in which the structure of a document is preserved.