Automated entry system for printed documents
Pattern Recognition
Pattern Classification (2nd Edition)
Pattern Classification (2nd Edition)
A Fast Algorithm for Bottom-Up Document Layout Analysis
IEEE Transactions on Pattern Analysis and Machine Intelligence
DL '97 Proceedings of the second ACM international conference on Digital libraries
Document Representation and Its Application to Page Decomposition
IEEE Transactions on Pattern Analysis and Machine Intelligence
INFORMys: A Flexible Invoice-Like Form-Reader System
IEEE Transactions on Pattern Analysis and Machine Intelligence
Twenty Years of Document Image Analysis in PAMI
IEEE Transactions on Pattern Analysis and Machine Intelligence
Geometric Structure Analysis of Document Images: A Knowledge-Based Approach
IEEE Transactions on Pattern Analysis and Machine Intelligence
Empirical Performance Evaluation Methodology and Its Application to Page Segmentation Algorithms
IEEE Transactions on Pattern Analysis and Machine Intelligence
IEEE Transactions on Pattern Analysis and Machine Intelligence
Text-Line Extraction as Selection of Paths in the Neighbor Graph
DAS '98 Selected Papers from the Third IAPR Workshop on Document Analysis Systems: Theory and Practice
The Segmentation and Identification of Handwriting in Noisy Document Images
DAS '02 Proceedings of the 5th International Workshop on Document Analysis Systems V
Skew detection and correction in document images based on straight-line fitting
Pattern Recognition Letters
Retrieving information from data flow diagrams
WCRE '95 Proceedings of the Second Working Conference on Reverse Engineering
Document Skew Detection Using Minimum-Area Bounding Rectangle
ITCC '00 Proceedings of the The International Conference on Information Technology: Coding and Computing (ITCC'00)
A nearest-neighbor chain based approach to skew estimation in document images
Pattern Recognition Letters
An Approach to Extracting the Target Text Line from a Document Image Captured by a Pen Scanner
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 1
Automated Detection and Segmentation of Table of Contents Page from Document Images
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 1
Improved Nearest Neighbor Based Approach to Accurate Document Skew Estimation
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 1
Gabor Filter Based Multi-class Classifier for Scanned Document Images
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 2
Consensus-Based Table Form Recognition
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 2
Machine Printed Text and Handwriting Identification in Noisy Document Images
IEEE Transactions on Pattern Analysis and Machine Intelligence
Adaptive Hindi OCR using generalized Hausdorff image comparison
ACM Transactions on Asian Language Information Processing (TALIP)
A Framework for Detecting and Selecting Text Line Candidates of Correct Orientation
ICPR '98 Proceedings of the 14th International Conference on Pattern Recognition-Volume 2 - Volume 2
A new algorithm for skew detection and correction
Pattern Recognition Letters
A fast orientation and skew detection algorithm for monochromatic document images
Proceedings of the 2005 ACM symposium on Document engineering
Page Segmentation for Manhattan and Non-Manhattan Layout Documents via Selective CRLA
ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
A Generic Method for Determining the Up/Down Orientation of Text in Roman and Non-roman Scripts
ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
Robust Skew Detection in mixed Text/Graphics Documents
ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
Skew Estimation for Scanned Documents from "Noises"
ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
Identifying Script onWord-Level with Informational Confidenc
ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
Text Extraction from Gray Scale Historical Document Images Using Adaptive Local Connectivity Map
ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
Text/Graphic labelling of Ancient Printed Documents
ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
Multi-Level Component Grouping Algorithm and Its Applications
ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
Structuralizing digital ink for efficient selection
Proceedings of the 11th international conference on Intelligent user interfaces
Combining DOM tree and geometric layout analysis for online medical journal article segmentation
Proceedings of the 6th ACM/IEEE-CS joint conference on Digital libraries
Recognition of perspectively distorted planar grids
Pattern Recognition Letters
Convex hull based skew estimation
Pattern Recognition
Extracting relevant named entities for automated expense reimbursement
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Document page segmentation using neuro-fuzzy approach
Applied Soft Computing
Character recognition using statistical parameters
SSIP'06 Proceedings of the 6th WSEAS International Conference on Signal, Speech and Image Processing
A word shape coding method for camera-based document images
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Pattern Recognition Letters
Hough transform based fast skew detection and accurate skew correction methods
Pattern Recognition
A Figure Image Processing System
Graphics Recognition. Recent Advances and New Opportunities
The Diagonal Split: A Pre-segmentation Step for Page Layout Analysis and Classification
IbPRIA '09 Proceedings of the 4th Iberian Conference on Pattern Recognition and Image Analysis
Text line segmentation in handwritten documents using Mumford-Shah model
Pattern Recognition
Perspective rectification of document images using fuzzy set and morphological operations
Image and Vision Computing
Page segmentation using texture analysis
Pattern Recognition
Fiducial line based skew estimation
Pattern Recognition
Multi-oriented english text line identification
SCIA'03 Proceedings of the 13th Scandinavian conference on Image analysis
Page frame detection for marginal noise removal from scanned documents
SCIA'07 Proceedings of the 15th Scandinavian conference on Image analysis
Proceedings of the 2010 ACM Symposium on Applied Computing
A new technique for global and local skew correction in binary documents
ACIVS'07 Proceedings of the 9th international conference on Advanced concepts for intelligent vision systems
Decomposing document images by heuristic search
EMMCVPR'07 Proceedings of the 6th international conference on Energy minimization methods in computer vision and pattern recognition
Algorithm of document skew detection based on character vertices
IITA'09 Proceedings of the 3rd international conference on Intelligent information technology application
An adaptive technique for global and local skew correction in color documents
Expert Systems with Applications: An International Journal
Semi-supervised learning for text-line detection
Pattern Recognition Letters
Context-aware and content-based dynamic Voronoi page segmentation
DAS '10 Proceedings of the 9th IAPR International Workshop on Document Analysis Systems
Overlapped text segmentation using Markov random field and aggregation
DAS '10 Proceedings of the 9th IAPR International Workshop on Document Analysis Systems
DAS '10 Proceedings of the 9th IAPR International Workshop on Document Analysis Systems
Associating figures with descriptions for patent documents
DAS '10 Proceedings of the 9th IAPR International Workshop on Document Analysis Systems
The BBN document analysis service: a platform for multilingual document translation
DAS '10 Proceedings of the 9th IAPR International Workshop on Document Analysis Systems
Skew estimation of document images using bagging
IEEE Transactions on Image Processing
ECCV'10 Proceedings of the 11th European conference on Computer vision: Part II
Movement invariants-based algorithm for medical image tilt correction
International Journal of Automation and Computing
Document seal detection using GHT and character proximity graphs
Pattern Recognition
Document image analysis: issues, comparison of methods and remaining problems
Artificial Intelligence Review
A new algorithm for segmenting warped text-lines in document images
Proceedings of the 2011 ACM Symposium on Applied Computing
Automatic localization of page segmentation errors
Proceedings of the 2011 Joint Workshop on Multilingual OCR and Analytics for Noisy Unstructured Text Data
Text line segmentation for gray scale historical document images
Proceedings of the 2011 Workshop on Historical Document Imaging and Processing
Ancient printed documents indexation: a new approach
ICAPR'05 Proceedings of the Third international conference on Advances in Pattern Recognition - Volume Part I
Region analysis of business card images acquired in PDA using DCT and information pixel density
ACIVS'05 Proceedings of the 7th international conference on Advanced Concepts for Intelligent Vision Systems
Skew estimation and correction for form documents using wavelet decomposition
ICIAR'05 Proceedings of the Second international conference on Image Analysis and Recognition
Using a boosted tree classifier for text segmentation in hand-annotated documents
Pattern Recognition Letters
Learning segmentation of documents with complex scripts
ICVGIP'06 Proceedings of the 5th Indian conference on Computer Vision, Graphics and Image Processing
ICVGIP'06 Proceedings of the 5th Indian conference on Computer Vision, Graphics and Image Processing
Performance comparison of six algorithms for page segmentation
DAS'06 Proceedings of the 7th international conference on Document Analysis Systems
Arabic bank check analysis and zone extraction
ICIAR'12 Proceedings of the 9th international conference on Image Analysis and Recognition - Volume Part I
Automatic localization and correction of line segmentation errors
Proceeding of the workshop on Document Analysis and Recognition
Margin noise removal from printed document images
Proceeding of the workshop on Document Analysis and Recognition
Multilingual OCR research and applications: an overview
Proceedings of the 4th International Workshop on Multilingual OCR
Text line extraction for historical document images
Pattern Recognition Letters
Hi-index | 0.15 |
Page layout analysis is a document processing technique used to determine the format of a page. This paper describes the document spectrum (or docstrum), which is a method for structural page layout analysis based on bottom-up, nearest-neighbor clustering of page components. The method yields an accurate measure of skew, within-line, and between-line spacings and locates text lines and text blocks. It is advantageous over many other methods in three main ways: independence from skew angle, independence from different text spacings, and the ability to process local regions of different text orientations within the same image. Results of the method shown for several different page formats and for randomly oriented subpages on the same image illustrate the versatility of the method. We also discuss the differences, advantages, and disadvantages of the docstrum with respect to other lay-out methods.