Segmentation of page images using the area Voronoi diagram
Computer Vision and Image Understanding - Special issue on document image understanding and retrieval
Empirical Performance Evaluation Methodology and Its Application to Page Segmentation Algorithms
IEEE Transactions on Pattern Analysis and Machine Intelligence
Performance evaluation of document structure extraction algorithms
Computer Vision and Image Understanding - Special issue on empirical evaluation of computer vision algorithms
The Document Spectrum for Page Layout Analysis
IEEE Transactions on Pattern Analysis and Machine Intelligence
Two Geometric Algorithms for Layout Analysis
DAS '02 Proceedings of the 5th International Workshop on Document Analysis Systems V
Ground-truthing and benchmarking document page segmentation
ICDAR '95 Proceedings of the Third International Conference on Document Analysis and Recognition (Volume 2) - Volume 2
ICDAR 2003 Page Segmentation Competition
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 2
ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
IBM Journal of Research and Development
Logical document conversion: combining functional and formal knowledge
Proceedings of the 2007 ACM symposium on Document engineering
The Diagonal Split: A Pre-segmentation Step for Page Layout Analysis and Classification
IbPRIA '09 Proceedings of the 4th Iberian Conference on Pattern Recognition and Image Analysis
Page frame detection for marginal noise removal from scanned documents
SCIA'07 Proceedings of the 15th Scandinavian conference on Image analysis
Context-aware and content-based dynamic Voronoi page segmentation
DAS '10 Proceedings of the 9th IAPR International Workshop on Document Analysis Systems
Genre classification in automated ingest and appraisal metadata
ECDL'06 Proceedings of the 10th European conference on Research and Advanced Technology for Digital Libraries
Learning segmentation of documents with complex scripts
ICVGIP'06 Proceedings of the 5th Indian conference on Computer Vision, Graphics and Image Processing
Hi-index | 0.00 |
This paper presents a quantitative comparison of six algorithms for page segmentation: X-Y cut, smearing, whitespace analysis, constrained text-line finding, Docstrum, and Voronoi-diagram-based. The evaluation is performed using a subset of the UW-III collection commonly used for evaluation, with a separate training set for parameter optimization. We compare the results using both default parameters and optimized parameters. In the course of the evaluation, the strengths and weaknesses of each algorithm are analyzed, and it is shown that no single algorithm outperforms all other algorithms. However, we observe that the three best-performing algorithms are those based on constrained text-line finding, Docstrum, and the Voronoi-diagram.