Automated entry system for printed documents
Pattern Recognition
Introduction to statistical pattern recognition (2nd ed.)
Introduction to statistical pattern recognition (2nd ed.)
Text segmentation using Gabor filters for automatic document processing
Machine Vision and Applications - Special issue: document image analysis techniques
Page segmentation and classification
CVGIP: Graphical Models and Image Processing
Automated Evaluation of OCR Zoning
IEEE Transactions on Pattern Analysis and Machine Intelligence
The nature of statistical learning theory
The nature of statistical learning theory
Incorporating Language Syntax in Visual Text Recognition with a Statistical Model
IEEE Transactions on Pattern Analysis and Machine Intelligence
Feature Selection: Evaluation, Application, and Small Sample Performance
IEEE Transactions on Pattern Analysis and Machine Intelligence
Document Representation and Its Application to Page Decomposition
IEEE Transactions on Pattern Analysis and Machine Intelligence
Adaptive confidence transform based classifier combination for Chinese character recognition
Pattern Recognition Letters
Empirical Performance Evaluation Methodology and Its Application to Page Segmentation Algorithms
IEEE Transactions on Pattern Analysis and Machine Intelligence
Machine-printed and hand-written text lines identification
Pattern Recognition Letters
Markov random field modeling in image analysis
Markov random field modeling in image analysis
Parameter-Free Geometric Document Layout Analysis
IEEE Transactions on Pattern Analysis and Machine Intelligence
Enhancement and Restoration of Digital Documents: Statistical Design of Nonlinear Algorithms
Enhancement and Restoration of Digital Documents: Statistical Design of Nonlinear Algorithms
Hybrid Contextural Text Recognition with String Matching
IEEE Transactions on Pattern Analysis and Machine Intelligence
The Document Spectrum for Page Layout Analysis
IEEE Transactions on Pattern Analysis and Machine Intelligence
Multiscale Segmentation of Unstructured Document Pages Using Soft Decision Integration
IEEE Transactions on Pattern Analysis and Machine Intelligence
Image Categorization Using Texture Features
ICDAR '97 Proceedings of the 4th International Conference on Document Analysis and Recognition
The Segmentation and Identification of Handwriting in Noisy Document Images
DAS '02 Proceedings of the 5th International Workshop on Document Analysis Systems V
A system for machine-written and hand-written character distinction
ICDAR '95 Proceedings of the Third International Conference on Document Analysis and Recognition (Volume 2) - Volume 2
Document Image Quality: Making Fine Discriminations
ICDAR '99 Proceedings of the Fifth International Conference on Document Analysis and Recognition
A Two-State Markov Chain Model of Degraded Document Images
ICDAR '99 Proceedings of the Fifth International Conference on Document Analysis and Recognition
Binarization of Low Quality Text Using a Markov Random Field Model
ICPR '02 Proceedings of the 16 th International Conference on Pattern Recognition (ICPR'02) Volume 3 - Volume 3
Zone Content Classification and its Performance Evaluation
ICDAR '01 Proceedings of the Sixth International Conference on Document Analysis and Recognition
Separating Handwritten Material from Machine Printed Text Using Hidden Markov Models
ICDAR '01 Proceedings of the Sixth International Conference on Document Analysis and Recognition
Adaptive Segmentation of Document Images
ICDAR '01 Proceedings of the Sixth International Conference on Document Analysis and Recognition
Estimation of morphological degradation model parameters
ICASSP '01 Proceedings of the Acoustics, Speech, and Signal Processing, 2001. on IEEE International Conference - Volume 03
LIBSVM: A library for support vector machines
ACM Transactions on Intelligent Systems and Technology (TIST)
Financial Document Image Coding with Regions of Interest Using JPEG2000
ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
Boosting-based Transductive Learning for Text Detection
ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
Signal Processing
Handwritten Character Distinction Method Inspired by Human Vision Mechanism
Neural Information Processing
Pattern Recognition Methods for Querying and Browsing Technical Documentation
CIARP '08 Proceedings of the 13th Iberoamerican congress on Pattern Recognition: Progress in Pattern Recognition, Image Analysis and Applications
A stroke filter and its application to text localization
Pattern Recognition Letters
Accurate text localization in images based on SVM output scores
Image and Vision Computing
Proceedings of the 2010 ACM Symposium on Applied Computing
Overlapped text segmentation using Markov random field and aggregation
DAS '10 Proceedings of the 9th IAPR International Workshop on Document Analysis Systems
Robust polynomial classifier using L1-norm minimization
Applied Intelligence
ECCV'10 Proceedings of the 11th European conference on Computer vision: Part II
A system for licence plate recognition using a hierarchically combined classifier
International Journal of Intelligent Systems Technologies and Applications
Integrated Computer-Aided Engineering
Pixel accurate document image content extraction
Proceedings of the 2011 ACM Symposium on Applied Computing
Display text segmentation after learning best-fitted OCR binarization parameters
Expert Systems with Applications: An International Journal
Similarity-based training set acquisition for continuous handwriting recognition
Information Sciences: an International Journal
Using a boosted tree classifier for text segmentation in hand-annotated documents
Pattern Recognition Letters
Proceedings of the International Conference on Advances in Computing, Communications and Informatics
An algorithm for accuracy enhancement of license plate recognition
Journal of Computer and System Sciences
Hi-index | 0.15 |
Abstract--In this paper, we address the problem of the identification of text in noisy document images. We are especially focused on segmenting and identifying between handwriting and machine printed text because: 1) Handwriting in a document often indicates corrections, additions, or other supplemental information that should be treated differently from the main content and 2) the segmentation and recognition techniques requested for machine printed and handwritten text are significantly different. A novel aspect of our approach is that we treat noise as a separate class and model noise based on selected features. Trained Fisher classifiers are used to identify machine printed text and handwriting from noise and we further exploit context to refine the classification. A Markov Random Field-based (MRF) approach is used to model the geometrical structure of the printed text, handwriting, and noise to rectify misclassifications. Experimental results show that our approach is robust and can significantly improve page segmentation in noisy document collections.