Statistical methods for speech recognition
Statistical methods for speech recognition
Segmentation of page images using the area Voronoi diagram
Computer Vision and Image Understanding - Special issue on document image understanding and retrieval
Foundations of statistical natural language processing
Foundations of statistical natural language processing
Holistic Word Recognition for Handwritten Historical Documents
DIAL '04 Proceedings of the First International Workshop on Document Image Analysis for Libraries (DIAL'04)
Improving the Quality of Degraded Document Images
DIAL '06 Proceedings of the Second International Conference on Document Image Analysis for Libraries
Towards Restoring Historic Documents Degraded Over Time
DIAL '06 Proceedings of the Second International Conference on Document Image Analysis for Libraries
Word spotting for historical documents
International Journal on Document Analysis and Recognition
International Journal on Document Analysis and Recognition
A Novel Connectionist System for Unconstrained Handwriting Recognition
IEEE Transactions on Pattern Analysis and Machine Intelligence
Language Model Integration for the Recognition of Handwritten Medieval Documents
ICDAR '09 Proceedings of the 2009 10th International Conference on Document Analysis and Recognition
ICDAR '09 Proceedings of the 2009 10th International Conference on Document Analysis and Recognition
ICDAR 2009 Handwriting Recognition Competition
ICDAR '09 Proceedings of the 2009 10th International Conference on Document Analysis and Recognition
IBM Journal of Research and Development
Multimodal interactive transcription of text images
Pattern Recognition
Improving Offline Handwritten Text Recognition with Hybrid HMM/ANN Models
IEEE Transactions on Pattern Analysis and Machine Intelligence
Transcription alignment of Latin manuscripts using hidden Markov models
Proceedings of the 2011 Workshop on Historical Document Imaging and Processing
Proceedings of the 2011 Workshop on Historical Document Imaging and Processing
Automatic indexing of French handwritten census registers for probate geneaology
Proceedings of the 2011 Workshop on Historical Document Imaging and Processing
Enabling search for facts and implied facts in historical documents
Proceedings of the 2011 Workshop on Historical Document Imaging and Processing
Thanatos: automatically retrieving information from death certificates in Brazil
Proceedings of the 2011 Workshop on Historical Document Imaging and Processing
Handwritten Text Recognition for Marriage Register Books
ICDAR '11 Proceedings of the 2011 International Conference on Document Analysis and Recognition
A Novel Word Spotting Method Based on Recurrent Neural Networks
IEEE Transactions on Pattern Analysis and Machine Intelligence
Lexicon-free handwritten word spotting using character HMMs
Pattern Recognition Letters
Contextual word spotting in historical manuscripts using Markov logic networks
Proceedings of the 2nd International Workshop on Historical Document Imaging and Processing
Divide and conquer: atomizing and parallelizing a task in a mobile crowdsourcing platform
Proceedings of the 2nd ACM international workshop on Crowdsourcing for multimedia
An iterative multimodal framework for the transcription of handwritten historical documents
Pattern Recognition Letters
Hi-index | 0.01 |
Historical records of daily activities provide intriguing insights into the life of our ancestors, useful for demography studies and genealogical research. Automatic processing of historical documents, however, has mostly been focused on single works of literature and less on social records, which tend to have a distinct layout, structure, and vocabulary. Such information is usually collected by expert demographers that devote a lot of time to manually transcribe them. This paper presents a new database, compiled from a marriage license books collection, to support research in automatic handwriting recognition for historical documents containing social records. Marriage license books are documents that were used for centuries by ecclesiastical institutions to register marriage licenses. Books from this collection are handwritten and span nearly half a millennium until the beginning of the 20th century. In addition, a study is presented about the capability of state-of-the-art handwritten text recognition systems, when applied to the presented database. Baseline results are reported for reference in future studies.