A Database for Handwritten Text Recognition Research
IEEE Transactions on Pattern Analysis and Machine Intelligence
Symbol Recognition: Current Advances and Perspectives
GREC '01 Selected Papers from the Fourth International Workshop on Graphics Recognition Algorithms and Applications
The IRESTE On/Off (IRONOFF) Dual Handwriting Database
ICDAR '99 Proceedings of the Fifth International Conference on Document Analysis and Recognition
Structure in On-line Documents
ICDAR '01 Proceedings of the Sixth International Conference on Document Analysis and Recognition
Recognition of Cursive Roman Handwriting - Past, Present and Future
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 1
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 1
A survey of table recognition: Models, observations, transformations, and inferences
International Journal on Document Analysis and Recognition
Distinguishing Text from Graphics in On-Line Handwritten Ink
IWFHR '04 Proceedings of the Ninth International Workshop on Frontiers in Handwriting Recognition
IAM-OnDB - an On-Line English Sentence Database Acquired from Handwritten Text on a Whiteboard
ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
UPX: A New XML Representation for Annotated Datasets of Online Handwriting Data
ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
Pixel-Accurate Representation and Evaluation of Page Segmentation in Document Images
ICPR '06 Proceedings of the 18th International Conference on Pattern Recognition - Volume 01
Hi-index | 0.00 |
In this paper we present a new database of online handwritten documents with different contents such as text, drawings, diagrams, formulas, tables, lists, and markings. It was designed to serve as a standard dataset for the development, training, testing and comparison of methods in the field of handwritten document analysis. The database can serve as a basis for layout analysis, and different segmentation and recognition tasks considering online or just offline information. Its size is 1,000 documents produced by approximately 200 writers including a total of 329,849 online strokes. Few constraints were imposed on the writers when creating the documents. Nonetheless, the database has a stable distribution of the different content types. A software tool was developed to allow easy access to the documents which are stored in InkML. In this paper we also present two experiments which show the challenge this database poses. They may figure as references for further research in this area.