Scale-Space and Edge Detection Using Anisotropic Diffusion
IEEE Transactions on Pattern Analysis and Machine Intelligence
Two-dimensional signal and image processing
Two-dimensional signal and image processing
The image processing handbook (2nd ed.)
The image processing handbook (2nd ed.)
Adaptive degraded document image binarization
Pattern Recognition
A Self-Adaptive Method for Extraction of Document-Specific Alphabets
ICDAR '09 Proceedings of the 2009 10th International Conference on Document Analysis and Recognition
Towards the visualisation of shape features: the scope histogram
KI'06 Proceedings of the 29th annual German conference on Artificial intelligence
Towards the processing of historic documents
NLP4DL'09/AT4DL'09 Proceedings of the 2009 international conference on Advanced language technologies for digital libraries
Hi-index | 0.00 |
This paper is about the reproduction of ancient texts with vectorised fonts. While for OCR only recognition rates count, a reproduction process does not necessarily require the recognition of characters. Our system aims at extracting all characters from printed historic documents without the employment of knowledge of language, font, or writing system. It searches for the best prototypes and creates a document-specific font from these glyphs. To reach this goal, many common OCR preprocessing steps are no longer adequate. We describe the necessary changes of our system that deals particularly with documents typeset in Fraktur. On the one hand, algorithms are described that extract glyphs accurately for the purpose of precise reproduction. On the other hand, classification results of extracted Fraktur glyphs are presented for different shape descriptors.