Computer Vision, Graphics, and Image Processing
A Survey of Methods and Strategies in Character Segmentation
IEEE Transactions on Pattern Analysis and Machine Intelligence
Word spotting: indexing handwritten manuscripts
Intelligent multimedia information retrieval
An Off-Line Cursive Handwriting Recognition System
IEEE Transactions on Pattern Analysis and Machine Intelligence
Scale-Space Theory in Computer Vision
Scale-Space Theory in Computer Vision
Document page decomposition by the bounding-box project
ICDAR '95 Proceedings of the Third International Conference on Document Analysis and Recognition (Volume 2) - Volume 2
Gap metrics for word separation in handwritten lines
ICDAR '95 Proceedings of the Third International Conference on Document Analysis and Recognition (Volume 1) - Volume 1
Segmentation of the Date in Entries of Historical Church Registers
Proceedings of the 24th DAGM Symposium on Pattern Recognition
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 1
Features for Word Spotting in Historical Manuscripts
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 1
Tree Structure forWord Extraction from Handwritten Text Lines
ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
Text Extraction from Gray Scale Historical Document Images Using Adaptive Local Connectivity Map
ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
A line-based representation for matching words in historical manuscripts
Pattern Recognition Letters
Integrated Computer-Aided Engineering
Retrieval of chinese calligraphic character image
PCM'04 Proceedings of the 5th Pacific Rim conference on Advances in Multimedia Information Processing - Volume Part I
Aligning transcripts to automatically segmented handwritten manuscripts
DAS'06 Proceedings of the 7th international conference on Document Analysis Systems
Natural language inspired approach for handwritten text line detection in legacy documents
LaTeCH '12 Proceedings of the 6th Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities
An autonomous and intelligent expert system for residential water end-use classification
Expert Systems with Applications: An International Journal
Hi-index | 0.00 |
Indexing large archives of historical manuscripts, like the papers of George Washington, is required to allow rapid perusal by scholars and researchers who wish to consult the original manuscripts. Presently, such large archives are indexed manually. Since optical character recognition (OCR) works poorly with handwriting, a scheme based on matching word images called word spotting has been suggested previously for indexing such documents. The important steps in this scheme are segmentation of a document page into words and creation of lists containing instances of the same word by word image matching. We have developed a novel methodology for segmenting handwritten document images by analyzing the extent of "blobs" in a scale space representationof the image. We believe this is the first application of scale space to this problem. The algorithm has been applied to around 30 grey level images randomly picked from Different sections of the George Washington corpus of 6,400 handwritten document images. An accuracy of 77-96 percent was observed with an average accuracy of around 87 percent. The algorithm works well in the presence of noise, shine through and other artifacts which may arise due aging and degradation of the page over a couple of centuries or through the man made processes of photocopying and scanning.