A guided tour to approximate string matching
ACM Computing Surveys (CSUR)
Word Spotting: A New Approach to Indexing Handwriting
CVPR '96 Proceedings of the 1996 Conference on Computer Vision and Pattern Recognition (CVPR '96)
Transcript Mapping for Historic Handwritten Document Images
IWFHR '02 Proceedings of the Eighth International Workshop on Frontiers in Handwriting Recognition (IWFHR'02)
Features for Word Spotting in Historical Manuscripts
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 1
Text Alignment with Handwritten Documents
DIAL '04 Proceedings of the First International Workshop on Document Image Analysis for Libraries (DIAL'04)
Holistic Word Recognition for Handwritten Historical Documents
DIAL '04 Proceedings of the First International Workshop on Document Image Analysis for Libraries (DIAL'04)
A Segmentation-free Approach for Keyword Search in Historical Typewritten Documents
ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
An old greek handwritten OCR system based on an efficient segmentation-free approach
International Journal on Document Analysis and Recognition
Keyword-guided word spotting in historical printed documents using synthetic data and user feedback
International Journal on Document Analysis and Recognition
Further explorations in text alignment with handwritten documents
International Journal on Document Analysis and Recognition
Seam carving for content-aware image resizing
ACM SIGGRAPH 2007 papers
Combining Alignment Results for Historical Handwritten Document Analysis
ICDAR '09 Proceedings of the 2009 10th International Conference on Document Analysis and Recognition
Segmentation-free Word Spotting in Historical Printed Documents
ICDAR '09 Proceedings of the 2009 10th International Conference on Document Analysis and Recognition
Language-Independent Text Lines Extraction Using Seam Carving
ICDAR '11 Proceedings of the 2011 International Conference on Document Analysis and Recognition
Aligning transcripts to automatically segmented handwritten manuscripts
DAS'06 Proceedings of the 7th international conference on Document Analysis Systems
Robust text and drawing segmentation algorithm for historical documents
Proceedings of the 2nd International Workshop on Historical Document Imaging and Processing
Text line extraction for historical document images
Pattern Recognition Letters
Hi-index | 0.00 |
This work aims to simplify the tiresome manual comparison of two similar Arabic historical manuscripts. We developed a system that determines the difference between two manuscripts by comparing their components, while ignoring page breaks and different warping among consecutive rows; i.e., we treat each manuscript as one long row of components. We compare two components (blocks of pixels) by extracting features from the columns of their bounding rectangles. We adopted the edit distance, which is computed using dynamic time warping (DTW) on the feature domain, to measure similarity between components. The user selects the region to align in two manuscripts and the system return its alignment with visual clues that indicate the distance between the aligned components. In our current implementation, our system provides good results and requires less interaction for manuscripts at good quality that do not include touching components. We tested our system on different Arabic manuscripts of various qualities and received encouraging results.