On the Detection of Dominant Points on Digital Curves
IEEE Transactions on Pattern Analysis and Machine Intelligence
Key concepts in model selection: performance and generalizability
Journal of Mathematical Psychology
Recognition of Handwritten Cursive Arabic Characters
IEEE Transactions on Pattern Analysis and Machine Intelligence
Indexing Hierarchical Structures Using Graph Spectra
IEEE Transactions on Pattern Analysis and Machine Intelligence
Offline Arabic Handwriting Recognition: A Survey
IEEE Transactions on Pattern Analysis and Machine Intelligence
Strategies for Large Handwritten Farsi/Arabic Lexicon Reduction
ICDAR '07 Proceedings of the Ninth International Conference on Document Analysis and Recognition - Volume 01
Shape Code Based Lexicon Reduction for Offline Handwritten Word Recognition
DAS '08 Proceedings of the 2008 The Eighth IAPR International Workshop on Document Analysis Systems
Combining Slanted-Frame Classifiers for Improved HMM-Based Arabic Handwriting Recognition
IEEE Transactions on Pattern Analysis and Machine Intelligence
ICDAR '09 Proceedings of the 2009 10th International Conference on Document Analysis and Recognition
Using topic models for OCR correction
International Journal on Document Analysis and Recognition - Special Issue NOISY
A multi-scale framework for adaptive binarization of degraded document images
Pattern Recognition
IBN SINA: a database for research on processing and understanding of Arabic manuscripts images
DAS '10 Proceedings of the 9th IAPR International Workshop on Document Analysis Systems
Holistic Urdu Handwritten Word Recognition Using Support Vector Machine
ICPR '10 Proceedings of the 2010 20th International Conference on Pattern Recognition
A Novel Lexicon Reduction Method for Arabic Handwriting Recognition
ICPR '10 Proceedings of the 2010 20th International Conference on Pattern Recognition
Shape Analysis of Elastic Curves in Euclidean Spaces
IEEE Transactions on Pattern Analysis and Machine Intelligence
Hi-index | 0.00 |
Automatic recognition of Arabic words is a challenging task and its complexity increases as the lexicon grows. In pre-modern documents, the vocabulary is unconstrained; therefore a lexicon-reduction strategy is needed to reduce the recognition computational complexity. This paper proposes a novel lexicon-reduction method for Arabic subwords based on their shapes' topology and geometry. First the sub-word shape's topological and geometrical information is extracted from its skeleton and encoded into a graph. Then the graph is converted into a topological signature vector (TSV) which preserves the graph structure. The lexicon is reduced based on the TSV distance between the lexicon sub-words' shapes and a query shape, by keeping the i nearest subwords. The value of i is selected according to a predetermined lexicon-reduction accuracy. The proposed framework has been tested on a database of pre-modern Arabic subword shapes with promising results.