Pattern Classification (2nd Edition)
Pattern Classification (2nd Edition)
Tree Structure forWord Extraction from Handwritten Text Lines
ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
Word Extraction from On-Line Handwritten Text Lines
ICPR '06 Proceedings of the 18th International Conference on Pattern Recognition - Volume 02
Handwriting Segmentation Contest
ICDAR '07 Proceedings of the Ninth International Conference on Document Analysis and Recognition - Volume 02
Text line and word segmentation of handwritten documents
Pattern Recognition
Handwritten document image segmentation into text lines and words
Pattern Recognition
ICDAR 2009 Handwriting Segmentation Contest
ICDAR '09 Proceedings of the 2009 10th International Conference on Document Analysis and Recognition
Hi-index | 0.00 |
In this paper, we propose a system for Arabic online word extraction from handwritten text lines, a problem addressed for the first time for Arabic language as there is no public dataset of Arabic online handwritten texts available so far. We collected a dataset of unconstrained online handwritten sentences and used it to design and evaluate our system. First, our system classifies the white gaps between words connected components into either intra-word or interword gap according to some local and global online features extracted from each gap together with the groups of strokes encompassing the gap. The classifier is a polynomial kernel support vector machine (SVM) which decisions are used for initial word extraction. A post stage is added to the system to test the extracted words for under-segmentation and resolve this undersegmentation by reconsidering the gap type decisions for the stuck word. Classifiers decision fusion takes place by consulting five different classifiers (four SVM and a radial basis function neural network 'RBF NN') and feeding their decisions to a separate pre-trained SVM to make the final decision. Most stuck words are correctly detected and a lot of them have been correctly resolved. The post stage leads to remarkable error reduction compared to single classifiers performance. Promising results are achieved regarding the fact that the unconstrained Arabic handwriting nature adds more difficulties to the problem.