An introduction to support Vector Machines: and other kernel-based learning methods
An introduction to support Vector Machines: and other kernel-based learning methods
ICDAR '01 Proceedings of the Sixth International Conference on Document Analysis and Recognition
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 1
Histograms of Oriented Gradients for Human Detection
CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
Optical Digit Recognition for Images of Handwritten Historical Documents
SBRN '06 Proceedings of the Ninth Brazilian Symposium on Neural Networks
Computer Assisted Transcription of Handwritten Text Images
ICDAR '07 Proceedings of the Ninth International Conference on Document Analysis and Recognition - Volume 02
Invariant Primitives for Handwritten Arabic Script: A Contrastive Study of Four Feature Sets
ICDAR '09 Proceedings of the 2009 10th International Conference on Document Analysis and Recognition
Automatic Transcription of Handwritten Medieval Documents
VSMM '09 Proceedings of the 2009 15th International Conference on Virtual Systems and Multimedia
Gabor filters-based feature extraction for character recognition
Pattern Recognition
Adapting Moments for Handwritten Kannada Kagunita Recognition
ICMLC '10 Proceedings of the 2010 Second International Conference on Machine Learning and Computing
Gabor features for offline Arabic handwriting recognition
DAS '10 Proceedings of the 9th IAPR International Workshop on Document Analysis Systems
Translating handwritten bushman texts
Proceedings of the 10th annual joint conference on Digital libraries
A visual dictionary for an extinct language
ICADL'10 Proceedings of the role of digital libraries in a time of global change, and 12th international conference on Asia-Pacific digital libraries
Using a hidden Markov model to transcribe handwritten bushman texts
Proceedings of the 11th annual international ACM/IEEE joint conference on Digital libraries
Creating a handwriting recognition corpus for Bushman languages
ICADL'11 Proceedings of the 13th international conference on Asia-pacific digital libraries: for cultural heritage, knowledge dissemination, and future creation
Digital libraries without databases: the Bleek and Lloyd collection
ECDL'07 Proceedings of the 11th European conference on Research and Advanced Technology for Digital Libraries
Hi-index | 0.00 |
The Bleek and Lloyd collection contains 19th century handwritten notebooks that document the language and culture of the |Xam-speaking people who lived in Southern Africa. Access to this rich data could be enhanced by transcriptions of the text; however, the complex diacritics used in the notebooks complicate the process of transcription. Machine learning techniques could be used to perform this transcription, but it is not known which techniques would produce the best results. This paper thus reports on a comparison of 3 popular techniques applied to this problem: artificial neural networks (ANN); hidden Markov models (HMM); and support vector machines (SVM). It was found that an SVM-based classifier using histograms of oriented gradients as features resulted in the best word recognition accuracy of 58.4%. Furthermore, it was found that most feature extraction parameters did not have a large effect on recognition accuracy and that the SVM-based recognisers outperform both ANN- and HMM-based recognisers.