Automatic labeling of handwritten mathematical symbols via expression matching
GbRPR'11 Proceedings of the 8th international conference on Graph-based representations in pattern recognition
CICM'12 Proceedings of the 11th international conference on Intelligent Computer Mathematics
A shape-based layout descriptor for classifying spatial relationships in handwritten math
Proceedings of the 2013 ACM symposium on Document engineering
Hi-index | 0.00 |
Although publicly available, ground-truthed corpora have proven useful for training, evaluating, and comparing recognition systems in many domains, the availability of such corpora for sketch recognizers, and math recognizers in particular, is currently quite poor. This paper presents a general approach to creating large, ground-truthed corpora for structured sketch domains such as mathematics. In the approach, random sketch templates are generated automatically using a grammar model of the sketch domain. These templates are transcribed manually, then automatically annotated with ground-truth. The annotation procedure uses the generated sketch templates to find a matching between transcribed and generated symbols. A large, ground-truthed corpus of handwritten mathematical expressions presented in the paper illustrates the utility of the approach.